Lecture 5

In-class notes: CS 505 Spring 2025 Lecture 5

Cook-Levin Theorem Wrap-Up

Recall from last time, we are trying to prove that $S A T$ is $NP$ -complete. To do so, we considered the single-tape non-deterministic Turing machine definition of $NP$ . Our goal is to show that for any $NP$ language $L$ , we have $L \leq_{p} S A T$ . That is, $L$ is poly-time reducible to $S A T$ .

From last time, we were able to construct an $n^{k} \times n^{k}$ table which encoded the execution of an NTM deciding $L$ on input $x$ . From this table, we constructed the Boolean formula

$ϕ = ϕ_{s t a r t} \land ϕ_{a cce pt} \land ϕ_{ce ll} \land ϕ_{m o v e} .$

The last thing to show is that our definition of $ϕ_{m o v e}$ correctly captures the correctness of the NTM $M$ deciding $L$ . Recall that $ϕ_{m o v e}$ was defined with respect to $2 \times 3$ windows in the table, and it tried to capture the notion of a legal window. That is, $ϕ_{m o v e} = 0 \leq i < n^{k} - 1 0 \leq j < n^{k} - 2 ⋀ [Window W_{i, j} is legal] .$

Claim. If the table has a correct starting configuration, and all $2 \times 3$ windows are legal, then $ro w [i + 1]$ is a correct transition from $ro w [i]$ for all $0 \leq i < n^{k}$ .

Proof. To prove the claim, first consider any such $i$ . Let $ro w [i]$ and $ro w [i + 1]$ be the $i$ and $i + 1$ rows of the table. Call $ro w [i]$ the upper configuration and $ro w [i + 1]$ the lower configuration.

Consider all windows $W_{i, j}$ for $0 \leq j < n^{k} - 2$ . That is, we look at all windows in the upper and lower configuration. We now define when window $W_{i, j}$ is legal. Legal windows fall into two categories: windows which contain a state and those which do not.

No state in the window. Suppose window $W_{i, j}$ contains no state. Then we say that $W_{i, j}$ is legal if and only if the two elements in the center column are equal. The window below is an example. $W_{i, j} = [a c b b c c]$ Note that even though in the above example, the first column has $a$ then $c$ , this would be a legal window because it is possible the tape head is just to the left of $a$ in the upper configuration, writes $c$ over $a$ , then moves left.
State in the window. Suppose that window $W_{i, j}$ contains a state. Then window $W_{i, j}$ is legal if and only if the upper and lower configuration in this window is consistent with the transition function of the Turing machine. In particular, by our construction of the table and since the NTM is a single-tape NTM, a state in the window represents the current position of the tape head. First, we know that when transitioning from the upper configuration to the lower configuration, the state can move at most one position (left, right, or stay). This is easy to check for. Then we know that in the table, the tape head only touches the cell immediately to its right. That is, if the state is in $ce ll [i, j]$ , then the tape head is reading from/writing to $ce ll [i, j + 1]$ . In a nutshell, the computation of a Turing machine is highly local: it can’t jump large distances in a single time-step. Examples of legal windows are given below. $[a a q_{1} q_{2} b b] [q_{1} c a q_{2} b b] [q_{1} d b c c c]$
Special windows. There are two special windows in any pair of upper and lower configurations: $W_{i, 0}$ and $W_{i, n^{k} - 3}$ . These represent the edges of the table. These windows are legal if and only if: (1) they satisfy both of the above constraints; and (2) they have the fixed $#$ symbol on the edges. See the examples below.

$W_{i, 0} = [# # q_{1} a a q_{2}] W_{i, n^{k} - 3} = [c q_{7} d d # #]$

By the above notion of legal windows, if all windows in the upper and lower configuration are legal, then it represents a correct transition to $ro w [i + 1]$ from $ro w [i]$ . Inductively, this means that if we start with a correct starting configuration, and every window in the table is legal, then each pair of upper and lower configurations represents a valid transition from $ro w [i]$ to $ro w [i + 1]$ , and hence the table correctly captures the computation of the decider $M$ for language $L$ .

We conclude by giving the Boolean formula for $ϕ_{m o v e}$ . To do so, we simply need to give a Boolean formula for the statement “ $W_{i, j} is a legal window$ .” Define the set $S$ as follows. $S = {(a_{1}, a_{2}, a_{3}, a_{4}, a_{5}, a_{6}) : a_{i} \in C \forall i \in [6] \land [a_{1} a_{4} a_{2} a_{5} a_{3} a_{6}] is a legal window} .$ Here, recall that $C$ is the cell alphabet of our table.

Given this set $S$ , the Boolean formula for the statement “ $W_{i, j} is a legal window$ ” is expressed as $(a_{1}, \dots, a_{6}) \in S ⋁ (x_{i, j, a_{1}} \land x_{i, j + 1, a_{2}} \land x_{i, j + 2, a_{3}} \land x_{i + 1, j, a_{4}} \land x_{i + 1, j + 1, a_{5}} \land x_{i + 1, j + 2, a_{6}})$ What is this formula saying? It says that given a tuple $(a_{1}, \dots, a_{6})$ , which I know by the definition of $S$ represents some legal window, is the current window $W_{i, j}$ this legal window? That is, it is asking if $W_{i, j} = ? [a_{1} a_{4} a_{2} a_{5} a_{3} a_{6}]$ is true. We take a big OR over all legal windows to make sure that window $W_{i, j}$ is some legal window.

If this big OR is true, then we know $W_{i, j}$ is some legal window. This gives us the final expression for $ϕ_{m o v e}$ as $ϕ_{m o v e} = 0 \leq i < n^{k} - 1 0 \leq j < n^{k} - 3 ⋀ (a_{1}, \dots, a_{6}) \in S ⋁ (x_{i, j, a_{1}} \land x_{i, j + 1, a_{2}} \land x_{i, j + 2, a_{3}} \land x_{i + 1, j, a_{4}} \land x_{i + 1, j + 1, a_{5}} \land x_{i + 1, j + 2, a_{6}}) .$ All together, we have that $ϕ = ϕ_{s t a r t} \land ϕ_{a cce pt} \land ϕ_{ce ll} \land ϕ_{m o v e}$ is satisfiable if and only if the NTM we are encoding in the table is accepting.

The final piece of the puzzle is arguing that we can construct $ϕ$ in polynomial time. Note that the cell alphabet $C = Γ \cup Q \cup {#}$ is of constant size with respect to the input length by definition of Turing machines.

For $ϕ_{s t a r t}$ , given an input $x$ to the NTM $M$ for deciding the language $L$ , the starting configuration of the machine is fixed. Thus, the starting row of the table is fixed as well. For $n = ∣ x ∣$ , the starting row of the table contains $n^{k}$ cells, which corresponds to $n^{k}$ literals in $ϕ_{s t a r t}$ . This can clearly be constructed in $O (n^{k})$ time.
For $ϕ_{a cce pt}$ , recall that we are simply scanning the entire $n^{k} \times n^{k}$ table for an accepting state. The table has total size $n^{2 k}$ , so this formula clearly has size $n^{2 k}$ and can be constructed in $O (n^{2 k})$ time.
For $ϕ_{ce ll}$ , it is a big AND of $n^{2 k}$ pairs $i, j$ . Within this big AND, we have two constant sized subformulas. First, the formula checking that cell $i, j$ contains a valid symbol $s \in C$ . Since $∣ C ∣$ is constant, the size of this formula is constant. Then this subformula is AND’d with a big AND of an OR which checks that cell $i, j$ doesn’t contain both symbol $s \in C$ and $t \in C$ . Again, since $∣ C ∣$ is constant, this subformula is constant. So the total size of $ϕ_{ce ll}$ is $O (n^{2 k})$ and can be constructed in this much time as well.
Similarly, for $ϕ_{m o v e}$ , the size of the set $S$ is at most $∣ C ∣! / (∣ C ∣ - 6)!$ , which is constant size (something like $O (∣ C ∣^{6})$ ) since $∣ C ∣$ and $6$ are constants. So the inner formula is a constant size, whereas the whole formula $ϕ_{m o v e}$ is a big AND of pairs $i, j$ from $0$ to at most $n^{k} - 1$ . So $ϕ_{m o v e}$ has size $O (n^{2 k})$ and can be constructed in this much time.

This completes the proof of the Cook-Levin theorem. $□$

Other NP-Complete Problems

SAT is a step-up from the (useless) $NP$ -Complete problem TMSAT. However, a general Boolean formula (like those given in SAT) may be difficult to handle when trying to understand specific $NP$ problems. Thus, we turn our attention to the wide variety of other $NP$ -complete problems.

First, we show that given any $NP$ -complete problem/language, if we want to show some other language is $NP$ -complete, we only need to reduce our known $NP$ -complete language to our new language.

Theorem 5.1. If $A$ is an $NP$ -complete language, and $B \in NP$ such that $A \leq_{p} B$ , then $B$ is $NP$ -complete.

Proof. Recall the transitive property of polynomial-time reducible languages. Let $L_{1}, L_{2}, L_{3}$ be languages such that $L_{1} \leq_{p} L_{2}$ and $L_{2} \leq_{p} L_{3}$ . Then we know that $L_{1} \leq_{p} L_{3}$ .

By our assumption, $A$ is $NP$ -complete. This means that $A \in NP$ and $L \leq_{p} A$ for all $L \in NP$ . By our other assumption, we know that $A \leq_{p} B$ . By the transitive property above, we now know that $L \leq_{p} B$ for any $L \in NP$ . Thus, $B$ is $NP$ -complete. $□$

Now, rather than having to do a complete Cook-Levin Theorem style proof for new languages we want to show are $NP$ -complete, it suffices to just reduce from a language we know is $NP$ -complete!

3SAT

We turn to our next (and possibly favorite) $NP$ -complete language: 3SAT. First, we need to set up some terminology.

Let $ϕ$ be a Boolean formula. We say that $ϕ$ is in conjunctive normal form (or $ϕ$ is a CNF formula) if $ϕ = ϕ_{1} \land ϕ_{2} \land \dots \land ϕ_{n}$ such that $ϕ_{i}$ only contains ORs of literals/variables (and their negations). We call each $ϕ_{i}$ a clause of $ϕ$ . One example of a CNF formula $ϕ$ with $3$ clauses is given below.

$ϕ = (x_{1} \lor \overline{x}_{2}) \land x_{3} \land (\overline{x}_{1} \lor x_{4} \lor x_{5} \lor x_{6})$

We say that $ϕ$ is a $k$ -CNF formula if each clause $ϕ_{i}$ contains exactly $k$ literals. An example of a $2$ -CNF formula with $3$ clauses is given below.

$ϕ = (x_{1} \lor x_{2}) \land (\overline{x}_{3} \lor x_{4}) \land (\overline{x}_{2} \lor x_{3})$

Definition (3SAT). The language $3 S A T$ is the set of all satisfiable $3$ -CNF formulas. That is, $3 S A T = {⟨ ϕ ⟩ : ϕ is a 3-CNF and is satisfiable} .$

Complexity theorists prefer $3 S A T$ over other NP-complete languages since it is simple, has very little combinatorial structure, and occurs in many differnt contexts such as constraint satisfaction problems.

The other part of the Cook-Levin Theorem (that I hid from you earlier) is that $3 S A T$ is NP-complete.

Theorem (Cook-Levin, Part 2). $3 S A T$ is $NP$ -complete.

Proof. $3 S A T \in NP$ is immediate. What remains to be shown is that $3 S A T$ is NP-hard. We could show that $S A T \leq_{p} 3 S A T$ using our above theorem, but it is actually simpler just to modify the proof of the Cook-Levin theorem directly to give us a 3-CNF formula.

Recall $ϕ = ϕ_{s t a r t} \land ϕ_{a cce pt} \land ϕ_{ce ll} \land ϕ_{m o v e}$ from the proof of the Cook-Levin theorem. First, we will change $ϕ$ slightly so that it is a CNF formula (we are almost there already). Once we have put $ϕ$ in CNF form, we will then transform it into a $3$ -CNF formula.

Since $ϕ$ is the AND of $4$ clauses, we just need to make sure that each of these clauses is an OR of 1 or more literals. First consider $ϕ_{s t a r t}$ . Recall that it was simply ANDing $n^{k}$ literals together: $ϕ_{s t a r t} = x_{0, 0, #} \land x_{0, 1, q_{0}} \land x_{0, 1, u_{1}} \land \dots \land x_{0, n + 1, u_{n}} \land x_{0, n + 2, □} \land \dots \land x_{0, n^{k} - 2, □} \land x_{0, n^{k} - 1, #} .$ So $ϕ_{s t a r t}$ is already a CNF formula with $n^{k}$ clauses of single literals (there are no ORs).

Now consider $ϕ_{a cce pt}$ . Remember that this is simply a big OR over the entire table, checking if there is at least one accepting state. So $ϕ_{a cce pt}$ will be a single clause of our CNF formula.

Now consider $ϕ_{ce ll}$ . From before, this is given by $ϕ_{ce ll} = 0 \leq i, j \leq n^{k} - 1 ⋀ (s \in C ⋁ x_{i, j, s}) \land s, t \in C s \neq = t ⋀ (\overline{x}_{i, j, s} \lor \overline{x}_{i, j, t}) .$ Notice that $ϕ_{ce ll}$ is already in CNF form. The big OR over $s \in C$ is a single clause, which gets AND’d with the formula $⋀_{s, t \in C s \neq = t} (\overline{x}_{i, j, s} \lor \overline{x}_{i, j, t})$ , which is itself a CNF formula. Then all of these formulas are AND’d together, meaning the final formula $ϕ_{ce ll}$ is in CNF form.

Finally, for $ϕ_{m o v e}$ , it is a big AND of a big OR of a constant number of ANDs (6 ANDs). Using Boolean equivalences, we can convert the inner formula $⋁_{(a_{1}, \dots, a_{6}) \in S} (a_{1} \land a_{2} \land \dots \land a_{6})$ ¹ into a new formula where it is a big AND of some (again constant) number of ORs. This conversion increases the formula size by at most a polynomial-time factor, so $ϕ_{m o v e}$ is still of polynomial size. All together, this transforms $ϕ_{m o v e}$ into CNF form in polynomial time.

This together establishes that we can convert our original formula $ϕ = ϕ_{s t a r t} \land ϕ_{a cce pt} \land ϕ_{ce ll} \land ϕ_{m o v e}$ into an equivalent CNF formula, say $ϕ^{'} = ϕ_{1} \land \dots \land ϕ_{m}$ for some $m$ such that each $ϕ_{i}$ is the OR of one or more literals. We now convert $ϕ^{'}$ into a $3$ -CNF. This can be done as follows. For any $i \in [m]$ , consider the clause $ϕ_{i}$ .

If $ϕ_{i}$ has 3 literals, we are done and can move on to the next clause.
If $ϕ_{i}$ has less than 3 literals, we transform it into an equivalent formula $ϕ_{i}^{'}$ with exactly 3 literals. For example, if $ϕ_{i}$ has one literal, say $x_{j}$ , we simply write $ϕ_{i}^{'} = (x_{j} \lor x_{j} \lor x_{j})$ . If $ϕ_{i}$ has two literals, say $x_{j_{1}}$ and $x_{j_{2}}$ , we pick one of the literals arbitrarily (e.g., always pick the first one) and repeat it, giving $ϕ^{'} = (x_{j_{1}} \lor x_{j_{1}} \lor x_{j_{2}})$ . Clearly $ϕ_{i}$ and $ϕ_{i}^{'}$ are equivalent.
If $ϕ_{i}$ has more than 3 literals, we will split $ϕ_{i}$ into a $3$ -CNF formula using extra variables. For example, if $ϕ_{i} = x_{1} \lor x_{2} \lor x_{3} \lor x_{4}$ , we introduce the variable $z$ and convert $ϕ_{i}$ to formula $ϕ_{i}^{'} = (x_{1} \lor x_{2} \lor z) \land (\overline{z} \lor x_{2} \lor x_{3}) .$ This conversion has the property that if $ϕ_{i}$ has a satisfying assignment, then there exists an assignment to the variable $z$ such that $ϕ_{i}^{'}$ is also satisfied by $z$ plus the assignment for $ϕ_{i}$ . In our above example, the vector $(x_{1}, x_{2}, x_{3}, x_{4}) = (1, 0, 0, 0)$ is a satisfying assignment for $ϕ_{i}$ , and so a satisfying assignment for $ϕ_{i}^{'}$ would be $(x_{1}, x_{2}, x_{3}, x_{4}, z) = (1, 0, 0, 0, 0)$ (set $\overline{z} = 1$ ).

In general, if $ϕ_{i}$ has $ℓ \geq 4$ literals, we introduce $ℓ - 3$ new variable $z_{1}, \dots, z_{ℓ - 3}$ and transform $ϕ_{i}$ into a 3CNF formula with $ℓ - 2$ clauses. If $ϕ_{i}$ has literals $a_{1}, \dots, a_{ℓ}$ , then we construct $ϕ_{i}^{'}$ as $ϕ_{i}^{'} = (a_{1} \lor a_{2} \lor z_{1}) \land (\overline{z}_{1} \lor a_{3} \lor z_{2}) \land (\overline{z}_{2} \lor a_{4} \lor z_{3}) \land \dots \land (\overline{z}_{ℓ - 3} \lor a_{ℓ - 1} \lor a_{ℓ}) .$ Clearly, $ϕ_{i}^{'}$ can be constructed in polynomial time from $ϕ$ .

All together, this gives us our new 3CNF formula $ϕ^{'}$ that is equivalent to the formula $ϕ$ we constructed in the Cook-Levin Theorem. $□$

Independent Set

The independent set problem on an undirected graph $G = (V, E)$ asks if there exists a set of node/vertices $S \subseteq V$ of size at least $k$ such that they are pairwise disconnected. That is, for every $u, v \in S$ , $(u, v) \in / E$ . As a set, this is written as $I N D SE T_{k} = {⟨ G = (V, E), k ⟩ : \exists S \subset V s.t. ∣ S ∣ \geq k \land (u, v) \in / E \forall u, v \in S} .$

Theorem. $I N D SE T_{k}$ is NP-complete.

Proof. Clearly $I N D SE T_{k} \in NP$ . To see this, one can simply specify a set $S$ of size at least $k$ . Then, verification that $S$ is an independent set takes at most $O (n^{2})$ time per pair of $u, v \in S$ since in the worst case you must scan the entire set $E$ per check. So the total time is at most $O (n^{3})$ in the worst case, where $n = ∣ V ∣$ .

Now we show that $I N D SE T_{k}$ is NP-complete. We do this by giving a reduction from $3 S A T$ . That is, $3 S A T \leq I N D SE T_{k}$ .

Suppose that $ϕ$ is a 3CNF formula with $k$ clauses: $ϕ = ϕ_{1} \land ϕ_{2} \land \dots \land ϕ_{k}$ . Assume that $ϕ$ has $ℓ$ literals $x_{1}, \dots, x_{ℓ}$ (note their negation are also literals). For each clause $ϕ_{i}$ , write $ϕ_{i} = (a_{i} \lor b_{i} \lor c_{i})$ , where $a_{i}, b_{i}, c_{i}$ are the literals of clause $ϕ_{i}$ . (For example, if $ϕ_{i} = (x_{2} \lor \overline{x}_{7} \lor x_{3})$ , then $a_{i} = x_{2}$ , $b_{i} = \overline{x}_{7}$ , and $c_{i} = x_{3}$ .)

For each clause $ϕ_{i}$ , we create a cluster of $3$ nodes/vertices in a graph $G$ . Label each vertex in this cluster with $a_{i}$ , $b_{i}$ , $c_{i}$ . This gives us $k$ clusters of $3$ nodes each, where each cluster of $3$ nodes is associated with $ϕ_{i}$ and labeled $a_{i}, b_{i}, c_{i}$ .

Now we connect nodes in this graph with $3 k$ vertices. First, create a triangle in each cluster. That is, for each $i$ , connect $(a_{i}, b_{i})$ , $(b_{i}, c_{i})$ , and $(c_{i}, a_{i})$ (note the graph is undirected so $(b_{i}, a_{i})$ , $(c_{i}, b_{i})$ and $(a_{i}, c_{i})$ are also edges). Next, we connect each node with its negation. For example, if $a_{1} = x_{3}$ and $b_{7} = \overline{x}_{3}$ , then we add the edge $(a_{1}, b_{7})$ to the graph. We claim that the given 3CNF $ϕ$ is satisfiable if and only if our constructed graph $G$ above has an independent set of size $k$ .

First, suppose that $ϕ$ has a satisfying assignment. Let $u \in {0, 1}^{ℓ}$ be the satisfying assignment. That is, $x_{i} = u_{i}$ is our satisfying assignment for $i \in [ℓ]$ . From $u$ , we build a $k$ -independent set in the graph $G$ . Now since $ϕ$ is satisfied, the assignment $u$ satisfies every clause $ϕ_{i}$ of $ϕ$ . Since every clause $ϕ_{i}$ is satisfied, as least one of its literals $a_{i}$ , $b_{i}$ , or $c_{i}$ is equal to $1$ . For each literal $ϕ_{i}$ , choose only one of its satisfied literals and add it to the set $S$ . For example, if $a_{i} = 1$ , $b_{i} = 1$ , and $c_{i} = 0$ under assignment $u$ , then add $a_{i}$ or $b_{i}$ to $S$ (but not both; simply choose one of them).

We claim this set $S$ constructed in this manner is an independent set of size $k$ . First, suppose that a literal $a_{j} \in S$ for some $j$ (e.g., $a_{1}$ ). By construction of the graph $G$ , we know that $a_{j}$ is connected to both $b_{j}$ and $c_{j}$ . But by our selection of the set $S$ , we only choose a single node from each cluster to add to the set. So $b_{j}, c_{j} \in / S$ . Now what if literals $c_{i}, a_{j} \in S$ for $i \neq = j$ ? By construction of $G$ , we know that $a_{j}$ and $c_{i}$ are connected if and only if $a_{j}$ is the negation of the literal represented by $c_{i}$ . For example, $a_{j} = \overline{x}_{3}$ and $c_{i} = x_{3}$ would be connected in the graph. However, by assumption, the assignment $u$ is a satisfying assignment. This means that if $a_{j}$ were satisfied (i.e, $a_{j} = 1$ ), then $c_{i}$ would not be satisfied (i.e., $c_{i} = 0$ ). So every element $s, s^{'} \in S$ satisfies the property that $(s, s^{'}) \neq \in E$ . Thus, $S$ is an independent set of size $k$ .

Now suppose that our constructed graph $G$ has an independent set of size $k$ . We reconstruct a satisfying assignment for the formula $ϕ$ . Let $S$ be the $k$ -independent set. Then for every $s, s^{'} \in S$ , we know that $(s, s^{'}) \in / E$ . We construct our satisfying assignment directly from the set $S$ . Suppose $s \in S$ . Then set literal $s$ in $ϕ$ to be $1$ . For example if $s = a_{j} = \overline{x}_{3}$ , then we set $a_{j} = \overline{x}_{3} = 1$ in the satisfying assignment (that is, $x_{3} = 0$ ). We claim this is a satisfying assignment. This is by construction of the graph $G$ .

Suppose $s \in S$ and $s$ is in cluster $i$ . Then we know $s$ is connected to every other node in its cluster ( $a_{i}, b_{i}, c_{i}$ are all connected in cluster $i$ ). So we know that the other nodes in this cluster are not in $S$ (otherwise it would not be an independent set).
Let $s, s^{'} \in S$ such that $s$ is in cluster $i$ and $s^{'}$ is in cluster $j$ . We know that $s, s^{'}$ are not connected. This implies that $s$ and $s^{'}$ are not a literal and their negation (e.g., $s = x_{2}$ and $s^{'} = \overline{x}_{2}$ is not possible). This means that we don’t obtain an assignment which sets both a literal and its negation to $1$ .

Thus, our constructed assignment is satisfying. This completes the proof. $□$

Please see the actual definition of $ϕ_{m o v e}$ for the correct formula here. I am just using a shorthand for demonstration. ↩

Keyboard shortcuts

CS 505 - Computability and Complexity Theory (Spring 2025)

Lecture 5

Cook-Levin Theorem Wrap-Up

Other NP-Complete Problems

3SAT

Independent Set