P, NP and NP-Complete Reductions

In this section, we change the focus from “Is a decision problem decidable/solvable?” to “If a decision problem is decidable, is it efficiently decidable/solvable?” Where by efficiently decidable or solvable we mean that there exists an executable program (i.e., an implementable Turing machine) which decides the decision problem in polynomial time. (Don’t worry if you’re rusty on your Big O notation.)

The classes of P and NP

If we restrict ourselves to the set of decidable problems, then a natural question that arises is “How efficiently?” can we decide a decision problem. There are two major classes of decidable decision problems: 1. Decision problems which can be decided in polynomial time using a deterministic Turing machine, also known as P and 2. Decision problems which can be decided in polynomial time using a non-deterministic Turing machine, also known as NP.

If you recall from the previous sections on NFA and (N)PDA, non-determinism is a nifty feature that we sometimes give to computational models to allow them to guess-and-verify. This nifty feature comes in handy when you have a “difficult” decision problem that you want to decide in polynomial time. For example, consider the decision problem of determining whether a graph \(G\) has a path of length \(k\). One very slow - in fact exponential time - way of deciding this is to list all possible sequences of \(k + 1\) vertices in \(G\) and check whether any of them forms a path in \(G\). Another much faster - in fact polynomial time - approach would be to guess the path of length \(k\) and then check whether it’s indeed in the graph \(G\). The reason why this solution runs in polynomial is because the guess-and-verify feature allows the procedure to guess correctly if a correct guess does indeed exist. While the guess-and-verify feature is great for theoretical polynomial-time procedures, it is not an implementable procedure - there’s no gadget or gizmo that lets you implement the step “Guess correctly” in an algorithm.¹

As a result, when we ask the question “If a decision problem is decidable, is it efficiently decidable/solvable?”, the answer we are really asking is “Is the decision problem in P?” If the decision problem is in P, then we at least have a solution that does not blow up exponentially in time. If, instead, the answer is something like “It’s certainly in NP, but there’s no obvious way to show that it’s in P.”², then it’s unlikely that we can avoid a solution that blows up exponentially in time.

All of this is discussed in the next video!

A concrete example showing the difference between P and NP

In the next video, I give a more concrete example based on the Post correspondence problem to illustrate the difference between P and NP.

NP-Hardness and NP-Completeness

Now that you’re hopefully a bit more comfortable with the notions of P and NP, we are ready to discuss the notions of NP-Hardness and NP-Completeness. A problem Q is NP-Hard if it is harder than every other problem in NP (i.e., every problem in NP reduces to Q). A problem is NP-Complete if 1. It is NP-Hard and 2. It is in NP. Thus, if a problem Q is shown to be NP-Complete, the answer to the question “Is Q an efficiently decidable/solvable problem?” is most likely “No!”

Showing that a problem is NP-Complete via reductions

Now that we’re aware of the definitions of NP-Hardness and NP-Completeness, we can start showing that a decision problem Q is NP-Complete. There are two common ways of doing this: 1. Showing that an arbitrary problem in NP reduces to the decision problem Q and 2. Reducing a known NP-Complete problem to the decision problem Q. Approach 1 is difficult as it prevents us from assuming anything about the structure we are reducing from. Approach 2 is more common and will be the one I present in the next video.

Exercises

Exercise. Show that the \(\texttt{Longest-Path}\) problem is NP-Complete by reducing it from the \(\texttt{Hamiltonian-Cycle}\) problem.

\(\texttt{Longest-Path}\): Given a graph \(G = (V, E)\), does \(G\) have a simple path of at least \(k\) edges? (A simple path has no repeating vertices.)
\(\texttt{Hamiltonian-Cycle}\): Given a graph \(G = (V, E)\) and a vertex \(u \in V\), does \(G\) contain a Hamiltonian cycle that starts and ends at \(u\)? (A Hamiltonian cycle visits every vertex exactly once.)

Exercise. Show that \(\texttt{Independent-Set}\) is NP-Complete by reducing it from the \(\texttt{3-SAT}\) problem.

\(\texttt{Independent-Set}\): Given a graph \(G = (V, E)\) and a positive integer \(k\), does \(G\) have an independent set of \(k\) vertices? (An independent set is a subset \(V' \subseteq V\) such that no pair of vertices in \(V'\) is connected by an edge.)

Exercise. Show that the \(\texttt{Independent-Set}\) is NP-Complete by reducing it from the \(\texttt{Vertex-Cover}\) problem.

\(\texttt{Vertex-Cover}\): Given a graph \(G = (V, E)\) and a positive integer \(k\), does \(G\) have a vertex cover of \(k\) vertices? (A vertex cover is a subset \(V' \subseteq V\) that covers all edges of \(G\).)

Exercise. Show that the \(\texttt{Clique}\) problem is NP-Complete by reducing it from the \(\texttt{Vertex-Cover}\) problem.

\(\texttt{Clique}\): Given a graph \(G = (V, E)\) and a positive integer \(k\), does \(G\) have a clique of \(k\) vertices? (A clique is a subset \(V' \subseteq V\) such that every pair of vertices in \(V'\) is connected by an edge.)

The fact that guess-and-verify seems like pure magic is because it kind of is. Unless P is indeed equal to NP, there is no general way to convert the guess-and-verify feature to a deterministic and implementable procedure which remains polynomial in time. ↩
This is my best attempt at calling a problem NP-Complete without actually using that term. ↩