CNF Translations¶

As mentioned earlier, each formula can be translated into a logically equivalent formula in the conjunctive normal form. We next show three methods to do this. The first two only use the variables occurring in the original formula and are, in the worst case, of exponential size in the size of the original formula. The third method introduces auxiliary variables and is of linear size.

Method 1: Applying local rewriting rules¶

One can transform an arbitrary formula into an equivalent CNF formula by applying a sequence of local transformations:

Eliminate other binary connectives but conjunction and disjunction by applying the following rules:
- \(\LHS \Xor \RHS\) \(\Trans\) \((\LHS \land \neg \RHS) \lor (\neg \LHS \land \RHS)\)
- \(\LHS \Implies \RHS\) \(\Trans\) \((\neg \LHS \lor \RHS)\)
- \(\LHS \Iff \RHS\) \(\Trans\) \((\neg \LHS \land \neg \RHS) \lor (\LHS \land \RHS)\)
Move negations next to variables
- \(\neg(\neg \LHS) \Trans \LHS\)
- \(\neg(\LHS \land \RHS) \Trans {(\neg \LHS) \lor (\neg \RHS)}\)
- \(\neg(\LHS \lor \RHS) \Trans {(\neg \LHS) \land (\neg \RHS)}\)
Move conjunctions outside of disjunctions:
- \({\LHS \lor (\RHS_1 \land ... \land \RHS_k)} \Trans {(\LHS \lor \RHS_1) \land ... \land (\LHS \lor \RHS_k)}\)

Note that simplification could, and should, be performed whenever possible by applying, for instance, the following rules:

\(x \lor \neg x \Trans \TF\)
\(\TF \lor \RHS \Trans \TF\)
\(\TF \land \RHS \Trans \RHS\)

Example

Translating the formula \((a \land b) \Xor c\) to CNF by the procedure above gives us

first \((a \land b \land \neg c) \lor (\neg(a \land b) \land c)\),
then \((a \land b \land \neg c) \lor ((\neg a \lor \neg b) \land c)\), and
and finally
- \(((a \land b \land \neg c) \lor (\neg a \lor \neg b))\ \ \land\ \ ((a \land b \land \neg c) \lor c)\),
- \((a \lor \neg a \lor \neg b) \land (b \lor \neg a \lor \neg b) \land (\neg c \lor \neg a \lor \neg b) \land (a \lor c) \land (b \lor c) \land (\neg c \lor c)\), and
- \((\neg c \lor \neg a \lor \neg b) \land (a \lor c) \land (b \lor c)\)

In the worst case, the above translation results in exponentially larger CNF formulas. For instance, for the formula

\[(x_1 \land y_1) \lor (x_2 \land y_2) \lor ... \lor (x_n \land y_n)\]

the equivalent CNF formula

\[\bigwedge_{z_1 \in \Set{x_1,y_1},...,z_n \in \Set{x_n,y_n}} (z_1 \lor z_2 \lor ... \lor z_n)\]

has \(2^n\) clauses. As a concrete example with \(n = 3\), the CNF of \((x_1 \land y_1) \lor (x_2 \land y_2) \lor (x_3 \land y_3)\) is

\begin{array}{l} (x_1 \lor x_2 \lor x_3) \land (x_1 \lor x_2 \lor y_3) \land\\ (x_1 \lor y_2 \lor x_3) \land (x_1 \lor y_2 \lor y_3) \land\\ (y_1 \lor x_2 \lor x_3) \land (y_1 \lor x_2 \lor y_3) \land\\ (y_1 \lor y_2 \lor x_3) \land (y_1 \lor y_2 \lor y_3) \end{array}

Method 2: Deriving CNF from truth tables¶

For small formulas, CNF can be directly read from the truth table for the formula. This is based on the observation that

a clause \((l_1 \lor .. \lor l_k)\) is equivalent to \(\neg(\neg l_1 \land ... \land \neg l_k)\) and
thus, in CNF, “excludes” the rows in the truth table in which \(l_1,...,l_k\) are all false.

Example

The truth table for the formula \((a \land b) \Xor c\) is shown below.

\begin{array}{ccc|c|c@{}} a & b & c & (a \land b) & (a \land b) \Xor c \\ \hline \False & \False & \False & \False & \False \\ \False & \False & \True & \False & \True \\ \False & \True & \False & \False & \False \\ \False & \True & \True & \False & \True \\ \True & \False & \False & \False & \False \\ \True & \False & \True & \False & \True \\ \True & \True & \False & \True & \True \\ \True & \True & \True & \True & \False \\ \end{array}

Now the clause \((a \lor b \lor c)\), which is equivalent to \(\neg(\neg a \land \neg b \land \neg c)\), forbids the all-false truth assignment in the first row. Excluding each row in which \((a \land b) \Xor c\) is \(\False\) by adding a clause, we get the CNF formula

\begin{eqnarray*} (a \lor b \lor c) \land (a \lor \neg b \lor c) \land \\ (\neg a \lor b \lor c) \land (\neg a \lor \neg b \lor \neg c). \end{eqnarray*}

This is equivalent to the formula \((\neg c \lor \neg a \lor \neg b) \land (a \lor c) \land (b \lor c)\) obtained in the example of the previous method above.

Method 3: The Tseitin-translation¶

This methods uses auxiliary, “fresh” variables that do not occur in the original formula to encode the semantics of the sub-formulas. The resulting CNF formula is of linear size in the size of the original formula.

Take a formula \(\phi\). For each non-variable sub-formula \(\EHS\), introduce a fresh auxiliary variable \(\TVar{\EHS}\). For each variable \(x\), let \(\TVar{x} = x\). We then encode the semantics of each sub-formula \(\EHS\) by the following rules:

\begin{array}{@{}c|l@{}} \EHS & \text{Encoding clauses} \\ \hline \neg \LHS & (\neg\TVar{\EHS} \lor \neg\TVar{\LHS}) \land (\TVar{\EHS} \lor \TVar{\LHS}) \\ \LHS \land \RHS & (\neg\TVar{\EHS} \lor \TVar{\LHS}) \land (\neg\TVar{\EHS} \lor \TVar{\RHS}) \land (\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \neg\TVar{\RHS}) \\ \LHS \lor \RHS & (\TVar{\EHS} \lor \neg\TVar{\LHS}) \land (\TVar{\EHS} \lor \neg\TVar{\RHS}) \land (\neg\TVar{\EHS} \lor \TVar{\LHS} \lor \TVar{\RHS}) \\ \LHS \Xor \RHS & (\neg\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \neg\TVar{\RHS}) \land (\neg\TVar{\EHS} \lor \TVar{\LHS} \lor \TVar{\RHS}) \land (\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \TVar{\RHS}) \land (\TVar{\EHS} \lor \TVar{\LHS} \lor \neg\TVar{\RHS}) \\ \LHS \Implies \RHS & (\TVar{\EHS} \lor \TVar{\LHS}) \land (\TVar{\EHS} \lor \neg\TVar{\RHS}) \land (\neg\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \TVar{\RHS}) \\ \LHS \Iff \RHS & (\neg\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \TVar{\RHS}) \land (\neg\TVar{\EHS} \lor \TVar{\LHS} \lor \neg\TVar{\RHS}) \land (\TVar{\EHS} \lor \neg\TVar{\LHS} \lor \neg\TVar{\RHS}) \land (\TVar{\EHS} \lor \TVar{\LHS} \lor \TVar{\RHS}) \end{array}

For intuition, consider the case \(\EHS = \LHS \land \RHS\). The clauses correspond to the implications:

\[(\TVar{\EHS} \Implies \TVar{\LHS}) \land (\TVar{\EHS} \Implies \TVar{\RHS}) \land (\neg \TVar{\EHS} \Implies (\neg\TVar{\LHS} \lor \neg\TVar{\RHS}))\]

or, equivalently,

\[(\neg \TVar{\LHS} \Implies \neg\TVar{\EHS}) \land (\neg\TVar{\RHS} \Implies \neg\TVar{\EHS}) \land (\TVar{\LHS} \land \TVar{\RHS} \Implies \TVar{\EHS})\]

From these it is easier to see that the clauses do encode the semantics of the sub-formulas correctly.

Finally, to force that the whole formula \(\phi\) should evaluate to true, we add the unary clause \((\TVar{\phi})\) to the CNF formula.

Example

Consider again the formula

\[(a \land b) \Xor c\]

For notational convenience, let \(\VA = \TVar{(a \land b) \Xor c}\) and \(\VB = \TVar{(a \land b)}\). The CNF translation is:

\begin{eqnarray*} && (\neg\VB \lor a) \land (\neg\VB \lor b) \land (\VB \lor \neg a \lor \neg b) \land {} \\ && (\neg\VA \lor \neg\VB \lor \neg c) \land (\neg\VA \lor \VB \lor c) \land (\VA \lor \neg\VB \lor c) \land (\VA \lor \VB \lor \neg c) \land {} \\ && (\VA) \end{eqnarray*}

The satisfying truth assignments of \(\phi\) and its Tseitin-translated formula are in one-to-one correspondence.

Example

Again, consider the formula \(\phi\) and its Tseitin translation \(\phi'\):

\begin{eqnarray*} \phi &=& (a \land b) \Xor c\\ \phi' &=& (\neg\VB \lor a) \land (\neg\VB \lor b) \land (\VB \lor \neg a \lor \neg b) \land {} \\ && (\neg\VA \lor \neg\VB \lor \neg c) \land (\neg\VA \lor \VB \lor c) \land (\VA \lor \neg\VB \lor c) \land (\VA \lor \VB \lor \neg c) \land {} \\ && (\VA) \end{eqnarray*}

The assignment \(\TA = \Set{a\mapsto\True,b\mapsto\False,c\mapsto\True}\) satisfies \(\phi\).

Its extension \(\TA' = \Set{a\mapsto\True,b\mapsto\False,c\mapsto\True,\VA\mapsto\True,\VB\mapsto\False}\) satisfies \(\phi'\).
The assignment \(\TA = \Set{a\mapsto\True,b\mapsto\False,c\mapsto\False}\) does not satisfy \(\phi\).

There is no extension of \(\Set{a\mapsto\True,b\mapsto\False,c\mapsto\False}\) that satisfies \(\phi'\).