Predicting Propositional Satisfiability Based on Graph Attention Networks

Chang, Wen**g; Zhang, Hengkai; Luo, Junwei

doi:10.1007/s44196-022-00139-9

Predicting Propositional Satisfiability Based on Graph Attention Networks

Research Article
Open access
Published: 28 September 2022

Volume 15, article number 84, (2022)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Computational Intelligence Systems Aims and scope Submit manuscript

Predicting Propositional Satisfiability Based on Graph Attention Networks

Download PDF

Wen**g Chang¹,
Hengkai Zhang¹ &
Junwei Luo¹

2148 Accesses
1 Citation
Explore all metrics

Abstract

Boolean satisfiability problems (SAT) have very rich generic and domain-specific structures. How to capture these structural features in the embedding space and feed them to deep learning models is an important factor influencing the use of neural networks to solve SAT problems. Graph neural networks have achieved good results, especially for message-passing models. These capture the displacement-invariant architecture well, whether building end-to-end models or improving heuristic algorithms for traditional solvers. We present the first framework for predicting the satisfiability of domain-specific SAT problems using graph attention networks, GAT-SAT. Our model can learn satisfiability features in a weakly supervised setting, i.e., in the absence of problem-specific feature engineering. We test the model to predict the satisfiability of randomly generated SAT instances SR(N) and random 3-SAT problems. Experiments demonstrate that our model improves the prediction accuracy of random 3-SAT problems by 1–4% and significantly outperforms other graph neural network approaches on random SR(N). Compared to NeuroSAT, our model can almost always achieve the same or even higher accuracy with half the amount of iterations. At the end of the paper, we also try to explain the role played by the graph attention mechanism in the model.

Unravelling SAT: Discussion on the Suitability and Implementation of Graph Convolutional Networks for Solving SAT

Towards Tackling QSAT Problems with Deep Learning and Monte Carlo Tree Search

GNN Based Extraction of Minimal Unsatisfiable Subsets

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Recently, there has been a surge of interest in applying machine learning to combinatorial optimization [1,2,3]. SAT, the most fundamental NP-complete problem in computer science [4], has a wide range of applications, from verification [5] to planning [6] and scheduling [7]. Benedikt Bünz et al. [8] proposed using graph neural networks (GNNs) for learning to solve SAT problems. They encoded SAT problem instances as undirected graphs without changing their rich invariance. The current applications of GNNs to SAT problems can be grouped into two categories. One is to build an end-to-end model that inputs a problem instance and directly predicts satisfiability or obtains a solution. A seminal work is NeuroSAT [9], which uses a message-passing model to learn the satisfiability of random SAT instances and decode each literal vote through an activation function to arrive at the final set of solutions. Minghao Liu et al. [10] further investigated the predicted solution quality of GNNs in learning to solve maximum satisfiability (MaxSAT) problems from both theoretical and practical perspectives. Another class combines neural networks with traditional search frameworks in an attempt to improve the effectiveness of some key heuristics. For example, GNNs have been used to replace the variable selection function in WalkSAT solvers [11], to generate initialized assignments for local search [12] or to guide search by predicting variables in the un-SAT core [13]. These works show that GNN models can learn invariant structural features from SAT problem instances and also have the potential to help improve SAT problem solving techniques in the future.

Graph attention networks [14] is a spatial domain-based graph neural network-based attention mechanism unlike previous spectral domain-based graph convolutional neural networks, which do not require sophisticated computations using Laplace matrices and just aggregate first-order neighbor nodes to update node characteristics. Graph attention networks can learn different weights for different neighbors through the attention mechanism.

Graph attention networks have recently achieved good research results in natural language processing tasks, such as text classification, where the graph attention mechanism can learn word nodes that are discriminative for the class of the text [15]. We also investigate whether the graph attention mechanism can be used to propositional logic problems in the same way as it does in natural language. Propositional logic and natural language are two different types of languages [8]. Propositional logic problems are rich in invariant features that cannot be encoded using traditional RNNs and CNNs, but are highly structured problems like natural language. After different ways of permutation, words in natural language express different meanings. For a news article, it may be that a certain word in its title determines to which category it belongs. In propositional logic, literals and clauses have different logical meanings due to different conjunctions and disjunctions. For a pair of SAT problem examples, the difference between satisfying and not satisfying may be whether or not one literal variable of a single clause contained is inverted. Like natural language processing tasks, propositional logic problems are predicted by a small number of key nodes. A model’s ability to predict the satisfiability of SAT problems depends heavily on whether the model can capture the invariant structural features of certain key nodes.

In this paper, we make the first attempt to solve SAT problems using graph attention networks and present GAT-SAT, a new framework for predicting the satisfiability of SAT problems. There is a big disadvantage to traditional graph attention networks. They are unsuccessful for higher-order neighbor nodes because they only aggregate the information of first-order neighbor nodes. To tackle this difficulty, we use message passing to ensure that each node has some information about its higher-order neighbors. Then the attention scores of literal nodes for clause nodes and clause nodes for literal nodes are computed separately using the graph attention mechanism, which in turn yields a weighted representation of the text vector and clause vector. Finally, the final prediction results are obtained by a multilayer perceptron. We compare with NeruoSAT on the randomly generated SAT problems and the random 3-SAT problems. On the one hand, the input of our model, like that of NeuroSAT, is just an arbitrary number of adjacency matrices of literals and clauses, and does not contain any artificial features. On the other hand, we believe that NeuroSAT is an advanced method that can represent graph neural network approach using the message-passing model. Experiments demonstrate that our model outperforms NeuroSAT in terms of prediction accuracy on both randomly generated SAT cases. In particular, on the random SAT problems, we only need half the epoch of NeuroSAT to achieve the same or even higher accuracy than them. We attempt to explain the role of the graph attention mechanism in the model at the end of this paper. The main contributions and originality of our work are summarized in the following:

We use graph attention networks for the first time to solve stochastic SAT problems.
To address the drawbacks of traditional graph attention networks in solving SAT problems, we propose a new framework for graph attention networks incorporating message passing, GAT-SAT.
We have tried to explain the role played by the graph attention mechanism in solving SAT problems.

2 Background and Related Work

2.1 Background

A SAT problem involves finding variable assignments that satisfy a propositional logic formula or showing that such an assignment does not exist. A formula of propositional logic is a Boolean expression built using the constants true(1) and false(0), variables, negations, conjunctions, and disjunctions. The term literal is used to refer to a variable or its negation. It is convenient to represent Boolean formulas in conjunctive normal form (CNF), i.e., conjunctions of clauses, where a clause is a disjunction of literals. An example of a CNF is (× 1∨¬ × 2)∧(× 2∨¬ × 3), where ∧, ∨, ¬ are conjunctions, disjunction, and negation, respectively. This CNF formula has two clauses: (× 1∨¬ × 2) and (× 2∨¬ × 3). Each conjunct of the formula in CNF is called a clause, and each (possibly negated) variable within a clause is called a literal. A formula in CNF has a satisfying assignment if and only if it has an assignment such that every clause has at least one literal mapped to 1. A SAT problem is a formula in CNF, where the goal is to determine if the formula is satisfiable and, if so, to produce a satisfying assignment of truth values to variables.

2.2 Related Work

Over the past two decades, many researchers have utilized machine learning to make predictions about specific properties of SAT problem instances, such as predicting how long the solver will need to run, the choice of algorithm, and so on [16,17,18]. These methods rely on carefully crafted features that encode aspects of the input SAT instances. Nudelman et al. [19] produced 84 feature sets that can be used to predict solver performance from known heuristics (e.g., the ratio of positive and negative occurrences of clauses and each variable), processable subclasses and other features on problem complexity. These features were subsequently combined with various machine learning models [20], resulting in the random forest model that Xu et al. [21] investigated. Hutter et al. [18] used hand-designed features to predict the run of each instance time; features are also used to build algorithm combinations that can improve performance by choosing different solvers on a per-instance basis [22, 23]. SAT problem cases, on the other hand, are rich in general and particular structural elements that are typically lost throughout most feature extraction approaches.

Due to new advances in the field of representation learning, especially geometric deep learning [24, 25], good progress has been made in predicting SAT problem satisfiability using graph neural networks. Benedikt Bünz et al. [8] define a graph representation for Boolean formulas in conjunctive normal form, and train neural classifiers over general graph structures called Graph Neural Networks to identify satisfiability features. It was experimentally demonstrated that using graph neural structure learning can classify SAT. Selsam et al. [9] proposed a message-passing neural network, NeuroSAT, which can learn to solve SAT problems by simply training as a classifier for predicting satisfiability. And in subsequent work, the authors showed that NeuroSAT could provide effective guidance for high-performance SAT solvers on real-world problems. Chris Cameron et al. [26] encoded SAT problems as sparse matrices. They predicted the satisfiability of stochastic 3-SAT problems by learning end-to-end the structural features of SAT problems, which are still cutting-edge difficulties for solvers. The authors demonstrate that on stochastic 3-SAT tasks, both exchangeable deep learning models and neural message-passing models have superior prediction performance, consistently beating models based on complicated artificial engineering characteristics.

3 Model

The overview of our model, GAT-SAT, is presented in Fig. 1. We encode the SAT problem as an undirected graph G, represented by an adjacency matrix. In the process of message passing, for the literal nodes ${l}_{i}\in \mathrm{G}$ and clause nodes ${c}_{i}\in \mathrm{G}$. We create an embedding for each ${l}_{i}$ and ${c}_{i}$ Each clause receives the message from its adjacent literals and updates its embedding accordingly. Each literal receives a message from its adjacent clauses and complements and updates its embedding accordingly, iteratively updating the embedding of each node's embedding at each time step. At the end of message delivery, GAT-SAT learns the attention weights ${a}_{ij}$ between nodes and then weighted average the feature vectors of literal nodes and clause nodes based on the attention weights, which in turn yields new representations of literal nodes and clause nodes $l_{i}^{^{\prime}}$ and $c_{i}^{^{\prime}}$. Finally, pass them into the MLP to get the final vote of each node by the activation function, and calculate the average of the literal votes to get the prediction satisfaction of the problem, which is considered satisfied if it is greater than 0.5 unsatisfied if it is not.

3.1 Formula Graph

As mentioned earlier, the ability of a model to predict the satisfiability of a Boolean formula depends on whether the input formula contains some variable conflict. If a weakly supervised algorithm is to learn to recognize these conflicting patterns, its input must include information about the relationships between the variables in the formula. Considering the structural information of the input formula, we first convert it into a formula graph. A general Boolean formula can be any expression consisting of variables, constants, conjunctions, disjunctions, and negations. All Boolean formulas can be reduced to equally satisfiable collocation paradigms with linear length in linear time [27]. In conjunctive normal form(CNF), the SAT instances are the clauses ${\mathrm{C}}_{1}$ ∧ ${\mathrm{C}}_{2}$ ∧ … ∧ ${\mathrm{C}}_{n}$ taken together. Each clause ${\mathrm{C}}_{i}$ = ${l}_{i1}$ ∨ ${l}_{i2}$ ∨ … ∨ ${l}_{in}$ is a literal (i.e., a variable and a negation variable) of the conjunction and disjunctions, where ${l}_{ij}$ = ${x}_{k}$ or ${l}_{ij}$ = ¬${x}_{k}$. In this paper, we encode the SAT problem as an undirected graph, where each literal corresponds to a node, each clause corresponds to a node, there is an edge between each literal and each clause in which it appears, and each pair of complementary literal is between (e.g.${x}_{i}$ and ¬${x}_{i}$) has an edge of a different type, and the element ${M}_{ij}$ is equal to 1 when there is an edge between clause node i and literal node j, otherwise, it is 0. For example, the Boolean formula (${x}_{1}$∨¬${x}_{2}$∨${x}_{3}$) ∧ ¬ (${x}_{1}$∧¬${x}_{2}$∧¬${x}_{4}$) can be converted to (${x}_{1}$∨¬${x}_{2}$∨${x}_{3}$) ∧ (¬${x}_{1}$∨${x}_{2}$∨${x}_{4}$). The undirected graph of the problem is shown in Fig. 2.

The adjacency matrix of this graph is:

$$ M = \left( {_{0\,\quad 1\quad 1\quad 0\quad 0\quad 0\quad 1\quad 0}^{1\quad 0\quad 0\quad 1\quad 1\quad 0\quad 0\quad 0} } \right) $$

3.2 Graph Neural Network

Our network consists of three multilayer perceptrons (${\text{L}}_{msg} ,$ ${\text{C}}_{msg} ,$${\text{L}}_{vote} ,$), two LSTMs (${\mathrm{L}}_{lstm}$,${\mathrm{C}}_{lstm}$), and two multi-headed graph attention network layers (${L}_{layer}$,${C}_{layer}$). An iteration consists of three phases, the first is a message-passing process where each clause receives a message from its neighboring literal and updates its embedding, and the second is each literal receives a message from its neighboring clause and reverse literal and updates its embedding accordingly. After T (in this paper, we take 26) rounds of passing, each node contains information about its multi-order neighbor nodes. Then it is further fed to the attention network to calculate the attention score${a}_{ij}$. These node vectors are updated with each GAT-SAT iteration. The input to our model is just the adjacency matrix M of any number of literals and clauses, and it may be trained and evaluated for problems of any size.

3.2.1 Messaging Passing

In the SAT instance, satisfiability is independent of the name of clause and literal. To use this property, we initialize each clause ${\mathrm{C}}_{i}$ initialized as a vector ${C}_{init}$∈${R}^{d}$, and all clause feature vectors are represented by the matrix C^(t) and each literal ${l}_{i}$ as another vector ${l}_{init}$∈${R}^{2n\times d}$. All literal feature vectors are represented by the matrix L^(t). For ${\mathrm{L}}_{lstm}$ and ${\mathrm{C}}_{lstm}$, we also have the hidden state ${L}_{h}^{\left(t\right)}\in {R}^{2n\times d}$ and ${C}_{h}^{\left(t\right)}\in {R}^{d}$, where d is the embedding size and d is set to 128 in this paper. ${S}_{wap}$ is a function used to exchange literal and negative literal. Formally, the detailed computation of the message-passing process in the iteration is represented by the following equation:

$$ C^{{\left( {t + 1} \right)}} ,C_{h}^{{\left( {t + 1} \right)}} \leftarrow C_{lstm} \left( {\left[ {C_{h}^{\left( t \right)} ,M^{T} L_{msg} \left( {L^{t} } \right)} \right]} \right), $$

(1)

$$ L^{{\left( {t + 1} \right)}} ,L_{h}^{{\left( {t + 1} \right)}} \leftarrow L_{lstm} \left( {\left[ {L_{h}^{\left( t \right)} ,S_{wap} \left( {L^{\left( t \right)} } \right),MC_{msg} \left( {C^{t + 1} } \right)} \right]} \right). $$

(2)

3.2.2 Graph Attention Mechanism

After the message passing, each node contains not only its own feature information, but also that of its multi-order neighbor nodes. We calculate the attention scores of literal nodes to clause nodes and clause nodes to literal nodes, respectively, and then use the attention scores to calculate the attention weights of clause nodes to literal nodes and literal nodes to clause nodes, as well as the weighted representation of each node. In addition, we calculate the attention scores of each node for all nodes, but only update the attention weights of each node for its neighbor nodes. Formally, at the end of the message passes, the detailed single-layer graph attention calculation procedure is shown below:

$$ e_{ij} = Leaky{\text{Re}} lu(\vec{a}^{T} [W\vec{h}_{i} || W\vec{h}_{j} ]), $$

(3)

$$ a_{ij} = soft\max \left( {e_{ij} } \right) = \frac{{\exp \left( {e_{ij} } \right)}}{{\mathop \sum \nolimits_{{k \in N_{i} }} \exp \left( {\left( {e_{ik} } \right)} \right)}}, $$

(4)

$$ \overrightarrow {{h^{\prime}_{i} }} = \sigma \left( {\Sigma_{j2Ni} \left( {a_{ij} W\vec{h}_{j} } \right)} \right), $$

(5)

where ${\overrightarrow{h}}_{i},{\overrightarrow{h}}_{j}$ are representations of literal nodes and clause nodes, respectively, and ${\overrightarrow{a}}^{T}$ and ${\varvec{W}}$ are the projection matrices. ${\varvec{W}}$ is trainable, $\upsigma $ is an elu activation function, and $||$ is a splicing symbol. To make the factors of different nodes easy to compare, we use the softmax function to homogenize the nodes ${\overrightarrow{h}}_{j}$ for node ${\overrightarrow{h}}_{i}$ attention weights of ${a}_{ij}$ and later use it to update the node ${\overrightarrow{h}}_{i}$'s feature representation. ${\overrightarrow{h}}_{i}^{\mathrm{^{\prime}}}$ as the output of a single attention layer.

The input of our layer is a set of node features. ${\overrightarrow{h}}_{i}\in {R}^{2n*d},{\overrightarrow{h}}_{j}\in {R}^{m*d}$ are representations of literal nodes and clause nodes, where n is the number of literal nodes, m is the number of clause nodes, and d is the number of features in each node. ${e}_{ij}$ indicates the importance of node j’s features to node i as its output. To obtain sufficient expressive power to transform the input features into higher-level features, at least one learnable linear transformation is required. To that end, as an initial step, a shared linear transformation, parametrized by a weight matrix, W ∈ ${R}^{d*d}$, is applied to every node.$||$ is a concatenate symbol,$ {\varvec{W}}{\overrightarrow{h}}_{i}|| {\varvec{W}}{\overrightarrow{h}}_{j}\in {\mathrm{R}}^{2n*m*2d}$. We then perform self-attention on the nodes—a shared attentional mechanism ${\overrightarrow{a}}^{T}:{{\mathrm{R}}^{2n*m*2d}\times \mathrm{R}}^{2d*1}\to {R}^{2n*m}$ that computes attention coefficients, as shown in Fig. 3.

It is difficult to capture all the feature information of neighboring nodes by calculating the attention score only once. To stabilize the learning process of the attention mechanism, we use a multi-headed attention mechanism by executing a single-layer attention network K times and concatenation the results of each output as the input of the final output layer, as shown in Eq. (6). To obtain the final representation of the literal node ${\overrightarrow{h}}_{i}^{\boldsymbol{^{\prime}}\boldsymbol{^{\prime}}}$, we compute the attention scores of the spliced feature vectors ${a}_{ij}^{^{\prime}}$ on the graph attention network's final output layer. The computation process of the multi-head graph attention network is shown as follows.

$$ \overrightarrow {h}_{i}^{^{\prime}} = \,||_{(K = 1)}^{K} \sigma \left( {\sum\nolimits_{j \in Ni} {\left( {a_{ij}^{k} W^{k} \overrightarrow {h}_{j} } \right)} } \right), $$

(6)

$$ \overrightarrow {h}_{i}^{^{\prime\prime}} = \sigma \left( {\sum\nolimits_{j \in Ni} {\left( {a_{ij}^{^{\prime}} W^{k} \overrightarrow {h}_{j}^{^{\prime}} } \right)} } \right). $$

(7)

After learning the attention scores, the node vectors contain weighted structural information. We apply a multilayer perceptron of hidden size 128 ${L}_{Vote}$ applied to the feature vector of each variable as a way to extract categorical information to obtain the votes for each literal, and then we compute the mean y of the literal votes to obtain the final prediction. Our model is trained by minimizing the cross-entropy loss of y with respect to the true label φ(P).

4 Datasets

To predict the satisfiability of the problem, we generated two different stochastic SAT problems. Uniformly distributed stochastic 3-SAT problems are simple to generate, but challenging to solve in practice, taking only milliseconds to solve for instances of 100 variables using MiniSAT [28], and sometimes a minute to solve for instances of 300 variables, and several hours for more difficult instances of 600 variables. The running time for solving 3-SAT stochastic problems increases exponentially with the size of the problem [29]. With a fixed number of variables, the probability that a randomly generated formula can be satisfied approaches 100% when the number of clauses decreases (most problems are under-constrained), and it approaches 0% when the number of clauses increases (most problems are over-constrained). For intermediate numbers of variables, this probability does not change gradually, but rather undergoes a sharp phase change at a critical point (where the ratio of clauses to variables is about 4.26) where the probability that the formula is satisfied is exactly 50%. To be able to obtain the dataset we needed quickly, we created five datasets at that critical point, each with a fixed number of variables and instances, with the number of variables being 100, 150, 200, 250, and 300. Each dataset contains 10,000 instances, and we randomly divided the problems of each dataset into training, validation, and test sets in the ratio of 8:1:1, with the number of satisfying instances and unsatisfying instances divided by half.

In the second, we generated small random SAT problems, which did not follow the same pattern as 3-SAT. They contain an unfixed number of literals per clause, ranging from as few as two to as many as seven. The number of clauses per problem is similarly not fixed, ranging from a few dozen clauses at times to more than 200 clauses at times. To facilitate predicting satisfiability, we need to generate a large number of instances for training, and we created two datasets, SR(3, 10) and SR(10, 40), using the same method as Selsam et al. [9], each with a training set of 300,000, a validation set of 30,000, and a test set of 30,000, in which the instances all appear in pairs, one satisfying and one unsatisfying. The distinction between satisfied and unsatisfied instances is that only one literal in a clause is taken backward. SR(3, 10) represents the smallest instance with 3 variables and the largest with 10 variables, while SR(10, 40) has the smallest instance with 10 variables and the largest with 40 variables. They are trivial for modern solvers, but for machine learning classification problems, they are not simple because SAT problems are highly structured and changing a single variable can easily change the formulation from satisfiable to unsatisfiable, so our models must correctly identify and capture the architectural features of invariance. We also built the dataset ${SR}_{5000}$ (3, 10) to evaluate the model’s capacity to acquire satisfiability features with extremely few samples, using just 5000 instances as the training set, 3000 instances as the validation set, and 3000 instances as the test set.

We detail how the distribution SR(N) is created. To generate random clauses with n variables, SR(N) first samples a small integer k (with a mean value slightly above 4), then samples k variables uniformly at random, and finally inverts each variable with 50% independent probability. It generates clauses in this way ${C}_{i}$, adds them to the SAT problem instance, and then queries a traditional SAT solver (we use MiniSAT [28]) until the addition of ${C}_{m}$ clauses makes the problem unsatisfiable. Up to this point $\left\{{C}_{1},\dots ,{C}_{m-1}\right\}$ is satisfiable, then we invert the ${C}_{m}$ of the individual literal so that a satisfiable problem can be generated $ \left\{{C}_{1},\dots ,{C}_{m-1},{C}_{m}^{^{\prime}}\right\}$. $\left\{{C}_{1},\dots ,{C}_{m-1},{C}_{m}^{^{\prime}}\right\}$ and $\left\{{C}_{1},\dots ,{C}_{m-1},{C}_{m}\right\}$ constitute a random SAT problem that occurs in pairs in the distribution SR(N).

5 Experiment and Results

5.1 Experimental Setup

In the messaging part, we use the hyperparameter setting of Selsam et al. [9]: the hidden cells and the literal and clause embeddings have a dimension of 128, each MLP has three hidden layers and a linear output layer, and we pass the parameters of ${\mathcal{l}}_{2}$ norm for regularization, and the scaling factor of the parameters is ${10}^{-10}$. For each problem, we perform 26 message-passing iterations. At the end of the message-passing process, we use two K = 8 multi-headed attention layers, one to calculate the attention scores of literal nodes for clause nodes and the other to compute the attention scores of clause nodes for literal nodes. Every single layer of the attention network has input and output features of dimension 128. it is worth noting that the last layer output layer has an input feature dimension of 1024 (8*128) and an output feature dimension of 128, and we trained our model using the ADAM optimizer [31]. We use binary cross entropy (BCE) as the loss function.

5.2 Experimental Results

5.2.1 SR(N)

We compared GAT-SAT with the advanced neural embedding framework NeuroSAT [9], as shown in Table 1. Bold values indicate the best performance. First trained and tested at SR(3, 10), our model and NeuroSAT used the same dataset instead of randomly generating it once more, and ran with only one epoch on all data. NeuroSAT and GAT-SAT can achieve good accuracy by executing only one epoch, but for the SR(10, 40) problem, we take the final result of executing 50 epochs. NeuroSAT achieved a training accuracy of 0.786 and a testing accuracy of 0.830, while GAT-SAT achieved a training accuracy of 0.879 and a testing accuracy of 0.891. We perform experiments on the more difficult SR(10, 40) to demonstrate that our model is not limited to simple instances. In a previous study, NeruoSAT trained nearly 10 million problems on this dataset and achieved a prediction accuracy of 0.85 on SR(40) [9], while a training set of only 300,000 is not an easy task for NeruoSAT. As shown in Fig. 4, the accuracy of GAT-SAT on SR(10, 40) reached 0.802 in one iteration and finally converged to 0.897 in 29 iterations, while NeuroSAT did not show an inflection point on this dataset, even though we trained for more than 50 iterations and the training time was more than 1 week. For the simple but much less instantiated ${\mathrm{SR}}_{5000}$ (3, 10) dataset. As shown in Fig. 5, after several experiments, and each time taking the results of the first 50 epochs of the validation set, we find that the performance of NeuroSAT is not stable. We select the performance of GAT-SAT when NeuroSAT performs better (the right panel) and worse (the left panel) as a comparison. It can be seen that NeuroSAT accuracy improves rapidly after sufficient message passing and then converges to a higher level, while GAT-SAT almost always completes the process in just a few iterations. Our model can almost always start to converge at single-digit epochs during training and converge quickly to a satisfactory accuracy. At the same time, NeuroSAT takes a dozen epochs to start converging in the best case and 30 epochs in the worst case before it starts to converge.

Table 1 Experimental results on the SR(3, 10) dataset

Full size table

5.2.2 Random 3-SAT

On the five datasets of the random 3-SAT problem, we evaluated the prediction accuracy using GAT-SAT and NeuroSAT. We observed that the prediction accuracy does not decrease as the problem size increases. We present the training accuracy, validation accuracy, and test accuracy of NeuroSAT and GAT-SAT on five random 3-SAT datasets, respectively. We compare the test accuracies of the two models and bold the better results. As shown in Table 2, on the dataset with 100 variables, NeuroSAT obtained a test accuracy of 0.745 and GAT-SAT had a test accuracy of 0.784, while on the dataset with 300 variables, NeuroSAT had an accuracy of 0.799 and GAT-SAT outperformed it by 2%, and on all test sets, GAT-SAT prediction accuracy improved by 1–4% over NeuroSAT. We also note that the difference in results between NeuroSAT and GAT-SAT on the validation set is not significant, but on the test set, NeuroSAT lowers the validation set almost every time, while GAT-SAT shows comparable results to the validation set. As shown in Fig. 6, we show the validation accuracy performance of GAT-SAT and NeuroSAT within 50 epochs on the datasets with 100, 150, 200, 250 and 300 variables. In the first few epochs of the three datasets, GAT-SAT significantly outperforms NeuroSAT. Still, as the number of training sessions increases, the gap narrows significantly, and NeuroSAT outperforms GAT-SAT in the datasets with 200 and 300 variables, gaining a slight advantage. To verify that the model can capture the universal structural features of a given SAT instance, rather than simply learning features of fixed size, we used the model with the highest accuracy on the validation set with 100 variables to predict the satisfiability of other large-scale problems. The results are shown in Table 3, where NeuroSAT and GAT-SAT still perform well on other datasets even when trained only on the dataset with 100 variables. GAT-SAT still has a prediction accuracy 1–4% higher than NeuroSAT.

Table 2 Experimental results on the random 3-SAT problem dataset

Full size table

Table 3 Experimental results on the trained on 100 variables and tested on the other scales

Full size table

5.3 Impact of Attention Layer

The traditional graph attention mechanism has an obvious drawback for the SAT problem: it works poorly for multi-order neighbor nodes. We tackle this problem by performing a message-passing step before computing the attention mechanism so that each node contains information about its multi-order neighbor nodes. To show more clearly what our attention layer does after message passing, we visualize the attention fraction of literal nodes for neighboring clause nodes on a very small problem with only 5 literals variables and 35 clauses. As shown in Fig. 7, the horizontal coordinates of the heat map are the literal nodes. For the convenience of representation, the first five rows represent the positive literal of these five literal variables, the last five rows represent the negative literal of these five literal variables, and the vertical coordinates represent the clause nodes. The color of their intersection points represents the importance of the adjacent clauses for the literal itself. The darker the color, the more important it is, whereas the opposite is less important. Each SAT problem instance we generate is assigned different weights to each node after learning the attention scores, and these weights will guide the final voting of these nodes. In fact, for any SAT problem, whether large or small, after the Boolean constraint propagation process (BCP), those variables whose assignments are obtained by inferring from the assignments of other variables are less important. Only a few variables are affecting the satisfiability of the whole SAT problem. If they are all correctly assigned, then the SAT problem is satisfiable, and such a set of solutions is found, and vice versa, the problem is unsatisfiable. We speculate that GAT-SAT achieves a faster and more accurate prediction of satisfiability for each SAT problem precisely by learning the attention scores among nodes and assigning higher weights to the essential literal nodes.

5.4 Ablation Study

In the previous subsection, we validated the effectiveness of our proposed model. The graph attention network acts as a feature extraction in our model. To verify the effectiveness of our model in extracting features, we set up three sets of experiments, changing only the part of the model feature extraction. The experimental results are shown in Table 4. The first group is the traditional graph attention network as a feature extractor, the second group uses only the message-passing network, and the third group is our proposed GAT-SAT. We train 50 epochs on two datasets, SR(3, 10) and random 3-SAT (number of variables is 100). Experiments take the results of the test set. As we thought, the traditional graph attention network, which only aggregates the information of first-order neighbor nodes, ultimately does not find a suitable method to solve the problem. The message-passing model learns to solve the problem within the dataset and achieves good accuracy. The highest accuracy was achieved by our GAT-SAT method, which improved 5.9–9.2% over the message-passing model. The experiments demonstrate that our improved graph attention network plays an important role in improving the prediction accuracy of our model, especially for difficult problems.

Table 4 Results of the ablation study

Full size table

6 Conclusions and Future Work

Graph attention networks have performed well on various tasks in recent years. In this paper, we use graph attention networks for the first time to solve propositional logic problems and propose GAT-SAT, a new framework for predicting the satisfiability of random SAT problems. Our model consistently outperforms NeuroSAT in prediction accuracy on both random SAT problems and random 3-SAT problem datasets. We visualize how GAT-SAT does this by giving different weights to the embeddings of literal and clause nodes after the graph attention network learns the attention scores. We concentrate on the feature embeddings of important nodes when predicting satisfiability. Our work shows that the graph attention network can learn and capture the invariant structural features of key nodes of propositional logic problems. GAT-SAT can achieve higher accuracy with only half the epoch of the message-passing model NeuroSAT in the same absence of artificial features and with only similar supervision. In subsequent work, we will try to use graph attention networks to solve some more practical problems.

Availability of Data and Material

Not applicable.

Abbreviations

SAT:: Boolean satisfiability problem
GNNs:: Graph neural networks
MaxSAT:: Maximum satisfiability problem
CNF:: Conjunctive normal form
RNN:: Recurrent neural network
CNN:: Convolutional neural network

References

Arqub, O.A., Abo-Hammour, Z.: Numerical solution of systems of second-order boundary value problems using continuous genetic algorithm. Inf. Sci. 279, 396–415 (2014)
Article MathSciNet Google Scholar
Abo-Hammour, Z.E., Alsmadi, O., Momani, S., Abu Arqub, O.: A genetic algorithm approach for prediction of linear dynamical systems. Math Probl Eng (2013). https://doi.org/10.1155/2013/831657
Article Google Scholar
Abo-Hammour, Z., Arqub, O.A., Alsmadi, O., Momani, S., Alsaedi, A.: An optimization algorithm for solving systems of singular boundary value problems. Appl Math Inform Sci 8(6), 2809 (2014)
Article MathSciNet Google Scholar
Cook SA (1971). The complexity of theorem-proving procedures. Proceedings of the third annual ACM symposium on Theory of computing 151–158.
Leino, K.R.M.: Dafny: An automatic program verifier for functional correctness. In: International Conference on Logic for Programming Artificial Intelligence and Reasoning, pp. 348–370. Springer, Berlin, Heidelberg (2010)
Google Scholar
Harris, W. R., Sankaranarayanan, S., Ivančić, F., & Gupta, A. (2010). Program analysis via satisfiability modulo path programs. Proceedings of the 37th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages. 71–82.
Kasi, B. K., Sarma, A. (2013). Cassandra: Proactive conflict minimization through optimized task scheduling. 2013 35th International Conference on Software Engineering (ICSE). IEE. 732–741
Bünz, B., & Lamm, M. (2017) Graph neural networks and boolean satisfiability. ar**v preprint ar**v:1702.03592.
Selsam, D., Lamm, M., Bünz, B., Liang, P., de Moura, L., Dill, D.L.: Learning a SAT solver from single-bit supervision. In ICLR. 1802, 03685 (2019)
Google Scholar
Liu, M., Jia, F., Huang, P., Zhang, F., Sun, Y., Cai, S., Zhang, J. (2021). Can Graph Neural Networks Learn to Solve MaxSAT Problem? ar**v preprint ar**v:2111.07568, 2021.
Yolcu, E., Póczos, B. (2019). Learning local search heuristics for boolean satisfiability. Advances in Neural Information Processing Systems. 32
Zhang, W., Sun, Z., Zhu, Q., Li, G., Cai, S., **ong, Y., Zhang, L. (2020). NLocalSAT: boosting local search with solution prediction. ar**v preprint ar**v:2001.09398.
Selsam, D., Bjørner, N.: Guiding high-performance SAT solvers with unsat-core predictions. In: International Conference on Theory and Applications of Satisfiability Testing, pp. 336–353. Springer, Cham (2019)
Google Scholar
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y. (2017). Graph attention networks. ar**v preprint ar**v:1710.10903.
Linmei, H., Yang, T., Shi, C., Ji, H., & Li, X. (2019, November). Heterogeneous graph attention networks for semi-supervised short text classification. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 4821–4830).
Finkler, U., Mehlhorn, K. (1996). Runtime prediction of real programs on real machines.
Smith-Miles, K., Lopes, L.: Measuring instance difficulty for combinatorial optimization problems. Comput. Oper. Res. 39(5), 875–889 (2012)
Article MathSciNet Google Scholar
Hutter, F., Xu, L., Hoos, H.H., Leyton-Brown, K.: Algorithm runtime prediction: Methods and evaluation. Artif. Intell. 206, 79–111 (2014)
Article MathSciNet Google Scholar
Nudelman, E., Leyton-Brown, K., Hoos, H.H., Devkar, A., Shoham, Y.: Understanding random SAT: Beyond the clauses-to-variables ratio. In: International Conference on Principles and Practice of Constraint Programming, pp. 438–452. Springer, Berlin, Heidelberg (2004)
Google Scholar
Xu, L., Hoos, H.H., Leyton-Brown, K.: Hierarchical hardness models for SAT. In: International Conference on Principles and Practice of Constraint Programming, pp. 696–711. Springer, Berlin, Heidelberg (2007)
Google Scholar
Xu, L., Hoos, H., Leyton-Brown, K. (2012). Predicting satisfiability at the phase transition. In Proceedings of the AAAI Conference on Artificial Intelligence. 584-590
Xu, L., Hutter, F., Hoos, H.H., Leyton-Brown, K.: SATzilla: portfolio-based algorithm selection for SAT. J Art Intel Res 32, 565–606 (2008)
MATH Google Scholar
Lindauer, M., Hoos, H. H., Hutter, F., Schaub, T. (2015). Autofolio: Algorithm configuration for algorithm selection. In Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence.
Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)
Article Google Scholar
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Philip, S.Y.: A comprehensive survey on graph neural networks. IEEE Trans Neural Net Learn Sys 32(1), 4–24 (2020)
Article MathSciNet Google Scholar
Cameron, C., Chen, R., Hartford, J., Leyton-Brown, K. (2020). Predicting propositional satisfiability via end-to-end learning. In Proceedings of the AAAI Conference on Artificial Intelligence. 3324–3331
Tseitin, G.S.: On the complexity of derivation in propositional calculus. In: Automation of reasoning, pp. 466–483. Springer, Berlin, Heidelberg (1983)
Chapter Google Scholar
Sorensson, N., Een, N.: Minisat v1. 13-a sat solver with conflict-clause minimization. SAT. 2005(53), 1–2 (2005)
Google Scholar
Mu, Z., & Hoos, H. H. (2015, June). On the empirical time complexity of random 3-SAT at the phase transition. In Twenty-Fourth International Joint Conference on Artificial Intelligence.
Kingma, D. P., Ba, J. (2014). Adam: A method for stochastic optimization. ar**v preprint ar**v:1412.6980.
Pascanu, R., Mikolov, T., Bengio, Y.: Understanding the exploding gradient problem. CoRR. 2(417), 1 (2012)
Google Scholar

Download references

Acknowledgements

The authors would like to thank the editors and reviewers for handling and reviewing our paper.

Funding

This work has been supported in part by the National Natural Science Foundation of China under Grant Nos. 62202145 and 61972134, Young Elite Teachers in Henan Province No. 2020GGJS050, Doctor Foundation of Henan Polytechnic University under Grant No. B2018-36.No. B2020-31, Innovative and Scientific Research Team of Henan Polytechnic University under No.T2021-3.

Author information

Authors and Affiliations

School of Software, Henan Polytechnic University, Jiaozuo, 454003, Henan, China
Wen**g Chang, Hengkai Zhang & Junwei Luo

Authors

Wen**g Chang
View author publications
You can also search for this author in PubMed Google Scholar
Hengkai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Junwei Luo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study's conception and design. The study was mainly conceived and designed by WC. The experiments were performed by HZ. The first draft of the manuscript was written by HZ, and all authors commented on previous versions of the manuscript. JL edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Junwei Luo.

Ethics declarations

Conflict of Interest

The authors declare that they have no competing interests.

Ethics Approval and Consent to Participate

Not applicable.

Consent for Publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chang, W., Zhang, H. & Luo, J. Predicting Propositional Satisfiability Based on Graph Attention Networks. Int J Comput Intell Syst 15, 84 (2022). https://doi.org/10.1007/s44196-022-00139-9

Download citation

Received: 12 June 2022
Accepted: 29 August 2022
Published: 28 September 2022
DOI: https://doi.org/10.1007/s44196-022-00139-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting Propositional Satisfiability Based on Graph Attention Networks

Abstract

Similar content being viewed by others

Unravelling SAT: Discussion on the Suitability and Implementation of Graph Convolutional Networks for Solving SAT

Towards Tackling QSAT Problems with Deep Learning and Monte Carlo Tree Search

GNN Based Extraction of Minimal Unsatisfiable Subsets

1 Introduction

2 Background and Related Work

2.1 Background

2.2 Related Work

3 Model

3.1 Formula Graph

3.2 Graph Neural Network

3.2.1 Messaging Passing

3.2.2 Graph Attention Mechanism

4 Datasets

5 Experiment and Results

5.1 Experimental Setup

5.2 Experimental Results

5.2.1 SR(N)

5.2.2 Random 3-SAT

5.3 Impact of Attention Layer

5.4 Ablation Study

6 Conclusions and Future Work

Availability of Data and Material

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of Interest

Ethics Approval and Consent to Participate

Consent for Publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation