Self-correcting Neural Networks for Safe Classification

Leino, Klas; Fromherz, Aymeric; Mangal, Ravi; Fredrikson, Matt; Parno, Bryan; Păsăreanu, Corina

doi:10.1007/978-3-031-21222-2_7

Klas Leino¹²,
Aymeric Fromherz¹³,
Ravi Mangal¹²,
Matt Fredrikson¹²,
Bryan Parno¹² &
…
Corina Păsăreanu^12,14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13466))

Included in the following conference series:

354 Accesses

Abstract

Classifiers learnt from data are increasingly being used as components in systems where safety is a critical concern. In this work, we present a formal notion of safety for classifiers via constraints called safe-ordering constraints. These constraints relate requirements on the order of the classes output by a classifier to conditions on its input, and are expressive enough to encode various interesting examples of classifier safety specifications from the literature. For classifiers implemented using neural networks, we also present a run-time mechanism for the enforcement of safe-ordering constraints. Our approach is based on a self-correcting layer, which provably yields safe outputs regardless of the characteristics of the classifier input. We compose this layer with an existing neural network classifier to construct a self-correcting network (SC-Net), and show that in addition to providing safe outputs, the SC-Net is guaranteed to preserve the classification accuracy of the original network whenever possible. Our approach is independent of the size and architecture of the neural network used for classification, depending only on the specified property and the dimension of the network’s output; thus it is scalable to large state-of-the-art networks. We show that our approach can be optimized for a GPU, introducing run-time overhead of less than 1 ms on current hardware—even on large, widely-used networks containing hundreds of thousands of neurons and millions of parameters. Code available at github.com/cmu-transparency/self-correcting-networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 7435; Price includes VAT (Japan)

Softcover Book: JPY 9294; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

NNrepair: Constraint-Based Repair of Neural Network Classifiers

Detecting and Mitigating Errors in Neural Networks

Safety Verification of Deep Neural Networks

Notes

1.
While [20] used the convention that the index of the minimal element of ACAS Xu networks is the top predicted advisory, in this paper we will use the more common convention of the maximal value’s index.
2.
$[m] := \{0,\ldots ,m-1\}$.

References

Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016), pp. 265–283 (2016)
Google Scholar
Alshiekh, M., Bloem, R., Ehlers, R., Könighofer, B., Niekum, S., Topcu, U.: Safe reinforcement learning via shielding. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar
Anderson, G., Pailoor, S., Dillig, I., Chaudhuri, S.: Optimization and abstraction: a synergistic approach for analyzing neural network robustness. In: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2019, pp. 731–744. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3314221.3314614
Anil, C., Lucas, J., Grosse, R.: Sorting out Lipschitz function approximation. In: International Conference on Machine Learning, pp. 291–301. PMLR (2019)
Google Scholar
Berger, E.D., Zorn, B.G.: DieHard: probabilistic memory safety for unsafe languages. In: Proceedings of the 27th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2006, pp. 158–168. Association for Computing Machinery, New York (2006)
Google Scholar
Bloem, R., Könighofer, B., Könighofer, R., Wang, C.: Shield synthesis: In: Baier, C., Tinelli, C. (eds.) TACAS 2015. LNCS, vol. 9035, pp. 533–548. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-46681-0_51
Chapter MATH Google Scholar
Brown, T.B., et al.: Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)
Clarkson, M.R., Schneider, F.B.: Hyperproperties. In: 2008 21st IEEE Computer Security Foundations Symposium, pp. 51–65 (2008). https://doi.org/10.1109/CSF.2008.7
Dekel, E., Nassimi, D., Sahni, S.: Parallel matrix and graph algorithms. SIAM J. Comput. 10, 657–675 (1981)
Article MathSciNet MATH Google Scholar
Donti, P.L., Roderick, M., Fazlyab, M., Kolter, J.Z.: Enforcing robust control guarantees within neural network policies. In: International Conference on Learning Representations (2021). https://openreview.net/forum?id=5lhWG3Hj2By
Dvijotham, K., Stanforth, R., Gowal, S., Mann, T., Kohli, P.: A dual approach to scalable verification of deep networks. In: Proceedings of the Thirty-Fourth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI 2018), Corvallis, Oregon, pp. 162–171. AUAI Press (2018)
Google Scholar
Ehlers, R.: Formal verification of piece-wise linear feed-forward neural networks. In: D’Souza, D., Narayan Kumar, K. (eds.) ATVA 2017. LNCS, vol. 10482, pp. 269–286. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68167-2_19
Chapter MATH Google Scholar
Findler, R.B., Felleisen, M.: Contracts for higher-order functions. In: Proceedings of the Seventh ACM SIGPLAN International Conference on Functional Programming, ICFP 2002 (2002)
Google Scholar
Fischer, M., Balunovic, M., Drachsler-Cohen, D., Gehr, T., Zhang, C., Vechev, M.: DL2: training and querying neural networks with logic. In: International Conference on Machine Learning, pp. 1931–1941. PMLR (2019)
Google Scholar
Gehr, T., Mirman, M., Drachsler-Cohen, D., Tsankov, P., Chaudhuri, S., Vechev, M.: AI2: Safety and robustness certification of neural networks with abstract interpretation. In: 2018 IEEE Symposium on Security and Privacy (SP), pp. 3–18 (2018)
Google Scholar
Guttmann, W., Maucher, M.: Variations on an ordering theme with constraints. In: Navarro, G., Bertossi, L., Kohayakawa, Y. (eds.) TCS 2006. IIFIP, vol. 209, pp. 77–90. Springer, Boston, MA (2006). https://doi.org/10.1007/978-0-387-34735-6_10
Chapter Google Scholar
Havelund, K., Rosu, G.: Monitoring programs using rewriting. In: Proceedings 16th Annual International Conference on Automated Software Engineering (ASE 2001), pp. 135–143 (2001)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Google Scholar
Huang, X., Kwiatkowska, M., Wang, S., Wu, M.: Safety verification of deep neural networks. In: Majumdar, R., Kunčak, V. (eds.) CAV 2017. LNCS, vol. 10426, pp. 3–29. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-63387-9_1
Chapter Google Scholar
Julian, K.D., Kochenderfer, M.J., Owen, M.P.: Deep neural network compression for aircraft collision avoidance systems. J. Guid. Control Dyn. 42(3), 598–608 (2019)
Article Google Scholar
Katz, G., Barrett, C., Dill, D.L., Julian, K., Kochenderfer, M.J.: Reluplex: an efficient SMT solver for verifying deep neural networks. In: Majumdar, R., Kunčak, V. (eds.) CAV 2017. LNCS, vol. 10426, pp. 97–117. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-63387-9_5
Chapter Google Scholar
Kling, M., Misailovic, S., Carbin, M., Rinard, M.: Bolt: on-demand infinite loop escape in unmodified binaries. In: Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA 2012, pp. 431–450. Association for Computing Machinery, New York (2012)
Google Scholar
Kochenderfer, M.J., et al.: Optimized airborne collision avoidance, pp. 249–276 (2015)
Google Scholar
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. Technical report 0, University of Toronto, Toronto, Ontario (2009)
Google Scholar
Leino, K., Fredrikson, M.: Relaxing local robustness. In: Advances in Neural Information Processing Systems (NeurIPS) (2021)
Google Scholar
Leino, K., Wang, Z., Fredrikson, M.: Globally-robust neural networks. In: International Conference on Machine Learning (ICML) (2021)
Google Scholar
Li, Q., Haque, S., Anil, C., Lucas, J., Grosse, R.B., Jacobsen, J.H.: Preventing gradient attenuation in lipschitz constrained convolutional networks. In: Advances in Neural Information Processing Systems 32, pp. 15390–15402 (2019)
Google Scholar
Li, T., Gupta, V., Mehta, M., Srikumar, V.: A logic-driven framework for consistency of neural models. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 3922–3933. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/D19-1405
Lin, X., Zhu, H., Samanta, R., Jagannathan, S.: ART: abstraction refinement-guided training for provably correct neural networks. In: 2020 Formal Methods in Computer Aided Design (FMCAD), pp. 148–157 (2020)
Google Scholar
Long, F., Sidiroglou-Douskos, S., Rinard, M.: Automatic runtime error repair and containment via recovery shepherding. In: Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2014, pp. 227–238. Association for Computing Machinery, New York (2014)
Google Scholar
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., Vladu, A.: Towards deep learning models resistant to adversarial attacks. In: International Conference on Learning Representations (2018). https://openreview.net/forum?id=rJzIBfZAb
Meyer, B.: Eiffel: The Language (1992)
Google Scholar
Mirman, M., Gehr, T., Vechev, M.: Differentiable abstract interpretation for provably robust neural networks. In: International Conference on Machine Learning, pp. 3578–3586. PMLR (2018)
Google Scholar
Müller, C., Serre, F., Singh, G., Püschel, M., Vechev, M.: Scaling polyhedral neural network verification on GPUS. In: Proceedings of Machine Learning and Systems 3 (2021)
Google Scholar
Nieuwenhuis, R., Rivero, J.M.: Practical algorithms for deciding path ordering constraint satisfaction. Inf. Comput. 178(2), 422–440 (2002). https://doi.org/10.1006/inco.2002.3146
Article MathSciNet MATH Google Scholar
Perkins, J.H., et al.: Automatically patching errors in deployed software. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles, SOSP 2009, pp. 87–102. Association for Computing Machinery, New York (2009)
Google Scholar
Qin, F., Tucek, J., Sundaresan, J., Zhou, Y.: Rx: treating bugs as allergies–a safe method to survive software failures. In: Proceedings of the Twentieth ACM Symposium on Operating Systems Principles, SOSP 2005, pp. 235–248. Association for Computing Machinery, New York (2005)
Google Scholar
Rinard, M., Cadar, C., Dumitran, D., Roy, D.M., Leu, T., Beebee, W.S.: Enhancing server availability and security through failure-oblivious computing. In: Proceedings of the 6th Conference on Symposium on Operating Systems Design and Implementation - Volume 6, OSDI 2004, p. 21. USENIX Association, Berkeley (2004)
Google Scholar
Singh, G., Gehr, T., Püschel, M., Vechev, M.: An abstract domain for certifying neural networks. In: Proceedings of the ACM on Programming Languages, 3(POPL), January 2019
Google Scholar
Sotoudeh, M., Thakur, A.V.: Provable repair of deep neural networks. In: Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation, pp. 588–603 (2021)
Google Scholar
Tan, M., Le, Q.: EfficientNet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114. PMLR (2019)
Google Scholar
Tan, M., Le, Q.V.: EfficientNetV2: smaller models and faster training. arXiv preprint arXiv:2104.00298 (2021)
Trockman, A., Kolter, J.Z.: Orthogonalizing convolutional layers with the Cayley transform. In: International Conference on Learning Representations (2021)
Google Scholar
Urban, C., Christakis, M., Wüstholz, V., Zhang, F.: Perfectly parallel fairness certification of neural networks. In: Proceedings of the ACM on Programming Languages, 4(OOPSLA), November 2020. https://doi.org/10.1145/3428253
Wu, H., et al.: Parallelization techniques for verifying neural networks. In: 2020 Formal Methods in Computer Aided Design, FMCAD 2020, Haifa, Israel, 21–24 September 2020, pp. 128–137. IEEE (2020). https://doi.org/10.34727/2020/isbn.978-3-85448-042-6_20
Zhu, H., Xiong, Z., Magill, S., Jagannathan, S.: An inductive synthesis framework for verifiable reinforcement learning. In: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2019, pp. 686–701. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3314221.3314638

Download references

Acknowledgment

This material is based upon work supported by the Software Engineering Institute under its FFRDC Contract No. FA8702-15-D-0002 with the U.S. Department of Defense, the National Science Foundation under Grant No. CNS-1943016, DARPA GARD Contract HR00112020006, and the Alfred P. Sloan Foundation.

Author information

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Klas Leino, Ravi Mangal, Matt Fredrikson, Bryan Parno & Corina Păsăreanu
Inria Paris, 75012, Paris, France
Aymeric Fromherz
NASA Ames, Moffett Field, CA, 94035, USA
Corina Păsăreanu

Authors

Klas Leino
View author publications
You can also search for this author in PubMed Google Scholar
Aymeric Fromherz
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Mangal
View author publications
You can also search for this author in PubMed Google Scholar
Matt Fredrikson
View author publications
You can also search for this author in PubMed Google Scholar
Bryan Parno
View author publications
You can also search for this author in PubMed Google Scholar
Corina Păsăreanu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ravi Mangal .

Editor information

Editors and Affiliations

The Hebrew University of Jerusalem, Jerusalem, Israel
Omri Isac
Rensselaer Polytechnic Institute, New York, NY, USA
Radoslav Ivanov
The Hebrew University of Jerusalem, Jerusalem, Israel
Guy Katz
Palo Alto, CA, USA
Nina Narodytska
University of Trieste, Trieste, Italy
Laura Nenzi

Appendices

A Appendix 1: Proofs

Theorem 2

(Accuracy Preservation). Given a neural network, $f : \mathbb {R}^n \rightarrow \mathbb {R}^m$, and set of constraints, $\varPhi $, let $f^\varPhi := SC^\varPhi (f)$ and let $F^O: \mathbb {R}^n \rightarrow [m]$ be the oracle classifier. Assume that SC satisfies transparency. Further, assume that accuracy is consistent with safety, i.e.,

$$ \forall x\in \mathbb {R}^n~.~ \exists y~.~\varPhi (x, y) ~\wedge ~ \mathop {\textrm{argmax}}\limits _i\{y_i\} = F^O(x). $$

Then,

$$ \forall x \in \mathbb {R}^n~.~ F(x) = F^O(x) \implies F^\varPhi (x) = F^O(x) $$

Proof

Let $x \in \mathbb {R}^n$ such that $F(x) = F^O(x)$. By hypothesis, we have that $\exists y~.~\varPhi (x, y) \wedge \mathop {\textrm{argmax}}\limits _i\{y_i\} = F^O(x)$, hence we can apply Property 1 to conclude that $F^\varPhi (x) = F(x) = F^O(x)$.

We now prove that the transformer presented in Algorithm 3.1, , is indeed self-correcting; i.e., it satisfies Properties 2 (i) and 2 (ii). Recall that this means that $f^\varPhi $ will either return safe outputs vectors, or in the event that $\varPhi $ is inconsistent at a point, and only in that event, return $\bot $.

Let $x : \mathbb {R}^n$ be an arbitrary vector. If $\varPhi (x, f(x))$ is initially satisfied, the SC-Layer does not modify the original output $y = f(x)$, and Properties 2 (i) and 2 (ii) are trivially satisfied. If $\varPhi (x, f(x))$ does not hold, we will rely on two key properties of and to establish that is self-correcting. The first, Property 7, requires that either return $\bot $, or else return ordering constraints that are sufficient to establish $\varPhi $.

Property 7

(). Let $\varPhi $ be a set of safe-ordering constraints, $x : \mathbb {R}^n$ and $y : \mathbb {R}^m$ two vectors.

Then satisfies the following properties:

(i)
$q = \bot \Longleftrightarrow \forall y'~.~\lnot \varPhi (x, y')$
(ii)
$q \ne \bot \implies (~\forall y'~.~q(y') \implies \varPhi (x, y')~)$

Proof

The first observation is that the list of ordering constraints in accurately models the initial set of safety constraints $\varPhi $, i.e.,

$$\begin{aligned} \forall y'~.~\varPhi (x, y') \Longleftrightarrow (~\exists q \in Q_p~.~q(y')~) \end{aligned}$$

(5)

This stems from the definition of the disjunctive normal form, and from the fact that only performs a permutation of the disjuncts.

We also rely on the following loop invariant, stating that all disjuncts considered so far, when iterating over , were unsatisfiable:

$$\begin{aligned} \forall q \in Q_p~.~\texttt{idx} (q, Q_p) < \texttt{idx} (q_i, Q_p) \implies (~\forall y~.~\lnot q(y)~) \end{aligned}$$

(6)

Here, $\texttt{idx} (q,Q_p)$ returns the index of constraint q in the list $Q_p$. This invariant is trivially true when entering the loop, since the current $q_i$ is the first element of the list. Its preservation relies on correctly determining whether q is satisfiable, i.e., [35].

Combining these two facts, we can now establish that satisfies 7 (i) and 7 (ii). By definition, outputs $\bot $ if and only if it traverses the entire list $Q_p$, never returning a $q_i$. From loop invariant 6, this is equivalent to $\forall q \in Q_p.~\forall y'.~\lnot q(y')$, which finally yields Property 7 (i) from Eq. 5. Conversely, if outputs $q \ne \bot $, then $q \in Q_p$. We directly obtain Property 7 (ii) as, for any $y' : \mathbb {R}^m$, $q(y')$ implies that $\varPhi (x, y')$ by application of Eq. 5

Next, Property 8 states that correctly permutes the output of the network to satisfy the constraint that it is given. Combined with Property 7, this is sufficient to show that is a self-correcting transformer (Theorem 5).

Property 8

(). Let q be a satisfiable ordering constraint, and $y : \mathbb {R}^m$ a vector. Then satisfies q.

Proof

Let $y_i < y_j$ be an atom in q. Reusing notation from Algorithm 3.3, let , , and . We have that (j, i) is an edge in , which implies that $\pi (j) < \pi (i)$ by Eq. 1. Because the elements of y are sorted in descending order, and assumed to be distinct (Definition 1), we obtain that $y^s_{\pi (i)} < y^s_{\pi (j)}$, i.e., that $y'_i < y'_j$.

Theorem 5

( is a self-correcting transformer). (Algorithm 3.1) satisfies conditions (i) and (ii) of Definition 2.

Proof

By definition of Algorithm 3.1, if and only if outputs $\bot $. We derive from Property 7 (i) that this is equivalent to $\forall y'.~\lnot \varPhi (x, y')$, which corresponds exactly to Property 2 (ii). Conversely, if $\varPhi $ is satisfiable for input x, i.e., $\exists y'.~\varPhi (x, y')$, then outputs $q \ne \bot $. By definition, we have , which satisfies q by application of Property 8, which in turn implies that $\varPhi (x, f^\varPhi (x))$ by application of Property 7 (ii).

Now that we have demonstrated that our approach produces safe-by-construction networks, we next prove that it also preserves the top predicted class when possible, i.e., that satisfies transparency, as formalized in Property 1.

Let $x : \mathbb {R}^n$ be an arbitrary vector. As in the previous section, if $\varPhi (x, f(x))$ is initially satisfied, transparency trivially holds, as the correction layer does not modify the original output f(x). When $\varPhi (x, f(x))$ does not hold, we will rely on several additional properties about , and . The first, Property 9, states that whenever the index of the network’s top prediction is a root of the graph encoding of q used by and , then there exists an output which satisfies q that preserves that top prediction.

Property 9

(). Let q be a satisfiable, disjunction-free ordering constraint, and $y : \mathbb {R}^m$ a vector. Then,

The intuition behind this property is that $i^* := \mathop {\textrm{argmax}}\limits _i\{y_i\}$ belongs to the roots of if and only if there is no $y_{i^*} < y_j$ constraint in q; hence since q is satisfiable, we can always permute indices in a solution $y'$ to have $\mathop {\textrm{argmax}}\limits _i\{y'_i\} = i^*$. Formally, Lemma 1 in Sect. B.1 entails this property, as it shows that the permutation returned by satisfies it.

Next, Property 10 formalizes the requirement that whenever returns a constraint (rather than $\bot $), then that constraint will not eliminate any top-prediction-preserving solutions that would otherwise have been compatible with the full set of safe-ordering constraints $\varPhi .$

Property 10

(). Let $\varPhi $ be a set of safe-ordering constraints, $x : \mathbb {R}^n$ and $y : \mathbb {R}^m$ two vectors, and . Then,

$$\begin{aligned}&q \ne \bot ~\wedge ~ \left( \exists y'.~ \varPhi (x, y') ~\wedge ~ \mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y_i'\}\right) \implies \\&\exists y'.~q(y') ~\wedge ~ \mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y_i'\} \end{aligned}$$

Proof

Let us assume that $q \ne \bot $, and that $\exists y'.~\varPhi (x, y') \wedge \mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y'_i\}$. We will proceed by contradiction, assuming that there does not exist $y''$ such that $q(y'')$ and $\mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y''_i\}$, which entails that by application of Property 9. In combination with the specification of (Property 3), this implies that any $q' \in Q_p$ such that $\exists y'.~q'(y') \wedge \mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y'_i\}$ occurs before q in , i.e., $\texttt{idx} (q', Q_p) < \texttt{idx} (q, Q_p)$. From loop invariant 6, we therefore conclude that there does not exist such a $q' \in Q_p$, which contradicts the hypothesis $\varPhi (x, y')$ by application of Eq. 5.

Lastly, Property 11 states that (Algorithm 3.3) will always find an output that preserves the original top prediction, whenever the constraint returned by allows it. This is the final piece needed to prove Theorem 6, the desired result about the self-correcting transformer.

Property 11

(). Let q be a satisfiable term, and $y : \mathbb {R}^m$ a vector. Then,

Proof

Assume that there exists $y'$ such that $q(y')$ and $\mathop {\textrm{argmax}}\limits _i\{y_i\} = \mathop {\textrm{argmax}}\limits _i\{y'_i\}$. This entails that (Property 9), which in turn implies that $\pi (\mathop {\textrm{argmax}}\limits _i\{y_i\})$ is 0 (Property 4). By definition of a descending sort, we have that , such that $\pi (j) = 0$, hence concluding that $j = \mathop {\textrm{argmax}}\limits _i\{y_i\}$ by injectivity of $\pi $.

Theorem 6

(Transparency of ). , the self-correcting transformer described in Algorithm 3.1 satisfies Property 1.

Proof

That the transformer satisfies transparency is straightforward given Properties 9–11. Let us assume that there exists $y'$ such that $\varPhi (x, y')$ and $\mathop {\textrm{argmax}}\limits _i\{y'_i\} = F(x)$. By application of Property 7 (i), this implies that outputs $q \ne \bot $, and therefore that there exists $y'$ such that $q(y')$ and $\mathop {\textrm{argmax}}\limits \{y'_i\} = F(x)$ by application of Property 10, since F(x) is defined as $\mathop {\textrm{argmax}}\limits _i\{f_i(x)\}$. Composing this fact with Property 11, we obtain that $F^\varPhi (x) = F(x)$, since $F^\varPhi (x) = \mathop {\textrm{argmax}}\limits _i\{f^\varPhi _i(x)\}$ by definition.

B Appendix 2: Vectorizing Self-Correction

Several of the subroutines of and (Algorithms 3.2 and 3.3 presented in Sect. 3) operate on an , which represents a conjunction of ordering literals, q. An contains a vertex set, V, and edge set, E, where V contains a vertex, i, for each class in $\{0, \ldots , m-1\}$, and E contains an edge, (i, j), from vertex i to vertex j if the literal $y_j < y_i$ is in q. We represent an as an $m\times m$ adjacency matrix, M, defined according to Eq. 7.

(7)

Section B.1 describes the matrix-based algorithm that we use to conduct the stable topological sort that (Algorithm 3.3) depends on. It is based on a classic parallel algorithm due to [9], which we modify to ensure that satisfies transparency (Property 1). Section B.2 describes our approach to cycle detection, which is able to share much of its work with the topological sort. Finally, Sect. B.3 discusses efficiently prioritizing ordering constraints, needed to ensure that satisfies transparency.

1.1 B.1 Stable Topological Sort

Our approach builds on a parallel topological sort algorithm given by [9], which is based on constructing an all pairs longest paths (APLP) matrix. However, this algorithm is not stable in the sense that the resulting order depends only on the graph, and not on the original order of the sequence, even when multiple orderings are possible. While for our purposes this is sufficient for ensuring safety, it is not for transparency. We begin with background on constructing the APLP matrix, showing that it is compatible with a vectorized implementation, and then describe how it is used to perform a stable topological sort.

All Pairs Longest Paths. The primary foundation underpinning many of the graph algorithms in this section is the all pairs longest paths (APLP) matrix, which we will denote by P. On acyclic graphs, $P_{ij}$ for $i,j \in [m]$ is defined to be the length of the longest path from vertex i to vertex j. Absent the presence of cycles, the distance from a vertex to itself, $P_{ii}$, is defined to be 0. For vertices i and j for which there is no path from i to j, we let $P_{ij} = -\infty $.

We compute P from M using a matrix-based algorithm from [9], which requires taking $O(\log {m})$ matrix max-distance products, where the max-distance product is equivalent to a matrix multiplication where element-wise multiplications have been replaced by additions and element-wise additions have been replaced by the pairwise maximum. That is, a matrix product can be abstractly written with respect to operations $\otimes $ and $\oplus $ according to Eq. 8, and the max-distance product corresponds to the case where $x \otimes y := x + y$ and $x \oplus y := \max \{x, y\}$.

$$\begin{aligned} (AB)_{ij} := (A_{i1}\otimes B_{1j}) \oplus \ldots \oplus (A_{ik}\otimes B_{kj}) \end{aligned}$$

(8)

Using this matrix product, $P = P^{2^{\lceil \log _2(m)\rceil }}$ can be computed recursively from M by performing a fast matrix exponentiation, as described in Eq. 9.

(9)

Stable Sort. We propose a stable variant of the [9] topological sort, shown in Algorithm B.1. Crucially, this variant satisfies Property 4 (Lemma 1), which Sect. 3.2 identifies as sufficient for ensuring transparency. Essentially, the value of each logit $y_j$ is adjusted so that it is at least as small as the smallest logit value corresponding to vertices that are parents of vertex j, including j itself. A vertex, i, is a parent of vertex j if $P_{ij} \ge 0$, meaning that there is some path from vertex i to vertex j or $i = j$. The logits are then sorted in descending order, with ties being broken in favor of minimum depth in the dependency graph. The depth of vertex j is the maximum of the $j^\text {th}$ column of $P_{ij}$, i.e., the length of the longest path from any vertex to j. An example trace of Algorithm B.1 is given in Fig. 2. By adjusting $y_j$ into $v_j$ such that for all ancestors, i, of j, $v_i \ge v_j$, we ensure each child vertex appears after each of its parents in the returned ordering–once ties have been broken by depth—as the child’s depth will always be strictly larger than that of any of its parents since a path of length d to an immediate parent of vertex j implies the existence of a path of length $d+1$ to vertex j.

Lemma 1

satisfies Property 4.

Proof

Note that the adjusted logit values, v, are chosen according to Eq. 10.

$$\begin{aligned} v_j := \min _{i~|~ i\, \text {is an ancestor of}\, j ~\vee ~ i=j}\Big \{~y_i~\Big \} \end{aligned}$$

(10)

We observe that (i) for all root vertices, i, $v_i = y_i$, and (ii) the root vertex with the highest original logit value will appear first in the topological ordering. The former follows from the fact that the root vertices have no ancestors. The latter subsequently follows from the fact that the first element in a valid topological ordering must correspond to a root vertex. Thus if $\mathop {\textrm{argmax}}\limits _i\{y_i\} = i^* \in \text {Roots} (g)$, then $i^*$ is the vertex with the highest logit value, and so by (ii), it will appear first in the topological ordering produced by , establishing Property 4.

1.2 B.2 Cycle Detection

, a subroutine of (Algorithm 3.2) checks to see if an ordering constraint, q, is satisfiable by looking for any cycles in the corresponding dependency graph, . Here we observe that the existence of a cycle can easily be decided from examining P, by checking if $P_{ii} > 0$ for some $i\in [m]$; i.e., if there exists a non-zero-length path from any vertex to itself. Since $P_{ii} \ge 0$, this is equivalent to $\text {Trace}(P) > 0$. While strictly speaking, $P_{ij}$, as constructed by [9], only reflects the longest path from i to j in acyclic graphs, it can nonetheless be used to detect cycles in this way, as for any $k \le m$, $P_{ij}$ is guaranteed to be at least k if there exists a path of length k from i to j, and any cycle will have length at most m.

1.3 B.3 Prioritizing Root Vertices

As specified in Property 3, in order to satisfy transparency, the search for a satisfiable ordering constraint performed by must prioritize constraints, q, in which the original predicted class, F(x), is a root vertex in q’s corresponding dependency graph. We observe that root vertices can be easily identified using the dependency matrix M. The in-degree, $d^\textit{in}_j$, of vertex j is simply the sum of the $j^\text {th}$ column of M, given by Eq. 11. Meanwhile, the root vertices are precisely those vertices with no ancestors, that is, those vertices j satisfying Eq. 11.

$$\begin{aligned} d^\textit{in}_j = \sum _{i\in [m]}{M_{ij}} = 0 \end{aligned}$$

(11)

In the context of , the subroutine lists ordering constraints q for which $d^\textit{in}_{F(x)} = 0$ in before any other ordering constraints. To save memory, we do not explicitly list and sort all the disjuncts of $Q_x$ (the DNF form of the active postconditions for x); rather we iterate through them one at a time. This can be done by, e.g., iterating through each disjunct twice, initially skipping any disjunct in which F(x) is not a root vertex, and subsequently skipping those in which F(x) is a root vertex.

C Appendix 3: Generation of Synthetic Data

In Sect. 5.4, we utilize a family of synthetic datasets with associated safe-ordering constraints that are randomly generated according to several specified parameters, allowing us to assess how aspects such as the number of constraints ($\alpha $), the number of disjunctions per constraint ($\beta $), and the dimension of the output vector (m) impact the run-time overhead. In our experiments, we fix the input dimension, n, to be 10. The synthetic data, which we will denote by $\mathcal D(\alpha , \beta , m)$, are generated according to the following procedure.

(i)
First, we generate $\alpha $ random safe-ordering constraints. The preconditions take the form $b_\ell \le x \le b_u$, where $b_\ell $ is drawn uniformly at random from $[0.0,1.0 - \epsilon ]$ and $b_u := b_\ell + \epsilon $. We choose $\epsilon = 0.4$ in our experiments; as a result, the probability that any two preconditions overlap is approximately 30%. The ordering constraints are disjunctions of $\beta $ randomly-generated cycle-free ordering graphs of m vertices, i.e., $\beta $ disjuncts. Specifically, in each graph, we include each edge, (i, j), for $i \ne j$ with equal probability, and require further that at least one edge is included, and the expected number of edges is $\gamma $ (we use $\gamma = 3$ in all of our experiments). Graphs with cycles are resampled until a graph with no cycles is drawn.
(ii)
Next, for each safe-ordering constraint, $\phi $, we sample $\nicefrac {N}{\alpha }$ random inputs, x, uniformly from the range specified by the precondition of $\phi $. In all of our experiments we let $N = \text {2,000}$. For each x, we select a random disjunct from the postcondition of $\phi $, and find the roots of the corresponding ordering graph. We select a label, $y^*$ for x uniformly at random from this set of roots, i.e., we pick a random label for each point that is consistent with the property for that point.
(iii)
Finally, we generate N random points that do not satisfy any of the preconditions of the $\alpha $ safe-ordering constraints. We label these points via a classifier trained on the N labeled points already generated in (ii). This results in a dataset of 2N labeled points, where 50% of the points are captured by at least one safe-ordering constraint.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leino, K., Fromherz, A., Mangal, R., Fredrikson, M., Parno, B., Păsăreanu, C. (2022). Self-correcting Neural Networks for Safe Classification. In: Isac, O., Ivanov, R., Katz, G., Narodytska, N., Nenzi, L. (eds) Software Verification and Formal Methods for ML-Enabled Autonomous Systems. NSV FoMLAS 2022 2022. Lecture Notes in Computer Science, vol 13466. Springer, Cham. https://doi.org/10.1007/978-3-031-21222-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-21222-2_7
Published: 16 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21221-5
Online ISBN: 978-3-031-21222-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Self-correcting Neural Networks for Safe Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

NNrepair: Constraint-Based Repair of Neural Network Classifiers

Detecting and Mitigating Errors in Neural Networks

Safety Verification of Deep Neural Networks

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

A Appendix 1: Proofs

Theorem 2

Proof

Property 7

Proof

Property 8

Proof

Theorem 5

Proof

Property 9

Property 10

Proof

Property 11

Proof

Theorem 6

Proof

B Appendix 2: Vectorizing Self-Correction

1.1 B.1 Stable Topological Sort

Lemma 1

Proof

1.2 B.2 Cycle Detection

1.3 B.3 Prioritizing Root Vertices

C Appendix 3: Generation of Synthetic Data

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us