Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

Mahdavi, Sadegh; Swersky, Kevin; Kipf, Thomas; Hashemi, Milad; Thrampoulidis, Christos; Liao, Renjie

Computer Science > Machine Learning

arXiv:2211.00692 (cs)

[Submitted on 1 Nov 2022 (v1), last revised 18 Mar 2023 (this version, v2)]

Title:Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

Authors:Sadegh Mahdavi, Kevin Swersky, Thomas Kipf, Milad Hashemi, Christos Thrampoulidis, Renjie Liao

View PDF

Abstract:In this paper, we study the OOD generalization of neural algorithmic reasoning tasks, where the goal is to learn an algorithm (e.g., sorting, breadth-first search, and depth-first search) from input-output pairs using deep neural networks. First, we argue that OOD generalization in this setting is significantly different than common OOD settings. For example, some phenomena in OOD generalization of image classifications such as \emph{accuracy on the line} are not observed here, and techniques such as data augmentation methods do not help as assumptions underlying many augmentation techniques are often violated. Second, we analyze the main challenges (e.g., input distribution shift, non-representative data generation, and uninformative validation metrics) of the current leading benchmark, i.e., CLRS \citep{deepmind2021clrs}, which contains 30 algorithmic reasoning tasks. We propose several solutions, including a simple-yet-effective fix to the input distribution shift and improved data generation. Finally, we propose an attention-based 2WL-graph neural network (GNN) processor which complements message-passing GNNs so their combination outperforms the state-of-the-art model by a 3% margin averaged over all algorithms. Our code is available at: \url{this https URL}.

Comments:	Transactions on Machine Learning Research (TMLR), 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2211.00692 [cs.LG]
	(or arXiv:2211.00692v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.00692

Submission history

From: Sadegh Mahdavi [view email]
[v1] Tue, 1 Nov 2022 18:33:20 UTC (130 KB)
[v2] Sat, 18 Mar 2023 08:23:33 UTC (198 KB)

Computer Science > Machine Learning

Title:Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators