PyGTED: Python Application for Computing Graph Traversal Edit Distance
- PMID: 32160033
- PMCID: PMC7207050
- DOI: 10.1089/cmb.2019.0510
PyGTED: Python Application for Computing Graph Traversal Edit Distance
Abstract
Graph Traversal Edit Distance (GTED) is a measure of distance (or dissimilarity) between two graphs introduced. This measure is based on the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs. GTED was motivated by and provides the first mathematical formalism for sequence coassembly and de novo variation detection in bioinformatics. Many problems in applied machine learning deal with graphs (also called networks), including social networks, security, web data mining, protein function prediction, and genome informatics. The kernel paradigm beautifully decouples the learning algorithm from the underlying geometric space, which renders graph kernels important for the aforementioned applications. In this article, we introduce a tool, PyGTED to compute GTED. It implements the algorithm based on the polynomial time algorithm devised for it by the authors. Informally, the GTED is the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs.
Keywords: clustering genera; coassembly; de novo variation detaction; graph comparison; graph kernel; linear programming.
Conflict of interest statement
The authors declare they have no conflicting financial interests.
Similar articles
-
Graph Traversal Edit Distance and Extensions.J Comput Biol. 2020 Mar;27(3):317-329. doi: 10.1089/cmb.2019.0511. Epub 2020 Feb 13. J Comput Biol. 2020. PMID: 32058803 Free PMC article.
-
Revisiting the complexity of and algorithms for the graph traversal edit distance and its variants.Algorithms Mol Biol. 2024 Apr 29;19(1):17. doi: 10.1186/s13015-024-00262-6. Algorithms Mol Biol. 2024. PMID: 38679703 Free PMC article.
-
Revisiting the Complexity of and Algorithms for the Graph Traversal Edit Distance and Its Variants.ArXiv [Preprint]. 2023 Nov 8:arXiv:2305.10577v2. ArXiv. 2023. Update in: Algorithms Mol Biol. 2024 Apr 29;19(1):17. doi: 10.1186/s13015-024-00262-6 PMID: 37292475 Free PMC article. Updated. Preprint.
-
Graph embedding and geometric deep learning relevance to network biology and structural chemistry.Front Artif Intell. 2023 Nov 16;6:1256352. doi: 10.3389/frai.2023.1256352. eCollection 2023. Front Artif Intell. 2023. PMID: 38035201 Free PMC article. Review.
-
On comparing neuronal morphologies with the constrained tree-edit-distance.Neuroinformatics. 2009 Sep;7(3):191-4. doi: 10.1007/s12021-009-9053-2. Epub 2009 Jul 28. Neuroinformatics. 2009. PMID: 19636974 Review.
Cited by
-
Graph Traversal Edit Distance and Extensions.J Comput Biol. 2020 Mar;27(3):317-329. doi: 10.1089/cmb.2019.0511. Epub 2020 Feb 13. J Comput Biol. 2020. PMID: 32058803 Free PMC article.
References
-
- Boroojeny A.E., Shrestha A., Sharifi-Zarchi A., et al. . 2018. Gted: Graph traversal edit distance, 37–53. In International Conference on Research in Computational Molecular Biology. Springer, Paris, France
-
- Ebrahimpour Boroojeny A. 2019. Theory of Graph Traversal Edit Distance, Extensions, and Applications. Ph.D. Thesis. Available from ProQuest Dissertations & Theses Global database. (Accession Order No. AAT 13812734)
-
- Smith T.F., and Waterman M.S.. 1981. Identification of common molecular subsequences. J Mol Biol 147, 195–197 - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources