Symbols in Engineering Drawings (SiED): An Imbalanced Dataset Benchmarked by Convolutional Neural Networks

Elyan, Eyad; Moreno-García, Carlos Francisco; Johnston, Pamela

doi:10.1007/978-3-030-48791-1_16

Eyad Elyan⁷,
Carlos Francisco Moreno-García⁷ &
Pamela Johnston⁷

Part of the book series: Proceedings of the International Neural Networks Society ((INNS,volume 2))

Included in the following conference series:

International Conference on Engineering Applications of Neural Networks

1404 Accesses
11 Citations

Abstract

Engineering drawings are common across different domains such as Oil & Gas, construction, mechanical and other domains. Automatic processing and analysis of these drawings is a challenging task. This is partly due to the complexity of these documents and also due to the lack of dataset availability in the public domain that can help push the research in this area. In this paper, we present a multiclass imbalanced dataset for the research community made of 2432 instances of engineering symbols. These symbols were extracted from a collection of complex engineering drawings known as Piping and Instrumentation Diagram (P&ID). By providing such dataset to the research community, we anticipate that this will help attract more attention to an important, yet overlooked industrial problem, and will also advance the research in such important and timely topics. We discuss the datasets characteristics in details, and we also show how Convolutional Neural Networks (CNNs) perform on such extremely imbalanced datasets. Finally, conclusions and future directions are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 22879; Price includes VAT (Japan)

Softcover Book: JPY 28599; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Multiclass Imbalanced Dataset Classification of Symbols from Piping and Instrumentation Diagrams

New trends on digitisation of complex engineering drawings

Article Open access 13 June 2018

A review of deep learning methods for digitisation of complex documents and engineering diagrams

Article Open access 09 May 2024

References

Ablameyko, S.V., Uchida, S.: Recognition of engineering drawing entities: review of approaches. Int. J. Image Graph. 07(04), 709–733 (2007)
Article Google Scholar
Ahmed, S., Liwicki, M., Weber, M., Dengel, A.: Automatic room detection and room labeling from architectural floor plans. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 339–343, March 2012
Google Scholar
Ali-Gombe, A., Elyan, E., Jayne, C.: Multiple fake classes GAN for data augmentation in face image dataset. In: 2019 International Joint Conference on Neural Networks (IJCNN), pp. 1–8, July 2019
Google Scholar
Ali-Gombe, A., Elyan, E.: Mfc-gan: class-imbalanced dataset classification using multiple fake class generative adversarial network. Neurocomputing 361, 212–221 (2019)
Article Google Scholar
Arroyo, E., Fay, A., Chioua, M., Hoernicke, M.: Integrating plant and process information as a basis for automated plant diagnosis tasks. In: Proceedings of the 2014 IEEE Emerging Technology and Factory Automation (ETFA), pp. 1–8, September 2014
Google Scholar
Blostein, D.: General Diagram-Recognition Methodologies. In: Proceedings of the 1st International Conference on Graphics Recognition (GREC 1995), pp. 200–212 (1995)
Google Scholar
Moreno-García, C.F., Elyan, E., Jayne, C.: New trends on digitisation of complex engineering drawings. Neural Computing and Applications, June 2018
Google Scholar
Chhabra, A.K.: Graphics Recognition Algorithms and Systems. In: Proceedings of the 2nd International Conference on Graphics Recognition (GREC 1997 ), pp. 244–252 (1997)
Google Scholar
Elyan, E., Moreno-Garcia, C.F., Jayne, C.: Symbols classification in engineering drawings. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8, July 2018
Google Scholar
Eyad Elyan and Mohamed Medhat Gaber: A fine-grained random forests using class decomposition: an application to medical diagnosis. Neural Comput. Appl. 27(8), 2279–2288 (2016)
Article Google Scholar
Eyad Elyan and Mohamed Medhat Gaber: A genetic algorithm approach to optimising random forests applied to class engineered data. Inf. Sci. 384, 220–234 (2017)
Article Google Scholar
Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., Cui, C., Corrado, G., Thrun, S., Dean, J.: A guide to deep learning in healthcare. Nat. Med. 25(1), 24–29 (2019)
Article Google Scholar
Goh, K.N., Mohd. Shukri, S.R., Manao, R.B.H.: Automatic assessment for engineering drawing. In: Zaman, H.B., Robinson, P., Olivier, P., Shih, T.K., Velastin, S. (eds.) Advances in Visual Informatics, pp. 497–507. Springer, Cham (2013)
Chapter Google Scholar
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River (2008)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org
Jiuxiang, G., Wang, Z., Kuen, J., Ma, L., Shahroudy, A., Shuai, B., Liu, T., Wang, X., Wang, G., Cai, J., Chen, T.: Recent advances in convolutional neural networks. Pattern Recogn. 77, 354–377 (2018)
Article Google Scholar
Holcomb, S.D., Porter, W.K., Ault, S.V., Mao, G., Wang, J.: Overview on deepmind and its alphago zero AI. In: Proceedings of the 2018 International Conference on Big Data and Education, ICBDE 2018, pp. 67–71. ACM, New York (2018)
Google Scholar
Howie, C., Kunz, J., Binford, T., Chen, T., Law, K.H.: Computer interpretation of process and instrumentation drawings. Adv. Eng. Softw. 29(7), 563–570 (1998)
Article Google Scholar
Kanungo, T., Haralick, R.M., Dori, D.: Understanding engineering drawings: a survey. In: Proceedings of the 1st International Conference on Graphics Recognition (GREC 1995), pp. 119–130 (1995)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
Article Google Scholar
Kulkarni, C.R., Barbadekar, A.B.: Text detection and recognition: a review. Int. Res. J. Eng. Technol. (IRJET) 4(6), 179–185 (2017)
Google Scholar
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Lu, Y.: Machine printed character segmentation - an overview. Pattern Recogn. 28(1), 67–80 (1995)
Article MathSciNet Google Scholar
Moreno-Garcia, C.F., Elyan, E.: Digitisation of assets from the oil gas industry: challenges and opportunities. In: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), vol. 7, pp. 2–5, September 2019
Google Scholar
Moreno-García, C.F., Elyan, E., Jayne, C.: Heuristics-based detection to improve text / graphics segmentation in complex engineering drawings. Eng. Appl. Neural Netw., volume CCIS 744, 87–98 (2017)
Article Google Scholar
Mori, S., Suen, C.Y., Yamamoto, K.: Historical review of ocr research and development. Proc. IEEE 80(7), 1029–1058 (1992)
Article Google Scholar
Park, U., Jain, A.K.: Face matching and retrieval using soft biometrics. IEEE Trans. Inf. Forens. Secur. 5(3), 406–415 (2010)
Article Google Scholar
Shone, N., Ngoc, T.N., Phai, V.D., Shi, Q.: A deep learning approach to network intrusion detection. IEEE Trans. Emerg. Topics Comput. Intell. 2(1), 41–50 (2018)
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
MathSciNet MATH Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9, June 2015
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708, June 2014
Google Scholar
Tombre, K., Tabbone, S., Lamiroy, B., Dosch, P.: Text/Graphics separation revisited. Document Anal. Syst. 2423, 200–211 (2002)
Article Google Scholar
Vaxiviere, P., Tombre, K.: Celesstin: CAD conversion of mechanical drawings. Computer 25(7), 46–54 (1992)
Article Google Scholar
Vuttipittayamongkol, P., Elyan, E.: Neighbourhood-based undersampling approach for handling imbalanced and overlapped data. Inf. Sci. 509, 47–70 (2020)
Article Google Scholar
Vuttipittayamongkol, P., Elyan, E., Petrovski, A., Jayne, C.: Overlap-based undersampling for improving imbalanced data classification. In: Yin, H., Camacho, D., Novais, P., Tallon-Ballesteros, A. (eds.) Intelligent Data Engineering and Automated Learning, pp. 689–697. Springer, Cham (2018)
Chapter Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480–1489. Association for Computational Linguistics, San Diego, June 2016
Google Scholar
Zhang, D., Lu, G.: Review of shape representation and description techniques. Pattern Recogn. 37(1), 1–19 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

The Robert Gordon University, Garthdee Road, Aberdeen, UK
Eyad Elyan, Carlos Francisco Moreno-García & Pamela Johnston

Authors

Eyad Elyan
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Francisco Moreno-García
View author publications
You can also search for this author in PubMed Google Scholar
Pamela Johnston
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eyad Elyan .

Editor information

Editors and Affiliations

School of Engineering, Department of Civil Engineering, Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
Lancaster University, Lancaster, UK
Plamen Parvanov Angelov
School of Computing and Digital Technologies, Teesside University, Middlesbrough, UK
Chrisina Jayne
University of the West of England, Bristol, UK
Elias Pimenidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elyan, E., Moreno-García, C.F., Johnston, P. (2020). Symbols in Engineering Drawings (SiED): An Imbalanced Dataset Benchmarked by Convolutional Neural Networks. In: Iliadis, L., Angelov, P., Jayne, C., Pimenidis, E. (eds) Proceedings of the 21st EANN (Engineering Applications of Neural Networks) 2020 Conference. EANN 2020. Proceedings of the International Neural Networks Society, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-030-48791-1_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-48791-1_16
Published: 28 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-48790-4
Online ISBN: 978-3-030-48791-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics