Abstract
In order to identify variations between two or several versions of Clinical Practice Guidelines, we propose a method based on the detection of noun phrases. Currently, we are developing a comparison approach to extract similar and different elements between medical documents in French in order to identify any significant changes such as new medical terms or concepts, new treatments etc. In this paper, we describe a basic initial step for this comparison approach i.e. detecting noun phrases. This step is based on patterns constructed from six main medical terminologies used in document indexing. The patterns are constructed by using a Tree Tagger. To avoid a great number of generated patterns, the most relevant ones are selected that are able identify more than 80% of the six terminologies used in this study. These steps allowed us to obtain a manageable list of 262 patterns which have been evaluated. Using this list of patterns, 708 maximal noun phrases were found, with, 364 correct phrases which represent a 51.41% precision. However by detecting these phrases manually, 602 maximal noun phrases were found which represent a 60.47% recall and therefore a 55.57% F-measure. We attempted to improve these results by increasing the number of patterns from 262 to 493. A total of 729 maximal noun phrases were obtained, with 365 which were correct, and corresponded to a 50.07% precision, 60.63% recall and 54.85% F-measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Grimshaw, J.M., Russell, I.T.: Effect of clinical guidelines on medical practice: a systematic review of rigorous evaluations. The Lancet. 342(8883), 1317–1322 (1993)
Bouaud, J., Séroussi, B., Brizon, A., Culty, T., Mentré, F., Ravery, V.: How updating uextual Clinical Practice Guidelines impacts Clinical Decision Support Systems: a case study with Bladder Cancer Management. Med. Info., 829–833 (2007)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of International Conference on New Methods in Language Processing, Manchester, UK, vol. 12, pp. 44–49 (1994)
Cheah, T.S.: The impact of clinical guidelines and clinical pathways on medical practice: effectiveness and medico-legal aspects. Annals-Academy of Medicine Singapore 27, 533–539 (1998)
Roche, C.: Terminologie et ontologie. Armand. Colin. 48–62(2005) (in French)
Lefèvre, P.: La recherche d’informations: du texte intégral au thésaurus. Hermes Science (2000) (in French)
Nelson, S.J., Johnston, W.D., Humphreys, B.L.: Relationships in medical subject headings (MeSH). In: Relationships in the Organization of Knowledge, pp. 171–184. Springer (2001)
Cornet, R., de Keizer, N.: Forty years of SNOMED: a literature review. BMC Medical Informatics and Decision Making 8( suppl.1), 1–6 (2008)
Brown, E.G., Wood, L., Wood, S.: The medical dictionary for regulatory activities (MedDRA). Drug Saf. 20, 109–117 (1999)
Skrbo, A., Begović, B., Skrbo, S.: Classification of drugs using the ATC system (Anatomic, Therapeutic, Chemical Classification) and the latest changes. Medicinski. Arhiv. 58(1 suppl. 2), 138 (2004)
Rosse, C., Mejino Jr., José, L.V.: A reference ontology for biomedical informatics: the Foundational Model of Anatomy. Journal of Biomedical Informatics 36(6), 478–500 (2003)
Merabti, T., Soualmia, L., Grosjean, J., Palombi, O., Müller, JM., Darmoni, S.: Translating the Foundational Model of Anatomy into French using knowledge-based and lexical methods. BMC Medical Informatics and Decision Making 11(1), 65 (2011)
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics-Doklady 10 (1965)
Stoilos, G., Stamou, G., Kollias, S.D.: A string metric for ontology alignment. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 624–637. Springer, Heidelberg (2005)
Yujian, L., Bo, L.: A normalized Levenshtein distance metric. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(6), 1091–1095 (2007)
Winkler, W.E.: The state of record linkage and current research problems. Technical report: Statistics of Income Division, Internal Revenue Service Publication (1999)
Daille, B.: Conceptual structuring through term variations. In: Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, vol. 18, pp. 9–16. Association for Computational Linguistics (2003)
Aubin, S., Hamon, T.: Improving term extraction with terminological resources. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds.) FinTAL 2006. LNCS (LNAI), vol. 4139, pp. 380–387. Springer, Heidelberg (2006)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., Aswani, N., Roberts, I., Gorrell, G., Funk, A., Roberts, A., Damljanovic, D., Heitz, T., Greenwood, M.A., Saggion, H., Petrak, J., Li, Y., Peters, W.: Text Processing with GATE, Version 6 (2011) 978-0956599315
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Merabti, A., Soualmia, L.F., Darmoni, S.J. (2014). Detecting Noun Phrases in Biomedical Terminologies: The First Step in Managing the Evolution of Knowledge. In: Zhang, Y., Yao, G., He, J., Wang, L., Smalheiser, N.R., Yin, X. (eds) Health Information Science. HIS 2014. Lecture Notes in Computer Science, vol 8423. Springer, Cham. https://doi.org/10.1007/978-3-319-06269-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-06269-3_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06268-6
Online ISBN: 978-3-319-06269-3
eBook Packages: Computer ScienceComputer Science (R0)