{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T03:15:58Z","timestamp":1740107758176,"version":"3.37.3"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","license":[{"start":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T00:00:00Z","timestamp":1644364800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T00:00:00Z","timestamp":1644364800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100006359","name":"Blekinge Institute of Technology","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100006359","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Requirements Eng"],"abstract":"Abstract<\/jats:title>Causal relations in natural language (NL) requirements convey strong, semantic information. Automatically extracting such causal information enables multiple use cases, such as test case generation, but it also requires to reliably detect causal relations in the first place. Currently, this is still a cumbersome task as causality in NL requirements is still barely understood and, thus, barely detectable. In our empirically informed research, we aim at better understanding the notion of causality and supporting the automatic extraction of causal relations in NL requirements. In a first case study, we investigate 14.983 sentences from 53 requirements documents to understand the extent and form in which causality occurs. Second, we present and evaluate a tool-supported approach, called CiRA, for causality detection. We conclude with a second case study where we demonstrate the applicability of our tool and investigate the impact of causality on NL requirements. The first case study shows that causality constitutes around 28 % of all NL requirements sentences. We then demonstrate that our detection tool achieves a macro-$$\\hbox {F}_{1}$$<\/jats:tex-math>F<\/mml:mtext>1<\/mml:mn><\/mml:msub><\/mml:math><\/jats:alternatives><\/jats:inline-formula>score of 82 % on real-world data and that it outperforms related approaches with an average gain of 11.06 % in macro-Recall and 11.43 % in macro-Precision. Finally, our second case study corroborates the positive correlations of causality with features of NL requirements. The results strengthen our confidence in the eligibility of causal relations for downstream reuse, while our tool and publicly available data constitute a first step in the ongoing endeavors of utilizing causality in RE and beyond.<\/jats:p>","DOI":"10.1007\/s00766-022-00371-x","type":"journal-article","created":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T05:02:43Z","timestamp":1644382963000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Causality in requirements artifacts: prevalence, detection, and impact"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3995-6125","authenticated-orcid":false,"given":"Julian","family":"Frattini","sequence":"first","affiliation":[]},{"given":"Jannik","family":"Fischbach","sequence":"additional","affiliation":[]},{"given":"Daniel","family":"Mendez","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Unterkalmsteiner","sequence":"additional","affiliation":[]},{"given":"Andreas","family":"Vogelsang","sequence":"additional","affiliation":[]},{"given":"Krzysztof","family":"Wnuk","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,2,9]]},"reference":[{"key":"371_CR1","unstructured":"Asghar N (2016) Automatic extraction of causal relations from natural language texts: a comprehensive survey. arXiv:1605.07895"},{"issue":"1","key":"371_CR2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Series B (Methodol) 57(1):289\u2013300","journal-title":"J Royal Stat Soc Series B (Methodol)"},{"key":"371_CR3","unstructured":"Blanco E, Castell N, Moldovan D (2008) Causal relation extraction. In: LREC\u201908"},{"issue":"3","key":"371_CR4","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1002\/int.20069","volume":"20","author":"K Chan","year":"2005","unstructured":"Chan K, Lam W (2005) Extracting causation knowledge from natural language texts. Int J Intell Syst 20(3):327\u2013358","journal-title":"Int J Intell Syst"},{"key":"371_CR5","doi-asserted-by":"crossref","unstructured":"Chang DS, Choi KS (2004) Causal relation extraction using cue phrase and lexical pair probabilities. In: International Conference on Natural Language Processing, pp. 61\u201370. Springer","DOI":"10.1007\/978-3-540-30211-7_7"},{"key":"371_CR6","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1177\/001316446002000104","volume":"20","author":"J Cohen","year":"1960","unstructured":"Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Meas 20:37\u201346","journal-title":"Educ Psychol Meas"},{"issue":"1","key":"371_CR7","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1177\/001316447303300111","volume":"33","author":"J Cohen","year":"1973","unstructured":"Cohen J (1973) Eta-squared and partial eta-squared in fixed factor anova designs. Educ Psychol Measurement 33(1):107\u2013112","journal-title":"Educ Psychol Measurement"},{"key":"371_CR8","unstructured":"Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805"},{"issue":"3","key":"371_CR9","first-page":"71","volume":"19","author":"S Doan","year":"2019","unstructured":"Doan S, Yang EW, Tilak SS, Li PW, Zisook DS, Torii M (2019) Extracting health-related causality from twitter messages using natural language processing. BMC Med Inf Decis Mak 19(3):71\u201377","journal-title":"BMC Med Inf Decis Mak"},{"key":"371_CR10","unstructured":"Fares M, Kutuzov A, Oepen S, Velldal E (2017) Word vectors, reuse, and replicability: Towards a community repository of large-text resources. In: NoDaLiDa\u201917"},{"key":"371_CR11","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1016\/0895-4356(90)90158-L","volume":"43","author":"AR Feinstein","year":"1990","unstructured":"Feinstein AR, Cicchetti DV (1990) High agreement but low kappa: I. the problems of two paradoxes. J Clin Epidemiol 43:543\u2013549","journal-title":"J Clin Epidemiol"},{"key":"371_CR12","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1016\/j.jss.2016.02.047","volume":"123","author":"H Femmer","year":"2017","unstructured":"Femmer H, Mendez D, Wagner S, Eder S (2017) Rapid quality assurance with requirements smells. J Syst Softw 123:190\u2013213","journal-title":"J Syst Softw"},{"key":"371_CR13","doi-asserted-by":"crossref","unstructured":"Fischbach J, Frattini J, Mendez D, Unterkalmsteiner M, Femmer H, Vogelsang A (2021) How do practitioners interpret conditionals in requirements? arXiv preprint arXiv:2109.02063","DOI":"10.1007\/978-3-030-91452-3_6"},{"key":"371_CR14","doi-asserted-by":"crossref","unstructured":"Fischbach J, Frattini J, Spaans A, Kummeth M, Vogelsang A, Mendez D, Unterkalmsteiner M (2021) Automatic detection of causality in requirement artifacts: the cira approach. In: International Working Conference on Requirements Engineering: Foundation for Software Quality, pp. 19\u201336. Springer","DOI":"10.1007\/978-3-030-73128-1_2"},{"key":"371_CR15","doi-asserted-by":"crossref","unstructured":"Fischbach J, Hauptmann B, Konwitschny L, Spies D, Vogelsang A (2020) Towards causality extraction from requirements. In: RE\u201920","DOI":"10.1109\/RE48521.2020.00053"},{"key":"371_CR16","doi-asserted-by":"crossref","unstructured":"Fischbach J, Vogelsang A, Spies D, Wehrle A, Junker M, Freudenstein D (2020) Specmate: Automated creation of test cases from acceptance criteria. In: ICST\u201920","DOI":"10.1109\/ICST46399.2020.00040"},{"key":"371_CR17","doi-asserted-by":"crossref","unstructured":"Frattini J, Junker M, Unterkalmsteiner M, Mendez D (2020) Automatic extraction of cause-effect-relations from requirements artifacts. In: ASE\u201920","DOI":"10.1145\/3324884.3416549"},{"issue":"2","key":"371_CR18","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1109\/MS.2010.44","volume":"27","author":"S Fricker","year":"2010","unstructured":"Fricker S, Gorschek T, Byman C, Schmidle A (2010) Handshaking with implementation proposals: negotiating requirements understanding. IEEE Softw 27(2):72\u201380","journal-title":"IEEE Softw"},{"key":"371_CR19","volume-title":"The elements of statistical learning: data mining, inference, and prediction","author":"JH Friedman","year":"2017","unstructured":"Friedman JH (2017) The elements of statistical learning: data mining, inference, and prediction. Springer, Berlin"},{"key":"371_CR20","unstructured":"Fuchs NE, Schwitter R (1995) Controlled natural language for requirements specifications"},{"key":"371_CR21","doi-asserted-by":"crossref","unstructured":"Girju R (2003) Automatic detection of causal relations for question answering. In: Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering-Volume 12, pp. 76\u201383. Association for Computational Linguistics","DOI":"10.3115\/1119312.1119322"},{"key":"371_CR22","unstructured":"Girju R, Moldovan DI, et al (2002) Text mining for causal relations. In: FLAIRS conference, pp. 360\u2013364"},{"key":"371_CR23","doi-asserted-by":"crossref","unstructured":"Gordon M, Harel D (2009) Generating executable scenarios from natural language. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 456\u2013467. Springer","DOI":"10.1007\/978-3-642-00382-0_37"},{"key":"371_CR24","unstructured":"Gwet K (2020) AgreeStat Analytics (Cloud-based version (AgreeStat360) was used in Sep\u2019). https:\/\/www.agreestat.com\/"},{"key":"371_CR25","unstructured":"Gwet KL (2014) Handbook of inter-rater reliability: The definitive guide to measuring the extent of agreement among raters. Advanced Analytics, LLC"},{"key":"371_CR26","doi-asserted-by":"crossref","unstructured":"Hashimoto C, Torisawa K, Kloetzer J, Oh JH (2015) Generating event causality hypotheses through semantic relations. In: Twenty-Ninth AAAI Conference on Artificial Intelligence","DOI":"10.1609\/aaai.v29i1.9533"},{"key":"371_CR27","unstructured":"Honnibal M, Montani I (2020) spaCy NLP library (We use the newest version of the en_core_web_sm model in Sep\u2019). https:\/\/spacy.io\/"},{"issue":"4","key":"371_CR28","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1145\/1113308.1113313","volume":"4","author":"T Inui","year":"2005","unstructured":"Inui T, Inui K, Matsumoto Y (2005) Acquiring causal knowledge from text using the connective marker tame. ACM Trans Asian Language Inf Process (TALIP) 4(4):435\u2013474","journal-title":"ACM Trans Asian Language Inf Process (TALIP)"},{"key":"371_CR29","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-7138-7","volume-title":"An introduction to statistical learning","author":"G James","year":"2013","unstructured":"James G, Witten D, Hastie T, Tibshirani RE (2013) An introduction to statistical learning. Springer, Berlin"},{"key":"371_CR30","doi-asserted-by":"crossref","unstructured":"Khoo CSG, Chan S, Niu Y (2000) Extracting causal knowledge from a medical database using graphical patterns. In: ACL\u201900","DOI":"10.3115\/1075218.1075261"},{"key":"371_CR31","doi-asserted-by":"crossref","unstructured":"Kyriakakis M, Androutsopoulos I, i Ametll\u00e9 JG, Saudabayev A (2019) Transfer learning for causal sentence detection. arXiv:1906.07544","DOI":"10.18653\/v1\/W19-5031"},{"key":"371_CR32","doi-asserted-by":"crossref","unstructured":"Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics","DOI":"10.2307\/2529310"},{"issue":"3","key":"371_CR33","doi-asserted-by":"publisher","first-page":"482","DOI":"10.1111\/j.1523-1739.2012.01828.x","volume":"26","author":"SC Latta","year":"2012","unstructured":"Latta SC, Howell CA, Dettling MD, Cormier RL (2012) Use of data on avian demographics and site persistence during overwintering to assess quality of restored riparian habitat. Conserv Biol 26(3):482\u2013492","journal-title":"Conserv Biol"},{"key":"371_CR34","volume-title":"Counterfactuals","author":"D Lewis","year":"2013","unstructured":"Lewis D (2013) Counterfactuals. Wiley, Hoboken"},{"key":"371_CR35","doi-asserted-by":"crossref","unstructured":"Mavin A, Wilkinson P, Harwood A, Novak M (2009) Easy approach to requirements syntax (ears). In: 2009 17th IEEE International Requirements Engineering Conference, pp. 317\u2013322. IEEE","DOI":"10.1109\/RE.2009.9"},{"key":"371_CR36","doi-asserted-by":"crossref","unstructured":"McHugh ML (2012) Interrater reliability: the kappa statistic. Biochemia Medica","DOI":"10.11613\/BM.2012.031"},{"issue":"2","key":"371_CR37","doi-asserted-by":"publisher","first-page":"143","DOI":"10.11613\/BM.2013.018","volume":"23","author":"ML McHugh","year":"2013","unstructured":"McHugh ML (2013) The chi-square test of independence. Biochemia Medica 23(2):143\u2013149","journal-title":"Biochemia Medica"},{"key":"371_CR38","volume-title":"Contemporary Empirical Methods in Software Engineering","author":"D Mendez","year":"2020","unstructured":"Mendez D, Graziotin D, Wagner S, Seibold H (2020) Open science in software engineering. In: Felderer M, Travassos GH (eds) Contemporary Empirical Methods in Software Engineering. Springer, Berlin"},{"key":"371_CR39","doi-asserted-by":"crossref","unstructured":"Mostafazadeh N, Grealish A, Chambers N, Allen J, Vanderwende L (2016) CaTeRS: Causal and temporal relation scheme for semantic annotation of event structures. In: EVENTS\u201916","DOI":"10.18653\/v1\/W16-1007"},{"key":"371_CR40","doi-asserted-by":"crossref","unstructured":"Neves M, \u0160eva J (2019) An extensive review of tools for manual annotation of documents. Briefings in Bioinformatics","DOI":"10.1093\/bib\/bbz130"},{"key":"371_CR41","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1016\/j.jss.2018.12.002","volume":"149","author":"T Olsson","year":"2019","unstructured":"Olsson T, Wnuk K, Gorschek T (2019) An empirical study on decision making for quality requirements. J Syst Softw 149:217\u2013233","journal-title":"J Syst Softw"},{"issue":"10","key":"371_CR42","doi-asserted-by":"publisher","first-page":"1523","DOI":"10.1093\/ietisy\/e90-d.10.1523","volume":"90","author":"C Pechsiri","year":"2007","unstructured":"Pechsiri C, Kawtrakul A (2007) Mining causality from texts for question answering system. IEICE TRANS Inf Syst 90(10):1523\u20131533","journal-title":"IEICE TRANS Inf Syst"},{"key":"371_CR43","volume-title":"Natural Language Annotation for Machine Learning\u2014a Guide to Corpus-Building for Applications","author":"J Pustejovsky","year":"2012","unstructured":"Pustejovsky J, Stubbs A (2012) Natural Language Annotation for Machine Learning\u2014a Guide to Corpus-Building for Applications. O\u2018Reilly Media Inc, Sebastopol"},{"key":"371_CR44","volume-title":"Real World Research\u2014A Resource for Social Scientists and Practitioner-Researchers","author":"C Robson","year":"2002","unstructured":"Robson C (2002) Real World Research\u2014A Resource for Social Scientists and Practitioner-Researchers. Wiley-Blackwell, Hoboken"},{"key":"371_CR45","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1007\/s10664-008-9102-8","volume":"14","author":"P Runeson","year":"2009","unstructured":"Runeson P, H\u00f6st M (2009) Guidelines for conducting and reporting case study research in software engineering. Empirical Softw Eng 14:131\u2013134","journal-title":"Empirical Softw Eng"},{"key":"371_CR46","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1016\/j.is.2015.04.003","volume":"54","author":"M Selway","year":"2015","unstructured":"Selway M, Grossmann G, Mayer W, Stumptner M (2015) Formalising natural language specifications using a cognitive linguistic\/configuration based approach. Inf Syst 54:191\u2013208","journal-title":"Inf Syst"},{"issue":"3","key":"371_CR47","doi-asserted-by":"publisher","first-page":"279","DOI":"10.4300\/JGME-D-12-00156.1","volume":"4","author":"GM Sullivan","year":"2012","unstructured":"Sullivan GM, Feinn R (2012) Using effect size\u2014or why the p value is not enough. J Graduate Med Educ 4(3):279\u2013282","journal-title":"J Graduate Med Educ"},{"key":"371_CR48","unstructured":"Sundararaman D, Subramanian V, Wang G, Si S, Shen D, Wang D, Carin L (2019) Syntax-infused transformer and bert models for machine translation and natural language understanding"},{"key":"371_CR49","first-page":"360","volume":"37","author":"A Viera","year":"2005","unstructured":"Viera A, Garrett J (2005) Understanding interobserver agreement: the kappa statistic. Family Med 37:360\u2013363","journal-title":"Family Med"},{"issue":"2","key":"371_CR50","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3306607","volume":"28","author":"S Wagner","year":"2019","unstructured":"Wagner S, Mendez D, Felderer M, Vetr\u00f2 A, Kalinowski M, Wieringa R, Pfahl D, Conte T, Christiansson MT, Greer D et al (2019) Status quo in requirements engineering: a theory and a global family of surveys. ACM Trans Softw Eng Methodol (TOSEM) 28(2):1\u201348","journal-title":"ACM Trans Softw Eng Methodol (TOSEM)"},{"issue":"1","key":"371_CR51","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1109\/TSE.2015.2445347","volume":"42","author":"K Wnuk","year":"2015","unstructured":"Wnuk K, Gorschek T, Callele D, Karlsson EA, \u00c5hlin E, Regnell B (2015) Supporting scope tracking and visualization for very large-scale requirements engineering-utilizing fsc+, decision patterns, and atomic decision visualizations. IEEE Trans Softw Eng 42(1):47\u201374","journal-title":"IEEE Trans Softw Eng"},{"key":"371_CR52","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1037\/0096-3445.136.1.82","volume":"136","author":"P Wolff","year":"2007","unstructured":"Wolff P (2007) Representing causation. J Exp Psychol General 136:82","journal-title":"J Exp Psychol General"},{"key":"371_CR53","doi-asserted-by":"publisher","first-page":"276","DOI":"10.1016\/S0010-0285(03)00036-7","volume":"47","author":"P Wolff","year":"2003","unstructured":"Wolff P, Song G (2003) Models of causation and the semantics of causal verbs. Cognitive Psychol 47:276\u2013332","journal-title":"Cognitive Psychol"},{"key":"371_CR54","doi-asserted-by":"crossref","unstructured":"Wongpakaran N, Wongpakaran T, Wedding D, Gwet K (2013) A comparison of cohen\u2019s kappa and gwet\u2019s ac1 when calculating inter-rater reliability coefficients: A study conducted with personality disorder samples. BMC Medical Research Methodology","DOI":"10.1186\/1471-2288-13-61"},{"key":"371_CR55","first-page":"50","volume":"20","author":"CH Wu","year":"2005","unstructured":"Wu CH, Yu LC, Jang FL (2005) Using semantic dependencies to mine depressive symptoms from consultation records. IEEE Intell Syst 20:50\u201358","journal-title":"IEEE Intell Syst"},{"key":"371_CR56","doi-asserted-by":"crossref","unstructured":"Xu J, Zuo W, Liang S, Zuo X (2020) A review of dataset and labeling methods for causality extraction. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 1519\u20131531","DOI":"10.18653\/v1\/2020.coling-main.133"}],"container-title":["Requirements Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00766-022-00371-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00766-022-00371-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00766-022-00371-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,18]],"date-time":"2024-09-18T03:55:19Z","timestamp":1726631719000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00766-022-00371-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,9]]},"references-count":56,"alternative-id":["371"],"URL":"https:\/\/doi.org\/10.1007\/s00766-022-00371-x","relation":{},"ISSN":["0947-3602","1432-010X"],"issn-type":[{"type":"print","value":"0947-3602"},{"type":"electronic","value":"1432-010X"}],"subject":[],"published":{"date-parts":[[2022,2,9]]},"assertion":[{"value":"23 June 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 December 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 February 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}