Abstract
[Context & Motivation] Once research questions and initial theories have shaped, empirical research typically requires to select cases to study subsumed ideas. Issue trackers of todays open source systems (OSS) are a gold mine for empirical research, not least to study trace links among the included issue artifacts. [Question / problem] The huge amount of available OSS projects complicates the process of finding suitable cases to support the research goals. Further, simply picking a large number of projects on a random basis does not imply generalizability. Therefore the selection process should be carefully designed. [Principle ideas / results] In this paper we propose a method to choose OSS projects to study trace links found in issue tracking systems. Builds upon purposive sampling and cluster analysis, relevant project characteristics are identified whereas irrelevant information is filtered. Every step of the method is demonstrated on a live example. [Contributions] The proposed strategy selects an information-rich, representative and diverse sample of OSS to perform a traceability case study. Our work may be used as practical guide for other researchers to perform project selection tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Apache Annual Report FY2018. https://s.apache.org/FY2018AnnualReport (2018). Accessed 29 Sept 2018
Apache Incubation Process. https://incubator.apache.org/policy/process.html (2018). Accessed 29 Sept 2018
Commenting on an Issue. https://confluence.atlassian.com/jira064/commenting-on-an-issue-720416302.html (2018). Accessed 29 Sept 2018
https://www.atlassian.com/customers (2019). Accessed 03 Jan 2019
Comparison of issue-tracking systems. https://en.wikipedia.org/wiki/Comparison_of_issue-tracking_systems (2019). Accessed 03 Jan 2019
Antoniol, G., Canfora, G., Casazza, G., De Lucia, A., Merlo, E.: Recovering traceability links between code and documentation. IEEE Trans. Softw. Eng. 28(10) 970–983 (2002)
Asuncion, H.U., Asuncion, A.U., Taylor, R.N.: Software traceability with topic modeling. In: Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering, ICSE 2010, Cape Town, South Africa, 1–8 May 2010. vol. 1, ACM (2010)
Curtis, S., Gesler, W., Smith, G., Washburn, S.: Approaches to sampling and case selection in qualitative research: examples in the geography of health. Soc. Sci. Med. 50(7—-8), 1001–1014 (2000)
De Lucia, A., Fasano, F., Oliveto, R., Tortora, G.: 20th IEEE International Conference on Software Maintenance (ICSM) (2004)
Dekhtyar, A., Hayes, J.H., Sundaram, S.K., Holbrook, E.A., Dekhtyar, O.: Technique integration for requirements assessment. In: 15th IEEE International Requirements Engineering Conference, RE 2007, 15–19th October 2007, New Delhi, India. IEEE Computer Society (2007)
Flyvbjerg, B.: Five misunderstandings about case-study research. Qual. Inq. 12(2), 219–245 (2006)
Foucault, M., Palyart, M., Falleri, J., Blanc, X.: Computing contextual metric thresholds, ACM (2014)
Ishak, N.M., Bakar, A.Y.A.: Developing sampling frame for case study: challenges and conditions. World J. Educ. 4(3), 29–35 (2014)
ISO/IEC 14143/1: Information technology, software measurement, functional size measurement, Part 1: definition of concepts. Standard, International Organization for Standardization, Geneva (2007)
Issue management tools - popularity ranking (2017). https://project-management.zone/ranking/category/issue
Jira Issue Tracking System (2018). https://www.atlassian.com/software/jira
Miles, M.B., Huberman, A.M., Huberman, M.A., Huberman, M.: Qualitative Data Analysis: An Expanded Sourceboo. Sage, Thousand Oaks (1994)
Nagappan, M., Zimmermann, T., Bird, C.: Diversity in software engineering research, ACM (2013)
Palinkas, L.A., Horwitz, S.M., Green, C.A., Wisdom, J.P., Duan, N., Hoagwood, K.: Purposeful sampling for qualitative data collection and analysis in mixed method implementation research. Adm. Policy Ment. Health Ment. Health Serv. Res. 42(5), 533–544 (2015)
Patton, M.Q.: Qualitative Evaluation and Research Methods. Sage Publications, Thousand Oaks (1990)
Rath, M., Rempel, P., Mäder, P.: The IlmSeven dataset. In: 25th IEEE International Requirements Engineering Conference, RE 2017, Lisbon, Portugal, 4–8 September 2017. pp. 516–519. IEEE Computer Society (2017)
Rempel, P., Mäder, P., Kuschke, T.: Towards feature-aware retrieval of refinement traces. In: 7th International Workshop on Traceability in Emerging Forms of Software Engineering, TEFSE 2013, 19 May 2013, San Francisco, CA, USA. IEEE Computer Society (2013)
Runeson, P., Höst, M., Rainer, A., Regnell, B.: Case Study Research in Software Engineering - Guidelines and Examples. Wiley, Hoboken (2012)
Suri, H.: Purposeful sampling in qualitative research synthesis. Qual. Res. J. 11(2), 63–75 (2011)
Teddlie, C., Yu, F.: Mixed methods sampling: a typology with examples. J. Mixed Methods Res. 1(1), 77–100 (2007)
Tomova, M.T., Rath, M., Mäder, P.: Use of trace link types in issue tracking systems. In: Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, ICSE 2018, Gothenburg, Sweden, 27 May–03 June 2018. ACM (2018)
Tryfos, P.: Methods for Business Analysis and Forecasting: Text and Cases. Wiley, Hoboken (1998)
Van Ryzin, G.G.: Cluster analysis as a basis for purposive sampling of projects in case study evaluations. Eval. Pract. 16(2), 109–119 (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Rath, M., Tomova, M.T., Mäder, P. (2019). Selecting Open Source Projects for Traceability Case Studies. In: Knauss, E., Goedicke, M. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2019. Lecture Notes in Computer Science(), vol 11412. Springer, Cham. https://doi.org/10.1007/978-3-030-15538-4_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-15538-4_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15537-7
Online ISBN: 978-3-030-15538-4
eBook Packages: Computer ScienceComputer Science (R0)