default search action
MSR 2020: Seoul, Korea
- Sunghun Kim, Georgios Gousios, Sarah Nadi, Joseph Hejderup:
MSR '20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020. ACM 2020, ISBN 978-1-4503-7517-7
Mining Challenge
- Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli:
The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History. 1-5 - Rao Hamza Ali, Chelsea Parlett-Pelleriti, Erik Linstead:
Cheating Death: A Statistical Survival Analysis of Publicly Available Python Projects. 6-10 - Avijit Bhattacharjee, Sristy Sumana Nath, Shurui Zhou, Debasish Chakroborti, Banani Roy, Chanchal K. Roy, Kevin A. Schneider:
An Exploratory Study to Find Motives Behind Cross-platform Forks from Software Heritage Dataset. 11-15 - Gábor Antal, Márton Keleti, Péter Hegedüs:
Exploring the Security Awareness of the Python and JavaScript Open Source Communities. 16-20
Technical Papers
- Shayan A. Akbar, Avinash C. Kak:
A Large-Scale Comparative Evaluation of IR-Based Tools for Bug Localization. 21-31 - Yang Chen, Andrew E. Santosa, Ming Yi Ang, Abhishek Sharma, Asankhaya Sharma, David Lo:
A Machine Learning Approach for Vulnerability Curation. 32-42 - Irving Muller Rodrigues, Daniel Aloise, Eraldo Rezende Fernandes, Michel R. Dagenais:
A Soft Alignment Model for Bug Deduplication. 43-53 - Yaroslav Golubev, Maria Eliseeva, Nikita Povarov, Timofey Bryksin:
A Study of Potential Code Borrowing and License Violations in Java Projects on GitHub. 54-64 - Abdulkarim Khormi, Mohammad Alahmadi, Sonia Haiduc:
A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming Screencasts. 65-75 - Yiwen Wu, Yang Zhang, Tao Wang, Huaimin Wang:
An Empirical Study of Build Failures in the Docker Context. 76-80 - Jason Tsay, Alan Braz, Martin Hirzel, Avraham Shinnar, Todd W. Mummert:
AIMMX: Artificial Intelligence Model Metadata Extractor. 81-92 - Tomoki Nakamaru, Tomomasa Matsunaga, Tetsuro Yamazaki, Soramichi Akiyama, Shigeru Chiba:
An Empirical Study of Method Chaining in Java. 93-102 - Peipei Wang, Chris Brown, Jamie A. Jennings, Kathryn T. Stolee:
An Empirical Study on Regular Expression Bugs. 103-113 - Paolo Calciati, Konstantin Kuznetsov, Alessandra Gorla, Andreas Zeller:
Automatically Granted Permissions in Android apps: An Empirical Study on their Prevalence and on the Potential Threats for Privacy. 114-124 - Matheus Paixão, Anderson G. Uchôa, Ana Carla Bibiano, Daniel Oliveira, Alessandro Garcia, Jens Krinke, Emilio Arvonio:
Behind the Intents: An In-depth Empirical Study on Software Refactoring in Modern Code Review. 125-136 - Laerte Xavier, Fabio Ferreira, Rodrigo Brito, Marco Túlio Valente:
Beyond the Code: Mining Self-Admitted Technical Debt in Issue Tracker Systems. 137-146 - Che Shian Hung, Robert Dyer:
Boa Views: Easy Modularization and Sharing of MSR Analyses. 147-157 - Nicole Novielli, Fabio Calefato, Davide Dongiovanni, Daniela Girardi, Filippo Lanubile:
Can We Use SE-specific Sentiment Analysis Tools in a Cross-Platform Setting? 158-168 - Jens Meinicke, Juan Hoyos, Bogdan Vasilescu, Christian Kästner:
Capture the Feature Flag: Detecting Feature Flags in Open-Source. 169-173 - Ahmad Abdellatif, Diego Costa, Khaled Badran, Rabe Abdalkareem, Emad Shihab:
Challenges in Chatbot Development: A Study of Stack Overflow Posts. 174-185 - Leonardo da Silva Sousa, Diego Cedrim, Alessandro Garcia, Willian Nalepa Oizumi, Ana Carla Bibiano, Daniel Oliveira, Miryung Kim, Anderson Oliveira:
Characterizing and Identifying Composite Refactorings: Concepts, Heuristics and Patterns. 186-197 - Antonio Borrelli, Vittoria Nardone, Giuseppe A. Di Lucca, Gerardo Canfora, Massimiliano Di Penta:
Detecting Video Game-Specific Bad Smells in Unity Projects. 198-208 - Tapajit Dey, Sara Mousavi, Eduardo Ponce, Tanner Fry, Bogdan Vasilescu, Anna Filippova, Audris Mockus:
Detecting and Characterizing Bots that Commit Code. 209-219 - Fabiano Pecorelli, Fabio Palomba, Foutse Khomh, Andrea De Lucia:
Developer-Driven Code Smell Prioritization. 220-231 - Danielle Gonzalez, Michael Rath, Mehdi Mirakhorli:
Did You Remember To Test Your Tokens? 232-242 - Rhys Compton, Eibe Frank, Panos Patros, Abigail M. Y. Koay:
Embedding Java Classes with code2vec: Improvements from Variable Obfuscation. 243-253 - Thomas Durieux, Claire Le Goues, Michael Hilton, Rui Abreu:
Empirical Study of Restarted and Flaky Builds on Travis CI. 254-264 - Nicolas E. Gold, Jens Krinke:
Ethical Mining: A Case Study on MSR Mining Challenges. 265-276 - Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli:
Forking Without Clicking: on How to Identify Software Repository Forks. 277-287 - Ang Jia, Ming Fan, Xi Xu, Di Cui, Wenying Wei, Zijiang Yang, Kai Ye, Ting Liu:
From Innovations to Prospects: What Is Hidden Behind Cryptocurrencies? 288-299 - Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan:
Improved Automatic Summarization of Subroutines via Attention to File Context. 300-310 - Davide Spadini, Martin Schvarcbacher, Ana-Maria Oprescu, Magiel Bruntink, Alberto Bacchelli:
Investigating Severity Thresholds for Test Smells. 311-321 - Hongbo Fang, Daniel Klug, Hemank Lamba, James D. Herbsleb, Bogdan Vasilescu:
Need for Tweet: How Open Source Developers Talk About Their GitHub Work on Twitter. 322-326 - Biruk Asmare Muse, Mohammad Masudur Rahman, Csaba Nagy, Anthony Cleve, Foutse Khomh, Giuliano Antoniol:
On the Prevalence, Impact, and Evolution of SQL Code Smells in Data-Intensive Systems. 327-338 - Omar El Zarif, Daniel Alencar da Costa, Safwat Hassan, Ying Zou:
On the Relationship between User Churn and Software Issues. 339-349 - Triet Huynh Minh Le, David Hin, Roland Croft, Muhammad Ali Babar:
PUMiner: Mining Security Posts from Developer Question and Answer Websites with PU Learning. 350-361 - Nan Yang, Pieter J. L. Cuijpers, Ramon R. H. Schiffelers, Johan Lukkien, Alexander Serebrenik:
Painting Flowers: Reasons for Using Single-State State Machines in Model-Driven Engineering. 362-373 - Konstantinos Barmpis, Patrick Neubauer, Jonathan Co, Dimitris S. Kolovos, Nicholas Matragkas, Richard F. Paige:
Polyglot and Distributed Software Repository Mining with Crossflow. 374-384 - Toni Mattis, Patrick Rein, Falco Dürsch, Robert Hirschfeld:
RTPTorrent: An Open-source Dataset for Evaluating Regression Test Prioritization. 385-396 - Shubhankar Suman Singh, Smruti R. Sarangi:
SoftMon: A Tool to Compare Similar Open-source Software from a Performance Perspective. 397-408 - James Walden:
The Impact of a Major Security Event on an Open Source Project: The Case of OpenSSL. 409-419 - Hadhemi Jebnoun, Houssem Ben Braiek, Mohammad Masudur Rahman, Foutse Khomh:
The Scent of Deep Learning Code: An Empirical Study. 420-430 - Danielle Gonzalez, Thomas Zimmermann, Nachiappan Nagappan:
The State of the ML-universe: 10 Years of Artificial Intelligence & Machine Learning Software Development on GitHub. 431-442 - Yalin Liu, Jinfeng Lin, Jane Cleland-Huang:
Traceability Support for Multi-Lingual Software Projects. 443-454 - Timofey Bryksin, Victor Petukhov, Ilya Alexin, Stanislav Prikhodko, Alexey Shpilman, Vladimir Kovalenko, Nikita Povarov:
Using Large-Scale Anomaly Detection on Code to Improve Kotlin Compiler. 455-465 - Suhaib Mujahid, Rabe Abdalkareem, Emad Shihab, Shane McIntosh:
Using Others' Tests to Identify Breaking Updates. 466-476 - Sergey Svitkov, Timofey Bryksin:
Visualization of Methods Changeability Based on VCS Data. 477-480 - Rolf-Helge Pfeiffer:
What constitutes Software?: An Empirical, Descriptive Study of Artifacts. 481-491 - Gustavo Pinto, Breno Miranda, Supun Dissanayake, Marcelo d'Amorim, Christoph Treude, Antonia Bertolino:
What is the Vocabulary of Flaky Tests? 492-502
Data Showcase
- Maëlick Claes, Mika V. Mäntylä:
20-MAD: 20 Years of Issues and Commits of Mozilla and Apache Development. 503-507 - Jiahao Fan, Yi Li, Shaohua Wang, Tien N. Nguyen:
A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries. 508-512 - Audris Mockus, Diomidis Spinellis, Zoe Kotti, Gabriel John Dusing:
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared Commits. 513-517 - Tanner Fry, Tapajit Dey, Andrey Karnauch, Audris Mockus:
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits. 518-522 - Diomidis Spinellis, Zoe Kotti, Audris Mockus:
A Dataset for GitHub Repository Deduplication. 523-527 - Jordan Henkel, Christian Bird, Shuvendu K. Lahiri, Thomas W. Reps:
A Dataset of Dockerfiles. 528-532 - Diomidis Spinellis, Zoe Kotti, Konstantinos Kravvaritis, Georgios Theodorou, Panos Louridas:
A Dataset of Enterprise-Driven Open Source Software. 533-537 - Usman Ashraf, Christoph Mayr-Dorn, Alexander Egyed, Sebastiano Panichella:
A Mixed Graph-Relational Dataset of Socio-technical Interactions in Open Source Systems. 538-542 - Xunhui Zhang, Ayushi Rastogi, Yue Yu:
On the Shoulders of Giants: A New Dataset for Pull-based Development Research. 543-547 - Pei Liu, Li Li, Yanjie Zhao, Xiaoyu Sun, John Grundy:
AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research Community. 548-552 - Cristiano Politowski, Fábio Petrillo, Gabriel Cavalheiro Ullmann, Josias de Andrade Werly, Yann-Gaël Guéhéneuc:
Dataset of Video Game Development Problems. 553-557 - Themistoklis Diamantopoulos, Michail D. Papamichail, Thomas Karanikiotis, Kyriakos C. Chatzidimitriou, Andreas L. Symeonidis:
Employing Contribution and Quality Metrics for Quantifying the Software Development Process. 558-562 - Esteban Parra, Ashley Ellis, Sonia Haiduc:
GitterCom: A Dataset of Open Source Developer Communications in Gitter. 563-567 - Laura Bello-Jiménez, Camilo Escobar-Velásquez, Anamaria Mojica-Hanke, Santiago Cortés-Fernández, Mario Linares-Vásquez:
Hall-of-Apps: The Top Android Apps Metadata Archive. 568-572 - Rafael-Michael Karampatsis, Charles Sutton:
How Often Do Single-Statement Bugs Occur?: The ManySStuBs4J Dataset. 573-577 - Federico Corò, Roberto Verdecchia, Emilio Cruciani, Breno Miranda, Antonia Bertolino:
JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing. 578-582 - Carolin E. Brandt, Annibale Panichella, Andy Zaidman, Moritz Beller:
LogChunks: A Data Set for Build Log Analysis. 583-587 - Preetha Chatterjee, Kostadin Damevski, Nicholas A. Kraft, Lori L. Pollock:
Software-related Slack Chats with Disentangled Conversations. 588-592 - András Kicsi, László Vidács, Tibor Gyimóthy:
TestRoutes: A Manually Curated Method Level Dataset for Test-to-Code Traceability. 593-597
Registered Reports
- Jürgen Cito, Jiasi Shen, Martin C. Rinard:
An Empirical Study on the Impact of Deimplicitization on Comprehension in Programs Using Application Frameworks. 598-601 - Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli:
Determining the Intrinsic Structure of Public Software Development History. 602-605 - Pavlína Wurzel Gonçalves, Enrico Fregnan, Tobias Baum, Kurt Schneider, Alberto Bacchelli:
Do Explicit Review Strategies Improve Code Review Performance? 606-610 - Steffen Herbold, Alexander Trautsch, Benjamin Ledel:
Large-Scale Manual Validation of Bugfixing Changes. 611-614 - Mouna Abidi, Moses Openja, Foutse Khomh:
Multi-language Design Smells: A Backstage Perspective. 615-618 - Ingrid Nunes, Christoph Treude, Fabio Calefato:
The Impact of Dynamics of Collaborative Software Engineering on Introverts: A Study Protocol. 619-622
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.