Impact of Selected Java Idioms on Source Code Maintainability – Empirical Study

A substantial effort, in general, is required for understanding APIs of application frameworks. High-quality API documentation may alleviate the effort, but the production of such documentation still poses a major challenge for modern frameworks. To facilitate the production of framework instantiation documentation, we hypothesize that the framework code itself and the code of existing instantiations provide useful information. However, given the size and complexity of existent code, automated approaches are required to assist the documentation production. Our goal is to assess an automated approach for constructing relevant documentation for framework instantiation based on source code analysis of the framework itself and of existing instantiations. The criterion for defining whether documentation is relevant would be to compare the documentation with an traditional framework documentation, considering the time spent and correctness during instantiation activities, information usefulness, complexity of the activity, navigation, satisfaction, information localization and clarity. We propose an automated approach for constructing relevant documentation for framework instantiation based on source code analysis of the framework itself and of existing instantiations. The proposed approach generates documentation in a cookbook style, where the recipes are programming activities using the necessary API elements driven by the framework features. We performed an empirical study, consisting of three experiments with 44 human subjects executing real framework instantiations aimed at comparing the use of the proposed cookbooks to traditional manual framework documentation (baseline). Our empirical assessment shows that the generated cookbooks performed better or, at least, with non-significant difference when compared to the traditional documentation, evidencing the effectiveness of the approach.

Download Full-text

An Empirical Study Assessing Source Code Readability in Comprehension

2019 IEEE International Conference on Software Maintenance and Evolution (ICSME) ◽

10.1109/icsme.2019.00085 ◽

2019 ◽

Author(s):

John Johnson ◽

Sergio Lubo ◽

Nishitha Yedla ◽

Jairo Aponte ◽

Bonita Sharif

Keyword(s):

Empirical Study ◽

Source Code

Download Full-text

An Empirical Study of Content Understanding in Conversational Question Answering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6257 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7578-7585

Author(s):

Ting-Rui Chiang ◽

Hao-Tong Ye ◽

Yun-Nung Chen

Keyword(s):

Natural Language Processing ◽

Empirical Study ◽

Language Processing ◽

Question Answering ◽

Source Code ◽

Content Understanding ◽

Question Answering Systems ◽

Benchmark Datasets ◽

Context Free ◽

Answering Questions

With a lot of work about context-free question answering systems, there is an emerging trend of conversational question answering models in the natural language processing field. Thanks to the recently collected datasets, including QuAC and CoQA, there has been more work on conversational question answering, and recent work has achieved competitive performance on both datasets. However, to best of our knowledge, two important questions for conversational comprehension research have not been well studied: 1) How well can the benchmark dataset reflect models' content understanding? 2) Do the models well utilize the conversation content when answering questions? To investigate these questions, we design different training settings, testing settings, as well as an attack to verify the models' capability of content understanding on QuAC and CoQA. The experimental results indicate some potential hazards in the benchmark datasets, QuAC and CoQA, for conversational comprehension research. Our analysis also sheds light on both what models may learn and how datasets may bias the models. With deep investigation of the task, it is believed that this work can benefit the future progress of conversation comprehension. The source code is available at https://github.com/MiuLab/CQA-Study.

Download Full-text

An empirical study of the relationship between the concepts expressed in source code and dependence

Journal of Systems and Software ◽

10.1016/j.jss.2008.04.007 ◽

2008 ◽

Vol 81 (12) ◽

pp. 2287-2298 ◽

Cited By ~ 8

Author(s):

David Binkley ◽

Nicolas Gold ◽

Mark Harman ◽

Zheng Li ◽

Kiarash Mahdavi

Keyword(s):

Empirical Study ◽

Source Code ◽

The Relationship

Download Full-text

Understanding the Causes of Architecture Changes Using OSS Mailing Lists

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194015400367 ◽

2015 ◽

Vol 25 (09n10) ◽

pp. 1633-1651 ◽

Cited By ~ 2

Author(s):

Wei Ding ◽

Peng Liang ◽

Antony Tang ◽

Hans van Vliet

Keyword(s):

Grounded Theory ◽

Empirical Study ◽

Open Source ◽

Open Source Software ◽

Source Code ◽

Internal Quality ◽

Functional Requirement ◽

Quality Requirement ◽

External Quality ◽

Mailing Lists

The causes of architecture changes can tell about why architecture changes, and this knowledge can be captured to prevent architecture knowledge vaporization and architecture degeneration. But the causes are not always known, especially in open source software (OSS) development. This makes it very hard to understand the underlying reasons for the architecture changes and design appropriate modifications. Architecture information is communicated in development mailing lists of OSS projects. To explore the possibility of identifying and understanding the causes of architecture changes, we conducted an empirical study to analyze architecture information (i.e. architectural threads) communicated in the development mailing lists of two popular OSS projects: Hibernate and ArgoUML, verified architecture changes with source code, and identified the causes of architecture changes from the communicated architecture information. The main findings of this study are: (1) architecture information communicated in OSS mailing lists does lead to architecture changes in code; (2) the major cause for architecture changes in both Hibernate and ArgoUML is preventative changes, and the causes of architecture changes are further classified to functional requirement, external quality requirement, and internal quality requirement using the coding techniques of grounded theory; (3) more than 45% of architecture changes in both projects happened before the first stable version was released.

Download Full-text

Do Missing Link Community Smell Affect Developers Productivity: An Empirical Study

Knowledge Engineering and Data Science ◽

10.17977/um018v4i12021p29-37 ◽

2021 ◽

Vol 4 (1) ◽

pp. 29

Author(s):

Toukir Ahammed ◽

Sumon Ahmed ◽

Mohammed Shafiul Alam Khan

Keyword(s):

Empirical Study ◽

Open Source ◽

Source Code ◽

Missing Link ◽

Code Smell ◽

Relationship Of ◽

The Relationship

Missing link smell occurs when developers contribute to the same source code without communicating with each other. Existing studies have analyzed the relationship of missing link smells with code smell and developer contribution. However, the productivity of developers involved in missing link smell has not been explored yet. This study investigates how productivity differs between smelly and non-smelly developers. For this purpose, the productivity of smelly and non-smelly developers of seven open-source projects are analyzed. The result shows that the developers not involved in missing link smell have more productivity than the developers involved in smells. The observed difference is also found statistically significant.

Download Full-text

Understanding Source Code Variability in Cloned Android Families: An Empirical Study on 75 Families

2019 26th Asia-Pacific Software Engineering Conference (APSEC) ◽

10.1109/apsec48747.2019.00047 ◽

2019 ◽

Author(s):

Anas Shatnawi ◽

Tewfik Ziadi ◽

Mohamed Yassin Mohamadi

Keyword(s):

Empirical Study ◽

Source Code

Download Full-text

Tracking Concerns in Evolving Source Code: An Empirical Study

2006 22nd IEEE International Conference on Software Maintenance ◽

10.1109/icsm.2006.70 ◽

2006 ◽

Cited By ~ 4

Author(s):

Martin P. Robillard

Keyword(s):

Empirical Study ◽

Source Code

Download Full-text

Statistical Unigram Analysis for Source Code Repository

International Journal of Semantic Computing ◽

10.1142/s1793351x18400123 ◽

2018 ◽

Vol 12 (02) ◽

pp. 237-260

Author(s):

Weifeng Xu ◽

Dianxiang Xu ◽

Abdulrahman Alatawi ◽

Omar El Ariss ◽

Yunkai Liu

Keyword(s):

Natural Language Processing ◽

Empirical Study ◽

Natural Language ◽

Programming Languages ◽

Language Processing ◽

Probabilistic Model ◽

Source Code ◽

Code Analysis ◽

Domain Specific ◽

Language Corpus

Unigram is a fundamental element of [Formula: see text]-gram in natural language processing. However, unigrams collected from a natural language corpus are unsuitable for solving problems in the domain of computer programming languages. In this paper, we analyze the properties of unigrams collected from an ultra-large source code repository. Specifically, we have collected 1.01 billion unigrams from 0.7 million open source projects hosted at GitHub.com. By analyzing these unigrams, we have discovered statistical properties regarding (1) how developers name variables, methods, and classes, and (2) how developers choose abbreviations. We describe a probabilistic model which relies on these properties for solving a well-known problem in source code analysis: how to expand a given abbreviation to its original indented word. Our empirical study shows that using the unigrams extracted from source code repository outperforms the using of the natural language corpus by 21% when solving the domain specific problems.

Download Full-text

Impact of Selected Java Idioms on Source Code Maintainability – Empirical Study

On the Co-evolution of ML Pipelines and Source Code - Empirical Study of DVC Projects

An Automated Approach for Constructing Framework Instantiation Documentation

An Empirical Study Assessing Source Code Readability in Comprehension

An Empirical Study of Content Understanding in Conversational Question Answering

An empirical study of the relationship between the concepts expressed in source code and dependence

Understanding the Causes of Architecture Changes Using OSS Mailing Lists

Do Missing Link Community Smell Affect Developers Productivity: An Empirical Study

Understanding Source Code Variability in Cloned Android Families: An Empirical Study on 75 Families

Tracking Concerns in Evolving Source Code: An Empirical Study

Statistical Unigram Analysis for Source Code Repository

Export Citation Format