An Empirical Study on Bash Language Usage in Github

dc.contributor.authorLi, Zheyang
dc.date.accessioned2021-05-27T19:00:42Z
dc.date.available2021-05-27T19:00:42Z
dc.date.issued2021-05-27
dc.date.submitted2021-05-20
dc.description.abstractThe Bourne-again shell (Bash) is a prevalent scripting language for orchestrating shell commands and managing resources in Unix-like environments. At the time of writing, it is one of the mainstream shell dialects that is available on most GNU Linux systems. However, the unique syntax and semantic of shell languages could easily lead to unintended behaviors if carelessly used. Prior studies primarily focused on replacing Bash with different languages and there is not much empirical evidence studying the usage of the Bash itself in practice. In this study, we perform a wide-ranging empirical study of Bash usage, based on an analysis over one million open-source Bash scripts found in Github repositories. We identify and discuss which features and utilities of Bash are most often used. Using static analysis, we find that Bash scripts are often error prone, and the error-proneness has a moderately positive correlation with the size of the script. We also find that the most common problem areas concern quoting, resource management, command options, permissions, and error handling. We envision that the findings of this study can be beneficial for learning Bash and future studies that aim to improve shell and command-line productivity and reliability. In addition, we provide a large dataset of Bash script source code, parse trees and code smell reports of each collected Bash script to facilitate future research in Bash language.en
dc.identifier.urihttp://hdl.handle.net/10012/17036
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.relation.urihttp://bit.ly/37PLx3nen
dc.subjectsoftware engineeringen
dc.subjectempirical studyen
dc.titleAn Empirical Study on Bash Language Usage in Githuben
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorSun, Chengnian
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Li_Zheyang.pdf
Size:
1021.82 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: