Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Open Source Software Development Challenges: A Systematic Literature Review on GitHub (2003.10750v3)

Published 24 Mar 2020 in cs.SE and cs.SI

Abstract: Git is used as the distributed version control system for many open-source software projects. One Git-based service, GitHub, is the most common code hosting and repository service for open-source software projects. For researchers that study software engineering, the content that is hosted on these platforms provides much valuable data. There are some alternatives to get GitHub data such as GitHub Archive, GitHub API or GHTorrent. Among these options, GHTorrent is the most widely known and used GitHub dataset in the literature. Although there are some review studies about software engineering challenges across the GitHub platform, no review of GHTorrent dataset-specific research is available. In this study, the 172 studies that use GHTorrent as a data source were categorized within the scope of open source software development challenges and a systematic literature review was carried out. Moreover, the pros and cons of the dataset have been indicated and the focused issues of the literature on and the open challenges have been noted.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Abdulkadir Şeker (4 papers)
  2. Banu Diri (7 papers)
  3. Halil Arslan (3 papers)
  4. Mehmet Fatih Amasyalı (2 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.