GitHub Project Analysis
Ever wonder how many different people contribute to a project on average, or how many commits the average contributor is responsible for? What about the size of the average patch…
Ever wonder how many different people contribute to a project on average, or how many commits the average contributor is responsible for? What about the size of the average patch and how many files it touches?
In The impact of language choice on github projects, Aldo Cortesi attempts to answer these questions and more by analyzing some 1.5 million commits made by 20 thousand contributors across 30 thousand active GitHub repositories and presents his findings in a series of graphs with commentary:
Interesting stuff.
Aldo has made the entire dataset available as a PostgreSQL dump file and is taking suggestions for refinements or expansions to the data. So get analyzin’!
Written by
Related posts
![](https://github.blog/wp-content/uploads/2023/09/screencapture-innovationgraph-github-2023-09-20-15_44_54-1.png?resize=400%2C212)
How researchers are using GitHub Innovation Graph data to estimate the impact of ChatGPT
An interview with economic researchers who are applying causal inference techniques to analyze the effect of generative AI tools on software development activity.
![](https://github.blog/wp-content/uploads/2024/01/Enterprise-DarkMode-1.png?resize=400%2C212)
GitHub Availability Report: June 2024
In June, we experienced two incidents that resulted in degraded performance across GitHub services.
![](https://github.blog/wp-content/uploads/2024/06/AI-DarkMode-4.png?resize=400%2C212)
Advancing responsible practices for open source AI
Outcomes from the Partnership on AI and GitHub workshop.