GitHub Project Analysis
Ever wonder how many different people contribute to a project on average, or how many commits the average contributor is responsible for? What about the size of the average patch…
Ever wonder how many different people contribute to a project on average, or how many commits the average contributor is responsible for? What about the size of the average patch and how many files it touches?
In The impact of language choice on github projects, Aldo Cortesi attempts to answer these questions and more by analyzing some 1.5 million commits made by 20 thousand contributors across 30 thousand active GitHub repositories and presents his findings in a series of graphs with commentary:
Interesting stuff.
Aldo has made the entire dataset available as a PostgreSQL dump file and is taking suggestions for refinements or expansions to the data. So get analyzin’!
Written by
Related posts

Explore the best of GitHub Universe: 9 spaces built to spark creativity, connection, and joy
See what’s happening at Universe 2025, from experimental dev tools and career coaching to community-powered spaces. Save $400 on your pass with Early Bird pricing.

Agents panel: Launch Copilot coding agent tasks anywhere on GitHub
Delegate coding tasks to Copilot and track progress wherever you are on GitHub. Copilot works in the background, creates a pull request, and tags you for review when finished.

Q1 2025 Innovation Graph update: Bar chart races, data visualization on the rise, and key research
Discover the latest trends and insights on public software development activity on GitHub with the quarterly release of data for the Innovation Graph, updated through March 2025.