Data Challenge II Results
In April we announced the second annual GitHub data challenge. Since last year, GitHub’s public timeline data on Google BigQuery has grown by over 80 million events, including 3.8 million…
In April we announced the second annual GitHub data challenge.
Since last year, GitHub’s public timeline data on
Google BigQuery has grown by over 80 million events, including
3.8 million new repositories, 38 million pushes, and 8 million comments on issues, pull requests, and commits.
After receiving some amazing entries in the previous
challenge, we were excited to see what people would discover with another year of data. The results blew us away:
we saw many more entrants and novel applications of our data. GitHubbers ranked their favorite entries, and after
tallying the votes, we’re happy to announce the top 3 entries for the 2013 GitHub data challenge.
First Place
The Open Source Report Card, by Dan Foreman-Mackey, analyzes a GitHub
user’s contributions to produce a “report card” with statistics and automatically generated prose.
Second Place
How often do people use tabs over spaces in Java? How many commits have lines wrapped to 80 characters?
Popular Convention by Outsider
uses GitHub data to analyze conventions in selected programming languages.
Third Place
David Fischer’s visualization of
open source contributions by location shows the geographic
distribution of contributors behind the 200 most active GitHub repositories.
Thanks
Congratulations to the winning entries, and huge thanks to everyone who submitted an entry!
Our top
3 winners will receive gift certificates to the GitHub Shop for $200, $100, and $50,
respectively.
We can’t wait to see what the next data challenge will bring!
Written by
Related posts
![](https://github.blog/wp-content/uploads/2023/09/screencapture-innovationgraph-github-2023-09-20-15_44_54-1.png?resize=400%2C212)
How researchers are using GitHub Innovation Graph data to estimate the impact of ChatGPT
An interview with economic researchers who are applying causal inference techniques to analyze the effect of generative AI tools on software development activity.
![](https://github.blog/wp-content/uploads/2024/01/Enterprise-DarkMode-1.png?resize=400%2C212)
GitHub Availability Report: June 2024
In June, we experienced two incidents that resulted in degraded performance across GitHub services.
![](https://github.blog/wp-content/uploads/2024/06/AI-DarkMode-4.png?resize=400%2C212)
Advancing responsible practices for open source AI
Outcomes from the Partnership on AI and GitHub workshop.