
Announcing the GitHub Innovation Graph
Explore a universe of data about how the world is building software together on GitHub.
On 02/02/2020 we took a snapshot of every active public repository on GitHub to be archived for a thousand years in the Arctic Code Vault. Learn about what’s included, how you can help us improve it, and more.
At GitHub Universe 2019, we introduced the GitHub Archive Program along with the GitHub Arctic Code Vault. We set out to preserve open source software for future generations by storing your code in an archive built to last a thousand years, and now is the time. The GitHub Arctic Code Vault is now in production.
On 02/02/2020, we took a snapshot of all active public repositories on GitHub to archive in the vault. The snapshot includes repositories with:
Learn more about the snapshot criteria
With archives around the world and an arctic vault full of code, we wanted to provide context and direction with a guide, that’s included in every archive. The human-readable index and guide itemizes the location of each repository and explains how to recover the data. The guide provides an overview of what software is, an explanation of open source and its ethos, and a technical overview of how to unpack the archive’s contents.
On January 23, we open sourced draft 0.1 of the guide, but we need your help to improve it. Take a look and submit a pull request in the GitHub Archive Program repository by midnight on February 29, 2020.
We gathered an advisory board of experts in anthropology, archaeology, archiving, history, linguistics, science, and long-term projects to help us maximize the archive’s value for future generations.
We held our first Advisory Summit on January 16-17. After examining the archive program, the advisory board identified three significant themes:
Today we begin production of the Arctic Code Vault which takes about two months to complete. In the spring we’ll return to Svalbard to make the official deposit of the Arctic Code Vault in the Arctic World Archive.
Join us at our booth at Satellite in May 2020, where we’ll share more about the Archive Program and the importance of preserving the software we collectively create today for future generations.
Thank you to the open source community for all your contributions.