Database Troubles
This morning we had some database problems. With help from Engine Yard we were able to identify the slower queries and eliminate them. Here you can see our database exploding:…
This morning we had some database problems. With help from Engine Yard we were able to identify the slower queries and eliminate them.
Here you can see our database exploding:
img http://img.skitch.com/20090130-x5dfywbure73aqfjt3bqspckf3.png
And the sweet release when we deployed the fix:
img http://img.skitch.com/20090130-rkcn1b4d566f2x226f46deuq3.png
Basically, our events table has swelled to over 30m rows taking up 18gb of disk space. We have some solid indexes in place, but saving new events and querying are both becoming problems. As we plan to do more fancy things with the dashboard in the future this needs to be addressed.
Currently the plan is twofold:
-
Split
eventstable intoeventsandevents_archive. Older events will go intoevents_archive, archiving will happen nightly on a cron. This will dramatically reduce the size of theeventstable. -
Re-evaluate every query that touches
events, make sure it’s using the best possible index.
We’ll update you as we make progress. Thanks!
Written by
Related posts
Pick your agent: Use Claude and Codex on Agent HQ
Claude by Anthropic and OpenAI Codex are now available in public preview on GitHub and VS Code with a Copilot Pro+ or Copilot Enterprise subscription. Here’s what you need to know and how to get started today.
What the fastest-growing tools reveal about how software is being built
What languages are growing fastest, and why? What about the projects that people are interested in the most? Where are new developers cutting their teeth? Let’s take a look at Octoverse data to find out.
Year recap and future goals for the GitHub Innovation Graph
Discover the latest trends and insights on public software development activity on GitHub with data from the Innovation Graph through Q3 2025.