The latest version of the Stack Exchange Creative Commons Data Dump is now available. This reflects all public data in …

… up to Apr 2011.

Download from ClearBits

This month’s Stack Exchange data dump, as always, is hosted at ClearBits! You can subscribe via RSS to be notified every time a new dump is available.

Please read, this is not the usual yadda yadda! Three things:

  1. Because the dumps are quite a bit of work for us, we’re moving to a bi-monthly schedule instead of monthly. Meaning, you can expect dumps every two months instead of every month. If you have an urgent need for more timely data than this, contact us directly, or use the Stack Exchange Data Explorer, which will continue to be updated monthly.
  2. The attribution rules have changed to forbid JavaScript generated attribution links.
  3. As of November 2010, we enhanced the format of the data dump to include more requested fields, full revision history, and many other pending meta requests tagged [data-dump]. That’s why the dump is so much larger, but we did break it out in individual files per site within the torrent, so you can download just the files you need.

If you’d prefer not to download the torrent and would rather play with the most recent public data in your web browser right now, check out our open source Stack Exchange Data Explorer.

Have fun remixing and reusing; all we ask is for proper attribution.

«
»
  1. Neal McBurnett says:

    Thanks! But why are the beta sites not included?
    Is there any other way to get a full set of data for a beta site like IT Security so we can e.g. do tag usage analysis and help get the site off to a great start?

  2. Tyler Breisacher says:

    Is there a list somewhere of all the publicly-available cool things that people have done with this data?

  3. Jeff Atwood says:

    @tyler there are a few at http://stackapps.com and this is in scope there, officially

    @neal public beta sites can be cancelled (like atheism, gadgets, ai so far); we don’t like to make them part of the dump until they graduate.

  4. Oscar Mederos says:

    +1 for .7z ;)

  5. hyperslug says:

    “attribution rules have changed to forbid JavaScript generated attribution links”

    Does this make the contest-winning StackTack illegal?

    http://stackapps.com/questions/518/stacktack-a-javascript-widget-you-can-stick-anywhere

  6. antony says:

    @jeff , Oscar i try this database to try my hands on SQL database because i use mainly MySQL. can anyone explain how i can play this data in my system. means further information to restore this database

    Thanks
    Antony

  7. Matthew Flaschen says:

    @hyperslug, no, that uses the API (e.g. api.stackoverflow.com), not the dumps.

  8. Jeff Atwood says:

    @antony see http://meta.stackoverflow.com/questions/3223/importing-creative-commons-data-dump-into-mysql

Leave a Reply