Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/brentozarultd/soddi
StackOverflow Data Dump Importer. Forked from https://bitbucket.org/bitpusher/soddi/ after the original author passed away.
https://github.com/brentozarultd/soddi
microsoft-sql-server soddi sql-server stack-overflow stackoverflow
Last synced: 2 months ago
JSON representation
StackOverflow Data Dump Importer. Forked from https://bitbucket.org/bitpusher/soddi/ after the original author passed away.
- Host: GitHub
- URL: https://github.com/brentozarultd/soddi
- Owner: BrentOzarULTD
- License: other
- Created: 2014-01-16T19:43:40.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2018-06-18T11:57:03.000Z (over 6 years ago)
- Last Synced: 2024-10-12T07:22:31.108Z (2 months ago)
- Topics: microsoft-sql-server, soddi, sql-server, stack-overflow, stackoverflow
- Language: C#
- Homepage:
- Size: 668 KB
- Stars: 81
- Watchers: 17
- Forks: 27
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
soddi
=====StackOverflow Data Dump Importer. Forked from https://bitbucket.org/bitpusher/soddi/ after the original author passed away.
This app takes the Stack Exchange Data Dump XML files after you've downloaded them from here: https://archive.org/details/stackexchange
And loads them into a database. If you don't want to hassle with this, just grab the Stack Overflow database in Microsoft SQL Server format from here: https://www.brentozar.com/archive/2015/10/how-to-download-the-stack-overflow-database-via-bittorrent/
Using It
--------- Compile the latest dev branch yourself from source, or download the most recent official release: https://github.com/BrentOzarULTD/soddi/releases
- Create a folder to hold the extracted XML files (e.g. `C:\TEMP`)
- Extract files into a separate folder per StackExchange site (e.g. `122017 Stack Overflow`). The MMYYYY format is important.
- Fire up the `soddi.exe`
- Select the extracted XML folder in the **Source** text box.
- Select the Sites you want to import
- Set up your connection string
- Adjust batch size as you see fit
- Click **Import**
- Depending on the size of the import, you may want to grab a sandwich