Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sidhantpanda/wiki-scraper

A wikipedia category scraper
https://github.com/sidhantpanda/wiki-scraper

Last synced: 23 days ago
JSON representation

A wikipedia category scraper

Awesome Lists containing this project

README

        

#Wiki Scraper

This is a implementation of the wiki tools to get articles from a specific category from wikipedia.

The edit the categories list in poc.py to add/delete categories of your choice.

##How to run

In the terminal, navigate to the repo and run

$python poc.py

This will create directory called "data" (ignored in .gitignore) and create directories of the categories and save the articles in a text file in the respective directories.