https://github.com/languagemachines/bioport
Scrape pages about persons ('biographies') from Wikipedia.
https://github.com/languagemachines/bioport
Last synced: about 1 year ago
JSON representation
Scrape pages about persons ('biographies') from Wikipedia.
- Host: GitHub
- URL: https://github.com/languagemachines/bioport
- Owner: LanguageMachines
- Created: 2020-09-23T11:37:54.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-11-03T13:54:10.000Z (over 5 years ago)
- Last Synced: 2025-01-31T11:32:44.243Z (over 1 year ago)
- Language: Python
- Size: 50.8 KB
- Stars: 0
- Watchers: 6
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README
Awesome Lists containing this project
README
Scripts to collect biographical information about people from their wikipedia pages.
Created June-Oct 2020 by Merijn Beeksma
extra filtering was done by Iris Hendrickx by explicitly removed those json files that contained one of these categories:
Cidades
Municípios
Concelhos
Álbuns
Fundações