https://github.com/aastopher/intelligent_asynchronous_web_scraper
Intelligent Asynchronous Web-Scraper which will collect completed UFC events and related match data from ufcstats.com
https://github.com/aastopher/intelligent_asynchronous_web_scraper
Last synced: 6 months ago
JSON representation
Intelligent Asynchronous Web-Scraper which will collect completed UFC events and related match data from ufcstats.com
- Host: GitHub
- URL: https://github.com/aastopher/intelligent_asynchronous_web_scraper
- Owner: aastopher
- Created: 2022-09-15T01:00:10.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2022-10-17T22:00:25.000Z (over 3 years ago)
- Last Synced: 2025-06-01T01:14:49.615Z (9 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 332 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Intelligent Asynchronous Web-Scraper - UFC Completed Events
Intelligent Asynchronous Web-Scraper which will collect completed UFC events and related match data from [ufcstats.com](http://www.ufcstats.com/statistics/events/completed?page=all). Clearing both or just one file will re-scrape the whole file otherwise if new event entries are found compared to the stored events only the new events will be scraped and each file with be updated with the new entries. NOTE: currently removing events from the `completed_events.csv` file manualy then re-running will result in double entries for `completed_event_matches.csv`; otherwise re-running the notebook will either update or refresh the files depending the existence of each file.
**TO-DO:**
* The notebook should be set to run on a schedule with github actions