https://github.com/abhishekshree/gsoc-scraper
Scraping the results.
https://github.com/abhishekshree/gsoc-scraper
Last synced: 9 months ago
JSON representation
Scraping the results.
- Host: GitHub
- URL: https://github.com/abhishekshree/gsoc-scraper
- Owner: abhishekshree
- Created: 2021-05-17T17:45:26.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2021-05-17T21:06:50.000Z (over 4 years ago)
- Last Synced: 2025-01-21T09:48:57.011Z (11 months ago)
- Language: Python
- Size: 51.8 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# gsoc-scraper
For scraping the results.
### Usage:
- This scrapper uses, the ```requests``` library.
- There was an API endpoint which I found through the Developer console. This endpoint will always fetch the current projects so can be reused.
To run this locally, the steps are:
- Run the command ```pip install -r requirements.txt```
- Run ```python scraper.py```
- This will create a ```gsoc2021.csv``` file with the desired columns.
---
[Link to the endpoint](https://summerofcode.withgoogle.com/api/program/current/project/?page=1&page_size=20)
This took some time so let it run for about 5 minutes peacefully please.