https://github.com/imsahil007/gsoc-scraper
Script for creating a simple Html page filtering orgs on basis of tech stack
https://github.com/imsahil007/gsoc-scraper
beautifulsoup gsoc gsoc-2020 payload payload-scraping python3 rich websc webscraper
Last synced: about 1 year ago
JSON representation
Script for creating a simple Html page filtering orgs on basis of tech stack
- Host: GitHub
- URL: https://github.com/imsahil007/gsoc-scraper
- Owner: imsahil007
- License: bsl-1.0
- Created: 2020-08-18T07:18:56.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2020-11-26T09:43:31.000Z (over 5 years ago)
- Last Synced: 2025-04-13T09:15:37.743Z (about 1 year ago)
- Topics: beautifulsoup, gsoc, gsoc-2020, payload, payload-scraping, python3, rich, websc, webscraper
- Language: Python
- Homepage:
- Size: 349 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# GSoC Organisation Scraper
Scripts for lazy people like me to scrap organization list filtering out a particular language.
Also shows the no. of times that organization was selected for GSoc in the period 2016-2020.
> Note: User can input only one langauge at time. I could add functionality for more. But I don't want to xD
# Libraries used:
[Requests](http://docs.python-requests.org/en/master/)
[BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)
[Rich](https://github.com/willmcgugan/rich)
[yaspin](https://github.com/pavdmyt/yaspin/)
### Requirements :
+ BeautifulSoup
+ Requests
+ Rich
+ yaspin
### Instructions :
```bash
# Clone this repository
git clone https://github.com/imsahil007/GSoC-Scraper.git
# Go into the repository
cd GSoC-Scraper
# Install dependencies (Activate your virutal environment for python3)
pip3 install -r requirements.txt
# Run the script
python3 year_wise.py
# python3 script.py
# "script.py" not working as 2021 list has been updated yet. Will work on Jan/feb
# Enter the language of choice when asked.
# Example:
Enter the language you want to filter out: python
```
### Screenshots :


## Todos:
[x] Links aren't working in table
> Note: Added a sentinal controlled loop for printing index wise details
### Contributing:
+ Fork the repo.
+ Create a new branch
+ Commit changes and make a PR.
+ PRs are welcome :)
Contact Me: [Sahil](https://www.linkedin.com/in/imsahil007)