https://github.com/ptyadana/web-scraping-and-api-in-python
Web Scraping and API in Python using beautifulsoup, requests, requests-xml, etc for processing multiple APIs and scraping multple sites such as youtube, soundcloud and many more.
https://github.com/ptyadana/web-scraping-and-api-in-python
365datascience api beautifulsoup exchangeratesapi-io github-api itunes-api jokes-api jupyter-notebook juypter lxml python3 requests requests-html soundcloud steam urllib webscraping youtube
Last synced: about 1 month ago
JSON representation
Web Scraping and API in Python using beautifulsoup, requests, requests-xml, etc for processing multiple APIs and scraping multple sites such as youtube, soundcloud and many more.
- Host: GitHub
- URL: https://github.com/ptyadana/web-scraping-and-api-in-python
- Owner: ptyadana
- License: mit
- Created: 2020-04-14T04:20:14.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-04-25T14:17:21.000Z (over 5 years ago)
- Last Synced: 2025-04-19T21:00:41.530Z (7 months ago)
- Topics: 365datascience, api, beautifulsoup, exchangeratesapi-io, github-api, itunes-api, jokes-api, jupyter-notebook, juypter, lxml, python3, requests, requests-html, soundcloud, steam, urllib, webscraping, youtube
- Language: Jupyter Notebook
- Homepage:
- Size: 4.41 MB
- Stars: 20
- Watchers: 3
- Forks: 12
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraping and API in Python
Python project for integrations with different API and web scraping with BeautifulSoup and HTML-requests libraries for multiple scraping projects such as Youtube, dynamically generated Javascript for SounCloud and many more.
## Built With
* [python 3](https://www.python.org/)
* [requests](https://requests.readthedocs.io/en/master/) - Requests is an elegant and simple HTTP library for Python.
* [pandas](https://pandas.pydata.org/) - fast, powerful, flexible and easy to use open source data analysis and manipulation tool
* [Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/) - for screen scraping library
* [requests-HTML](https://requests.readthedocs.io/projects/requests-html/en/latest/) - make parsing HTML as simple and intuitive as possible with Full JavaScript Support
* [python html.parser](https://docs.python.org/3/library/html.parser.html) - html parser
* [lxml parser](https://lxml.de/parsing.html) - asd
* [html5lib parser](https://github.com/html5lib/html5lib-python) - simple and powerful API for parsing XML and HTML
* [urllib](https://docs.python.org/3/library/urllib.parse.html#module-urllib.parse) - URL handling module
## API Projects
* [Currency Exchange Rate API](https://exchangeratesapi.io/)
* [iTune API](https://developer.apple.com/library/archive/documentation/AudioVideo/Conceptual/iTuneSearchAPI/Searching.html#//apple_ref/doc/uid/TP40017632-CH5-SW1)
* [GitHub Jobs API](https://jobs.github.com/api)
* [Official Joke API](https://github.com/15Dkatz/official_joke_api)
* [Joke API](https://sv443.net/jokeapi)
## Web Scraping Projects
* [Rotten Tomatoes](https://www.rottentomatoes.com/)
* [Steam](https://store.steampowered.com/games/)
* [Youtube](https://www.youtube.com/)
* [Sound Cloud](https://soundcloud.com/)
## License
This project is licensed under the MIT License - see the [LICENSE.md](LICENSE.md) file for details
## References
* The challenges are part of [Web Scraping and API in Python course](https://365datascience.com/courses/web-scraping-and-api-fundamentals-in-python/) by 365 Data Science.