Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/akash1070/web-scraping
scrape data from websites using great Python Libraries like: BeautifulSoup: prettify, Requests, Selenium, Scrapy
https://github.com/akash1070/web-scraping
amazon web-scraping wikipedia
Last synced: about 2 months ago
JSON representation
scrape data from websites using great Python Libraries like: BeautifulSoup: prettify, Requests, Selenium, Scrapy
- Host: GitHub
- URL: https://github.com/akash1070/web-scraping
- Owner: Akash1070
- Created: 2022-09-07T05:43:19.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-09-07T11:29:00.000Z (over 2 years ago)
- Last Synced: 2024-04-05T11:45:28.599Z (10 months ago)
- Topics: amazon, web-scraping, wikipedia
- Language: Jupyter Notebook
- Homepage:
- Size: 4.59 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# **Web Scraping**
Web scraping is the process of extracting and processing large amounts of data from the web. Web scraping is a great skill for anyone to have especially if you are someone who processes and analyzes large amounts of datasets, the ability to scrape data from the web is a very useful skill. There maybe a situation where you find data from the web, and there is no direct way to download it, web scraping will be your saviour and using Python makes it easier to extract the data into a useful form that can be processed and analyzed.
**WEB SCRAPING** of following pages:
1) Amazon Webpage
2) Wikipedia - Artificial Intelligence Page
3) ISB Datascrape data from websites using great Python Libraries like:
1) BeautifulSoup: prettify
2) Requests
3) Selenium
4) Scrapy## Authors
- [@Akash Kumar Jha](https://github.com/Akash1070)
## Installation
To install the libraries used in this project. Follow the
below steps:```bash
!pip install requests
import requests #to make request to the required page
import bs4 #contains beautifulSoup for pulling data out of HTML and XML files
import pandas as pd #to convert our data to a dataframe```
## Running Flask ApiTo run tests, run the following command
```bash
python app.py
```## š About Me
Data Scientist Enthusiast | Petroleum Engineer Graduate | Solving Problems Using Data
# Hi, I'm Akash! š
## š Links
[![github](https://img.shields.io/badge/github-000?style=for-the-badge&logo=ko-fi&logoColor=white)](https://github.com/Akash1070)
[![linkedin](https://img.shields.io/badge/linkedin-0A66C2?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/in/akashkumar107/)
## Other Common Github Profile Sections
š©āš» Iām interested in Petroleum Engineeringš§ Iām currently learning Data Scientist | Data Analytics | Business Analytics
šÆāāļø Iām looking to collaborate on Ideas & Data
## š Skills
1. Data Scientist
2. Data Analyst
3. Business Analyst
4. Machine Learning