https://github.com/shubhamgoyal575/topcompaineslist_webscraping
https://github.com/shubhamgoyal575/topcompaineslist_webscraping
Last synced: 4 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/shubhamgoyal575/topcompaineslist_webscraping
- Owner: shubhamgoyal575
- Created: 2024-06-10T06:55:08.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-10T07:01:17.000Z (over 1 year ago)
- Last Synced: 2024-06-10T08:54:40.313Z (over 1 year ago)
- Language: Jupyter Notebook
- Size: 70.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Scraping Top Companies using Python
This repository contains a Python project designed to scrape information about top companies from a specified website.
The project utilizes popular libraries such as BeautifulSoup and Requests for web scraping, and Pandas for data manipulation and storage.This project aims to extract data about top companies, such as their names, rankings, revenue, and other relevant details from a given website.
The scraped data is then cleaned and stored in a CSV file for further analysis.-- **Features**
Scrape company names, rankings, revenue, and more.
Clean and preprocess the scraped data.
Save the data in a structured CSV file.
Error handling and logging.