https://github.com/hackerajofficial/web-scraping-application
Web scraping to extract data from a website and stores it in a structured format. This project aims to assess your proficiency in web scraping techniques, data handling, error management.
https://github.com/hackerajofficial/web-scraping-application
hackeraj hackeraj-official hackerajofficial web-scraping-application
Last synced: 3 months ago
JSON representation
Web scraping to extract data from a website and stores it in a structured format. This project aims to assess your proficiency in web scraping techniques, data handling, error management.
- Host: GitHub
- URL: https://github.com/hackerajofficial/web-scraping-application
- Owner: HackerajOfficial
- Created: 2024-04-23T04:44:51.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2024-04-23T10:31:28.000Z (about 1 year ago)
- Last Synced: 2025-01-08T01:58:55.570Z (4 months ago)
- Topics: hackeraj, hackeraj-official, hackerajofficial, web-scraping-application
- Language: Python
- Homepage:
- Size: 3.91 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Scraping Application
## Description:
Web scraping to extract data from a website and stores it in a structured
format. This project aims to assess your proficiency in web scraping techniques, data
handling, error management.## Technologies Used:
- **bs4 / BeautifulSoup:** Beautiful Soup is a Python library used for web scraping purposes. It provides tools for parsing HTML and XML documents and extracting useful information from them.- **requests:** The requests library in Python is a powerful and user-friendly tool for making HTTP requests.
## Usage:
1. Clone the repository to your local machine.
```
git clone https://github.com/HackerajOfficial/Web-Scraping-Application.git
```
2. Active environment
```
venv\Scripts\activate
```
3. Install the necessary dependencies
```
pip install -r requirements.txt
```
4. Start the Django development server:
```
python scraper.py
```## Limitations:
- **Website Changes:** Web scraping relies on the structure of the website being consistent over time. If the website's structure changes, such as class names or HTML layout, the scraping code may break.