https://github.com/rjohnpaul/web_scrapper_py
This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.
https://github.com/rjohnpaul/web_scrapper_py
Last synced: 3 months ago
JSON representation
This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.
- Host: GitHub
- URL: https://github.com/rjohnpaul/web_scrapper_py
- Owner: RJohnPaul
- License: mit
- Created: 2024-01-10T16:40:34.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-11T05:06:26.000Z (over 1 year ago)
- Last Synced: 2025-01-20T07:44:02.692Z (5 months ago)
- Size: 3.19 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraping App
![]()
![]()
This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.
## Getting Started
Follow the instructions below to set up and run the web scraping app on your local machine.
### Prerequisites
Make sure you have the following installed:
- [Python](https://www.python.org/downloads/)
- [pip](https://pip.pypa.io/en/stable/installation/)
- [Flask](https://flask.palletsprojects.com/en/2.1.x/installation/)### Installation
1. Clone the repository:
```bash
git clone https://github.com/your-username/web-scraping-app.git
```2. Navigate to the project directory:
```bash
cd web-scraping-app
```3. Install dependencies:
```bash
pip install -r requirements.txt
```This will install Flask along with other necessary packages.
### File Structure
```plaintext
web-scraping-app/
│
├── templates/
│ ├── index.html
│ └── results.html
│
├── app.py
├── requirements.txt
├── LICENSE
├── README.md
└── .gitignore
```### Usage
1. Run the Flask app:
```bash
python app.py
```2. Open your browser and go to [http://localhost:5000](http://localhost:5000).
3. Click the "Scrape Website" button to initiate the scraping process.## Features
- Random User-Agent rotation for scraping
- Extracts title, paragraphs, images, and links from a specified website
- Displays the scraped data on a web page using Flask## Contributing
Contributions are welcome! If you'd like to contribute to this project, please follow these steps:
1. Fork the repository
2. Create your feature branch: `git checkout -b feature/new-feature`
3. Commit your changes: `git commit -m 'Add a new feature'`
4. Push to the branch: `git push origin feature/new-feature`
5. Submit a pull request## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
![]()
---