https://github.com/rjohnpaul/web_scrapper_py

This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.
https://github.com/rjohnpaul/web_scrapper_py

Last synced: 3 months ago
JSON representation

This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.

Host: GitHub
URL: https://github.com/rjohnpaul/web_scrapper_py
Owner: RJohnPaul
License: mit
Created: 2024-01-10T16:40:34.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-01-11T05:06:26.000Z (over 1 year ago)
Last Synced: 2025-01-20T07:44:02.692Z (5 months ago)
Size: 3.19 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Web Scraping App

This is a simple web scraping app built with Python, Flask, and BeautifulSoup. The app allows you to scrape data from a website and display the results on a web page.

## Getting Started

Follow the instructions below to set up and run the web scraping app on your local machine.

### Prerequisites

Make sure you have the following installed:

- [Python](https://www.python.org/downloads/)
- [pip](https://pip.pypa.io/en/stable/installation/)
- [Flask](https://flask.palletsprojects.com/en/2.1.x/installation/)

### Installation

1. Clone the repository:

```bash
git clone https://github.com/your-username/web-scraping-app.git
```

2. Navigate to the project directory:

```bash
cd web-scraping-app
```

3. Install dependencies:

```bash
pip install -r requirements.txt
```

This will install Flask along with other necessary packages.

### File Structure

```plaintext
web-scraping-app/
│
├── templates/
│ ├── index.html
│ └── results.html
│
├── app.py
├── requirements.txt
├── LICENSE
├── README.md
└── .gitignore
```

### Usage

1. Run the Flask app:

```bash
python app.py
```

2. Open your browser and go to [http://localhost:5000](http://localhost:5000).

3. Click the "Scrape Website" button to initiate the scraping process.

## Features

- Random User-Agent rotation for scraping
- Extracts title, paragraphs, images, and links from a specified website
- Displays the scraped data on a web page using Flask

## Contributing

Contributions are welcome! If you'd like to contribute to this project, please follow these steps:

1. Fork the repository
2. Create your feature branch: `git checkout -b feature/new-feature`
3. Commit your changes: `git commit -m 'Add a new feature'`
4. Push to the branch: `git push origin feature/new-feature`
5. Submit a pull request

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rjohnpaul/web_scrapper_py

Awesome Lists containing this project

README