Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/emirhansilsupur/web-scraper-news
Web Scraper for News Related to the Israel-Palestine Conflict
https://github.com/emirhansilsupur/web-scraper-news
beautifulsoup python selenium-webdriver webscraping
Last synced: 22 days ago
JSON representation
Web Scraper for News Related to the Israel-Palestine Conflict
- Host: GitHub
- URL: https://github.com/emirhansilsupur/web-scraper-news
- Owner: emirhansilsupur
- License: mit
- Created: 2023-12-22T15:47:56.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-02T18:45:25.000Z (about 1 year ago)
- Last Synced: 2024-11-11T21:18:55.997Z (3 months ago)
- Topics: beautifulsoup, python, selenium-webdriver, webscraping
- Language: Python
- Homepage:
- Size: 214 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: news_data.csv
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraper for News Related to the Israel-Palestine Conflict
This Python web scraper collects news articles related to the Israel-Palestine conflict from the [Al Jazeera](https://www.aljazeera.com/tag/israel-palestine-conflict/) website. It utilizes Selenium for dynamic page loading and BeautifulSoup for HTML parsing.
## Dependencies
- [Pandas](https://pandas.pydata.org/): Data manipulation and analysis library
- [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/): HTML parsing library
- [Selenium](https://www.selenium.dev/documentation/en/): Web testing library for browser automation## Installation
1. Install the required Python libraries:
```bash
pip install -r requirements.txt
2. Download ChromeDriver from the official website: https://sites.google.com/chromium.org/driver/ and place it in the specified path.## Usage
1. Run the `israel_palestine_news_scraper.py` script to collect news data.
2. The script will navigate to Al Jazeera's Israel-Palestine conflict page, scrape news articles, and save the data to a CSV file (`news_data.csv`).## License
This project is licensed under the MIT License - see the LICENSE.md file for details.
### Dataset
[Kaggle](https://www.kaggle.com/datasets/emirslspr/israel-hamas-conflict-news-dataset/data)