https://github.com/dharmendradiwaker/village-data

This project involves scraping data from the Delhi Election Commision website to gather information about the locations within each Assembly Constituency (AC).
https://github.com/dharmendradiwaker/village-data

selenium webscraping

Last synced: about 2 months ago
JSON representation

This project involves scraping data from the Delhi Election Commision website to gather information about the locations within each Assembly Constituency (AC).

Host: GitHub
URL: https://github.com/dharmendradiwaker/village-data
Owner: dharmendradiwaker
Created: 2024-05-02T17:02:44.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-11-20T15:28:55.000Z (over 1 year ago)
Last Synced: 2025-02-02T01:28:49.933Z (over 1 year ago)
Topics: selenium, webscraping
Language: Python
Homepage:
Size: 982 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Data Collection Project

States project involves scraping data from 'https://vlist.in/' to gather information about villages, tehsils, districts, and states in India.

## Requirements

- Python (version 3.x)
- Selenium WebDriver
- BeautifulSoup
- Chrome WebDriver (or WebDriver compatible with your browser)

Install the required libraries using pip:

```bash
pip install selenium beautifulsoup4
```

Make sure you have Chrome WebDriver installed and added to your system PATH.

## Usage

1. Clone this repository:

```bash
git clone https://github.com/dharmendradiwaker/village-data.git
```

2. Navigate to the project directory:

```bash
cd
```

3. Update the `url` variable in the script with the desired URL:

```python
url = 'https://vlist.in/'
```

4. Run the script to scrape the data:

```bash
python all_states.py
```

The script will extract information about states, districts, tehsils, and villages from the provided URL and store the data in JSON format.

## Project Structure

- `all_states.py`: Contains the main scraping functions (`get_state`, `get_one_state`, `get_district`, `get_tehsil`, `get_village`).
- `states_data.json`: Output JSON file containing data for some states only. You can run the code and can get all states data.
- `delhi.json`: Output JSON file containing data for a specific state.

## Note

- The scraping process may take some time due to delays introduced between requests (`time.sleep(delay)`).
- Ensure you comply with the website's terms of service and use responsible scraping practices.

---

# Delhi Assembly Constituency Location Data Scraper

This project involves scraping data from the Delhi Election Commision website to gather information about the locations within each Assembly Constituency (AC).

## Requirements

- Python (version 3.x)
- Selenium WebDriver
- BeautifulSoup
- Chrome WebDriver (or WebDriver compatible with your browser)

Install the required libraries using pip:

```bash
pip install selenium beautifulsoup4
```

Make sure you have Chrome WebDriver installed and added to your system PATH.

## Usage

1. Clone this repository:

```bash
git clone
```

2. Navigate to the project directory:

```bash
cd
```

3. Update the `url` variable in the script (`scraper.py`) with the desired URL:

```python
url_ac = 'https://www.ceodelhi.gov.in/AcListEng.aspx'
```

4. Run the script to scrape the data:

```bash
python delhi_name.py
```

The script will extract information about locations within each Assembly Constituency (AC) in Delhi from the provided URL and store the data in a JSON file (`ac_data4.json`).

## Project Structure

- `delhi_name.py`: Contains the main scraping functions (`get_vill`, `get_ac`) for extracting AC location data.
- `ac_data4.json`: Output JSON file containing Assembly Constituency location data.

## Note

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dharmendradiwaker/village-data

Awesome Lists containing this project

README