{"id":22505551,"url":"https://github.com/dharmendradiwaker/village-data","last_synced_at":"2026-05-02T05:43:09.979Z","repository":{"id":237733863,"uuid":"795141895","full_name":"dharmendradiwaker/village-data","owner":"dharmendradiwaker","description":"This project involves scraping data from the Delhi Election Commision website to gather information about the locations within each Assembly Constituency (AC).","archived":false,"fork":false,"pushed_at":"2024-11-20T15:28:55.000Z","size":1006,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-02T01:28:49.933Z","etag":null,"topics":["selenium","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dharmendradiwaker.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-05-02T17:02:44.000Z","updated_at":"2024-11-20T15:28:59.000Z","dependencies_parsed_at":"2024-05-04T15:31:07.757Z","dependency_job_id":"d02cd0c7-ba1c-4bfe-8bb4-95243a7a4971","html_url":"https://github.com/dharmendradiwaker/village-data","commit_stats":null,"previous_names":["dharmendradiwaker/village-data"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dharmendradiwaker%2Fvillage-data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dharmendradiwaker%2Fvillage-data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dharmendradiwaker%2Fvillage-data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dharmendradiwaker%2Fvillage-data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dharmendradiwaker","download_url":"https://codeload.github.com/dharmendradiwaker/village-data/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245939355,"owners_count":20697215,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["selenium","webscraping"],"created_at":"2024-12-07T00:20:50.856Z","updated_at":"2026-05-02T05:43:09.941Z","avatar_url":"https://github.com/dharmendradiwaker.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Data Collection Project\n\nStates project involves scraping data from 'https://vlist.in/' to gather information about villages, tehsils, districts, and states in India.\n\n## Requirements\n\n- Python (version 3.x)\n- Selenium WebDriver\n- BeautifulSoup\n- Chrome WebDriver (or WebDriver compatible with your browser)\n\nInstall the required libraries using pip:\n\n```bash\npip install selenium beautifulsoup4\n```\n\nMake sure you have Chrome WebDriver installed and added to your system PATH.\n\n## Usage\n\n1. Clone this repository:\n\n```bash\ngit clone https://github.com/dharmendradiwaker/village-data.git\n```\n\n2. Navigate to the project directory:\n\n```bash\ncd \u003cproject_directory\u003e\n```\n\n3. Update the `url` variable in the script with the desired URL:\n\n```python\nurl = 'https://vlist.in/'\n```\n\n4. Run the script to scrape the data:\n\n```bash\npython all_states.py\n```\n\nThe script will extract information about states, districts, tehsils, and villages from the provided URL and store the data in JSON format.\n\n## Project Structure\n\n- `all_states.py`: Contains the main scraping functions (`get_state`, `get_one_state`, `get_district`, `get_tehsil`, `get_village`).\n- `states_data.json`: Output JSON file containing data for some states only. You can run the code and can get all states data.\n- `delhi.json`: Output JSON file containing data for a specific state.\n\n## Note\n\n- The scraping process may take some time due to delays introduced between requests (`time.sleep(delay)`).\n- Ensure you comply with the website's terms of service and use responsible scraping practices.\n\n---\n\n\n# Delhi Assembly Constituency Location Data Scraper\n\nThis project involves scraping data from the Delhi Election Commision website to gather information about the locations within each Assembly Constituency (AC).\n\n## Requirements\n\n- Python (version 3.x)\n- Selenium WebDriver\n- BeautifulSoup\n- Chrome WebDriver (or WebDriver compatible with your browser)\n\nInstall the required libraries using pip:\n\n```bash\npip install selenium beautifulsoup4\n```\n\nMake sure you have Chrome WebDriver installed and added to your system PATH.\n\n## Usage\n\n1. Clone this repository:\n\n```bash\ngit clone \u003crepository_url\u003e\n```\n\n2. Navigate to the project directory:\n\n```bash\ncd \u003cproject_directory\u003e\n```\n\n3. Update the `url` variable in the script (`scraper.py`) with the desired URL:\n\n```python\nurl_ac = 'https://www.ceodelhi.gov.in/AcListEng.aspx'\n```\n\n4. Run the script to scrape the data:\n\n```bash\npython delhi_name.py\n```\n\nThe script will extract information about locations within each Assembly Constituency (AC) in Delhi from the provided URL and store the data in a JSON file (`ac_data4.json`).\n\n## Project Structure\n\n- `delhi_name.py`: Contains the main scraping functions (`get_vill`, `get_ac`) for extracting AC location data.\n- `ac_data4.json`: Output JSON file containing Assembly Constituency location data.\n\n## Note\n\n- The scraping process may take some time due to delays introduced between requests (`time.sleep(delay)`).\n- Ensure you comply with the website's terms of service and use responsible scraping practices.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdharmendradiwaker%2Fvillage-data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdharmendradiwaker%2Fvillage-data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdharmendradiwaker%2Fvillage-data/lists"}