{"id":25236764,"url":"https://github.com/relostar-devil/web-scraping-using-python","last_synced_at":"2026-04-29T16:35:11.802Z","repository":{"id":276930661,"uuid":"930765700","full_name":"Relostar-Devil/Web-Scraping-Using-Python","owner":"Relostar-Devil","description":"Wikipedia Web Scraper: Python script to extract and display Asian countries by area. Demonstrates web scraping techniques using Beautiful Soup and Pandas.","archived":false,"fork":false,"pushed_at":"2025-02-11T07:15:04.000Z","size":131,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-05T18:13:17.629Z","etag":null,"topics":["beautifulsoup4","pandas","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Relostar-Devil.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-02-11T07:09:26.000Z","updated_at":"2025-02-11T07:17:37.000Z","dependencies_parsed_at":"2025-02-11T08:26:46.150Z","dependency_job_id":"3c831fd8-cb7f-45ff-b87c-066efd03c3d1","html_url":"https://github.com/Relostar-Devil/Web-Scraping-Using-Python","commit_stats":null,"previous_names":["relostar-devil/web-scraping-using-python"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Relostar-Devil%2FWeb-Scraping-Using-Python","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Relostar-Devil%2FWeb-Scraping-Using-Python/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Relostar-Devil%2FWeb-Scraping-Using-Python/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Relostar-Devil%2FWeb-Scraping-Using-Python/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Relostar-Devil","download_url":"https://codeload.github.com/Relostar-Devil/Web-Scraping-Using-Python/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247378146,"owners_count":20929297,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup4","pandas","python"],"created_at":"2025-02-11T15:31:49.395Z","updated_at":"2026-04-29T16:35:11.759Z","avatar_url":"https://github.com/Relostar-Devil.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Web Scraping Top Asian Countries by Area from Wikipedia\n\n## Overview\n\nThis repository contains a Python script that performs web scraping to extract a list of Asian countries by area from a Wikipedia page. The script retrieves the data, cleans and formats it, and presents the top countries in a user-friendly format.\n\n## Project Goals\n\n*   Scrape data of Asian countries area from the Wikipedia page.\n*   Clean and process the extracted data.\n*   Present the data in a clear and organized manner.\n\n## Code Description\n\nThe Python script `Web-Scraping-Using-Python.ipynb` uses the following libraries:\n\n*   **requests:** To send HTTP requests to the Wikipedia page.\n*   **Beautiful Soup:** To parse the HTML content and extract the table data.\n*   **Pandas:** To create and manipulate dataframes for cleaning and analysis.\n\n## Methodology\n\n1.  **Send Request:** The script sends an HTTP request to the specified Wikipedia page URL.\n2.  **Parse HTML:** Beautiful Soup is used to parse the HTML content of the page.\n3.  **Extract Table:** The script locates the table containing the list of Asian countries by area.\n4.  **Create DataFrame:** The extracted table data is converted into a Pandas DataFrame.\n5.  **Data Cleaning:** The data is cleaned by removing unwanted characters, handling missing values, and converting data types.\n6.  **Display Results:** The cleaned and processed data is displayed in a tabular format.\n\n## Usage\n\n1.  Make sure you have Python installed.\n2.  Install the required libraries:\n\n    ```\n    pip install requests beautifulsoup4 pandas\n    ```\n\n3.  Run the script:\n\n    ```\n    jupyter notebook Web-Scraping-Using-Python.ipynb\n    ```\n\n4.  Follow the instructions in the Jupyter Notebook to execute the code and view the results.\n\n## Sample Output\n\nThe script outputs a list of country names by area present in asia\n\n\n## Skills Demonstrated\n\n*   Web Scraping\n*   Data Extraction\n*   HTML Parsing\n*   Data Cleaning\n*   Pandas\n*   Python Programming\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frelostar-devil%2Fweb-scraping-using-python","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frelostar-devil%2Fweb-scraping-using-python","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frelostar-devil%2Fweb-scraping-using-python/lists"}