{"id":24864761,"url":"https://github.com/emmanuel10701/selenium_scraping","last_synced_at":"2025-08-22T10:43:31.639Z","repository":{"id":273047412,"uuid":"918265968","full_name":"Emmanuel10701/Selenium_scraping","owner":"Emmanuel10701","description":"Data-Scraping","archived":false,"fork":false,"pushed_at":"2025-01-20T10:24:35.000Z","size":88,"stargazers_count":5,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-13T00:53:06.677Z","etag":null,"topics":["data-mining","data-science","python","selenium","selenium-webdriver","web-scraping"],"latest_commit_sha":null,"homepage":"","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Emmanuel10701.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-01-17T15:26:08.000Z","updated_at":"2025-03-15T17:15:31.000Z","dependencies_parsed_at":"2025-03-26T18:43:24.893Z","dependency_job_id":"3b77023b-e582-4d72-bb9e-41236c28b579","html_url":"https://github.com/Emmanuel10701/Selenium_scraping","commit_stats":null,"previous_names":["emmanuel10701/selenium_scraping"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Emmanuel10701%2FSelenium_scraping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Emmanuel10701%2FSelenium_scraping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Emmanuel10701%2FSelenium_scraping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Emmanuel10701%2FSelenium_scraping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Emmanuel10701","download_url":"https://codeload.github.com/Emmanuel10701/Selenium_scraping/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248650439,"owners_count":21139672,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-mining","data-science","python","selenium","selenium-webdriver","web-scraping"],"created_at":"2025-01-31T23:55:37.375Z","updated_at":"2025-08-22T10:43:31.606Z","avatar_url":"https://github.com/Emmanuel10701.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# E-commerce Reviews Scraping\n\nThis Python project scrapes customer reviews from an e-commerce website (or a local HTML file) and saves the extracted data into both **CSV** and **Excel** formats. It uses libraries like **Selenium**, **Pandas**, and **openpyxl** to achieve this.\n\n## Features\n- Scrapes reviews, including customer ratings, comments, and review dates.\n- Saves the scraped data in both **CSV** and **Excel** formats.\n- Automates browser interactions to dynamically load pages for scraping.\n- Option to scrape multiple pages of reviews.\n- Visualizes the ratings of reviews in an area plot.\n- Added the collab file in jupyter format to classify grades of students.\n\n## Requirements\n\nBefore using this project, ensure that **Python** is installed on your machine, and the necessary libraries are set up.\n\n### 1. Python Installation:\n   - Make sure you have Python installed on your system. You can download it from [python.org](https://www.python.org/downloads/).\n   - After installation, verify by running the following command in your terminal:\n     ```bash\n     python --version\n     ```\n     or\n     ```bash\n     python3 --version\n     ```\n     This should print the Python version (e.g., `Python 3.x.x`).\n\n### 2. Library Installation:\n   The following Python libraries are required to run this project:\n   - **selenium**: For browser automation and dynamically interacting with web pages.\n   - **pandas**: For handling and saving the scraped data.\n   - **openpyxl**: For saving data to an Excel file.\n   - **numpy**: For data handling and numeric operations.\n   - **matplotlib**: For visualizing the ratings in an area plot.\n   - **webdriver-manager**: For managing the WebDriver for Selenium.\n\n   To install the required libraries, open your terminal and run the following command:\n   ```bash\n   pip install -r requirements.txt\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Femmanuel10701%2Fselenium_scraping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Femmanuel10701%2Fselenium_scraping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Femmanuel10701%2Fselenium_scraping/lists"}