{"id":15198878,"url":"https://github.com/pb319/scrap_with_selenium","last_synced_at":"2026-03-08T12:34:57.556Z","repository":{"id":254517748,"uuid":"846785590","full_name":"pb319/Scrap_with_Selenium","owner":"pb319","description":"Let's dive deeper into the domain of web scraping using Selenium.","archived":false,"fork":false,"pushed_at":"2024-08-25T07:23:40.000Z","size":598,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-01T15:03:50.452Z","etag":null,"topics":["beautifulsoup","pandas","pandas-dataframe","python","python-script","selenium"],"latest_commit_sha":null,"homepage":"","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/pb319.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-24T00:40:44.000Z","updated_at":"2024-08-25T07:23:43.000Z","dependencies_parsed_at":"2024-08-24T01:44:21.086Z","dependency_job_id":null,"html_url":"https://github.com/pb319/Scrap_with_Selenium","commit_stats":null,"previous_names":["pb319/scrap_with_selenium"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pb319%2FScrap_with_Selenium","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pb319%2FScrap_with_Selenium/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pb319%2FScrap_with_Selenium/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pb319%2FScrap_with_Selenium/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/pb319","download_url":"https://codeload.github.com/pb319/Scrap_with_Selenium/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":238645926,"owners_count":19506926,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup","pandas","pandas-dataframe","python","python-script","selenium"],"created_at":"2024-09-28T01:43:01.847Z","updated_at":"2025-10-28T11:31:56.681Z","avatar_url":"https://github.com/pb319.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Scrap_with_Selenium\nLet's dive deeper into the domain of web scraping using Selenium. This repository leverages automation tool Selenium to scrap web pages and later on uses Beautiful Soup to parse HTML files to fetch specific elements of concern.\n\n### Table of Contents\n-  [Resources](https://github.com/pb319/Scrap_with_Selenium#resource) \n-  [Objective](https://github.com/pb319/Scrap_with_Selenium#objective)\n-  [Approach](https://github.com/pb319/Scrap_with_Selenium#approach)\n-  [Output Files](https://github.com/pb319/Scrap_with_Selenium#output-files)\n\n#### Resource:\n- Youtube Video Link: [Click Here](https://www.youtube.com/watch?v=XI5_nsClCYI\u0026t=197s)\n- Tech Stack: `Selenium`, `Beautiful Soup`, `Pandas`\n- Selenium Getting Started: [Selenium](https://selenium-python.readthedocs.io/getting-started.html)\n- Beautiful Soup: [Beautiful Soup](https://beautiful-soup-4.readthedocs.io/en/latest/#quick-start)\n\n#### Objective:\n- Create a database of laptops available on `amazon.in`.\n\n#### Approach:\n- Export HTML formatted search results one by one from all available pages in the local machine.\n- Fetch multiple elements (`title, price, link`) from the HTML files.\n- Finally export it as a CSV formatted file.\n\n#### Output Files:\n-  [Python Script](https://github.com/pb319/Scrap_with_Selenium/blob/main/collect.py)\n-  [HTML Files](https://github.com/pb319/Scrap_with_Selenium/tree/main/Data)\n-  [CSV File](https://github.com/pb319/Scrap_with_Selenium/blob/main/data.csv)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpb319%2Fscrap_with_selenium","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpb319%2Fscrap_with_selenium","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpb319%2Fscrap_with_selenium/lists"}