{"id":24837260,"url":"https://github.com/mr-mudgal/amazon-scrapper","last_synced_at":"2025-03-26T04:14:04.126Z","repository":{"id":183365644,"uuid":"670013903","full_name":"mr-mudgal/Amazon-Scrapper","owner":"mr-mudgal","description":"This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.","archived":false,"fork":false,"pushed_at":"2025-01-15T19:03:54.000Z","size":18,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-31T05:53:38.546Z","etag":null,"topics":["amazon","csv","csv-export","python3","scraper","scraping","scraping-framework","scraping-websites","web-scraper","web-scraping","web-scraping-python","web-scraping-software","web-scrapping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mr-mudgal.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-24T05:30:56.000Z","updated_at":"2025-01-15T19:03:55.000Z","dependencies_parsed_at":"2025-01-15T20:29:57.235Z","dependency_job_id":"11536caa-eb1b-45c8-bc96-2fa51f74baee","html_url":"https://github.com/mr-mudgal/Amazon-Scrapper","commit_stats":null,"previous_names":["mr-mudgal/amazon-scrapper"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mr-mudgal%2FAmazon-Scrapper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mr-mudgal%2FAmazon-Scrapper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mr-mudgal%2FAmazon-Scrapper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mr-mudgal%2FAmazon-Scrapper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mr-mudgal","download_url":"https://codeload.github.com/mr-mudgal/Amazon-Scrapper/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245585813,"owners_count":20639671,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["amazon","csv","csv-export","python3","scraper","scraping","scraping-framework","scraping-websites","web-scraper","web-scraping","web-scraping-python","web-scraping-software","web-scrapping"],"created_at":"2025-01-31T05:53:41.121Z","updated_at":"2025-03-26T04:14:04.107Z","avatar_url":"https://github.com/mr-mudgal.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Amazon-Scrapper\n\nThis is an Amazon Scrapper using Python, using scrapingant_client, and BeautifulSoup4 and csv modules in order to request the product pages, and extract the data from them respectively.\nData extracted are:\n* Product URL\n* Product Name\n* Product Price\n* Product Description\n* Ratings\n* Number of Reviews\n* ASIN\n* Manufacturer\n\nIt extract 20 product listing pages. It then extract each single product page.\n\nInitailly the data is stored in a dictionary variable, and then it is converted, and written into a file in CSV format.\n\nIt uses Scrapingant API as amazon have algorithm to detect the scrapping script, and makes its service unavailable in response to a scrapping script. API help us to simulate a real browser behaviour, hence bypassing the amazon algorithms, and allowing us to extract the data.\n\nThe whole software is divided into two parts, the first part extract products from product listing page, with their url, name, price, rating and number of review detail.\nIn part 2, we extract every single product page, to obtain its description, asin and manufacturer.\n\nOutput File:\n\u003cimg width=\"1280\" src=\"https://github.com/user-attachments/assets/36be71a8-606e-4c85-ac25-aacd4eeb1f1e\" /\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmr-mudgal%2Famazon-scrapper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmr-mudgal%2Famazon-scrapper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmr-mudgal%2Famazon-scrapper/lists"}