{"id":30770384,"url":"https://github.com/myselfabk5/commodity_price_extraction","last_synced_at":"2026-05-14T20:33:08.479Z","repository":{"id":309798247,"uuid":"1032951676","full_name":"myselfabk5/commodity_price_extraction","owner":"myselfabk5","description":"Extrating Commodity Prices","archived":false,"fork":false,"pushed_at":"2025-08-13T20:27:59.000Z","size":10,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-09-04T23:15:20.828Z","etag":null,"topics":["commodityprices","selenium","selenium-python","selenium-webdriver","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/myselfabk5.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-08-06T04:50:10.000Z","updated_at":"2025-08-13T20:48:10.000Z","dependencies_parsed_at":"2025-08-13T22:21:27.331Z","dependency_job_id":"5c428a71-fe84-46dc-9834-240a2cf891ea","html_url":"https://github.com/myselfabk5/commodity_price_extraction","commit_stats":null,"previous_names":["myselfabk5/commodity_price_extraction"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/myselfabk5/commodity_price_extraction","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/myselfabk5%2Fcommodity_price_extraction","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/myselfabk5%2Fcommodity_price_extraction/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/myselfabk5%2Fcommodity_price_extraction/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/myselfabk5%2Fcommodity_price_extraction/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/myselfabk5","download_url":"https://codeload.github.com/myselfabk5/commodity_price_extraction/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/myselfabk5%2Fcommodity_price_extraction/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33042194,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-13T13:14:54.681Z","status":"online","status_checked_at":"2026-05-14T02:00:06.663Z","response_time":57,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["commodityprices","selenium","selenium-python","selenium-webdriver","webscraping"],"created_at":"2025-09-04T23:04:21.730Z","updated_at":"2026-05-14T20:33:08.462Z","avatar_url":"https://github.com/myselfabk5.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"## \u003cb\u003e Description \u003c/b\u003e\n\n**Purpose** – The function web_price_data_scrapping() scrapes daily commodity price data for multiple dates from the official FCA Info Web portal.\n\n**Inputs** – Accepts a list of dates (list_of_dates) in the format DD/MM/YYYY for which price reports need to be extracted.\n\n**Selenium Setup** – Uses selenium with Chrome WebDriver in headless mode to automate browser actions without opening a visible browser window.\n\n**Website Navigation** – Programmatically selects the “Price Report” option, chooses “Daily Prices” report type, enters the given date, and triggers the Get Data button.\n\n**Dynamic Content Handling** – Uses WebDriverWait and expected_conditions to ensure that page elements (radio buttons, dropdowns, input fields, tables) are fully loaded before interacting.\n\n**Table Extraction** – Locates the HTML table (id=\"gv0\") containing price data, retrieves its HTML, and parses it with BeautifulSoup.\n\n**Data Cleaning** – Extracts table headers (\u003cth\u003e) and row data (\u003ctr\u003e + \u003ctd\u003e), creates a Pandas DataFrame, and filters only relevant commodity columns.\n\n**Date Annotation** – Adds a Date column to tag each row with the corresponding report date for easier analysis later.\n\n**Multiple Dates** – Loops through all given dates, scraping and appending each day’s data into a single output DataFrame.\n\n**Return Value** – Returns the combined DataFrame containing state/UT-wise daily prices for commodities such as Rice, Wheat, Pulses, Edible Oils, Sugar, Vegetables, etc.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmyselfabk5%2Fcommodity_price_extraction","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmyselfabk5%2Fcommodity_price_extraction","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmyselfabk5%2Fcommodity_price_extraction/lists"}