{"id":23611129,"url":"https://github.com/lazycatcoder/autoriadataparser","last_synced_at":"2026-04-29T15:03:11.270Z","repository":{"id":269626422,"uuid":"908028109","full_name":"lazycatcoder/AutoRiaDataParser","owner":"lazycatcoder","description":"Python parser extracting data about cars and sellers from the AutoRia website","archived":false,"fork":false,"pushed_at":"2025-04-01T12:26:30.000Z","size":23,"stargazers_count":1,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-10T00:42:52.316Z","etag":null,"topics":["autoria","autoriaparser","autoriatool","beautifulsoup","carscraper","parsing","python","pythonscraper","selenium","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lazycatcoder.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-12-24T23:11:36.000Z","updated_at":"2025-04-01T12:26:34.000Z","dependencies_parsed_at":"2024-12-25T00:24:43.957Z","dependency_job_id":"7cc9a201-3e33-4ad5-a31e-d55e690974c7","html_url":"https://github.com/lazycatcoder/AutoRiaDataParser","commit_stats":null,"previous_names":["lazycatcoder/autoriadataparser"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/lazycatcoder/AutoRiaDataParser","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lazycatcoder%2FAutoRiaDataParser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lazycatcoder%2FAutoRiaDataParser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lazycatcoder%2FAutoRiaDataParser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lazycatcoder%2FAutoRiaDataParser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lazycatcoder","download_url":"https://codeload.github.com/lazycatcoder/AutoRiaDataParser/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lazycatcoder%2FAutoRiaDataParser/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32430803,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-29T13:34:34.882Z","status":"ssl_error","status_checked_at":"2026-04-29T13:34:29.830Z","response_time":110,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["autoria","autoriaparser","autoriatool","beautifulsoup","carscraper","parsing","python","pythonscraper","selenium","webscraping"],"created_at":"2024-12-27T16:13:21.870Z","updated_at":"2026-04-29T15:03:11.262Z","avatar_url":"https://github.com/lazycatcoder.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cdiv align=\"center\"\u003e\n  \u003ch1\u003eAutoRiaDataParser\u003c/h1\u003e\n\n\u003cdiv align=\"justify\"\u003e\nThe script automatically collects and processes data necessary for analyzing data about cars and sellers posted on the website \u003ca href=\"https://auto.ria.com/\"\u003e AutoRia\u003c/a\u003e.\n\n\u003cbr\u003e\n\n📌 **Operating principle:**\n\nPage analysis and data collection is performed using *Selenium* and *BeautifulSoup*. Links are collected and checked for uniqueness relative to the current date, after which they are saved to the database, excluding duplicates. Each unique URL is processed to extract information about cars and sellers. The data is saved to the database and updated as needed.\n\n📌 **Advantages of use:**\n\n• *Automation:* Eliminates the need for manual analysis of a large number of pages and ads.  \n• *Time saving:* Fast data collection using Selenium and BeautifulSoup.  \n• *Flexibility:* The script can be configured to work with any section of the AutoRia website and any search parameters by substituting the appropriate link.  \n• *Data relevance:* Information can be updated daily, and old records can be additionally analyzed relative to new ones.  \n• *Analytics:* The collected data allows for an in-depth analysis of the car market.  \n\n\u003c/div\u003e\n\n\u003cdiv align=\"center\"\u003e\n\n   ## 💡Practical use\n\n\u003c/div\u003e\n\n\u003cdiv align=\"justify\"\u003e\n\n• *Market Analysis:* Identifying market prices, popular models, average car specifications.  \n• *Marketing:* Helping sellers set competitive prices.  \n• *Research:* Using data to study market trends.  \n• *App Development:* Integrating data into car comparison apps.  \n\n\u003c/div\u003e\n\n\u003cbr\u003e\n\n\u003cdiv align=\"center\"\u003e\n\n   # Settings\n\n\u003c/div\u003e\n\n\u003cdiv align=\"left\"\u003e\n\n1. Clone this repository:\n\n   ```\n      git clone https://github.com/lazycatcoder/AutoRiaDataParser.git\n   ```\n\n\n2. Install dependencies:\n   \n   ```\n      pip install -r requirements.txt\n   ```\n\n\n3. Download *ChromeDriver* for your version of Chrome browser from the \u003ca href=\"https://developer.chrome.com/docs/chromedriver/downloads\"\u003e official website\u003c/a\u003e. Specify the path to the driver in the **driver** variable:\n\n   ```\n     driver = webdriver.Chrome(executable_path=r\"D:\\chromedriver.exe\", options=options)\n   ```\n\n\n4. Set the parsing link in the **base_url** variable:   \n\n   ```\n     base_url = \"https://auto.ria.com/search/...\"\n   ```\n\n\n5. In the **DB_PATH** variable, specify the path to the location of the database file: \n   ```\n      DB_PATH = r\"D:\\AR_data.db\"\n   ```\n\n\u003c/div\u003e\n\n\u003cbr\u003e\n\n### 🔴 Additional Information\n\u003cdiv align=\"justify\"\u003e\n\nAlso in this repository there is a script *\"CSV_Converter.py\"* which provides the functionality to convert database data to a CSV file. Before running, make sure that the **DB_PATH** variable is correctly set to the location of your database. Run the script. Upon successful execution, the CSV file will be created in the same directory as the database.\n\n\u003c/div\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flazycatcoder%2Fautoriadataparser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flazycatcoder%2Fautoriadataparser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flazycatcoder%2Fautoriadataparser/lists"}