{"id":18611900,"url":"https://github.com/houarizegai/web-scraping","last_synced_at":"2025-04-10T23:31:02.478Z","repository":{"id":47563488,"uuid":"259777585","full_name":"HouariZegai/web-scraping","owner":"HouariZegai","description":"Code samples of web scraping using Java.","archived":false,"fork":false,"pushed_at":"2022-09-29T11:15:11.000Z","size":11,"stargazers_count":15,"open_issues_count":0,"forks_count":8,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-25T06:09:50.654Z","etag":null,"topics":["java","jsoup","jsoup-example","jsoup-library","scraping","web-scraping","web-scraping-java","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/HouariZegai.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-04-28T23:44:54.000Z","updated_at":"2024-07-05T19:51:30.000Z","dependencies_parsed_at":"2023-01-18T17:35:01.175Z","dependency_job_id":null,"html_url":"https://github.com/HouariZegai/web-scraping","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HouariZegai%2Fweb-scraping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HouariZegai%2Fweb-scraping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HouariZegai%2Fweb-scraping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HouariZegai%2Fweb-scraping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/HouariZegai","download_url":"https://codeload.github.com/HouariZegai/web-scraping/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248315969,"owners_count":21083359,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["java","jsoup","jsoup-example","jsoup-library","scraping","web-scraping","web-scraping-java","webscraping"],"created_at":"2024-11-07T03:15:17.782Z","updated_at":"2025-04-10T23:31:02.049Z","avatar_url":"https://github.com/HouariZegai.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Web Scraping :mag: :bar_chart:\nCode samples of scraping data from web pages using **Java** \u0026 **JSoup** Library\n\n[![License MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://raw.githubusercontent.com/HouariZegai/PrayerTimes/master/LICENSE)\n\n## What?\nWeb Scraping is a web data extraction, is the process of retrieving or “scraping” data from a website. Uses intelligent automation to retrieve millions of data points from the internet.  \nWe can use the extracted data in Machine Learning, Data Science, Data Analysis, ...ect).\n\n## Samples\n* [Amazon (Best Sellers Kindle Books)](src/main/java/com/houarizegai/webscraping/amazon)\n* [IMDB (Top 250)](src/main/java/com/houarizegai/webscraping/imdb)\n\n**Note:** I will add more examples in the few next days\n\n## Installation :electric_plug:\n1. Download the repository files (project) from the download section or clone this project by typing in the bash the following command:\n\n       git clone https://github.com/HouariZegai/WebScraping.git\n2. Import it in Intellij IDEA or any other Java IDE and let Maven download the required dependencies for you.\n3. Run the application :D\n\n## Contributing 💡\nIf you want to contribute to this project and make it better with new ideas, your pull request is very welcomed.\nIf you find any issue just put it in the repository issue section, thank you.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhouarizegai%2Fweb-scraping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhouarizegai%2Fweb-scraping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhouarizegai%2Fweb-scraping/lists"}