Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/houarizegai/web-scraping
Code samples of web scraping using Java.
https://github.com/houarizegai/web-scraping
java jsoup jsoup-example jsoup-library scraping web-scraping web-scraping-java webscraping
Last synced: about 2 months ago
JSON representation
Code samples of web scraping using Java.
- Host: GitHub
- URL: https://github.com/houarizegai/web-scraping
- Owner: HouariZegai
- License: mit
- Created: 2020-04-28T23:44:54.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-09-29T11:15:11.000Z (over 2 years ago)
- Last Synced: 2023-03-04T21:07:18.460Z (almost 2 years ago)
- Topics: java, jsoup, jsoup-example, jsoup-library, scraping, web-scraping, web-scraping-java, webscraping
- Language: Java
- Homepage:
- Size: 10.7 KB
- Stars: 10
- Watchers: 1
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraping :mag: :bar_chart:
Code samples of scraping data from web pages using **Java** & **JSoup** Library[![License MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://raw.githubusercontent.com/HouariZegai/PrayerTimes/master/LICENSE)
## What?
Web Scraping is a web data extraction, is the process of retrieving or βscrapingβ data from a website. Uses intelligent automation to retrieve millions of data points from the internet.
We can use the extracted data in Machine Learning, Data Science, Data Analysis, ...ect).## Samples
* [Amazon (Best Sellers Kindle Books)](src/main/java/com/houarizegai/webscraping/amazon)
* [IMDB (Top 250)](src/main/java/com/houarizegai/webscraping/imdb)**Note:** I will add more examples in the few next days
## Installation :electric_plug:
1. Download the repository files (project) from the download section or clone this project by typing in the bash the following command:git clone https://github.com/HouariZegai/WebScraping.git
2. Import it in Intellij IDEA or any other Java IDE and let Maven download the required dependencies for you.
3. Run the application :D## Contributing π‘
If you want to contribute to this project and make it better with new ideas, your pull request is very welcomed.
If you find any issue just put it in the repository issue section, thank you.