An open API service indexing awesome lists of open source software.

https://github.com/kumaranand05/imdb-data-scraper

Java Selenium based scraper to collect all media details from IMDb website.
https://github.com/kumaranand05/imdb-data-scraper

imdb java scraping selenium

Last synced: 2 months ago
JSON representation

Java Selenium based scraper to collect all media details from IMDb website.

Awesome Lists containing this project

README

          

# IMDb Data Scraper

By Anand Kumar

## Features
* **Entire Database Collection** : Extracts all the media data stored on IMDb website.
* **CSV Output** : Extracts the data in CSV format.

# Instructions
After you have downloaded the project files. Follow the instructions below to setup your machine to make code functional.
## Downloading/Installing dependencies
Of course you need [JDK](https://www.oracle.com/in/java/technologies/downloads/) and an IDE like [VSCode](https://code.visualstudio.com), [IntelliJ](https://www.jetbrains.com/idea/) etc. installed on your machine.

> Download Dependencies

Open the project in your IDE and connect to internet. Using the pom.xml file present in the project directory, download the dependencies by performing specific actions for the IDE that you are using.

## Dos and Don'ts
> Do's

+ You can use your machine during the process.
+ You can keep the browser and IDE in background.

> Don'ts

+ Do not click on any element of the webpage as it can lead to termination of the code.
+ Do not use console during the process.
+ Do not turn off internet or close the automated browser session.