An open API service indexing awesome lists of open source software.

https://github.com/allenjue/webscraper

A WebScraper that grabs stock data and exports it to a CSV file
https://github.com/allenjue/webscraper

csv-export htmlunit webscraping

Last synced: 2 months ago
JSON representation

A WebScraper that grabs stock data and exports it to a CSV file

Awesome Lists containing this project

README

        

# webScraper

## Introduction
This webScraper is an introductory webScraper project that gathers share information from
[Fortune 500 companies](https://markets.businessinsider.com/index/components/s&p_500?p=1) and exports the data into a folder of csv files.

## Features
The webScraper is currently a precursor to machine learning models that learn from csv datasets. It can currently only scrape info from [this site](https://markets.businessinsider.com/index/components/s&p_500?p=1).
Csv files will not be recreated if they already exist, and the files are stored in the users //User/Documents/Shared_Data directory.

## Technologies
* Java 8 or higher
* HTMLUnit
* OpenCSV
* JUnit