https://github.com/allenjue/webscraper
A WebScraper that grabs stock data and exports it to a CSV file
https://github.com/allenjue/webscraper
csv-export htmlunit webscraping
Last synced: 2 months ago
JSON representation
A WebScraper that grabs stock data and exports it to a CSV file
- Host: GitHub
- URL: https://github.com/allenjue/webscraper
- Owner: AllenJue
- Created: 2022-08-18T02:40:40.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2022-08-21T22:36:45.000Z (almost 3 years ago)
- Last Synced: 2025-01-21T11:33:58.328Z (4 months ago)
- Topics: csv-export, htmlunit, webscraping
- Language: Java
- Homepage:
- Size: 69.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# webScraper
## Introduction
This webScraper is an introductory webScraper project that gathers share information from
[Fortune 500 companies](https://markets.businessinsider.com/index/components/s&p_500?p=1) and exports the data into a folder of csv files.## Features
The webScraper is currently a precursor to machine learning models that learn from csv datasets. It can currently only scrape info from [this site](https://markets.businessinsider.com/index/components/s&p_500?p=1).
Csv files will not be recreated if they already exist, and the files are stored in the users //User/Documents/Shared_Data directory.## Technologies
* Java 8 or higher
* HTMLUnit
* OpenCSV
* JUnit