Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wirthual/mldatacollection
Repository for data collection used in Machine Learning Course at UPC in spring semester 2017
https://github.com/wirthual/mldatacollection
delay flight machine-learning spring-semester upc weather-data
Last synced: about 2 months ago
JSON representation
Repository for data collection used in Machine Learning Course at UPC in spring semester 2017
- Host: GitHub
- URL: https://github.com/wirthual/mldatacollection
- Owner: wirthual
- Created: 2017-05-02T14:53:45.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-05-13T16:06:49.000Z (over 7 years ago)
- Last Synced: 2024-10-15T03:52:02.240Z (3 months ago)
- Topics: delay, flight, machine-learning, spring-semester, upc, weather-data
- Language: Python
- Size: 27.8 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# MLDataCollection
Repository for data collection used in Machine Learning Course at UPC in spring semester 2017# Description
As machine learning project in the spring semester at UPC we need to implement a project and write a report about it.
The basis for our project: https://github.com/limcheekin/r-flight-delay-predictionWe want to improve the existing project with weather data, similar to what is described in this paper:
http://ieeexplore.ieee.org/document/7777956/?reload=trueThe data for the delay of the airplanes we get from here:
https://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=236&DB_Short_Name=On-TimeThis python script is intendet to enrich the data about delayed planes by weather data which we want to collect from here:
https://developer.worldweatheronline.com/api/historical-weather-api.aspx (If they give us more then 500 api calls per day ;))If you want to use it you need to add a file "weatherAPI.key" in the data folder with your API-key from worldweatheronline.
In data you find the original data from bts, the full dataset as well as one with one line and 20 lines for test purposes.
Output is generated in data folder named raw_with_weather.csv.
Here you should see the following additional fields:
TEMPC,
HUMIDITY,
WINDSPEEDKMH,
WINDDIRDEGREE,
WEATHERCODE,
VISIBILITY,
CLOUDCOVER,
PRESSUREDescription over the fields can be found here: https://developer.worldweatheronline.com/api/docs/historical-weather-api.aspx