Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/anish202020/big-data-analysis-mini-project

The primary purpose of this code is to predict wind speed using an LSTM model. The model is trained on historical wind speed data to forecast future values.
https://github.com/anish202020/big-data-analysis-mini-project

keras matplotlib-pyplot numpy pandas sklearn

Last synced: about 2 months ago
JSON representation

The primary purpose of this code is to predict wind speed using an LSTM model. The model is trained on historical wind speed data to forecast future values.

Awesome Lists containing this project

README

        

# LSTM Wind Speed Prediction Documentation

[![Google Colab](https://img.shields.io/badge/Google%20Colab-F9AB00?style=for-the-badge&logo=googlecolab&logoColor=white)](https://colab.research.google.com/)
[![NumPy](https://img.shields.io/badge/NumPy-013243?style=for-the-badge&logo=numpy&logoColor=white)](https://numpy.org/)
[![Pandas](https://img.shields.io/badge/Pandas-150458?style=for-the-badge&logo=pandas&logoColor=white)](https://pandas.pydata.org/)
[![Matplotlib](https://img.shields.io/badge/Matplotlib-003B57?style=for-the-badge&logo=matplotlib&logoColor=white)](https://matplotlib.org/)
[![Keras](https://img.shields.io/badge/Keras-D00000?style=for-the-badge&logo=keras&logoColor=white)](https://keras.io/)
[![TensorFlow](https://img.shields.io/badge/TensorFlow-FF6F20?style=for-the-badge&logo=tensorflow&logoColor=white)](https://www.tensorflow.org/)
[![Scikit-learn](https://img.shields.io/badge/Scikit--learn-F7931E?style=for-the-badge&logo=scikit-learn&logoColor=white)](https://scikit-learn.org/)

## Overview

This document provides an overview and explanation of the code designed to train an LSTM (Long Short-Term Memory) model to predict wind speed based on historical data.

## My Project
This project includes a detailed report available in PDF format and Google Colab Execution.

[![Download PDF](https://img.shields.io/badge/Download-PDF-blue?style=flat)](https://github.com/Anish202020/Big-Data-Analysis-Mini-Project/blob/main/Big%20Data%20Mini%20Project-2.0.pdf)
[![Open in Google Colab](https://img.shields.io/badge/Open%20in-Google%20Colab-blue?style=flat&logo=googlecolab)](https://colab.research.google.com/drive/1zSBZl_BOLwNUjKmZMQ2twBDiA8qImwV8?usp=sharing)

## Table of Contents
1. [Introduction](https://#introduction)
2. [Data Requirements](https://#data-requirements)
3. [Code Explanation](https://#code-explanation)
- [Data Loading and Preparation](https://#data-loading-and-preparation)
- [Feature Creation](https://#feature-creation)
- [Data Scaling](https://#data-scaling)
- [Model Training](https://#model-training)
- [Prediction and Visualization](https://#prediction-and-visualization)
4. [Output](https://#output)
5. [Conclusion](https://#conclusion)
## Introduction
The primary purpose of this code is to predict wind speed using an LSTM model. The model is trained on historical wind speed data to forecast future values.

## Data Requirements
- **Input CSV File Structure**:
- **Station ID**: Unique identifier for the weather station.
- **Location**: Geographical location of the station.
- **Date**: Date of the recorded wind speed.
- **Wind Speed**: Recorded wind speed (in km/h or mph).
## Code Explanation
### Data Loading and Preparation
- The code reads the CSV file using Pandas.
- It extracts the wind speed values from the 4th column.
- A plot is generated to visualize wind speed variations over time.
### Feature Creation
- Three input features (X1, X2, X3) are created by shifting the wind speed data.
- Each prediction is based on the wind speeds of the previous three days.
### Data Scaling
- Both input features and target values are scaled to a range between 0 and 1 using MinMaxScaler.
### Model Training
- An LSTM model is defined using the Sequential API.
- The model is trained on 80% of the data for 25 epochs.
- The loss for each epoch is displayed during training.
### Prediction and Visualization
- The model makes predictions on the test set (20% of data).
- Two plots are generated:
- **Scatter Plot**: Compares actual vs. predicted wind speed values.
- **Line Plot**: Shows actual and predicted wind speed over time.
## Output
- **Scatter Plot**: Visualizes the accuracy of predictions against actual values.
- **Line Plot**: Illustrates how well the model captures trends and fluctuations in wind speed.
## Conclusion
The final output allows for a visual assessment of the LSTM model's performance in predicting wind speeds based on historical data. A successful model will have predictions closely following actual values in the line plot and points clustering around a diagonal line in the scatter plot.