Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/syedsajjadaskari/end-to-end-chicago-taxi-tip-prediction-with-bigquery-and-vertex-ai
An end-to-end example of Chicago taxi on Google Cloud using TensorFlow, TFX, and Vertex AI
https://github.com/syedsajjadaskari/end-to-end-chicago-taxi-tip-prediction-with-bigquery-and-vertex-ai
bigquery gcp tensorflow tfx vertex-ai
Last synced: 1 day ago
JSON representation
An end-to-end example of Chicago taxi on Google Cloud using TensorFlow, TFX, and Vertex AI
- Host: GitHub
- URL: https://github.com/syedsajjadaskari/end-to-end-chicago-taxi-tip-prediction-with-bigquery-and-vertex-ai
- Owner: syedsajjadaskari
- License: mit
- Created: 2023-09-29T10:05:28.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-17T05:12:50.000Z (about 1 year ago)
- Last Synced: 2024-04-17T22:09:20.003Z (7 months ago)
- Topics: bigquery, gcp, tensorflow, tfx, vertex-ai
- Language: Shell
- Homepage:
- Size: 4.88 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# End-To-End-Chicago-Taxi-Tip-Prediction-with-BigQuery-and-Vertex-AI
This repository contains a complete end-to-end MLOps pipeline for predicting taxi tips in Chicago using Vertex AI and Smart Analytics. The pipeline uses Keras to implement the machine learning model, TFX to implement the training pipeline, and Model Builder SDK to interact with Vertex AI. The Chicago Taxi Trips dataset is a public dataset hosted on BigQuery that includes taxi trips from 2013 to the present.
The pipeline consists of the following steps:
1. **Data preparation:** The pipeline first loads the Chicago Taxi Trips dataset from BigQuery and cleans and preprocesses the data.
2. **Feature engineering:** The pipeline then creates new features from the existing data that may be useful for predicting taxi tips.
3. **Model training:** The pipeline then trains a machine learning model using the preprocessed data and the engineered features.
4. **Model evaluation:** The pipeline then evaluates the trained model on a held-out test set to assess its performance.
5. **Model deployment:** Once the model is trained and evaluated, the pipeline deploys it to Vertex AI as an endpoint.
6. **Model monitoring:** The pipeline then monitors the deployed model to ensure that it is performing as expected.The repository includes all of the code and configuration files necessary to run the pipeline. To get started, simply clone the repository and follow the instructions in the README file.
## Dataset description
https://pantheon.corp.google.com/marketplace/details/city-of-chicago-public-data/chicago-taxi-tripsThe Chicago Taxi Trips dataset is one of public datasets hosted with BigQuery, which includes taxi trips from 2013 to the present, reported to the City of Chicago in its role as a regulatory agency. The task is to predict whether a given trip will result in a tip > 20%.
## Getting started
1. Setup your MLOps environment on Google Cloud.2. Start your AI Notebook instance.
3. Open the JupyterLab then open a new Terminal
4. Clone the repository to your AI Notebook instance:
```
cd https://github.com/syedsajjadaskari/End-To-End-Chicago-Taxi-Tip-Prediction-with-BigQuery-and-Vertex-AI
cd End-To-End-Chicago-Taxi-Tip-Prediction-with-BigQuery-and-Vertex-AI
```6. Install the required Python packages:
```
pip install -r requirements.txt
```