An open API service indexing awesome lists of open source software.

https://github.com/djdhairya/uber-data-analytics

Mage Vm
https://github.com/djdhairya/uber-data-analytics

aiml api bigdata bigquery deep-learning docker google-maps-api ml python3 sql ssh vmware

Last synced: about 2 months ago
JSON representation

Mage Vm

Awesome Lists containing this project

README

          

# Uber-Data-Analytics
## Introduction

The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.

## Architecture
![architecture](https://github.com/djdhairya/Uber-Data-Analytics/assets/99894946/50b08eef-e198-4211-8a63-ddb91eef101c)

## Technology Used
- Programming Language - Python

Google Cloud Platform
1. Google Storage
2. Compute Instance
3. BigQuery
4. Looker Studio

Modern Data Pipeine Tool - https://www.mage.ai/

Contibute to this open source project - https://github.com/mage-ai/mage-ai

## Dataset Used
TLC Trip Record Data
Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

More info about dataset can be found here:
1. Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
2. Data Dictionary - https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf

## Data Model
![data_model](https://github.com/djdhairya/Uber-Data-Analytics/assets/99894946/e3825473-91ec-4ad4-a528-d3b1c80fc903)

## Complete Video Tutorial
Video Link - https://youtu.be/WpQECq5Hx9g