Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nagpritam/identification-of-trucks-and-potential-risky-driver-using-databricks-spark-api-

The project intended to identify trucks based on their model, fuel consumption, driving behaviors and past records of violations/accidents
https://github.com/nagpritam/identification-of-trucks-and-potential-risky-driver-using-databricks-spark-api-

databricks hadoop hive powerbi python3 spark

Last synced: 4 months ago
JSON representation

The project intended to identify trucks based on their model, fuel consumption, driving behaviors and past records of violations/accidents

Awesome Lists containing this project

README

        

# Project Title

## ANT SAFE WHEELS

### Ensuring Safety and Compliance at ANT

#### Project Introduction

Accidents caused by large trucks are a significant concern in the United States. Az National Trucking (ANT) aims to mitigate risks associated with trucking operations by identifying drivers exhibiting high-risk behaviors. The organization seeks to ensure compliance with regulations, enhance safety measures, and reduce insurance risks.

#### Objective

Identify drivers with risk factors of 7.0 or higher to trigger alerts to management and insurance and analyze trucking data to visualize driver risk levels for informed decision-making.
Strengthen adherence to FMCSA regulations and internal policies to elevate safety standards across the fleet.
### Project Architectural Diagram

![Project Architectural Diagram](https://github.com/Nagpritam/Big-Data-Project/blob/master/architecture.png)

* This Project utilises Spark Via Databricks Notebooks, Hadoop file format, Hive Tables and Power BI for Data Visualization

### Spark Transformations

![Spark Transformations](https://github.com/Nagpritam/Big-Data-Project/blob/master/spark-transformation.png)

### Analysis

![Count of Models](https://github.com/Nagpritam/Big-Data-Project/blob/master/countofmodels.png)

#### 1. Count of Models (Bubble Chart)
* Displays the number of vehicles for each model.
* Larger bubbles represent a higher count of vehicles.
* Ford has the largest bubble, indicating it has the most

![MPG by Model](https://github.com/Nagpritam/Big-Data-Project/blob/master/MPGbymodel.png)

#### 2. MPG by Model (Heat Map)
* Shows the miles per gallon for each vehicle model.
* Darker shades represent higher MPG values.
* Crane appears to have the highest MPG, as indicated by its dark shade.

![Risk Factor By Model](https://github.com/Nagpritam/Big-Data-Project/blob/master/Riskfactorbymodel.png)

### 3. Risk Factor By Model (Bar Graph):
* Depicts the risk factor associated with each vehicle model.
* Oshkosh has the highest risk factor while Navistar has the least
* The risk factors are relatively similar across models, ranging around 7 to just under 10.

![Violation Analysis](https://github.com/Nagpritam/Big-Data-Project/blob/master/violations.png)

### 4. Violation Analysis (Pie Chart):
* Lane Departure: The most frequent violation with 152 occurrences, indicating potential issues with driver attention or fatigue.
* Unsafe Following Distance: Nearly as common, with 150 instances, pointing to risks of rear-end collisions due to inadequate spacing between vehicles.
* Overspeed and Unsafe Tail Distance: Speeding is noted in 90 instances, a widespread issue, while unsafe tail distance, though less frequent at 65 cases, still poses a significant risk in poor driving conditions.

![Risky drivers](https://github.com/Nagpritam/Big-Data-Project/blob/master/riskydrivers.png)

### 5. Top 10 Risky drivers
Namely A97 Driver is the most unsafe driver

![Location of Violations](https://github.com/Nagpritam/Big-Data-Project/blob/master/Maps_violations.png)

### 6. Location of Violations
Geographical Spread of Violations: Violations are widespread across California, with concentrations in urban areas like San Francisco and San Diego.
* Types of Violations: The map indicates two main types of violations: overspeed and unsafe following distance, with unsafe following distance being less common