https://github.com/zaman-hamza/citadel-datathon
My submission to the 2022 East Coast Datathon. The event started on the 21st of March and ended on the 28th, lasting about a whole week. I was in a team of two where we analyzed the non-conventional indicators and instigators of traffic.
https://github.com/zaman-hamza/citadel-datathon
citadel data-science data-visualization datathon
Last synced: 7 months ago
JSON representation
My submission to the 2022 East Coast Datathon. The event started on the 21st of March and ended on the 28th, lasting about a whole week. I was in a team of two where we analyzed the non-conventional indicators and instigators of traffic.
- Host: GitHub
- URL: https://github.com/zaman-hamza/citadel-datathon
- Owner: zaman-hamza
- Created: 2022-03-28T23:35:34.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2025-03-30T22:48:54.000Z (7 months ago)
- Last Synced: 2025-04-10T22:58:27.652Z (7 months ago)
- Topics: citadel, data-science, data-visualization, datathon
- Language: Jupyter Notebook
- Homepage:
- Size: 1.79 MB
- Stars: 5
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
[
](https://muhammadzaman.tech/Certificates/Datathon%20Certificate%20of%20Achievement.pdf)
# Citadel Datathon
The East-Coast Dataopen hosted by Citadel and Correlation One is an invite-only Datathon featuring university students across the east coast. The event started on the 21st of March and ended on the 28th, lasting about a whole week. I was in a team of two where we analyzed the non-conventional indicators and instigators of traffic. We posed the following question: "How would investments in businesses and education affect traffic and road safety in major American cities?"
In this report, we analyzed datasets and later created recommendations for which non-conventional areas municipal governments should invest in to reduce traffic congestion. The three cities of focus were New York, NY, Austin, TX, and Washington, DC. We analyzed publicly available data on 311 calls, building permits, business licenses, population education level and traffic congestion statistics.
## Getting Started
Our final submission is in the corresponding directory, but the project has been split up to allow for better readability. The data directory includes all the datasets that were used in this report. The Jupyter Notebooks can be downloaded and run if the datasets are put in the proper directories as indicated on the first cell of each notebook, where the data is loaded.
### Prerequisites
- Download Anaconda to run the file
- Download pandas to manipulate and analyze the data
- Download plotly and seaborn to visualize the graphs for the data### Installing
Anaconda can be downloaded off the following link: [https://docs.anaconda.com/anaconda/install/windows/](https://docs.anaconda.com/anaconda/install/windows/)
To install seaborn, run any of the following commands in the command prompt:
```
> pip install seaborn> conda install seaborn
```To install plotly, run any of the following commands in the command prompt:
```
> conda install -c plotly plotly> conda install -c plotly/label/test plotly
```Pandas already comes installed with Anaconda.
## Deployment
1. Open Anaconda, then Jupyter Notebook
2. Open the file and run the code whilst ensuring that the datafiles are in the correct directory## Built With
* [Pandas](https://pandas.pydata.org/) - Tool used for data analysis
* [Plotly](https://plotly.com/) - One of the visualization frameworks used
* [Seaborn](https://seaborn.pydata.org/) - The other visualization framework used## Authors
* **Hamza Zaman** - *Co-Author* - [zaman-hamza](https://github.com/zaman-hamza)
* **William Harkless** - *Co-Author*## License
This project is licensed under the MIT License - see the [LICENSE.md](LICENSE.md) file for details