https://github.com/arv-anshul/notebooks
My Jupyter notebooks in which I practice data science.
https://github.com/arv-anshul/notebooks
data-analysis data-science jupyter-notebook llm machine-learning marimo matplotlib regression transformers
Last synced: 2 months ago
JSON representation
My Jupyter notebooks in which I practice data science.
- Host: GitHub
- URL: https://github.com/arv-anshul/notebooks
- Owner: arv-anshul
- License: apache-2.0
- Created: 2023-08-15T06:49:37.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2026-02-17T07:20:37.000Z (4 months ago)
- Last Synced: 2026-02-17T12:47:52.684Z (4 months ago)
- Topics: data-analysis, data-science, jupyter-notebook, llm, machine-learning, marimo, matplotlib, regression, transformers
- Language: Jupyter Notebook
- Homepage: https://arv-anshul.github.io/projects
- Size: 29.1 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Notebooks
Contains all my 📓 Notebooks where I have performed Data Analysis on unique datasets.
> \[!IMPORTANT\]
>
> See [`data/README.md`](data/README.md) to know about datasets I have used.
## Directory Info
### [`yt-watch-history`](yt-watch-history/)
Perform analysis on **YouTube Watch History** data _(exported via Google Takeout)_.
- > Previously using Pandas but switched to Polars as I started exploring it.
- Used `polars`'s amazing syntax to handle data, preprocess the text data and handle datetime data.
- Plot many graphs to show some amazing insights present in data.
- Build ML model to predict videos **"Content Type"** from its title.
- Build a **Channel Recommender System** which recommends similar channels from channel's videos' title and tags.
### [`spotify-analysis`](spotify-analysis/)
Perform analysis on **Spotify Streaming History** data _(exported via Spotify website)_.
- Analysed data from the perspective of Track, Artist, Album, Playlist and Time.
- Used `polars` builtin `plot` namespace (which uses `hvplot` library internally) to plot analysis graphs.
### [`credit-modeling`](credit-modeling/)
A project from **CampusX's free course on Credit Risk Modeling by Rohan Azad**.
- Collaborated with [@sambhavm22].
- Perform data analysis, build ML model using diffrent ML algorithms.
- Contains notebooks of mine and [@sambhavm22] both.
- Got many insights about banking sector.
- Created diagrams to explain the project workflows.
- [Credit Risk Modeling project documentation] in PDF format.
[Credit Risk Modeling project documentation]: credit-modeling/docs/DOCUMENTATION.pdf
[@sambhavm22]: https://github.com/sambhavm22
### [`election-2024`](election-2024/)
Created a dashboard using Streamlit which fetches data from ECI official website.
- Used `httpx` to fetch data asynchronously.
- Used `polars.LazyFrame` to manipulate data efficiently.
- Used `streamlit` to create dashboard.
Where is Notebooks?
There are no notebooks present in this project because I've converted those into `.py` scripts because I have to create
a dashboard using it and converted notebook's non-`async` codes into `async` code.
### [`extra`](extra/)
This directory contains extra notebooks which are independent of each others. Created these notebooks just for learning
or fun purpose.