https://github.com/nikolatechie/spotify-playlist
Data pipeline that fetches recently played songs in the past 24 hours using Spotify API and saves the data in the SQLite database. Scheduled to run daily using Apache Airflow.
https://github.com/nikolatechie/spotify-playlist
apache-airflow api data data-engineering python spotify sql sqlite
Last synced: 10 months ago
JSON representation
Data pipeline that fetches recently played songs in the past 24 hours using Spotify API and saves the data in the SQLite database. Scheduled to run daily using Apache Airflow.
- Host: GitHub
- URL: https://github.com/nikolatechie/spotify-playlist
- Owner: nikolatechie
- Created: 2021-07-26T16:20:09.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-07-26T18:12:50.000Z (over 4 years ago)
- Last Synced: 2025-02-09T08:40:29.809Z (12 months ago)
- Topics: apache-airflow, api, data, data-engineering, python, spotify, sql, sqlite
- Language: Python
- Homepage:
- Size: 4.88 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Spotify-Playlist
Covered the ETL process by using Python to create a data pipeline that daily downloads songs (used Apache Airflow) played in the past 24 hours (accessing to the playlist done by using Spotify API), transforms the data into appropriate format and saves the playlist in the SQLite database.
Useful for finding out things such as the most/least played song, favorite band, favorite genre etc.