Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pmbaumgartner/syntax-speaker-prediction

The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?
https://github.com/pmbaumgartner/syntax-speaker-prediction

Last synced: about 2 months ago
JSON representation

The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?

Awesome Lists containing this project

README

        

# Syntax Speaker Prediction

This repository contains notebooks used to train a model to predict the speaker (Scott or Wes) in one second cilps of their podcast, syntax.fm. There are accompanying files for using Prodigy to label the data required to build the model. There is a brief description of the purpose of each notebook as the first cell.

## The Results

**Total Podcast Time**: 3 days, 15 hours, 56 minutes, 39 seconds

**Wes**: 2 days, 1 hours, 51 minutes, 11 seconds

**Scott**: 1 days, 11 hours, 51 minutes, 45 seconds

**Non-speaking time (crosstalk, laughing, intros)**: 0 days, 2 hours, 13 minutes, 43 seconds

## Cumulative Speaking Time

![Cumulative Speaking Time](https://raw.githubusercontent.com/pmbaumgartner/syntax-speaker-prediction/master/speaking_time.png)

Each vertical line is the start of a new episode.

##

Helpful tools:

- [Prodigy](https://prodi.gy/) for data labeling
- [Pydub](http://pydub.com/) for slicing audio
- [Librosa](https://librosa.github.io/librosa/) for audio analysis
- [pandas](https://pandas.pydata.org/) for data manipulation
- [scikit-learn](https://scikit-learn.org/stable/index.html) for machine learning
- [chartify](https://github.com/spotify/chartify/) for plots