Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/algosup/2022-project-artificial-intelligence-group-e
https://github.com/algosup/2022-project-artificial-intelligence-group-e
Last synced: about 9 hours ago
JSON representation
- Host: GitHub
- URL: https://github.com/algosup/2022-project-artificial-intelligence-group-e
- Owner: algosup
- Created: 2022-04-28T11:47:29.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-06-27T10:17:05.000Z (over 2 years ago)
- Last Synced: 2024-11-13T19:11:54.190Z (3 days ago)
- Language: Jupyter Notebook
- Size: 1.18 MB
- Stars: 0
- Watchers: 0
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 2022-Project-Artificial-Intelligence-Group-E
# The goal of the project
The goal of the project is to build a device that can listen to conversations (in project rooms for instance) and detect when English is spoken or when French is spoken. The device could emit a sound or display different colors based on the language.
# What we used
To perform the project we used a Raspberry Pi 4 as well as a microphone connected to it to listen to the conversations in order to predict the language spoken thanks to the AI stored on it.
In order to create the [Machine Learning](https://en.wikipedia.org/wiki/Machine_learning), we used [Google Colab](https://colab.research.google.com/?utm_source=scs-index) to be able to retrieve and process the audio data and turn it into spectrograms so that we could train the AI. Here is the link to our [Google Colab](https://colab.research.google.com/drive/106yJAkhgMO-FBhq3uEyNG9jtAzCzUVf6?usp=sharing) notebook.
> Our model is a Binary classification model, which means that it will predict whether the language spoken is English or French by the way of 0 or 1.
> 0 is for English and 1 is for French.
To be able to get the audio data, we use the Mozilla Common Voice Dataset by the way of [Kaggle](https://www.kaggle.com/datasets/mozillaorg/common-voice) for the English dataset as well as the
Common Voice Corpus 9.0 still from the [Mozilla Common Voice Dataset](https://commonvoice.mozilla.org/fr/datasets) for the French dataset.To be able to store the datasets and the spectrograms, we used [Google Drive](https://drive.google.com/drive/u/2/my-drive) to store the files.
# Link to the other Google Colab notebooks
We used other Google Collab notebooks to perform the data preprocessing:
Notebook of the EN data preprocessing [here](https://colab.research.google.com/drive/1PfovKMU0Hb0WhUldsybmvaXD1vCEkBz9?usp=sharing)
Notebook of the FR data preprocessing [here](https://colab.research.google.com/drive/19v3hLpmHs5z-1vMLICxcxRSMHJg1fGxD?usp=sharing)
# Link to the documents
[Functional Specification](https://github.com/algosup/2022-Project-Artificial-Intelligence-Group-E/blob/documents/Documents/Functional_Specifications.md)
[Technical Specification](https://github.com/algosup/2022-Project-Artificial-Intelligence-Group-E/blob/documents/Documents/Technical_Specifications.md)