Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/veb-101/Data-Science-Projects
Collection of data science projects in Python
https://github.com/veb-101/Data-Science-Projects
Last synced: 2 days ago
JSON representation
Collection of data science projects in Python
- Host: GitHub
- URL: https://github.com/veb-101/Data-Science-Projects
- Owner: veb-101
- Created: 2020-05-07T20:17:47.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2023-11-11T19:46:32.000Z (12 months ago)
- Last Synced: 2024-11-05T21:05:43.269Z (3 days ago)
- Language: Jupyter Notebook
- Size: 25.6 MB
- Stars: 1,716
- Watchers: 36
- Forks: 440
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-data-science-resources - Different project topics for learning purposes
- awesome-data-science-resources - Different project topics for learning purposes
- awesome-ai-data-github-repos - Data Science End-to-End Projects
- awesome-ai-data-github-repos - Data Science End-to-End Projects
README
# Data Science Projects
---
- This is a compiled list of different project topics for learning purposes
- The main purpose of this list is to get hands-on experience on different topics.| Sr. No. | Project | Resource link | Completed |
| ------- | ---------------------------------------------------------------------- || --------------------------------------------------------------------------------------------------------------------------------------------------------- |
| | **BASIC** | | |
| 1 | Sentiment Analysis ||
- [Amazon reviews dataset](https://www.kaggle.com/anshulrai/cudnnlstm-implementation-93-7-accuracy)
- [Amazon reviews dataset](https://www.kaggle.com/muonneutrino/sentiment-analysis-with-amazon-reviews)
- [Twitter Sentiment analysis - Medium](https://towardsdatascience.com/creating-the-twitter-sentiment-analysis-program-in-python-with-naive-bayes-classification-672e5589a7ed)
- [Twitter Sentiment analysis - analytics vidhya](https://www.analyticsvidhya.com/blog/2018/07/hands-on-sentiment-analysis-dataset-python/)
|
- ☐
- ☐
- ☐
- ☐
| 2 | Fake News detection | [Detecting Fake News](https://data-flair.training/blogs/advanced-python-project-detecting-fake-news/) | ☐ |
| 3 | Detecting Parkinsons's Disease |
- [Disease detection using XGBoost](https://data-flair.training/blogs/python-machine-learning-project-detecting-parkinson-disease/)
- [pyimagesearch - Detecting Parkinsons's Disease](https://www.pyimagesearch.com/2019/04/29/detecting-parkinsons-disease-with-opencv-computer-vision-and-the-spiral-wave-test/)
||
- ☐
- ☐
| 4 | Color Detection | [OpenCV Project](https://data-flair.training/blogs/project-in-python-colour-detection/) | ☐ |
| 5 | Iris Data Set - Predict the class of the flower | [many - analytics vidhya](https://repl.it/@LakshayArora1/Iris-Dataset-Logistic-Regression) | ☑ |
| 6 | Loan Prediction - Predict if a loan will get approved or not. | [many - analytics vidhya](https://repl.it/@LakshayArora1/Logistic-Regression-Loan-Dataset) | ☐ |
| 7 | BigMart Sales Dataset - Predict the sales of a store. | [many - analytics vidhya](https://repl.it/@LakshayArora1/Linear-Regression) | ☐ |
| 8 | House Price Regression | [kaggle](https://www.kaggle.com/c/house-prices-advanced-regression-techniques/notebooks) | ☑ |
| 9 | Wine quality - Predict the quality of the wine. | [Kaggle kernel](https://www.kaggle.com/uciml/red-wine-quality-cortez-et-al-2009/kernels) | ☐ |
| 10 | Heights and Weights Dataset - Predict the height or weight of a person | [Study of height versus weight](https://www3.nd.edu/~steve/computing_with_data/2_Motivation/motivate_ht_wt.html) | ☐ |
| 11 | Email Classification | [youtube](https://www.youtube.com/watch?v=exHwwy9kVcg) | ☐ |
| 12 | Titanic dataset ||
- [Comprehensive data exploration with Python- Kaggle](https://www.kaggle.com/pmarcelino/comprehensive-data-exploration-with-python)
- [Titanic Data Science Solutions - Kaggle](https://www.kaggle.com/startupsci/titanic-data-science-solutions)
- [Data ScienceTutorial for Beginners - Kaggle](https://www.kaggle.com/kanncaa1/data-sciencetutorial-for-beginners)
- [Introduction to Ensembling/Stacking in Python - Kaggle](https://www.kaggle.com/arthurtok/introduction-to-ensembling-stacking-in-python)
- [A Data Science Framework: To Achieve 99% Accuracy - Kaggle](https://www.kaggle.com/ldfreeman3/a-data-science-framework-to-achieve-99-accuracy)
- [Stacked Regressions : Top 4% on LeaderBoard - Kaggle](https://www.kaggle.com/serigne/stacked-regressions-top-4-on-leaderboard)
- [An Interactive Data Science Tutorial - Kaggle](https://www.kaggle.com/helgejo/an-interactive-data-science-tutorial)
- [EDA To Prediction(DieTanic) - Kaggle](https://www.kaggle.com/ash316/eda-to-prediction-dietanic)
- [Titanic: Machine Learning from Disaster - Kaggle](https://www.kaggle.com/c/titanic)
|
- ☐
- ☐
- ☐
- ☐
- ☐
- ☐
- ☐
- ☐
- ☐
| | **Intermediate** | | |
| 1 | Speech Emotion Recognition | [Speech Emotion Recognition with librosa](https://data-flair.training/blogs/python-mini-project-speech-emotion-recognition/) | ☐ |
| 2 | Gender and Age Detection ||
- [pyimagesearch - Age Detection with deep learning](https://www.pyimagesearch.com/2020/04/13/opencv-age-detection-with-deep-learning/)
- [learnopencv - Gender & Age Classification using OpenCV Deep Learning](https://www.learnopencv.com/age-gender-classification-using-opencv-deep-learning-c-python/)
- [DataFlair - Gender and Age Detection with OpenCV](https://data-flair.training/blogs/python-project-gender-age-detection/)
- [analytics vidhya - Age Detection](https://www.analyticsvidhya.com/blog/2017/06/hands-on-with-deep-learning-solution-for-age-detection-practice-problem/)
|
- ☐
- ☐
- ☐
| 3 | Driver Drowsiness detection ||
- [Real-time facial landmark detection](https://www.pyimagesearch.com/2017/04/17/real-time-facial-landmark-detection-opencv-python-dlib/)
- [Eye blink detection](https://www.pyimagesearch.com/2017/04/24/eye-blink-detection-opencv-python-dlib/)
- [Drowsiness detection with OpenCV](https://www.pyimagesearch.com/2017/05/08/drowsiness-detection-opencv/)
- [DataFlair - System with OpenCV & Keras](https://data-flair.training/blogs/python-project-driver-drowsiness-detection-system/)
|
- ☐
- ☐
- ☐
- ☐
| 4 | Basic Chatbot | [chatbot using NLTK & Keras](https://data-flair.training/blogs/python-chatbot-project/) | ☐ |
| 5 | Handwritten Digit Recognition ||
- [Handwritten Digit Recognition](https://data-flair.training/blogs/python-deep-learning-project-handwritten-digit-recognition/)
- [kaggle - Digit Recognition-Tutorial (CNN)](https://www.kaggle.com/tarunkr/digit-recognition-tutorial-cnn-99-67-accuracy)
|
- ☐
- ☐
| 6 | Black Friday Dataset - Predict purchase amount. | [github](https://github.com/rouseguy/BlackFridayDataHack) | ☐ |
| 7 | Trip History Dataset - Predict the class of user. | [analytics vidhya](https://www.analyticsvidhya.com/blog/2015/06/solution-kaggle-competition-bike-sharing-demand/) | ☐ |
| 8 | Song recommendation ||
- [Medium - Simple song recommender system](https://towardsdatascience.com/how-to-build-a-simple-song-recommender-296fcbc8c85)
- [Medium - A Simple Song Recommender System in Python](https://towardsdatascience.com/a-simple-song-recommender-system-in-python-tutorial-3e4c111198d6)
- [analytics vidhya - Guide to song recommendation system](https://analyticsindiamag.com/beginners-guide-to-building-a-song-recommender-in-python/)
- [Youtube - Building Recommender Systems Using Python](https://www.youtube.com/watch?v=39vJRxIPSxw)
- [Youtube - Music Search and Recommendation from Millions of Songs](https://www.youtube.com/watch?v=RIW7jjurpPI)
|
- ☐
- ☐
- ☐
- ☐
- ☐
| 9 | Handwritten Text Recognition | [Build a Handwritten Text Recognition System using TensorFlow](https://towardsdatascience.com/build-a-handwritten-text-recognition-system-using-tensorflow-2326a3487cd5) | ☐ |
| 10 | Sentiment analysis - IMDB movie review dataset | [tensorflow](https://www.tensorflow.org/tutorials/text/text_classification_rnn) | ☐ |
| 11 | Text generation - Shakespeare | [tensorflow](https://www.tensorflow.org/tutorials/text/text_generation) | ☐ |
| 12 | Sign language MNIST classification ||
- [kaggle - Deep learning using sign langugage](https://www.kaggle.com/ranjeetjain3/deep-learning-using-sign-langugage#CNN-Model)
- [kaggle - CNN using Keras](https://www.kaggle.com/madz2000/cnn-using-keras-99-7-accuracy)
|
- ☐
- ☐
| | **Advanced** | |
| 1 | Image Captioning | [Image Captioning with visual Attention](https://www.tensorflow.org/tutorials/text/image_captioning) | ☑ |
| 2 | Credit Card Fraud Detection ||
- [geeksforgeeks - Credit Card Fraud Detection](https://www.geeksforgeeks.org/ml-credit-card-fraud-detection/)
- [Kaggel kernels](https://www.kaggle.com/mlg-ulb/creditcardfraud/kernels)
- [Google search results](https://www.google.com/search?q=Credit+Card+Fraud+Detection&rlz=1C1CHBF_enIN820IN820&oq=Credit+Card+Fraud+Detection&aqs=chrome..69i57j69i60&sourceid=chrome&ie=UTF-8)
|
- ☐
- ☐
- ☐
| 3 | Recommendation System ||
- [Recommendation Systems](https://engmrk.com/module-18-machine-learning-based-recommendation-systems/?utm_campaign=News&utm_medium=Community&utm_source=DataCamp.com)
- [RECOMMENDATION SYSTEM](https://technotipsondatascience.wordpress.com/2018/10/22/recommendation-system/)
- [Medium - Movie recommendation](https://towardsdatascience.com/movie-recommender-system-part-1-7f126d2f90e2)
- [kaggel - Movies Recommender System](https://www.kaggle.com/rounakbanik/movie-recommender-systems)
- [Quick Guide to Build a Recommendation Engine in Python & R](https://www.analyticsvidhya.com/blog/2016/06/quick-guide-build-recommendation-engine-python/)
|
- ☐
- ☐
- ☐
- ☐
- ☐
| 4 | Customer Segmentation ||
- [Customer Segmentation by RFM clustering](https://towardsdatascience.com/data-driven-growth-with-python-part-2-customer-segmentation-5c019d150444)
- [kaggle - Customer Segmentation](https://www.kaggle.com/fabiendaniel/customer-segmentation)
- [Customer Segmentation by KMeans](https://towardsdatascience.com/customer-segmentation-with-machine-learning-a0ac8c3d4d84)
- [KDnuggets - Beginner’s Guide to Customer Segmentation](https://www.kdnuggets.com/2017/03/yhat-beginner-guide-customer-segmentation.html)
- [KDnuggets - Customer Segmentation Using K Means Clustering](https://www.kdnuggets.com/2019/11/customer-segmentation-using-k-means-clustering.html)
- [Customer Segmentation: A Technical Guide](https://www.mktr.ai/applications-and-methods-in-data-science-customer-segmentation/)
|
- ☐
- ☐
- ☐
- ☐
- ☐
- ☐
| 5 | Breast Cancer Classification ||
- [pyimagesearch - Breast cancer classification with Keras and Deep Learning](https://www.pyimagesearch.com/2019/02/18/breast-cancer-classification-with-keras-and-deep-learning/)
- [Dataflair - Breast Cancer Classification](https://data-flair.training/blogs/project-in-python-breast-cancer-classification/)
|
- ☐
- ☐
| 6 | Traffic Signs Recognition | [pyimagesearch - Traffic Sign Classification with Keras and Deep Learning](https://www.pyimagesearch.com/2019/11/04/traffic-sign-classification-with-keras-and-deep-learning/) | ☑ |
| 7 | Urban Sound Classification | [Audio Data Analysis using Deep Learning](https://www.analyticsvidhya.com/blog/2017/08/audio-voice-processing-deep-learning/) | ☐ |
| 8 | Human Activity Recognition ||
- [MLM - Deep Learning Models for Human Activity Recognition](https://machinelearningmastery.com/deep-learning-models-for-human-activity-recognition/)
- [Human Activity Recognition with OpenCV and Deep Learning](https://www.pyimagesearch.com/2019/11/25/human-activity-recognition-with-opencv-and-deep-learning/)
|
- ☐
- ☐
| 9 | Covid-19 ||
- [pyimagesearch - Covid-19 in X-ray images](https://www.pyimagesearch.com/2020/03/16/detecting-covid-19-in-x-ray-images-with-keras-tensorflow-and-deep-learning/)
- [pyimagesearch - Mask detection](https://www.pyimagesearch.com/2020/05/04/covid-19-face-mask-detector-with-opencv-keras-tensorflow-and-deep-learning/)
- [rubiks-code Detection of COVID-19 in chest X-Rays with Deep Learning](https://rubikscode.net/2020/03/23/detection-of-covid-19-in-chest-x-rays-with-deep-learning/)
|
- ☐
- ☐
- ☐
| 10 | Video classification | [pyimagesearch](https://www.pyimagesearch.com/2019/07/15/video-classification-with-keras-and-deep-learning/) | ☐ |
| 11 | Fire and smoke detection | [pyimagesearch](https://www.pyimagesearch.com/2019/11/18/fire-and-smoke-detection-with-keras-and-deep-learning/) | ☐ |
| 12 | Detecting Natural Disasters | [pyimagesearch](https://www.pyimagesearch.com/2019/11/11/detecting-natural-disasters-with-keras-and-deep-learning/) | ☐ |
| 13 | Anonymizing faces | [pyimagesearch](https://www.pyimagesearch.com/2020/04/06/blur-and-anonymize-faces-with-opencv-and-python/) | ☐ |
| 14 | Text Summarization | [list of links](https://www.one-tab.com/page/at7XZn6iRsKSgpgNAqK0dw) | ☐ | | |
| 15 | Deep Dream and Style Transfer | [list of links](https://www.one-tab.com/page/RYc4BqXSRWOE_A1GIUAPYA) | ☐ |## Additional resources
- [A curated list of applied machine learning and data science notebooks and libraries across different industries.](https://github.com/firmai/industry-machine-learning)
- [edyoda/data-science-complete-tutorial: For extensive instructor led learning](https://github.com/edyoda/data-science-complete-tutorial)
- [Advanced Data Science - YouTube](https://www.youtube.com/playlist?list=PLegWUnz91Wftp1CsVFQaCgZAILUslEVhF)
- [Data ScienceTutorial for Beginners | Kaggle](https://www.kaggle.com/kanncaa1/data-sciencetutorial-for-beginners)
- [donnemartin/data-science-ipython-notebooks](https://github.com/donnemartin/data-science-ipython-notebooks)