Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 29 Jan 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 29 Jan 2025

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 23 Jan 2025

https://github.com/gauravsingh9356/machine_learning

All my practical learning work involved in MACHINE LEARNING (Data Processing to Deep Learning)

deep-learning jupyter-notebook machine-learning machine-learning-algorithms nlp-machine-learning python scikit-learn

Last synced: 02 Jan 2025

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.🩺

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 04 Feb 2025

https://github.com/aurelienmorgan/french_text_sentiment

Sentiment Analysis in texts written in French language using Tensorflow/Keras (and using XGBoost for hyperparameters optimization)

beautifulsoup dask fasttext french gru hyperparameters-optimization jupyter-notebook keras multiprocessing nlp python rnn scikit-learn sentiment-analysis tensorflow transfer-learning web-scraping xgboost

Last synced: 15 Dec 2024

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/fohlen/stats-experiment

A tiny stats experiment with GENESIS data

matplotlib python3 scikit-learn

Last synced: 23 Jan 2025

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 09 Jan 2025

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 27 Jan 2025

https://github.com/shubhranpara/heart-disease-predictor

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colab jupyter-notebook machine-learning medical prediction-model python scikit-learn

Last synced: 02 Jan 2025

https://github.com/abz4375/recommendersystem

A sophisticated recommender system that leverages web mining techniques to help users find hotels that match their preferences.

cosine-similarity css html javascript pandas python scikit-learn selenium selenium-webdriver

Last synced: 06 Feb 2025

https://github.com/anoopgeorge418/my-learnings

Learning Topics Related Data-Science and Web-Development From Complete Scratch.

bokeh bootstrap css folium html matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 07 Jan 2025

https://github.com/hayatoy/gcpml-notebook

Dockerfile with Jupyter Machine Learning environment plus Google Cloud SDK

dockerfile google-cloud-platform jupyter scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/kr1shnasomani/greendetect

Detection of plant diseases using CNN (ResNet50 architecture)

computer-vision deep-learning keras neural-network numpy opencv pandas scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp

This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯

attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm

Last synced: 02 Feb 2025

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 21 Jan 2025

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 29 Jan 2025

https://github.com/chrislemke/scikit-tabtrans

TabTransformer ready for Scikit learn 🧑‍🔬

deep-learning machine-learning python scikit-learn transformer

Last synced: 09 Nov 2024

https://github.com/idaraabasiudoh/knn-customer-classification

Labels telecommunication customer base to respective groups to determine service type required for each customer.

data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn

Last synced: 14 Jan 2025

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 28 Jan 2025

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 28 Jan 2025

https://github.com/m-rishab/research-paper-recommendation

This project aims to build a research paper recommendation system. Given a paper title as input, the system provides the top 5 recommended research papers.

deep-learning flask llm nlp numpy pandas python scikit-learn sentence-transformers tensorflow torch transformers

Last synced: 22 Jan 2025

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 14 Jan 2025

https://github.com/kostasereksonas/ids_test

Code for intrusion detection system based on "Intrusion Detection System Using Machine Learning Algorithms" tutorial on Geeksforgeeks and Intrusion Detection on NSL KDD Github repository.

ids intrusion-detection intrusion-detection-system nsl-kdd-dataset numpy pandas python scikit-learn tensorflow

Last synced: 16 Jan 2025

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 22 Jan 2025

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 22 Jan 2025

https://github.com/jordandeklerk/pygridge

A scikit-learn compatible Python package for data-driven group regularized ridge regression

python regression regularized-regression scikit-learn

Last synced: 19 Dec 2024

https://github.com/haloapping/ml-with-me

Kalo dengar istilah ML, biasanya rada ambigu. Soalnya punya beberapa kepanjangan, seperti Mobile Legend, Makan Lontong, dan lain-lain. Tapi pada repo ini membahas Machine Learning :)

ml pusing python3 scikit-learn stress tau-ah-gelap

Last synced: 06 Jan 2025

https://github.com/haloapping/ml-workflow

Template alur kerja machine learning.

mahine-learning numpy pandas python3 scikit-learn

Last synced: 06 Jan 2025

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask regression scikit-learn

Last synced: 22 Jan 2025

https://github.com/7irelo/regressor-model

Video Game Sales Prediction using Decision Tree Regression

joblib matplotlib numpy pandas scikit-learn

Last synced: 13 Jan 2025

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 22 Jan 2025

https://github.com/lechemi/machine-learning-vademecum

Un notebook contenente nozioni di base ed esempi pratici in python sul machine learning.

machine-learning python scikit-learn

Last synced: 19 Dec 2024

https://github.com/grachale/predict_pass_exam

Creating AdaBoost classifier with decision trees for predicting whether a student will pass or fail an exam (classification) based on the number of study hours and their scores in the previous exam.

adaboost cross-validation decision-tree jupyter-notebook matplotlib python scikit-learn seaborn

Last synced: 13 Jan 2025

https://github.com/udityamerit/all_mlpackages

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 31 Oct 2024

https://github.com/mpolinowski/isometric-mapping

Non-linear dimensionality reduction through Isometric Mapping

isomap matplotlib-pyplot python scikit-learn

Last synced: 28 Jan 2025

https://github.com/kohlerhector/trex-tree-reward-exploration

Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.

decision-trees drl exploration rl scikit-learn stable-baselines3

Last synced: 30 Dec 2024

https://github.com/grachale/predict_titanik

Predicting the survival of Titanic passengers (binary classification) with usage of decision tree and KNN from scikit-learn.

classification decision-tree-classifier knn-classifier matplotlib pandas python scikit-learn titanic-survival-prediction

Last synced: 13 Jan 2025

https://github.com/kookmin-sw/capstone-2023-29

자리있어? - 경기도 광역버스 좌석예측 시스템

fastapi lstm postgresql python3 pytorch react scikit-learn sqlalchemy

Last synced: 13 Nov 2024

https://github.com/pockerman/tech3python

Collection of Python based algorithms on numerics, statistics, control etc

algorithms control estimation kalman-filter machine-learning numerical-methods particle-filter python3 scikit-learn statistics

Last synced: 06 Jan 2025

https://github.com/colinwu0403/heartbpmusic

Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.

django neurokit2 scikit-learn spotify-web-api vuejs

Last synced: 22 Jan 2025

https://github.com/grachale/predict_life_expect

Predicting life expectancy (regression) with usage of custom random forest, linear regression and decision tree regressor from scikit-learn.

decision-tree-regression jupyter-notebook linear-regression pandas python random-forest regression scikit-learn

Last synced: 13 Jan 2025

https://github.com/ayushshahh/fespn

A neural network made to predict final exam scores of students

mlp mlp-regressor multilayer-perceptron neural-network prediction-model scikit-learn

Last synced: 02 Feb 2025

https://github.com/elcorto/gp_playground

Explore selected topics related to Gaussian processes

gaussian-processes gpy gpytorch kernel-ridge-regression machine-learning scikit-learn tinygp

Last synced: 26 Jan 2025

https://github.com/bhimrazy/iris-species-prediction-using-decision-tree-algorithm-grip

Iris Species Intelligence: Classifying Iris Species with Confidence using Decision Trees | The Sparks Foundation: GRIP

decision-tree-classifier fastapi gripjan23 machine-learning python scikit-learn sparkfoundation

Last synced: 17 Jan 2025

https://github.com/yvesemmanuel/machine_learning

Implements data problems solved with machine learning algorithms.

data-science keras keras-tensorflow linear-algebra machine-learning neural-network python scikit-learn

Last synced: 16 Jan 2025

https://github.com/anans9/ham-spam-analysis

A significant aspect of this application is its advanced ham and spam analysis system. This feature is crucial in maintaining the integrity and quality of the email content.

flask matplotlib pandas python scikit-learn

Last synced: 28 Dec 2024

https://github.com/somjit101/nlp-casestudy-quora-question-similarity

An application of NLP and classical ML algorithms to an interesting real-world use case of predicting similarity between two questions on Quora. This allows the platform to combine similar questions into one and combine their answers to avoid duplication and unnecessary confusion.

cross-validation feature-engineering feature-extraction gradient-boosting kaggle logistic-regression machine-learning model-calibration natural-language-processing nlp quora-question-pairs scikit-learn svm text-mining xgboost

Last synced: 16 Jan 2025

https://github.com/prashver/titanic-survival-prediction

This project tackles the Titanic challenge on Kaggle, predicting passenger survival based on variables like age, sex, and passenger class. The Jupyter notebook covers essential steps of a data science pipeline, including exploratory data analysis, data cleaning, feature engineering, and modeling. The dataset used is the Titanic dataset.

classification-algorithm machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 28 Dec 2024

https://github.com/jasper-koops/easy-gscv

This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.

classification machine-learning python3 scikit-learn

Last synced: 20 Dec 2024

https://github.com/rakibhhridoy/supportvectormachinein-medical

Support vector machine in medical disease detection. Both linear and non-linear data can be fitted in svm through its kernel specialization In medical we focus on precision or recall rather than accuracy.

diabetes-prediction machine-learning medical precision-medicine recall-precision scikit-learn support-vector-machines svm

Last synced: 25 Dec 2024

https://github.com/the-developer-306/house-price-predictor

House Price Predictor: Harnessing machine learning algorithms to forecast housing prices in Boston, empowering buyers and sellers with accurate predictions based on key factors like location, crime rate, rooms, accessibility, and more.

csv ipynb-jupyter-notebook joblib matplotlib numpy pandas python scikit-learn

Last synced: 12 Jan 2025

https://github.com/noahtigner/discoverdaily

A Spotify Recommender System. Trains a Classifier on your musical tastes and recommends songs daily. Uses the Spotify API and scikit-learn for machine learning.

machine-learning recommender-system scikit-learn spotify spotify-api

Last synced: 22 Dec 2024

https://github.com/skekre98/picture-compressor

A tool for compressing images using unsupervised machine learning

kmeans-clustering scikit-learn

Last synced: 22 Jan 2025

https://github.com/bestmahdi2/uni__decisiontreesinpythonpredictingdiabetes

A university project designed in Python decision tree for diabetes diagnosis

decision-tree diabetes-prediction graphviz pandas prediction pydotplus python scikit-learn

Last synced: 17 Jan 2025

https://github.com/lfenzo/ml-solar-sao-paulo

Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction

forecasting machine-learning python scikit-learn

Last synced: 18 Jan 2025

https://github.com/somenath203/titanic-survival-project-backend

Click the link below to check the swagger documentation of the website live

fastapi pandas python render scikit-learn seaborn titanic-survival-predictor

Last synced: 20 Jan 2025

https://github.com/guoshijiang/scikit-learn

带你一起学习scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Jan 2025

https://github.com/sarthak-1408/rain-fall-prediction

This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).

heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm

Last synced: 16 Jan 2025

https://github.com/hvignolo87/marketing-campaign-classification

Real case of classification with machine learning. Analysis of real data from telemarketing campaigns of a Portuguese bank.

binary-classification data-science pandas python scikit-learn xgbclassifier xgboost

Last synced: 02 Jan 2025

https://github.com/arizdn234/spotify-api-with-colab

Crawling, Analyzing, Clustering music data from Spotify API

machile-learning scikit-learn spotify-api spotipy-library

Last synced: 30 Dec 2024

https://github.com/shreeparab1890/duplicate-question-predictor

The ipython notebook is working to build a model which will detect duplicate questions if two questions pair are given.

bag-of-words nlp nlp-machine-learning nltk numpy pandas python random-forest scikit-learn sklearn streamlit

Last synced: 01 Jan 2025

https://github.com/miguellopezvirues/azure_keyword_cpc

Development and deployment of simple regression model in Azure Machine Learning.

azureml deplyment machine-learning mlflow pandas scikit-learn

Last synced: 24 Dec 2024

https://github.com/mnj-tothetop/english-handwritten-characters-recognizer

A handwritten english character recognizer [0-9, A-Z, a-z] made by using a Dataset of 3409 images. Tensorflow, Keras, Scikit-learn, and OpenCV was used to implement the Convolution Neural Network (CNN). Matplotlib and Seaborn were used to visualize the data.

artificial-intelligence convolutional-neural-networks keras matplotlib opencv-python scikit-learn seaborn tensorflow

Last synced: 21 Jan 2025