Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/bkamapantula/discover-workshop

Code search utility to assist developer workflows via code discovery. Currently uses tf-idf estimator.

developer-tools pycon python scikit-learn tf-idf

Last synced: 16 Oct 2024

https://github.com/tma15/bunruija

A text classification toolkit

neural-networks pytorch scikit-learn text-classification

Last synced: 11 Oct 2024

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/rjlovespy/house-price-predictor

A Tkinter GUI whose predictions are based on an ML model that is trained by Random Forest Regressor

cx-freeze gui-development jupyter-notebook machine-learning-models numpy pandas py-to-exe python random-forest-regression scikit-learn tkinter-gui

Last synced: 08 Nov 2024

https://github.com/fork123aniket/contrastive-learning-for-sentence-embeddings

Implementation of Simple Contrastive Learning-based Unsupervised approach to generate sentence embeddings and to perform text similarity in Tensorflow

bert-model contrastive-learning natural-language-processing scikit-learn tensorflow-tutorials tensorflow2 transformers

Last synced: 10 Oct 2024

https://github.com/smola/fastcountvectorizer

FastCountVectorizer is a faster alternative to scikit-learn CountVectorizer.

natural-language-processing python scikit-learn

Last synced: 10 Oct 2024

https://github.com/eonu/daze

Better multi-class confusion matrix plots for Scikit-Learn, incorporating per-class and overall evaluation measures.

accuracy classification confusion-matrix confusion-matrix-heatmap evaluation evaluation-metrics f1-score measures multi-class plot precision recall scikit-learn

Last synced: 10 Oct 2024

https://github.com/aianytime/machine-learning-models-implementation

Implementation of several ML models on real-world datasets with detailed explanation in notebooks.

eda machine-learning machine-learning-algorithms ml numpy pandas pycaret python scikit-learn scikitlearn-machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/chinaskidev/ml-prediccion-lluvia-brazil

MLOps, usando Docker,Airflow,tensorflow,streamlit

python3 scikit-learn streamlit tensorflow

Last synced: 10 Nov 2024

https://github.com/tanaybhadula/phishing-website-detection

A web application to predicted whether a URL/Website is phishing or not by extracting its lexical features.

classification descision-tree flask machine-learning pandas phishing-detection python random-forest scikit-learn stacking-classifier svm-classifier xgboost

Last synced: 11 Oct 2024

https://github.com/nguyenanht/john-toolbox

This is my own toolbox to explore data science

data-science machine-learning pipeline python pytorch scikit-learn

Last synced: 13 Oct 2024

https://github.com/jameschapman19/scikit-prox

A package for fitting regularized models from scikit-learn via proximal gradient descent

proximal-gradient-descent regularization scikit-learn scikit-learn-api

Last synced: 13 Oct 2024

https://github.com/aianytime/early-stage-diabetes-risk-prediction

Early stage diabetes risk prediction using several supervised machine learning algorithms.

github machine-learning machine-learning-algorithms python scikit-learn

Last synced: 07 Nov 2024

https://github.com/tikam02/wine-shop

Wine Reviews and Recommendation Engine - Web-Application [Django]

django machine-learning numpy pandas python recommender-system scikit-learn wine

Last synced: 01 Nov 2024

https://github.com/juliandavidmr/machinelearningscikit

Clasificador de flores mediante aprendizaje supervisado

neural-network python scikit-learn sklearn

Last synced: 09 Nov 2024

https://github.com/houseofai/french-realestate-price-prediction

Machine Learning appliqué aux Valeurs Foncières Française

google-colab machine-learning opendata scikit-learn

Last synced: 09 Nov 2024

https://github.com/datpham0412/covid19-prediction-model

Machine learning project aimed at predicting new COVID-19 cases using historical COVID-19 and mobility data. The project involves data fetching, migration, preprocessing, exploratory data analysis (EDA), feature engineering, data splitting, model training, and evaluation.

cmake cplusplus-17 dill googletest jupyter-notebook matplotlib pandas python3 scikit-learn scikitlearn-machine-learning seaborn-python sqlite

Last synced: 09 Nov 2024

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 28 Oct 2024

https://github.com/victormotogna/irislogisticregression

Iris Dataset Logistic Regression - scikit learn version & from scratch

data-science iris-dataset logistic-regression python scikit-learn

Last synced: 21 Oct 2024

https://github.com/alfredfrancis/jarvis2.0

An intelligent Home automation system using Internet of Things and Machine learning

flask internet-of-things machine-learning php python raspberry-pi scikit-learn

Last synced: 28 Oct 2024

https://github.com/klane/springboard

Springboard Data Science Career Track assignments

data-science jupyter-notebook pyspark python scikit-learn springboard sql

Last synced: 31 Oct 2024

https://github.com/shreyamalogi/credit-card-fraud-detection-system

A Credit Card Fraud Detection System using Adaboost and Majority Voting, designed to identify fraudulent credit card transactions by combining the strength of multiple classifiers.

adaboost ensemble-learning majority-voting python scikit-learn

Last synced: 21 Oct 2024

https://github.com/vartikaraj2512/dsml-internship-devtown-notebooks-

🌟 Data Science & Machine Learning Internship Projects 📊 Explore a curated collection of DS & ML notebooks covering topics like regression models, clustering, NLP, and deep learning. Dive into real-world projects such as price prediction, sentiment analysis, and customer segmentation. This repository reflects modern data-driven industry solutions

data-science filehandling googlecolab json kaggle keras machine-learning matplotlib numpy pandas python scikit-learn seaborn sql tensorflow

Last synced: 31 Oct 2024

https://github.com/sayakpaul/floydhub-k_means-blog

Contains the Jupyter Notebook made for a FloydHub article on K-Means

numpy pandas scikit-learn yellowbricks

Last synced: 28 Oct 2024

https://github.com/kamomille/titanic

Auriez vous survécu au naufrage du Titanic ?

data-science jupiter-notebook machine-learning scikit-learn titanic

Last synced: 28 Oct 2024

https://github.com/lugq1990/automl-engine

3 lines of code for automate machine learning for classification and regression

auto-ml automl machine-learning random-forest scikit-learn xgboost

Last synced: 27 Oct 2024

https://github.com/azrdev/sklearn-seco

Implementation of the *Separate and Conquer* / *Covering*-Algorithm for scikit-learn

covering machine-learning scikit-learn sklearn

Last synced: 27 Oct 2024

https://github.com/sayakpaul/bentoml-explorations

Contains my experiments made with the mighty library BentoML

bentoml python rest-api scikit-learn tensorflow zomato

Last synced: 28 Oct 2024

https://github.com/34j/sklearn-utilities

Utilities for scikit-learn. Append prediction to x, append prediction to x single, append x prediction to x, compose var estimator, data frame wrapper, drop by noise prediction, drop missing rows y, dummy regressor var, estimator wrapper base, excluded column transformer pandas, feature union pandas, id transformer, included column transformer pand

catboost feature-engine feature-engineering multioutput pandas pca python pytorch regression scikit-learn sklearn sklearn-compatible skorch torch tqdm

Last synced: 22 Oct 2024

https://github.com/shreyamalogi/personalized-travel-planning-system

"Personalized Travel Planning System," uses a graphical user interface (GUI) to help users plan personalized travel experiences. It recommends tourist destinations based on user preferences and provides information about nearby places.

knn-algorithm machine-learning-algorithms matplotlib numpy pandas python scikit-learn tkinter-gui

Last synced: 21 Oct 2024

https://github.com/janasunrise/ml-guide-and-implementation

This repository contains the predictions, and plots for the datasets included in the scikit learn library by default and also some other datasets from kaggle or other sources.

machine-learning ml python3 scikit scikit-learn scikitlearn-machine-learning sklearn

Last synced: 15 Oct 2024

https://github.com/vopaaz/learning-utility

Assist small-scale machine learning.

data-science machine-learning pandas python3 scikit-learn

Last synced: 13 Oct 2024

https://github.com/vhnegrisoli/materiais-pos-graduacao

Repositório com scripts e notebooks utilizando Python 3 e bancos de dados relacionais e não-relacionais (Oracle, MongoDB, Redis, Neo4J) como estudo para a Pós-Graduação em Data Science & Big Data pela Pontifícia Universidade Católica de Minas Gerais (PUC-MG)

business-intelligence data-science dataviz jupyter-notebook matplotlib mongodb pandas powerbi python scikit-learn

Last synced: 11 Nov 2024

https://github.com/christoph/robics

Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.

gensim lda natural-language-processing nmf robust-parametrizations scikit-learn topic-modeling topic-models

Last synced: 14 Oct 2024

https://github.com/yinleon/s3

s3 helpers for reading files to/from pandas dataframes, moving files between buckets, and persisting scikit-learn classifiers.. all in s3.

pandas-dataframe s3 scikit-learn

Last synced: 11 Oct 2024

https://github.com/34j/lightgbm-callbacks

A collection of LightGBM callbacks. (DART early stopping, tqdm progress bar)

dart early-stopping hacktoberfest lgbm lightgbm lightgbm-dart scikit-learn sklearn sklearn-compatible tqdm

Last synced: 22 Oct 2024

https://github.com/jbeno/datawaza

Data science tools for exploration, visualization, and model iteration.

data-science dataviz machine-learning matplotlib pandas scikit-learn seaborn

Last synced: 10 Oct 2024

https://github.com/caiocarneloz/scyred

Automatic sklearn parameter tuning with bio-inspired algorithms

bio-inspired library parameter-tuning scikit-learn

Last synced: 11 Nov 2024

https://github.com/manena/sp-sentiment-analysis

Sentiment Analysis in Python trained with Amazon Spain reviews in Spanish

jupyter-notebook machine-learning nltk nltk-library python-3-5 pyton scikit-learn sentiment-analysis

Last synced: 12 Oct 2024

https://github.com/nazchanel/fake-news-detection-webapp

A Flask webapp that detects fake news with a given text input using the power of Natural Language Processing. Deployment on Heroku failed due to the program's large memory consumption.

data-science dataset keras keras-tensorflow machine-learning natural-language-processing nlp nlp-machine-learning python scikit-learn tensorflow

Last synced: 09 Nov 2024

https://github.com/rvandewater/recipys

🥧ReciPys: easily define and execute preprocessing and feature engineering steps on Pandas dataframes.

data-science pandas python scikit-learn tidymodels

Last synced: 10 Oct 2024

https://github.com/oneapi-src/customer-segmentation

AI Starter Kit for Customer Segmentation for Online Retail using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/kingabzpro/fastapi-for-ml

Building a simple FastAPI application for model inference.

fastapi jinja2-templates machine-learning scikit-learn

Last synced: 10 Oct 2024

https://github.com/oneapi-src/powerline-fault-detection

AI Starter Kit for detect faulty signals in power line voltage using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/imsanjoykb/automated-spam-mail-detection-and-flask-deployment

This is an simple NLP project in which the model is able to predict the incoming mail whether it is spam or not spam(ham). As we seen in gmail automatically the mail is classified and stored in spam or inbox so this project is prototype.

flask machine-learning naive-bayes-classifier nlp python scikit-learn

Last synced: 12 Oct 2024

https://github.com/oneapi-src/purchase-prediction

AI Starter Kit for Purchase Prediction model using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/luca-parisi/m_arcsinh

m-arcsinh: A Reliable and Efficient Function for Supervised Machine Learning (scikit-learn, TensorFlow, and Keras) and Feature Extraction (scikit-learn)

activation activation-function activation-functions activations arcsinh classification dimensionality-reduction feature-extraction keras keras-tensorflow machine-learning machinelearning mlp mlp-classifier neural-network python scikit-learn svm svm-classifier tensorflow

Last synced: 30 Oct 2024

https://github.com/pateash/kisanmitra-python

Python Machine learning Utility for Kisanmitra Web App

jupyter-notebook machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/stewartpark/scikit-small-ensemble

scikit-small-ensemble is a library to make your ensemble models(Random Forest Classifier, etc) have a small memory footprint/usage.

compression ensemble-learning lz4 mmap random-forest-classifier scikit-learn

Last synced: 13 Oct 2024

https://github.com/msjahid/machine-learning-projects

A collection of machine learning projects featuring models and algorithms for supervised and unsupervised learning, model evaluation, and optimization.

jupyter matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Oct 2024

https://github.com/ranjan2104/diabetes-prediction-application

It is a Model that Predict the Diabetes Status of any person by just giving the some observations so that take the decision on that with the accuracy of 93%+. Due to Trained Model with Data sets so that able to predict very carefully on the previous decision it is supervised learning model using an algorithm is Linear regression and Sklearn for testing and Training the Model and Flask is Uses in Backend and in Frontend HTML, CSS, Js is Using.

flask gunicorn itsdangerous jinja2 markupsafe matplotlib numpy pandas scikit-learn scipy werkzeug

Last synced: 11 Nov 2024

https://github.com/mjahmadee/machinelearning2023

Welcome to the official GitHub repository for the "Machine Learning" course 2023! In this course, we explore the fascinating world of machine learning, diving deep into the algorithms, techniques, and tools that enable computers to learn from data and make intelligent decisions.

machine-learning python scikit-learn

Last synced: 12 Nov 2024

https://github.com/devamoghs/pos-tagger-nltk-scikit-learn

Part-Of-Speech Tagger using custom trained models, implemented with Scikit-Learn and NLTK

machine-learning natural-language-understanding nltk-library part-of-speech-tagger pos-tagger scikit-learn

Last synced: 28 Oct 2024

https://github.com/mekhyw/facial-emotion-classification

Very fast classification model for facial expressions using Mediapipe facial landmarks, Scikit-learn and OpenCV, as part of the DYNAMO project

keras-tensorflow mediapipe opencv scikit-learn spektral

Last synced: 12 Nov 2024

https://github.com/thomasthaddeus/dataanalysistoolkit

DataAnalysisToolkit is a Python-based data analysis tool designed to streamline various data analysis tasks. It provides the ability to load data from CSV files, perform statistical calculations, detect outliers, clean data, and visualize data.

data-science matplotlib python python-script python3 scikit-learn

Last synced: 30 Oct 2024

https://github.com/pr38/numbadecisiontrees

novel 'numba' based recreation of scikit-learn's decision tree algorithm

decision-tree decision-trees machine-learning numba python scikit-learn

Last synced: 09 Nov 2024

https://github.com/pr38/dask_backward_feature_selection

Backward step-wise feature selection using Dask, scikit-learn compatible

dask feature-selection machine-learning python scikit-learn

Last synced: 09 Nov 2024

https://github.com/k5924/stockpredictor

A stock price predictor built using python and tensforflow

python scikit-learn stock-price-prediction tensorflow

Last synced: 09 Nov 2024

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 09 Oct 2024

https://github.com/spags093/spotify_song_data

Part 1: Analysis of Spotify song data that uses Machine Learning to determine what features make a "hit" song on Spotify.

machine-learning matplotlib music pandas python scikit-learn seaborn shap spotify spotify-api tensorflow

Last synced: 03 Nov 2024

https://github.com/joseabrantesjr/previsai

O PrevisAI é uma aplicação que utiliza tecnica avançada de deep-learning para prever os preços de fechamento de ações, ETFs, Fundos Imobiliários, Criptomoedas, etc.

acoes criptomoedas deep-learning etf fii keras mercado-financeiro numpy pandas previsao python scikit-learn tensorflow trade trading yfinance

Last synced: 31 Oct 2024

https://github.com/gappeah/ethereum-prediction-ml

A machine learning project that predicts the future price of Ethereum (ETH) using the price data gathered from coincodex.com.

crypto cryptocurrency ethereum jupyternotebook lstm lstm-neural-networks machine-learning machine-learning-algorithms python scikit-learn scikitlearn-machine-learning sklean svm tensorflow

Last synced: 10 Nov 2024

https://github.com/joshua-dias-barreto/diabetes-predictor

The objective of this project is to accurately predict whether a person has diabetes or not

colab-notebook deep-learning diabetes-prediction diabetic-retinopathy-detection logistic-regression machine-learning scikit-learn svm

Last synced: 12 Nov 2024

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 09 Nov 2024

https://github.com/sayakpaul/patients-conversation-detector

Contains my experiments for ZS's hiring hackathon (II).

data-science keras machine-learning nlp python scikit-learn text-classification

Last synced: 28 Oct 2024

https://github.com/plantaest/feverfew

Comprehensive link checker tool for Wikipedia

aws-lambda caddy java mantine onnx python quarkus react scikit-learn typescript

Last synced: 14 Oct 2024

https://github.com/nazchanel/fake-news-detection-algorithm

A fake news detection algorithm. This repository contains the various variations of my original project. WIP.

dataset deep-learning fake-news-detection machine-learning-algorithms natural-language-processing scikit-learn work-in-progress

Last synced: 09 Nov 2024

https://github.com/pr38/dask_tfidf

A Dask native implementation of 'Term Frequency Inverse Document Frequency' for dask-ml and scikit-learn

dask dask-ml distributed-computing machine-learning python scikit-learn

Last synced: 27 Oct 2024

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2024! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 09 Nov 2024

https://github.com/metriccoders/metriccoders_notebooks

This is the Metric Coders repository containing all the notebooks for machine learning.

artificial-intelligence genai keras llm machine-learning natural-language-processing pytorch scikit-learn tensorflow

Last synced: 27 Oct 2024

https://github.com/owenodriscoll/automl

Python package for automated hyperparameter-optimization of common machine-learning algorithms

automl catboost classification hyperparameter-optimization lightgbm machine-learning optuna regression scikit-learn xgboost

Last synced: 27 Oct 2024

https://github.com/g0r0kh/clustering

k-means & hierarchical clustering

conda matplot numpy pandas scikit-learn scipy sklearn

Last synced: 14 Oct 2024

https://github.com/kohlerhector/dpdt-py

Implementation of Dynamic Programming Decision Tree algorithm (Kohler et. al. 2024).

decision-tree-classifier decision-trees dynamic-programming scikit-learn scikitlearn-machine-learning sklearn sklearn-classifier

Last synced: 08 Nov 2024

https://github.com/kohlerhector/tree-mbpo

Study Model-Based Policy Optimization by varying the model estimator classes (e.g Decision Trees vs MLP)

decision-tree mbpo mbrl mlp rl sac scikit-learn stable-baselines3

Last synced: 08 Nov 2024

https://github.com/cyberfantics/bitcoin-price-prediction

A deep learning-based web app for predicting future Bitcoin prices using historical data. Users can interactively select prediction days and view recent price data in real-time.

artificial-intelligence artificial-neural-networks bitcoin deep-learning machine-learning neural-network prediction-model scikit-learn tensorflow

Last synced: 02 Nov 2024

https://github.com/facultyai/faculty-xval

Cross-validation of Keras and scikit-learn models with the Faculty platform

cross-validation faculty-platform keras machine-learning python scikit-learn

Last synced: 08 Nov 2024

https://github.com/oscarqjh/ntu_sc1015_project

A mini project for NTU's data science and artificial intelligence mod - Analysis on League of Legends competitive matches

data-science machine-learning pandas python scikit-learn

Last synced: 09 Nov 2024

https://github.com/shreyansh055/time-series-forecasting_055

The Time Series Forecasting Project predicts future trends using historical data with Python, Pandas, and models like ARIMA, LSTM, and Prophet, focusing on scalable, accurate forecasting for business and finance.

lstm matplotlib numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024