Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/chalmerlowe/machine_learning

A gentle introduction to machine learning: data handling, linear regression, naive bayes, clustering

data data-science linear-regression machine-learning nearest-neighbors python scikit-learn

Last synced: 12 Oct 2024

https://github.com/mind-the-pineapple/sklearn-rvm

An sklearn style implementation of the Relevance Vector Machine (RVM).

machine-learning relevance-vector-machine scikit-learn sklearn

Last synced: 10 Oct 2024

https://github.com/ksachdeva/scikit-nni

AutoML - Hyper parameters search for scikit-learn pipelines using Microsoft NNI

automl hyperparameter-search hyperparameters neural-network-intelligence nni scikit-learn scikit-learn-api sklearn sklearn-library tool

Last synced: 10 Oct 2024

https://github.com/godfanmiao/ml-kaggle-github-2022

《 Python机器学习及实践:从零开始通往Kaggle竞赛之路(2022年度版)》全书数据和开源代码

paddlepaddle pandas pyspark python3 pytorch scikit-learn tensorflow2

Last synced: 08 Nov 2024

https://github.com/mfpierre/coreml-scikit-example

Apple CoreML example with scikit-learn

apple-coreml coreml scikit-learn

Last synced: 07 Aug 2024

https://github.com/davisidarta/fastlapmap

Fast Laplacian Eigenmaps: lightweight multicore LE for non-linear dimensional reduction with minimal memory usage. Outperforms sklearn's implementation and escalates linearly beyond 10e6 samples.

denoising dimensionality-reduction embedding feature-engineering laplacian-eigenmaps machine-learning multithreading python scikit-learn

Last synced: 13 Oct 2024

https://github.com/392781/scikit-ntk

Neural Tangent Kernel (NTK) module for the scikit-learn library

kernel-methods machine-learning neural-tangent-kernel ntk scikit scikit-learn

Last synced: 13 Oct 2024

https://github.com/boniolp/theseus

[VLDB 2022] Dash application for "Navigating the Labyrinth of Time Series Anomaly Detection"

anomaly-detection dash dashboard pandas plotly plotly-dash python scikit-learn subsequence time-series time-series-analysis webapp

Last synced: 09 Nov 2024

https://github.com/howardyclo/kmeans-dbscan-tutorial

A clustering tutorial with scikit-learn for beginners.

clustering-algorithm dbscan ipython-notebook kmeans scikit-learn tutorial

Last synced: 11 Oct 2024

https://github.com/boniolp/Theseus

[VLDB 2022] Dash application for "Navigating the Labyrinth of Time Series Anomaly Detection"

anomaly-detection dash dashboard pandas plotly plotly-dash python scikit-learn subsequence time-series time-series-analysis webapp

Last synced: 30 Oct 2024

https://github.com/smarie/python-m5p

An implementation of M5 and model trees in python, compliant with scikit-learn.

m5 machine-learning model prime regression scikit-learn tree

Last synced: 11 Oct 2024

https://github.com/ragibhasan894/phishing_website_detection

This project is based on detecting phishing/fraud/malicious website using Random Forest Classification formula. Implemented using Python programming language and Django framework.

cyber-security data-mining data-science django django-framework machine-learning phsihing python random-forest scikit-learn security

Last synced: 11 Oct 2024

https://github.com/gauravpandeylab/eipy

Ensemble Integration: a customizable pipeline for generating multi-modal, heterogeneous ensembles

classification ensemble interpretation machine-learning multimodal nested-cross-validation predictive-modeling scikit-learn

Last synced: 10 Oct 2024

https://github.com/mapbox/gabbar

Guarding OpenStreetMap from harmful edits using machine learning

banished jupyter-notebook machine-learning openstreetmap scikit-learn vandalism

Last synced: 14 Oct 2024

https://github.com/orestis-z/facial-beauty-predictor

Deep learning model to predict a beauty score for faces in images. Outperforms the state-of-the-art by up to 18% (2019).

computer-vision deep-learning deep-neural-networks facial-beauty-prediction scikit-learn tensorflow

Last synced: 28 Oct 2024

https://github.com/pateash/kisanmitra

Crop Yield Prediction Web App Built using Sklearn and Laravel Web Framework

crop-yeild-prediction farmers laravel machinelearning scikit-learn

Last synced: 13 Oct 2024

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 13 Oct 2024

https://github.com/dunnkers/fseval

Benchmarking framework for Feature Selection and Feature Ranking algorithms 🚀

automl benchmarking benchmarking-framework benchmarks feature-rankers feature-ranking feature-selection hydra machine-learning python scikit-learn wandb

Last synced: 07 Nov 2024

https://github.com/mauroluzzatto/explainy

explainy is a Python library for generating machine learning model explanations for humans

data-science explanation machine-learning machine-learning-explainability python scikit-learn

Last synced: 11 Nov 2024

https://github.com/jakoch/jupyter-devbox

A Docker DevBox for Jupyter Notebook's with a focus on Computer Vision, Machine Learning, Finance, Statistics and Visualization.

debian docker imutils jupyter-notebook keras opencv pandas python3 scikit-learn scipy seaborn tensorflow

Last synced: 28 Oct 2024

https://github.com/curiousily/Reproducible-ML-with-DVC

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 13 Aug 2024

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 10 Oct 2024

https://github.com/curiousily/reproducible-ml-with-dvc

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 11 Nov 2024

https://github.com/lucaangioloni/fit-covid19

Easy model to fit logistic curve to COVID19 data from Italy. Demo: https://fit-covid19.herokuapp.com

contagi-giornalieri coronavirus covid-19 covid-virus covid19 demo forecasting italy logistic prediction python regression scikit-learn totale-contagi

Last synced: 20 Oct 2024

https://github.com/bgu-cs-vil/pdc-dp-means

"Revisiting DP-Means: Fast Scalable Algorithms via Parallelism and Delayed Cluster Creation" [Dinari and Freifeld, UAI 2022]

clustering dpmeans kmeans machine-learning minibatch scikit-learn

Last synced: 13 Oct 2024

https://github.com/balins/fuzzytree

A Fuzzy Decision Tree implementation for Python.

classification decision-trees fuzzy-logic machine-learning scikit-learn

Last synced: 08 Nov 2024

https://github.com/karelze/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 11 Oct 2024

https://github.com/fabianp/mash_2016_sklearn_intro

Material for the MASH course on introduction to scikit-learn

machine-learning notebooks scikit-learn tutorial

Last synced: 16 Oct 2024

https://github.com/KarelZe/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 17 Aug 2024

https://github.com/scikit-learn-contrib/sklearn-ann

Integration with (approximate) nearest neighbors libraries for scikit-learn + clustering based on with kNN-graphs.

approximate-nearest-neighbor-search clustering knn knn-graphs scikit-learn

Last synced: 06 Nov 2024

https://github.com/rclement/datasette-ml

A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models

ai datasette datasette-plugin machine-learning mlops python scikit-learn sql sqlite

Last synced: 11 Oct 2024

https://github.com/manasvigoyal/gmail-classification

Extract Emails from Gmail account, convert to Excel file and classify using various classification algorithms.

beautifulsoup classification email-classification excel gmail jupyter-notebooks machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Oct 2024

https://github.com/godfanmiao/pyai-github-2024

《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【已出版】

git paddlepaddle pandas pyspark python3 pytorch scikit-learn tensorflow

Last synced: 08 Nov 2024

https://github.com/soda-inria/sklearn-numba-dpex

Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.

gpu intel numba-dpex scikit-learn

Last synced: 31 Oct 2024

https://github.com/ashishpatel26/MLflow_End_to_End_Example

MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.

mlflow mlflow-example mlflow-tracking scikit-learn xgboost

Last synced: 03 Aug 2024

https://github.com/younader/dnnr

The Python package of differential nearest neighbors regression (DNNR): Raising KNN-regression to levels of gradient boosting method. Build on-top of Numpy, Scikit-Learn, and Annoy.

annoy knn machine-learning machine-learning-algorithms numpy python3 scikit-learn tabular-data

Last synced: 08 Nov 2024

https://github.com/ashishpatel26/sklearn-ranking

Sklearn-ranking is ranking algorithm used for recommendation system algorithm. RANKSVM, RANKBOOST, RANKNET is included in this package

ranking ranking-algorithm ranking-methods scikit-learn sklearn sklearn-ranking

Last synced: 14 Oct 2024

https://github.com/aleximb/automl-streams

AutoML framework for implementing automated machine learning on data streams

automl data-streams scikit-learn

Last synced: 27 Oct 2024

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 12 Oct 2024

https://github.com/zeke/github-avatars

A machine learning model to detect whether a GitHub user has a custom or default avatar

cog machine-learning replicate scikit-learn

Last synced: 09 Nov 2024

https://github.com/huangcongqing/sklearn

scikit-learn (sklearn) 常用的机器学习,包括回归(Regression)、降维(Dimensionality Reduction)、分类(Classfication)、聚类(Clustering)等方法

scikit-learn sklearn

Last synced: 09 Nov 2024

https://github.com/sayakpaul/generating-categories-from-arxiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles

arxiv deep-learning gcp natural-language-processing scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/snehankekre/streamlit-yellowbrick

Streamlit component for the Yellowbrick visualization and model diagnostics library

data-visualization machine-learning matplotlib python scikit-learn streamlit visualization yellowbrick

Last synced: 10 Oct 2024

https://github.com/genfifth/cvopt

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

bayesian-optimization deep-learning feature-selection hyperopt hyperparameter-optimization integrated-visualization keras logmanagement machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/hbldh/skboost

MILBoost and other boosting algorithms, compatible with scikit-learn

boosting boosting-algorithms machine-learning scikit-learn

Last synced: 15 Oct 2024

https://github.com/rohankishore/graphyte

A simple, and responsive math graphing app powered by PyQt6, NumPy and Matplotlib. Create and analyze functions with ease.

equation-solver geogebra graph graph-algorithms matplotlib numpy pyqt6 python scikit-learn visualization

Last synced: 27 Oct 2024

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 28 Oct 2024

https://github.com/snoop2head/instagram_hashtag_analysis

📷 Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF

adjective gensim gensim-word2vec instagram-hashtag-analysis konlpy natural-language-processing noun scikit-learn scikitlearn tf-idf word2vec

Last synced: 04 Nov 2024

https://github.com/kenlimmj/fightin-words

A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.

bayesian-methods evaluation-metrics nlp scikit-learn

Last synced: 12 Oct 2024

https://github.com/esvs2202/concrete-compressive-strength-prediction

The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.

anaconda data-visualization flask gunicorn-web-server heroku-deployment html5 joblib jupyter-notebook machine-learning-algorithms matplotlib-pyplot numpy pandas pycharm-ide python3 randomizedsearchcv scikit-learn seaborn statsmodels xgboost-regression

Last synced: 09 Oct 2024

https://github.com/supercowpowers/scp-labs

SCP Labs (Open Source Team for SuperCowPowers)

data-analysis data-science pandas python scikit-learn security

Last synced: 09 Nov 2024

https://github.com/axegon/sklite

Transpile scikit-learn models to Flutter

fluttter python36 scikit-learn

Last synced: 11 Oct 2024

https://github.com/ozguraslank/flexml

Easy-to-use and flexible AutoML library for Python

automl data-science machine-learning python scikit-learn

Last synced: 10 Oct 2024

https://github.com/mohd-faizy/career-track-data-scientist-with-python

This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.

data-science data-visualization datascience-machinelearning datasciencecoursera datascientist datascientisttraining decision-trees hypothesis hypothesis-testing machine-learning machine-learning-algorithms nlp-machine-learning numpy pandas python scikit-learn seaborn statistical

Last synced: 28 Oct 2024

https://github.com/brackendev/scikit-learn-hy

An introduction to scikit-learn (machine learning in Python) and Hy (a Lisp dialect embedded in Python)

hy hylang machine-learning python scikit-learn tutorial

Last synced: 10 Nov 2024

https://github.com/starkblaze01/sentiment-analyzer

Model for Sentiment Analysis using Naive Bayes and CNN, and implementation of Model on Tweets and Web Application using React

flask-server keras-tensorflow python reactjs scikit-learn sentiment-analysis sklearn tweepy typescript

Last synced: 27 Oct 2024

https://github.com/yu9824/kennard_stone

This is an algorithm for evenly partitioning.

kfold-cross-validation python scikit-learn train-test-split

Last synced: 27 Oct 2024

https://github.com/jeffzi/pandas-select

Supercharged pandas indexing

pandas pandera python scikit-learn

Last synced: 08 Nov 2024

https://github.com/akoury/ml-helper

Python library with helpers to speed up and structure machine learning projects.

data data-visualization machine-learning ml python scikit-learn sklearn

Last synced: 10 Oct 2024

https://github.com/centre-for-humanities-computing/stormtrooper

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

chatgpt few-shot-learning gpt-4 large-language-models llm scikit-learn transformer transformers zero-shot-learning

Last synced: 10 Oct 2024

https://github.com/r-m-n/sklearn-deltatfidf

DeltaTfidfVectorizer for scikit-learn

delta-tf-idf python scikit-learn sentiment-analysis sklearn tf-idf

Last synced: 14 Oct 2024

https://github.com/alexioannides/lime-interpretable-ml

An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.

data-science interpretability lime machine-learning numpy pandas pydata python scikit-learn

Last synced: 12 Oct 2024

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/isala404/r2d2

Line Following Robot Powered By OpenCV and Machine Learing

ai linefollower machine-learing numpy opencv python3 rasberrypi scikit-learn tensorflow

Last synced: 15 Oct 2024

https://github.com/sayakpaul/dockerml

Contains my explorations of using Docker to automate ML workflows.

ci-cd docker scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/bnediction/scboolseq

scBoolSeq: scRNA-Seq data binarisation and synthetic generation from Boolean dynamics

bioinformatics boolean-networks computational-biology machine-learning pandas python3 scikit-learn scrna-seq single-cell-rna-seq

Last synced: 12 Oct 2024

https://github.com/ccrvlh/autoclass

Script that automatically classifies your bank statement into pre-determined labels.

banking banking-applications machine-learning personal-finance python scikit-learn

Last synced: 13 Oct 2024