Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/pateash/kisanmitra

Crop Yield Prediction Web App Built using Sklearn and Laravel Web Framework

crop-yeild-prediction farmers laravel machinelearning scikit-learn

Last synced: 13 Oct 2024

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 13 Oct 2024

https://github.com/dunnkers/fseval

Benchmarking framework for Feature Selection and Feature Ranking algorithms 🚀

automl benchmarking benchmarking-framework benchmarks feature-rankers feature-ranking feature-selection hydra machine-learning python scikit-learn wandb

Last synced: 07 Nov 2024

https://github.com/curiousily/Reproducible-ML-with-DVC

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 13 Aug 2024

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 10 Oct 2024

https://github.com/jakoch/jupyter-devbox

A Docker DevBox for Jupyter Notebook's with a focus on Computer Vision, Machine Learning, Finance, Statistics and Visualization.

debian docker imutils jupyter-notebook keras opencv pandas python3 scikit-learn scipy seaborn tensorflow

Last synced: 28 Oct 2024

https://github.com/balins/fuzzytree

A Fuzzy Decision Tree implementation for Python.

classification decision-trees fuzzy-logic machine-learning scikit-learn

Last synced: 08 Nov 2024

https://github.com/bgu-cs-vil/pdc-dp-means

"Revisiting DP-Means: Fast Scalable Algorithms via Parallelism and Delayed Cluster Creation" [Dinari and Freifeld, UAI 2022]

clustering dpmeans kmeans machine-learning minibatch scikit-learn

Last synced: 13 Oct 2024

https://github.com/karelze/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 11 Oct 2024

https://github.com/fabianp/mash_2016_sklearn_intro

Material for the MASH course on introduction to scikit-learn

machine-learning notebooks scikit-learn tutorial

Last synced: 16 Oct 2024

https://github.com/lucaangioloni/fit-covid19

Easy model to fit logistic curve to COVID19 data from Italy. Demo: https://fit-covid19.herokuapp.com

contagi-giornalieri coronavirus covid-19 covid-virus covid19 demo forecasting italy logistic prediction python regression scikit-learn totale-contagi

Last synced: 20 Oct 2024

https://github.com/KarelZe/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 17 Aug 2024

https://github.com/rclement/datasette-ml

A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models

ai datasette datasette-plugin machine-learning mlops python scikit-learn sql sqlite

Last synced: 11 Oct 2024

https://github.com/scikit-learn-contrib/sklearn-ann

Integration with (approximate) nearest neighbors libraries for scikit-learn + clustering based on with kNN-graphs.

approximate-nearest-neighbor-search clustering knn knn-graphs scikit-learn

Last synced: 06 Nov 2024

https://github.com/manasvigoyal/gmail-classification

Extract Emails from Gmail account, convert to Excel file and classify using various classification algorithms.

beautifulsoup classification email-classification excel gmail jupyter-notebooks machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Oct 2024

https://github.com/godfanmiao/pyai-github-2024

《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【已出版】

git paddlepaddle pandas pyspark python3 pytorch scikit-learn tensorflow

Last synced: 08 Nov 2024

https://github.com/soda-inria/sklearn-numba-dpex

Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.

gpu intel numba-dpex scikit-learn

Last synced: 31 Oct 2024

https://github.com/ashishpatel26/MLflow_End_to_End_Example

MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.

mlflow mlflow-example mlflow-tracking scikit-learn xgboost

Last synced: 03 Aug 2024

https://github.com/younader/dnnr

The Python package of differential nearest neighbors regression (DNNR): Raising KNN-regression to levels of gradient boosting method. Build on-top of Numpy, Scikit-Learn, and Annoy.

annoy knn machine-learning machine-learning-algorithms numpy python3 scikit-learn tabular-data

Last synced: 08 Nov 2024

https://github.com/ashishpatel26/sklearn-ranking

Sklearn-ranking is ranking algorithm used for recommendation system algorithm. RANKSVM, RANKBOOST, RANKNET is included in this package

ranking ranking-algorithm ranking-methods scikit-learn sklearn sklearn-ranking

Last synced: 14 Oct 2024

https://github.com/aleximb/automl-streams

AutoML framework for implementing automated machine learning on data streams

automl data-streams scikit-learn

Last synced: 27 Oct 2024

https://github.com/huangcongqing/sklearn

scikit-learn (sklearn) 常用的机器学习,包括回归(Regression)、降维(Dimensionality Reduction)、分类(Classfication)、聚类(Clustering)等方法

scikit-learn sklearn

Last synced: 24 Oct 2024

https://github.com/sayakpaul/generating-categories-from-arxiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles

arxiv deep-learning gcp natural-language-processing scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/zeke/github-avatars

A machine learning model to detect whether a GitHub user has a custom or default avatar

cog machine-learning replicate scikit-learn

Last synced: 21 Oct 2024

https://github.com/snehankekre/streamlit-yellowbrick

Streamlit component for the Yellowbrick visualization and model diagnostics library

data-visualization machine-learning matplotlib python scikit-learn streamlit visualization yellowbrick

Last synced: 10 Oct 2024

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 12 Oct 2024

https://github.com/genfifth/cvopt

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

bayesian-optimization deep-learning feature-selection hyperopt hyperparameter-optimization integrated-visualization keras logmanagement machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/hbldh/skboost

MILBoost and other boosting algorithms, compatible with scikit-learn

boosting boosting-algorithms machine-learning scikit-learn

Last synced: 15 Oct 2024

https://github.com/rohankishore/graphyte

A simple, and responsive math graphing app powered by PyQt6, NumPy and Matplotlib. Create and analyze functions with ease.

equation-solver geogebra graph graph-algorithms matplotlib numpy pyqt6 python scikit-learn visualization

Last synced: 27 Oct 2024

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 28 Oct 2024

https://github.com/snoop2head/instagram_hashtag_analysis

📷 Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF

adjective gensim gensim-word2vec instagram-hashtag-analysis konlpy natural-language-processing noun scikit-learn scikitlearn tf-idf word2vec

Last synced: 04 Nov 2024

https://github.com/kenlimmj/fightin-words

A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.

bayesian-methods evaluation-metrics nlp scikit-learn

Last synced: 12 Oct 2024

https://github.com/esvs2202/concrete-compressive-strength-prediction

The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.

anaconda data-visualization flask gunicorn-web-server heroku-deployment html5 joblib jupyter-notebook machine-learning-algorithms matplotlib-pyplot numpy pandas pycharm-ide python3 randomizedsearchcv scikit-learn seaborn statsmodels xgboost-regression

Last synced: 09 Oct 2024

https://github.com/axegon/sklite

Transpile scikit-learn models to Flutter

fluttter python36 scikit-learn

Last synced: 11 Oct 2024

https://github.com/ozguraslank/flexml

Easy-to-use and flexible AutoML library for Python

automl data-science machine-learning python scikit-learn

Last synced: 10 Oct 2024

https://github.com/mohd-faizy/career-track-data-scientist-with-python

This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.

data-science data-visualization datascience-machinelearning datasciencecoursera datascientist datascientisttraining decision-trees hypothesis hypothesis-testing machine-learning machine-learning-algorithms nlp-machine-learning numpy pandas python scikit-learn seaborn statistical

Last synced: 28 Oct 2024

https://github.com/starkblaze01/sentiment-analyzer

Model for Sentiment Analysis using Naive Bayes and CNN, and implementation of Model on Tweets and Web Application using React

flask-server keras-tensorflow python reactjs scikit-learn sentiment-analysis sklearn tweepy typescript

Last synced: 27 Oct 2024

https://github.com/yu9824/kennard_stone

This is an algorithm for evenly partitioning.

kfold-cross-validation python scikit-learn train-test-split

Last synced: 27 Oct 2024

https://github.com/akoury/ml-helper

Python library with helpers to speed up and structure machine learning projects.

data data-visualization machine-learning ml python scikit-learn sklearn

Last synced: 10 Oct 2024

https://github.com/r-m-n/sklearn-deltatfidf

DeltaTfidfVectorizer for scikit-learn

delta-tf-idf python scikit-learn sentiment-analysis sklearn tf-idf

Last synced: 14 Oct 2024

https://github.com/centre-for-humanities-computing/stormtrooper

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

chatgpt few-shot-learning gpt-4 large-language-models llm scikit-learn transformer transformers zero-shot-learning

Last synced: 10 Oct 2024

https://github.com/alexioannides/lime-interpretable-ml

An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.

data-science interpretability lime machine-learning numpy pandas pydata python scikit-learn

Last synced: 12 Oct 2024

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/bnediction/scboolseq

scBoolSeq: scRNA-Seq data binarisation and synthetic generation from Boolean dynamics

bioinformatics boolean-networks computational-biology machine-learning pandas python3 scikit-learn scrna-seq single-cell-rna-seq

Last synced: 12 Oct 2024

https://github.com/isala404/r2d2

Line Following Robot Powered By OpenCV and Machine Learing

ai linefollower machine-learing numpy opencv python3 rasberrypi scikit-learn tensorflow

Last synced: 15 Oct 2024

https://github.com/sayakpaul/dockerml

Contains my explorations of using Docker to automate ML workflows.

ci-cd docker scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/ccrvlh/autoclass

Script that automatically classifies your bank statement into pre-determined labels.

banking banking-applications machine-learning personal-finance python scikit-learn

Last synced: 13 Oct 2024

https://github.com/jeffzi/pandas-select

Supercharged pandas indexing

pandas pandera python scikit-learn

Last synced: 19 Oct 2024

https://github.com/qiancao/hskl

A library for hyperspectral image analysis using scikit-learn.

hyperspectral image-analysis-toolbox machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/sshh12/csgo-market-analysis

Some interesting stats from the CS:GO Steam community market.

csgo csgo-skins scikit-learn steam-market

Last synced: 27 Oct 2024

https://github.com/alexioannides/bodywork-mlops-demo

Demonstrating how Bodywork can be used to deploy a simulation of the lifecycle of a train-and-serve ML pipeline, responding to new data undergoing concept drift.

aws data-science docker kubernetes machine-learning mlops numpy python scikit-learn

Last synced: 12 Oct 2024

https://github.com/chrislemke/sk-transformers

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

data-science feature-engineering feature-selection machine-learning pandas preprocessing python scikit-learn scikit-learn-pipelines scikit-learn-transformer

Last synced: 13 Oct 2024

https://github.com/gbolmier/sklearn-neighbors-benchmark

:bar_chart: Scikit-learn nearest neighbors algorithms benchmark

benchmark nearest-neighbors-algorithms scikit-learn

Last synced: 13 Oct 2024

https://github.com/tlapusan/woodpecker

A python library used for tree structure interpretation.

decision-trees machine-learning random-forest scikit-learn sklearn visualization

Last synced: 24 Oct 2024

https://github.com/octo-technology/ddui

Airflow's plugin for Data Science pipeline visualisation

airflow airflow-plugin datadriver datascience ml pandas-python scikit-learn

Last synced: 10 Oct 2024

https://github.com/lai-bluejay/diego

Diego: Data in, IntElliGence Out. A fast framework that supports the rapid construction of automated learning tasks. Simply create an automated learning study (Study) and generate correlated trials (Trial). Then run the code and get a machine learning model. Implemented using Scikit-learn API glossary, using Bayesian optimization and genetic algorithms for automated machine learning. Inspired by [Fast.ai](https://github.com/fastai/fastai).

automl autosklearn bayesian-optimization generation-algorithms hyperparameter-optimization machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/nossbigg/fyp_py

Source code for A Study on Rumour Detection on Online Social Networks final year research project

machine-learning nltk python scikit-learn sentiment-analysis twitter

Last synced: 24 Oct 2024

https://github.com/mit-lcp/shakespeare-method

The Shakespeare-Method repository contains the code we used to develop a new method to identify attributed and unattributed potential adverse events using the unstructured notes portion of electronic health records.

adverse-events ehr ehr-notes epidemiology feature-selection latent-dirichlet-allocation medical mimic-iii postgres python scikit-learn topic-modeling

Last synced: 05 Nov 2024

https://github.com/manojkarthick/pcregression

Python package to build principal components regression model using the scikit-learn library.

machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/burhanuday/ml-dl-algorithms

Repository containing notes, cheatsheets, datasets and usage of different ML and DL algorithms and libraries. These files can be used as base templates for your next project

deep-learning deeplearning keras machine-learning machinelearning neural-networks scikit-learn

Last synced: 29 Oct 2024

https://github.com/veb-101/machine-learning-algorithms

One notebook to learn it all - Algorithms from scratch

matplotlib numpy pandas python-3-7 scikit-learn scipy seaborn

Last synced: 05 Nov 2024

https://github.com/prakharchoudhary/mlworld

A collection of simple machine learning projects, that got me started in this wonderful domain!

classification clustering iris-dataset keras-neural-networks knn machine-learning neural-networks numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/integeralex/netflix-recommendation-system

This Netflix Recommendation System is a web application developed using Node.js and Express. It utilizes a recommendation engine written in Python

ai collaborate docker express netflix nodejs pandas recommendation-system scikit-learn

Last synced: 18 Oct 2024

https://github.com/dayyass/extended-naive-bayes

[WIP] Extension of sklearn Naive Bayes models that allows sampling and more feature distributions.

data-science distributions generative-model machine-learning naive-bayes python sampling scikit-learn

Last synced: 14 Oct 2024

https://github.com/omegaml/dashserve

develop and serve Plotly Dash apps in Jupyter Notebook or JupyterLab

data-science plotly plotly-dash scikit-learn

Last synced: 10 Oct 2024