Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/gauravpandeylab/eipy

Ensemble Integration: a customizable pipeline for generating multi-modal, heterogeneous ensembles

classification ensemble interpretation machine-learning multimodal nested-cross-validation predictive-modeling scikit-learn

Last synced: 10 Oct 2024

https://github.com/mapbox/gabbar

Guarding OpenStreetMap from harmful edits using machine learning

banished jupyter-notebook machine-learning openstreetmap scikit-learn vandalism

Last synced: 14 Oct 2024

https://github.com/orestis-z/facial-beauty-predictor

Deep learning model to predict a beauty score for faces in images. Outperforms the state-of-the-art by up to 18% (2019).

computer-vision deep-learning deep-neural-networks facial-beauty-prediction scikit-learn tensorflow

Last synced: 28 Oct 2024

https://github.com/pateash/kisanmitra

Crop Yield Prediction Web App Built using Sklearn and Laravel Web Framework

crop-yeild-prediction farmers laravel machinelearning scikit-learn

Last synced: 13 Oct 2024

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 13 Oct 2024

https://github.com/dunnkers/fseval

Benchmarking framework for Feature Selection and Feature Ranking algorithms 🚀

automl benchmarking benchmarking-framework benchmarks feature-rankers feature-ranking feature-selection hydra machine-learning python scikit-learn wandb

Last synced: 07 Nov 2024

https://github.com/jakoch/jupyter-devbox

A Docker DevBox for Jupyter Notebook's with a focus on Computer Vision, Machine Learning, Finance, Statistics and Visualization.

debian docker imutils jupyter-notebook keras opencv pandas python3 scikit-learn scipy seaborn tensorflow

Last synced: 28 Oct 2024

https://github.com/curiousily/Reproducible-ML-with-DVC

Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC

deep-learning dvc experiment-tracking linear-regression machine-learning metrics python random-forest reproducibility scikit-learn tracking

Last synced: 13 Aug 2024

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 10 Oct 2024

https://github.com/lucaangioloni/fit-covid19

Easy model to fit logistic curve to COVID19 data from Italy. Demo: https://fit-covid19.herokuapp.com

contagi-giornalieri coronavirus covid-19 covid-virus covid19 demo forecasting italy logistic prediction python regression scikit-learn totale-contagi

Last synced: 20 Oct 2024

https://github.com/bgu-cs-vil/pdc-dp-means

"Revisiting DP-Means: Fast Scalable Algorithms via Parallelism and Delayed Cluster Creation" [Dinari and Freifeld, UAI 2022]

clustering dpmeans kmeans machine-learning minibatch scikit-learn

Last synced: 13 Oct 2024

https://github.com/karelze/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 11 Oct 2024

https://github.com/fabianp/mash_2016_sklearn_intro

Material for the MASH course on introduction to scikit-learn

machine-learning notebooks scikit-learn tutorial

Last synced: 16 Oct 2024

https://github.com/KarelZe/tclf

A scikit-learn compatible classifier to perform trade classification in Python.

empirical finance microstructure python rule-based-classifier scikit-learn trade-classification

Last synced: 17 Aug 2024

https://github.com/balins/fuzzytree

A Fuzzy Decision Tree implementation for Python.

classification decision-trees fuzzy-logic machine-learning scikit-learn

Last synced: 08 Nov 2024

https://github.com/scikit-learn-contrib/sklearn-ann

Integration with (approximate) nearest neighbors libraries for scikit-learn + clustering based on with kNN-graphs.

approximate-nearest-neighbor-search clustering knn knn-graphs scikit-learn

Last synced: 06 Nov 2024

https://github.com/rclement/datasette-ml

A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models

ai datasette datasette-plugin machine-learning mlops python scikit-learn sql sqlite

Last synced: 11 Oct 2024

https://github.com/manasvigoyal/gmail-classification

Extract Emails from Gmail account, convert to Excel file and classify using various classification algorithms.

beautifulsoup classification email-classification excel gmail jupyter-notebooks machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Oct 2024

https://github.com/soda-inria/sklearn-numba-dpex

Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.

gpu intel numba-dpex scikit-learn

Last synced: 31 Oct 2024

https://github.com/godfanmiao/pyai-github-2024

《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【已出版】

git paddlepaddle pandas pyspark python3 pytorch scikit-learn tensorflow

Last synced: 08 Nov 2024

https://github.com/ashishpatel26/MLflow_End_to_End_Example

MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.

mlflow mlflow-example mlflow-tracking scikit-learn xgboost

Last synced: 03 Aug 2024

https://github.com/younader/dnnr

The Python package of differential nearest neighbors regression (DNNR): Raising KNN-regression to levels of gradient boosting method. Build on-top of Numpy, Scikit-Learn, and Annoy.

annoy knn machine-learning machine-learning-algorithms numpy python3 scikit-learn tabular-data

Last synced: 08 Nov 2024

https://github.com/ashishpatel26/sklearn-ranking

Sklearn-ranking is ranking algorithm used for recommendation system algorithm. RANKSVM, RANKBOOST, RANKNET is included in this package

ranking ranking-algorithm ranking-methods scikit-learn sklearn sklearn-ranking

Last synced: 14 Oct 2024

https://github.com/aleximb/automl-streams

AutoML framework for implementing automated machine learning on data streams

automl data-streams scikit-learn

Last synced: 27 Oct 2024

https://github.com/zeke/github-avatars

A machine learning model to detect whether a GitHub user has a custom or default avatar

cog machine-learning replicate scikit-learn

Last synced: 09 Nov 2024

https://github.com/huangcongqing/sklearn

scikit-learn (sklearn) 常用的机器学习,包括回归(Regression)、降维(Dimensionality Reduction)、分类(Classfication)、聚类(Clustering)等方法

scikit-learn sklearn

Last synced: 24 Oct 2024

https://github.com/sayakpaul/generating-categories-from-arxiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier from the arXiv article/paper titles

arxiv deep-learning gcp natural-language-processing scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/snehankekre/streamlit-yellowbrick

Streamlit component for the Yellowbrick visualization and model diagnostics library

data-visualization machine-learning matplotlib python scikit-learn streamlit visualization yellowbrick

Last synced: 10 Oct 2024

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 12 Oct 2024

https://github.com/genfifth/cvopt

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

bayesian-optimization deep-learning feature-selection hyperopt hyperparameter-optimization integrated-visualization keras logmanagement machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/hbldh/skboost

MILBoost and other boosting algorithms, compatible with scikit-learn

boosting boosting-algorithms machine-learning scikit-learn

Last synced: 15 Oct 2024

https://github.com/rohankishore/graphyte

A simple, and responsive math graphing app powered by PyQt6, NumPy and Matplotlib. Create and analyze functions with ease.

equation-solver geogebra graph graph-algorithms matplotlib numpy pyqt6 python scikit-learn visualization

Last synced: 27 Oct 2024

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 28 Oct 2024

https://github.com/snoop2head/instagram_hashtag_analysis

📷 Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF

adjective gensim gensim-word2vec instagram-hashtag-analysis konlpy natural-language-processing noun scikit-learn scikitlearn tf-idf word2vec

Last synced: 04 Nov 2024

https://github.com/kenlimmj/fightin-words

A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.

bayesian-methods evaluation-metrics nlp scikit-learn

Last synced: 12 Oct 2024

https://github.com/esvs2202/concrete-compressive-strength-prediction

The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.

anaconda data-visualization flask gunicorn-web-server heroku-deployment html5 joblib jupyter-notebook machine-learning-algorithms matplotlib-pyplot numpy pandas pycharm-ide python3 randomizedsearchcv scikit-learn seaborn statsmodels xgboost-regression

Last synced: 09 Oct 2024

https://github.com/axegon/sklite

Transpile scikit-learn models to Flutter

fluttter python36 scikit-learn

Last synced: 11 Oct 2024

https://github.com/ozguraslank/flexml

Easy-to-use and flexible AutoML library for Python

automl data-science machine-learning python scikit-learn

Last synced: 10 Oct 2024

https://github.com/mohd-faizy/career-track-data-scientist-with-python

This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.

data-science data-visualization datascience-machinelearning datasciencecoursera datascientist datascientisttraining decision-trees hypothesis hypothesis-testing machine-learning machine-learning-algorithms nlp-machine-learning numpy pandas python scikit-learn seaborn statistical

Last synced: 28 Oct 2024

https://github.com/yu9824/kennard_stone

This is an algorithm for evenly partitioning.

kfold-cross-validation python scikit-learn train-test-split

Last synced: 27 Oct 2024

https://github.com/starkblaze01/sentiment-analyzer

Model for Sentiment Analysis using Naive Bayes and CNN, and implementation of Model on Tweets and Web Application using React

flask-server keras-tensorflow python reactjs scikit-learn sentiment-analysis sklearn tweepy typescript

Last synced: 27 Oct 2024

https://github.com/akoury/ml-helper

Python library with helpers to speed up and structure machine learning projects.

data data-visualization machine-learning ml python scikit-learn sklearn

Last synced: 10 Oct 2024

https://github.com/r-m-n/sklearn-deltatfidf

DeltaTfidfVectorizer for scikit-learn

delta-tf-idf python scikit-learn sentiment-analysis sklearn tf-idf

Last synced: 14 Oct 2024

https://github.com/centre-for-humanities-computing/stormtrooper

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

chatgpt few-shot-learning gpt-4 large-language-models llm scikit-learn transformer transformers zero-shot-learning

Last synced: 10 Oct 2024

https://github.com/jeffzi/pandas-select

Supercharged pandas indexing

pandas pandera python scikit-learn

Last synced: 08 Nov 2024

https://github.com/alexioannides/lime-interpretable-ml

An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learning algorithm - in this case a Radom Forest regressor.

data-science interpretability lime machine-learning numpy pandas pydata python scikit-learn

Last synced: 12 Oct 2024

https://github.com/bnediction/scboolseq

scBoolSeq: scRNA-Seq data binarisation and synthetic generation from Boolean dynamics

bioinformatics boolean-networks computational-biology machine-learning pandas python3 scikit-learn scrna-seq single-cell-rna-seq

Last synced: 12 Oct 2024

https://github.com/isala404/r2d2

Line Following Robot Powered By OpenCV and Machine Learing

ai linefollower machine-learing numpy opencv python3 rasberrypi scikit-learn tensorflow

Last synced: 15 Oct 2024

https://github.com/sayakpaul/dockerml

Contains my explorations of using Docker to automate ML workflows.

ci-cd docker scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/ccrvlh/autoclass

Script that automatically classifies your bank statement into pre-determined labels.

banking banking-applications machine-learning personal-finance python scikit-learn

Last synced: 13 Oct 2024

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/sshh12/csgo-market-analysis

Some interesting stats from the CS:GO Steam community market.

csgo csgo-skins scikit-learn steam-market

Last synced: 27 Oct 2024

https://github.com/qiancao/hskl

A library for hyperspectral image analysis using scikit-learn.

hyperspectral image-analysis-toolbox machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/alexioannides/bodywork-mlops-demo

Demonstrating how Bodywork can be used to deploy a simulation of the lifecycle of a train-and-serve ML pipeline, responding to new data undergoing concept drift.

aws data-science docker kubernetes machine-learning mlops numpy python scikit-learn

Last synced: 12 Oct 2024

https://github.com/chrislemke/sk-transformers

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

data-science feature-engineering feature-selection machine-learning pandas preprocessing python scikit-learn scikit-learn-pipelines scikit-learn-transformer

Last synced: 13 Oct 2024

https://github.com/gbolmier/sklearn-neighbors-benchmark

:bar_chart: Scikit-learn nearest neighbors algorithms benchmark

benchmark nearest-neighbors-algorithms scikit-learn

Last synced: 13 Oct 2024

https://github.com/tlapusan/woodpecker

A python library used for tree structure interpretation.

decision-trees machine-learning random-forest scikit-learn sklearn visualization

Last synced: 24 Oct 2024

https://github.com/octo-technology/ddui

Airflow's plugin for Data Science pipeline visualisation

airflow airflow-plugin datadriver datascience ml pandas-python scikit-learn

Last synced: 10 Oct 2024

https://github.com/lai-bluejay/diego

Diego: Data in, IntElliGence Out. A fast framework that supports the rapid construction of automated learning tasks. Simply create an automated learning study (Study) and generate correlated trials (Trial). Then run the code and get a machine learning model. Implemented using Scikit-learn API glossary, using Bayesian optimization and genetic algorithms for automated machine learning. Inspired by [Fast.ai](https://github.com/fastai/fastai).

automl autosklearn bayesian-optimization generation-algorithms hyperparameter-optimization machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/nossbigg/fyp_py

Source code for A Study on Rumour Detection on Online Social Networks final year research project

machine-learning nltk python scikit-learn sentiment-analysis twitter

Last synced: 24 Oct 2024

https://github.com/mit-lcp/shakespeare-method

The Shakespeare-Method repository contains the code we used to develop a new method to identify attributed and unattributed potential adverse events using the unstructured notes portion of electronic health records.

adverse-events ehr ehr-notes epidemiology feature-selection latent-dirichlet-allocation medical mimic-iii postgres python scikit-learn topic-modeling

Last synced: 05 Nov 2024

https://github.com/manojkarthick/pcregression

Python package to build principal components regression model using the scikit-learn library.

machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/herrfeder/ai_cybersecurity_ids_poc

Winning Contribution of Michael Schwabe and David Lassig to BWI Data Analytics Hackathon 2020 in the Category Cyber Security. Proof of Concept Intrusion Detection using Zeek with selfmade MachineLearning in a nice WebApp.

circleci cloudformation cyber-security dash docker-container intrusion-detection keras kubernetes machine-learning plotly python scikit-learn tensorflow zeek

Last synced: 14 Oct 2024

https://github.com/burhanuday/ml-dl-algorithms

Repository containing notes, cheatsheets, datasets and usage of different ML and DL algorithms and libraries. These files can be used as base templates for your next project

deep-learning deeplearning keras machine-learning machinelearning neural-networks scikit-learn

Last synced: 29 Oct 2024

https://github.com/veb-101/machine-learning-algorithms

One notebook to learn it all - Algorithms from scratch

matplotlib numpy pandas python-3-7 scikit-learn scipy seaborn

Last synced: 05 Nov 2024

https://github.com/prakharchoudhary/mlworld

A collection of simple machine learning projects, that got me started in this wonderful domain!

classification clustering iris-dataset keras-neural-networks knn machine-learning neural-networks numpy pandas python scikit-learn

Last synced: 12 Oct 2024