Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/qiancao/hskl

A library for hyperspectral image analysis using scikit-learn.

hyperspectral image-analysis-toolbox machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/sayakpaul/dockerml

Contains my explorations of using Docker to automate ML workflows.

ci-cd docker scikit-learn tensorflow wandb

Last synced: 23 Oct 2024

https://github.com/ccrvlh/autoclass

Script that automatically classifies your bank statement into pre-determined labels.

banking banking-applications machine-learning personal-finance python scikit-learn

Last synced: 13 Oct 2024

https://github.com/alexioannides/bodywork-mlops-demo

Demonstrating how Bodywork can be used to deploy a simulation of the lifecycle of a train-and-serve ML pipeline, responding to new data undergoing concept drift.

aws data-science docker kubernetes machine-learning mlops numpy python scikit-learn

Last synced: 12 Oct 2024

https://github.com/bnediction/scboolseq

scBoolSeq: scRNA-Seq data binarisation and synthetic generation from Boolean dynamics

bioinformatics boolean-networks computational-biology machine-learning pandas python3 scikit-learn scrna-seq single-cell-rna-seq

Last synced: 12 Oct 2024

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/sshh12/csgo-market-analysis

Some interesting stats from the CS:GO Steam community market.

csgo csgo-skins scikit-learn steam-market

Last synced: 27 Oct 2024

https://github.com/isala404/r2d2

Line Following Robot Powered By OpenCV and Machine Learing

ai linefollower machine-learing numpy opencv python3 rasberrypi scikit-learn tensorflow

Last synced: 15 Oct 2024

https://github.com/lai-bluejay/diego

Diego: Data in, IntElliGence Out. A fast framework that supports the rapid construction of automated learning tasks. Simply create an automated learning study (Study) and generate correlated trials (Trial). Then run the code and get a machine learning model. Implemented using Scikit-learn API glossary, using Bayesian optimization and genetic algorithms for automated machine learning. Inspired by [Fast.ai](https://github.com/fastai/fastai).

automl autosklearn bayesian-optimization generation-algorithms hyperparameter-optimization machine-learning scikit-learn

Last synced: 14 Oct 2024

https://github.com/tlapusan/woodpecker

A python library used for tree structure interpretation.

decision-trees machine-learning random-forest scikit-learn sklearn visualization

Last synced: 24 Oct 2024

https://github.com/gbolmier/sklearn-neighbors-benchmark

:bar_chart: Scikit-learn nearest neighbors algorithms benchmark

benchmark nearest-neighbors-algorithms scikit-learn

Last synced: 13 Oct 2024

https://github.com/octo-technology/ddui

Airflow's plugin for Data Science pipeline visualisation

airflow airflow-plugin datadriver datascience ml pandas-python scikit-learn

Last synced: 10 Oct 2024

https://github.com/chrislemke/sk-transformers

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

data-science feature-engineering feature-selection machine-learning pandas preprocessing python scikit-learn scikit-learn-pipelines scikit-learn-transformer

Last synced: 13 Oct 2024

https://github.com/nossbigg/fyp_py

Source code for A Study on Rumour Detection on Online Social Networks final year research project

machine-learning nltk python scikit-learn sentiment-analysis twitter

Last synced: 24 Oct 2024

https://github.com/mit-lcp/shakespeare-method

The Shakespeare-Method repository contains the code we used to develop a new method to identify attributed and unattributed potential adverse events using the unstructured notes portion of electronic health records.

adverse-events ehr ehr-notes epidemiology feature-selection latent-dirichlet-allocation medical mimic-iii postgres python scikit-learn topic-modeling

Last synced: 05 Nov 2024

https://github.com/herrfeder/ai_cybersecurity_ids_poc

Winning Contribution of Michael Schwabe and David Lassig to BWI Data Analytics Hackathon 2020 in the Category Cyber Security. Proof of Concept Intrusion Detection using Zeek with selfmade MachineLearning in a nice WebApp.

circleci cloudformation cyber-security dash docker-container intrusion-detection keras kubernetes machine-learning plotly python scikit-learn tensorflow zeek

Last synced: 14 Oct 2024

https://github.com/dhaitz/python-sklearn-firefly-docker-heroku

Build a hate speech detection system with scikit-learn and deploy it via Docker on Heroku.

deployment docker firefly heroku machine-learning python scikit-learn

Last synced: 11 Nov 2024

https://github.com/techytushar/would-you-have-survived

Predict if you would have survived if you were on the Titanic Ship

flask machine-learning python scikit-learn titanic-kaggle

Last synced: 10 Nov 2024

https://github.com/veb-101/machine-learning-algorithms

One notebook to learn it all - Algorithms from scratch

matplotlib numpy pandas python-3-7 scikit-learn scipy seaborn

Last synced: 05 Nov 2024

https://github.com/prakharchoudhary/mlworld

A collection of simple machine learning projects, that got me started in this wonderful domain!

classification clustering iris-dataset keras-neural-networks knn machine-learning neural-networks numpy pandas python scikit-learn

Last synced: 12 Oct 2024

https://github.com/dayyass/extended-naive-bayes

[WIP] Extension of sklearn Naive Bayes models that allows sampling and more feature distributions.

data-science distributions generative-model machine-learning naive-bayes python sampling scikit-learn

Last synced: 14 Oct 2024

https://github.com/burhanuday/ml-dl-algorithms

Repository containing notes, cheatsheets, datasets and usage of different ML and DL algorithms and libraries. These files can be used as base templates for your next project

deep-learning deeplearning keras machine-learning machinelearning neural-networks scikit-learn

Last synced: 29 Oct 2024

https://github.com/gentaiscool/pmf

Probabilistic Matrix Factorization on MovieLens 100K

matrix-factorization movielens pmf scikit-learn sklearn sparse

Last synced: 08 Nov 2024

https://github.com/manojkarthick/pcregression

Python package to build principal components regression model using the scikit-learn library.

machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/omegaml/dashserve

develop and serve Plotly Dash apps in Jupyter Notebook or JupyterLab

data-science plotly plotly-dash scikit-learn

Last synced: 10 Oct 2024

https://github.com/integeralex/netflix-recommendation-system

This Netflix Recommendation System is a web application developed using Node.js and Express. It utilizes a recommendation engine written in Python

ai collaborate docker express netflix nodejs pandas recommendation-system scikit-learn

Last synced: 18 Oct 2024

https://github.com/oneapi-src/network-intrusion-detection

AI Starter Kit for Network Intrusion Detection using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/x-tabdeveloping/rvfln

A Python implementation of random vector functional networks and broad learning systems using Sklearn's Regressor and classifier APIs

broad-learning data-science deep-learning machine-learning scikit-learn sklearn sklearn-compatible

Last synced: 28 Oct 2024

https://github.com/learnables/torchml

Scikit-learn implemented with PyTorch

machine-learning pytorch scikit-learn

Last synced: 15 Oct 2024

https://github.com/deadly-panda/clinicapp

Django web application to help a medical clinic manage their appointments and assits doctors in their diagnosisis

django keras python scikit-learn

Last synced: 12 Nov 2024

https://github.com/malykhin/anomaly_detection

Anomaly detection in NAB dataset using mxnet and scikit-learn

anomaly-detection mxnet nab scikit-learn timeseries

Last synced: 24 Oct 2024

https://github.com/miolab/jupyterlab_poetry

JupyterLab runtime environment with Poetry and Docker management.

docker jupyterlab nltk poetry python3 pytorch scikit-learn scipy shap xgboost

Last synced: 30 Oct 2024

https://github.com/gentaiscool/scikit-learn-examples

Exploration on Logistic Regression, MLP, and SVM using Scikit-learn

logistic-regression mlp scikit-learn sklearn svm

Last synced: 08 Nov 2024

https://github.com/feedzai/feedzai-openml-python

Python-based Feedzai OpenML Providers

feedzai openml python scikit scikit-learn

Last synced: 10 Oct 2024

https://github.com/shobrook/neuropipe

Easy scaffolding for machine learning pipelines in Scikit-Learn

framework machine-learning microframework pipeline scaffolding scikit-learn template

Last synced: 11 Oct 2024

https://github.com/coelhosilva/flight-ad

flight-ad is a Python package for anomaly detection in the aviation domain built on top of scikit-learn.

anomaly-detection data-science fdm flight-data flight-data-analysis flight-data-monitoring machine-learning python scikit-learn

Last synced: 12 Oct 2024

https://github.com/lostintangent/intro-to-ml

🎓 Simple to use notebooks, based on the scikit-Learn and matplotlib tutorials

machine-learning scikit-learn tutorial

Last synced: 16 Oct 2024

https://github.com/codelionx/pyonlinesvr

Python-Wrapper for Francesco Parrella's OnlineSVR C++ implementation with scikit-learn-compatible interface.

cpp machine-learning numpy online onlinesvr python python3 scikit-learn streaming-algorithms svr

Last synced: 11 Oct 2024

https://github.com/neptune-ai/neptune-sklearn

Experiment tracking for scikit-learn. 🧩 Log, organize, visualize and compare model metrics, parameters, dataset versions, and more.

dashboard logger management metadata mlops optimization organize scikit-learn sklearn store tracker training versioning

Last synced: 11 Oct 2024

https://github.com/syamkakarla98/kmeans-clustering

Introduction and implementation of K-Means clustering.

matplotlib pyhton3 scikit-learn

Last synced: 13 Nov 2024

https://github.com/kevinarce/scikit-learn

Machine learning project based on sklearn library.

educative github gitkraken learn machine-learning python scikit-learn student-vscode

Last synced: 05 Nov 2024

https://github.com/sayakpaul/manning-phishing-websites-detection

Starter repository for Manning LP: Use Machine Learning to Detect Phishing Websites

beginners machine-learning phishing-sites scikit-learn wandb

Last synced: 24 Oct 2024

https://github.com/yash22222/ibm-csrbox-internship-project

The objective of the Data Analytics internship at CSRBOX is to provide interns with hands-on experience in applying data analytics techniques to real-world projects in the field of corporate social responsibility (CSR). Interns will gain practical skills in data collection, cleaning, analysis, visualization, and reporting, while working on projects

data-mining data-preprocessing data-science exploratory-data-analysis feature-engineering lemmatization machine-learning pandas pos-tagging random-forest random-forest-classifier scikit-learn sentiment-analysis web-scraping wordcloud

Last synced: 09 Nov 2024

https://github.com/khaledsharif/sklearn2docker

Convert your trained scikit-learn classifier to a Docker container with a pre-configured API

api classification docker docker-container machine-learning scikit-learn

Last synced: 15 Oct 2024

https://github.com/crflynn/sklearn-instrumentation

Generalized scikit-learn machine learning model instrumentation library

instrumentation machine-learning scikit-learn

Last synced: 08 Nov 2024

https://github.com/joaopfonseca/ml-research

A Python library with utilities for Machine Learning research and algorithm implementations

active-learning data-science machine-learning python scikit-learn

Last synced: 11 Oct 2024

https://github.com/yaricom/english-article-correction

The experiment with applying NLP to correction of definite/indefinite articles in English text corpus

data-analysis glove-vectors nlp nlp-machine-learning numpy pandas scikit-learn umbc-webbase-corpus

Last synced: 05 Nov 2024

https://github.com/ngupta23/more

This is a helper package for pandas, visualizations and scikit-learn

data-science helpers pandas python scikit-learn visualization visualizations

Last synced: 10 Oct 2024

https://github.com/kulia/optical-character-recognition

Approach using k-Nearest Neighbors and Support Vector Machines

machine-learning matlab ntnu ocr python scikit-learn

Last synced: 05 Nov 2024

https://github.com/melikaas/lda_classification

This repository focuses on implementing Linear Discriminant Analysis (LDA) as a classifier on embedded scikit-learn datasets. The goal is to improve the model's performance on these datasets.

classification machine-learning-algorithms scatterplot scikit-learn

Last synced: 10 Oct 2024

https://github.com/hmiladhia/piskle

A serialization package optimized for scikit-learn

data-science machine-learning python scikit-learn serialization

Last synced: 11 Oct 2024

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/g0bel1n/tinyautoml

TinyAutoML is a comprehensive Pipeline Classifier Project thought as a Scikit-learn plugin

automl-pipeline machine-learning scikit-learn

Last synced: 13 Oct 2024

https://github.com/manasvigoyal/enron-classification

Exploratory Analysis of Enron Dataset and Classification using multiple algorithms

classification email-classifier enron-dataset jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 13 Nov 2024

https://github.com/alexeyev/hse-spb-bigdata-python-fall2016

Материалы к курсу по программированию и инструментам анализа данных, прочитанному в петербургском филиале НИУ ВШЭ осенью 2016 года

course-materials data-analysis numpy pandas python scikit-learn sklearn

Last synced: 11 Nov 2024

https://github.com/metriccoders/ai-studio

AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.

ai-studio artificial-intelligence generative-ai keras langchain large-language-models llm machine-learning natural-language-processing nlp python python3 scikit-learn tensorflow

Last synced: 27 Oct 2024

https://github.com/chris-santiago/steps

A SciKit-Learn style feature selector using best subsets and stepwise regression.

best-subset-selection data-science python scikit-learn stepwise-selection

Last synced: 27 Oct 2024

https://github.com/rubydamodar/loan-approval-prediction-

Loan approval prediction is a popular machine learning project, especially in the banking and finance industry. The goal of this project is to build a predictive model that can determine whether a loan application will be approved or not based on the applicant's information such as income, credit history, and loan amount.

ai-in-finance banking classification classification-internal credit-risk data-science exploratory-data-analysis feature-engineering financial-analytics loan-approval machine-learning matplotlib pandas predictive-modeling python scikit-learn seaborn visualization

Last synced: 31 Oct 2024

https://github.com/learninghouseservice/learninghouse

The LearningHouse Service provides machine learning algorithms based on the scikit-learn python library as a RESTful API. Its purpose is to offer smart home enthusiasts an easy way to teach their homes.

angular fastapi rest-api scikit-learn smarthome smarthome-api smarthome-app

Last synced: 12 Nov 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 11 Oct 2024

https://github.com/aianytime/early-stage-diabetes-risk-prediction

Early stage diabetes risk prediction using several supervised machine learning algorithms.

github machine-learning machine-learning-algorithms python scikit-learn

Last synced: 07 Nov 2024

https://github.com/aianytime/machine-learning-models-implementation

Implementation of several ML models on real-world datasets with detailed explanation in notebooks.

eda machine-learning machine-learning-algorithms ml numpy pandas pycaret python scikit-learn scikitlearn-machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/adityashrm21/adult-income-prediction

A end-to-end data analysis pipeline including model deployment

data-science eda flask heroku logistic-regression r scikit-learn tidyverse

Last synced: 11 Nov 2024

https://github.com/evilpegasus/apstats

A statistical analysis of WA state teacher salaries and SAT scores

data data-science excel matplotlib numpy pandas python scikit-learn statistics

Last synced: 11 Nov 2024

https://github.com/deepmancer/tweet-disaster-detection

fine-tuned BERT and scikit-learn models for real-time classification of disaster-related tweets, using TensorFlow, Keras, and Transformers. .

bert bert-fine-tuning classification fine-tuning huggingface-transformers keras keras-tensorflow natural-language-processing nlp scikit-learn tensorflow tensorflow2 tokenizer transformers

Last synced: 02 Nov 2024

https://github.com/katharineshapcott/rank-similarity

Rank Similarity is a set of nonlinear classification and transform tools for large datasets.

classification machine-learning nonlinear scikit-learn transformer

Last synced: 10 Oct 2024

https://github.com/bkamapantula/discover-workshop

Code search utility to assist developer workflows via code discovery. Currently uses tf-idf estimator.

developer-tools pycon python scikit-learn tf-idf

Last synced: 16 Oct 2024

https://github.com/bjpcjp/python-data-science-handbook

Sourced from O'Reilly ebook of the same name.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Nov 2024

https://github.com/ashwin-rajeev/face-recognition

A simple face recognition system using scikit-learn and OpenCV

face-detection face-recognition machine-learning python scikit-learn

Last synced: 09 Nov 2024

https://github.com/eonu/daze

Better multi-class confusion matrix plots for Scikit-Learn, incorporating per-class and overall evaluation measures.

accuracy classification confusion-matrix confusion-matrix-heatmap evaluation evaluation-metrics f1-score measures multi-class plot precision recall scikit-learn

Last synced: 10 Oct 2024

https://github.com/jrbourbeau/madpy-ml-sklearn-2018

MadPy supervised machine learning talk

machine-learning python scikit-learn

Last synced: 28 Oct 2024

https://github.com/chinaskidev/ml-prediccion-lluvia-brazil

MLOps, usando Docker,Airflow,tensorflow,streamlit

python3 scikit-learn streamlit tensorflow

Last synced: 10 Nov 2024

https://github.com/alexandregazagnes/scikit-transformers

Very usefull package to enable and provide custom transformers such as LogColumnTransformer, BoolColumnTransformers and others fancy transformers.

data data-science log python scikit-learn transformer

Last synced: 11 Oct 2024