Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/oneapi-src/customer-segmentation

AI Starter Kit for Customer Segmentation for Online Retail using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/34j/sklearn-utilities

Utilities for scikit-learn. Append prediction to x, append prediction to x single, append x prediction to x, compose var estimator, data frame wrapper, drop by noise prediction, drop missing rows y, dummy regressor var, estimator wrapper base, excluded column transformer pandas, feature union pandas, id transformer, included column transformer pand

catboost feature-engine feature-engineering multioutput pandas pca python pytorch regression scikit-learn sklearn sklearn-compatible skorch torch tqdm

Last synced: 22 Oct 2024

https://github.com/oneapi-src/powerline-fault-detection

AI Starter Kit for detect faulty signals in power line voltage using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/oneapi-src/purchase-prediction

AI Starter Kit for Purchase Prediction model using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/manena/sp-sentiment-analysis

Sentiment Analysis in Python trained with Amazon Spain reviews in Spanish

jupyter-notebook machine-learning nltk nltk-library python-3-5 pyton scikit-learn sentiment-analysis

Last synced: 12 Oct 2024

https://github.com/manasvigoyal/enron-classification

Exploratory Analysis of Enron Dataset and Classification using multiple algorithms

classification email-classifier enron-dataset jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 11 Oct 2024

https://github.com/aianytime/recommendation_system_implementation

Complete concepts behind implementing a Recommendation System using Association Rules, Collaborative Filtering, and Matrix Factorization.

collaborative-filtering data-science machine-learning matrix-factorization python recommender-system scikit-learn

Last synced: 07 Nov 2024

https://github.com/rutujar/scikit-learn

Scikit learn is applied for Androsensor (available in google play store) datasets and iris data sets.

jupyter-notebook linear-regression machine-learning python scikit-learn svm

Last synced: 13 Oct 2024

https://github.com/owenodriscoll/automl

Python package for automated hyperparameter-optimization of common machine-learning algorithms

automl catboost classification hyperparameter-optimization lightgbm machine-learning optuna regression scikit-learn xgboost

Last synced: 27 Oct 2024

https://github.com/magnusax/automl

The project aims to develop a customized ML framework on top of existing libraries

data-science machine-learning machine-learning-algorithms machine-learning-library python scikit-learn

Last synced: 14 Oct 2024

https://github.com/gxuravkumar911/tubedigest

Demonstrating expertise in Python and Django, TubeDigest is a robust web application that leverages NLTK and YouTube API for AI-powered video summarization.

artificial-intelligence django machine-learning natural-language-processing nltk python scikit-learn text-analysis tf-idf web-development

Last synced: 10 Oct 2024

https://github.com/lorey/data-intensive-latex-documents

Python framwork for data-intensive LaTeX documents.

automation latex machine-learning scikit-learn sklearn

Last synced: 14 Oct 2024

https://github.com/reekrajroy/selflearning_chatbot

Self learning chatbot using python.

python scikit-learn

Last synced: 06 Nov 2024

https://github.com/spags093/spotify_song_data

Part 1: Analysis of Spotify song data that uses Machine Learning to determine what features make a "hit" song on Spotify.

machine-learning matplotlib music pandas python scikit-learn seaborn shap spotify spotify-api tensorflow

Last synced: 03 Nov 2024

https://github.com/metriccoders/metriccoders_notebooks

This is the Metric Coders repository containing all the notebooks for machine learning.

artificial-intelligence genai keras llm machine-learning natural-language-processing pytorch scikit-learn tensorflow

Last synced: 27 Oct 2024

https://github.com/cyberfantics/bitcoin-price-prediction

A deep learning-based web app for predicting future Bitcoin prices using historical data. Users can interactively select prediction days and view recent price data in real-time.

artificial-intelligence artificial-neural-networks bitcoin deep-learning machine-learning neural-network prediction-model scikit-learn tensorflow

Last synced: 02 Nov 2024

https://github.com/plantaest/feverfew

Comprehensive link checker tool for Wikipedia

aws-lambda caddy java mantine onnx python quarkus react scikit-learn typescript

Last synced: 14 Oct 2024

https://github.com/andreasgrv/tictacs

Machine learning pipeline configuration on steroids (on tictacs actually)

config pipeline reflection scikit-learn

Last synced: 11 Oct 2024

https://github.com/pr38/dask_tfidf

A Dask native implementation of 'Term Frequency Inverse Document Frequency' for dask-ml and scikit-learn

dask dask-ml distributed-computing machine-learning python scikit-learn

Last synced: 27 Oct 2024

https://github.com/thomasthaddeus/dataanalysistoolkit

DataAnalysisToolkit is a Python-based data analysis tool designed to streamline various data analysis tasks. It provides the ability to load data from CSV files, perform statistical calculations, detect outliers, clean data, and visualize data.

data-science matplotlib python python-script python3 scikit-learn

Last synced: 30 Oct 2024

https://github.com/g0r0kh/clustering

k-means & hierarchical clustering

conda matplot numpy pandas scikit-learn scipy sklearn

Last synced: 14 Oct 2024

https://github.com/krzjoa/scikit-gbm

scikit-learn compatible tools to work with GBM models

data-science feature-engineering gbm gradient-boosting machine-learning scikit-learn

Last synced: 07 Nov 2024

https://github.com/alejoduarte23/ssi-cov

The SSICOV is a system identification method for structural health monitoring (SHM) using Stochastic Subspace Identification (SSI). It processes acceleration data to extract modal parameters of a structure.

matplotlib modal-analysis numpy scikit-learn scipy signal-processing system-identification

Last synced: 14 Oct 2024

https://github.com/devamoghs/pos-tagger-nltk-scikit-learn

Part-Of-Speech Tagger using custom trained models, implemented with Scikit-Learn and NLTK

machine-learning natural-language-understanding nltk-library part-of-speech-tagger pos-tagger scikit-learn

Last synced: 28 Oct 2024

https://github.com/sayakpaul/patients-conversation-detector

Contains my experiments for ZS's hiring hackathon (II).

data-science keras machine-learning nlp python scikit-learn text-classification

Last synced: 28 Oct 2024

https://github.com/aadityatamrakar/captcha_ai

ML program to decode text from captcha images.

matplotlib python scikit-learn sklearn

Last synced: 14 Oct 2024

https://github.com/superbderrick/creditratingprediction

A simple data prediction system that evaluates creditRating with a little bit data

scikit-learn tensorflow

Last synced: 06 Nov 2024

https://github.com/zacheddy/issuelabeler

Automatically label issues on GitHub repositories with machine learnin' (demo available)

classifier machine-learning python scikit-learn

Last synced: 14 Oct 2024

https://github.com/andreped/nlp-mtl

Training neural networks to solve multiple tasks simultaneously from free text through multi-task learning

bert-embeddings keras multi-task-learning natural-language-processing neural-networks nlp scikit-learn

Last synced: 07 Nov 2024

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 28 Oct 2024

https://github.com/oneapi-src/intelligent-indexing

AI Starter Kit for Intelligent Indexing of Incoming Correspondence using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/marella/evaluate

A tool to evaluate the performance of various machine learning algorithms and preprocessing steps to find a good baseline for a given task.

lightgbm machine-learning python scikit-learn xgboost

Last synced: 25 Oct 2024

https://github.com/msjahid/machine-learning-projects

A collection of machine learning projects featuring models and algorithms for supervised and unsupervised learning, model evaluation, and optimization.

jupyter matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Oct 2024

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 09 Oct 2024

https://github.com/wwunlp/sner

𒈬 Sumerian Named Entity Recognition

machine-learning ner nlp python scikit-learn

Last synced: 13 Oct 2024

https://github.com/gitstar-oc/machine-learning

This are the Machine Learning notes by leading AI website named Deeplearning.AI. This notes will help you to be a machine learner from beginner to advanced level. Welcome Everyone!!

deep-learning deep-neural-networks jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn supervised-learning tensorflow unsupervised-learning

Last synced: 13 Oct 2024

https://github.com/shreyansh055/time-series-forecasting_055

The Time Series Forecasting Project predicts future trends using historical data with Python, Pandas, and models like ARIMA, LSTM, and Prophet, focusing on scalable, accurate forecasting for business and finance.

lstm matplotlib numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/thenorthkun/twitter-gender-text-analysis

This Project contains a vivid analysis of texts, words (using NLP) typed by Male & Female users active on Twitter. 🐦👨📝

analysis data-visualization natural-language-processing scikit-learn

Last synced: 14 Oct 2024

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 08 Nov 2024

https://github.com/kohlerhector/tree-mbpo

Study Model-Based Policy Optimization by varying the model estimator classes (e.g Decision Trees vs MLP)

decision-tree mbpo mbrl mlp rl sac scikit-learn stable-baselines3

Last synced: 08 Nov 2024

https://github.com/kohlerhector/dpdt-py

Implementation of Dynamic Programming Decision Tree algorithm (Kohler et. al. 2024).

decision-tree-classifier decision-trees dynamic-programming scikit-learn scikitlearn-machine-learning sklearn sklearn-classifier

Last synced: 08 Nov 2024

https://github.com/joseabrantesjr/previsai

O PrevisAI é uma aplicação que utiliza tecnica avançada de deep-learning para prever os preços de fechamento de ações, ETFs, Fundos Imobiliários, Criptomoedas, etc.

acoes criptomoedas deep-learning etf fii keras mercado-financeiro numpy pandas previsao python scikit-learn tensorflow trade trading yfinance

Last synced: 31 Oct 2024

https://github.com/spockoo/pylegend

A fusion between Python and legend, a name that suggests that the code is both modern and mythical. 3 months of work, with tons of errors to establish the calculations necessary for the superposition, I want to publish my work and improve it and share it under Apache 2.0 License. Designed to work with NBminer!

crypto crypto-tools how-to-farm-crypto kerastuner matplotlib matrix-multiplication mining nbminer numpy performance-optimization pickle project quantum quantumcircuits quantumcomputing qubits scikit-learn tensorflow

Last synced: 10 Oct 2024

https://github.com/facultyai/faculty-xval

Cross-validation of Keras and scikit-learn models with the Faculty platform

cross-validation faculty-platform keras machine-learning python scikit-learn

Last synced: 08 Nov 2024

https://github.com/markdouthwaite/lingo

A package for quickly deploying Scikit-Learn Linear Models in Go.

golang linear-models machine-learning scikit-learn

Last synced: 13 Oct 2024

https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data

Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material

data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder

Last synced: 06 Nov 2024

https://github.com/farahibrar/programming-in-python

Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.

beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow

Last synced: 15 Oct 2024

https://github.com/md-emon-hasan/ml-project-car-price-prediction

🚗 End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.

car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn

Last synced: 10 Oct 2024

https://github.com/ayushshahh/fespn

A neural network made to predict final exam scores of students

mlp mlp-regressor multilayer-perceptron neural-network prediction-model scikit-learn

Last synced: 18 Oct 2024

https://github.com/sd338/fractureai

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 31 Oct 2024

https://github.com/marty1885/scirknn

Convert and run scikit-learn models into RKNN2 format to run on Rockchip NPUs.

npu rknn rknpu2 rockchip scikit-learn

Last synced: 15 Oct 2024

https://github.com/edikedik/eboruta

Flexible and transparent Python Boruta implementation

ensemble-models feature-selection machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/nirmalyabag20/breast-cancer-prediction-using-machine-learning

This project leverages machine learning to classify breast cancer as malignant or benign based on tumor characteristics. By applying and evaluating multiple algorithms, the model achieves high accuracy, demonstrating the practical application of data-driven solutions in medical diagnostics.

logistic-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/chanioxaris/german-credit-data

Experimental classification algorithms on german credit data implemented using scikit-learn library

classification classifier cross-validation dataset information-entropy information-gain naive-bayes prediction random-forest scikit-learn support-vector-machines

Last synced: 02 Nov 2024

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 11 Oct 2024

https://github.com/vishal-038/attendance_by_face_recogination

This project is a face recognition-based attendance system that uses Python, OpenCV, Scikit-learn, Streamlit, and various other libraries like Pandas, Numpy, Datetime, and OS for different functionalities. It enables adding faces to the database, taking attendance based on face recognition, and showing live attendance through a web interface built

opencv python scikit-learn

Last synced: 09 Oct 2024

https://github.com/raihan4520/ml

A collection of machine learning projects showcasing various algorithms and techniques, including a final project for the Machine Learning course at AIUB.

data-preprocessing jupyter-notebook machine-learning model-evaluation numpy pandas python scikit-learn

Last synced: 31 Oct 2024

https://github.com/skekre98/picture-compressor

A tool for compressing images using unsupervised machine learning

kmeans-clustering scikit-learn

Last synced: 13 Oct 2024

https://github.com/vaibhavs10/learn-ml

Modified notebooks (single) from kaggle.com/learn with added nuances

decision-trees machine-learning pandas random-forest scikit-learn

Last synced: 25 Oct 2024

https://github.com/ivanyu/kaggle-digit-recognizer

Kaggle's "Digit Recognizer" competition

kaggle keras machine-learning scikit-learn

Last synced: 15 Oct 2024

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask scikit-learn

Last synced: 13 Oct 2024

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 05 Nov 2024

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

jupyter logistic-regression machine-learning python3 scikit-learn

Last synced: 21 Oct 2024

https://github.com/corentinth/ml-gender_classification

[Machine Learning] The Hello Wolrd of Machine Learning using sklearn

body-metrics gender-classification machine-learning scikit-learn

Last synced: 02 Nov 2024

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 10 Oct 2024

https://github.com/udityamerit/all_mlpackages

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 31 Oct 2024

https://github.com/gappeah/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 10 Oct 2024

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/swimshahriar/heart-attack-prediction

Heart attack prediction from 13 features.

jupyter-notebook pandas python3 scikit-learn

Last synced: 02 Nov 2024

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 13 Oct 2024

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 13 Oct 2024

https://github.com/rajikaimal/emma

:santa: Intelligent mention bot for GitHub organizations

bot emma machine-learning python scikit-learn

Last synced: 27 Oct 2024

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 13 Oct 2024

https://github.com/vectominist/mednlp

Mandarin Medical Dialogue Analysis with Pytorch.

dialog huggingface mandarin medical pytorch scikit-learn transformers

Last synced: 14 Oct 2024

https://github.com/jasper-koops/easy-gscv

This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.

classification machine-learning python3 scikit-learn

Last synced: 23 Oct 2024

https://github.com/dustinmichels/bayesian-values-guesser

Uses some user input, data from the World Values Survey <www.worldvaluessurvey.org>, and Bayes Rule to guess a number of beliefs the user might have. STATUS: In progress.

bayes-rule bayesian-values-guesser naive-bayes-classifier pandas python scikit-learn values-survey

Last synced: 27 Oct 2024

https://github.com/bhuvaneshwarguttula/student-performance-indicator

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis

Last synced: 10 Oct 2024

https://github.com/offchan42/thai-thesis-classification

Classify each document inside the corpus using Python machine learning module: scikit-learn

nlp python python2 scikit-learn segment thai thai-language thai-thesis-classification

Last synced: 29 Oct 2024