An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/wasifsohail5/amusic-ai_powered_musicrecommendationsystem

AMUSIC is an AI-driven music recommendation system that helps users discover personalized songs. Using Python, Streamlit, and Scikit-learn, it offers smart recommendations, advanced search, and interactive music insights. Users can save favorites, create playlists, and export data for a seamless music discovery experience.

joblib k-nearest-neighbours matplotlib minmaxscaler numpy pandas pickle plotly python scikit-learn seaborn streamlit

Last synced: 14 Oct 2025

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hansalemaos/cirdhighspeedcoverter

High speed conversion of IP addresses represented in CIDR notation into their corresponding start and end IPs, along with their respective subnet masks.

cidr data-manipulation ip-addresses ip-conversion network network-management network-security numpy pandas subnet-mask

Last synced: 27 Jan 2026

https://github.com/hoyirul/flask-starter-fs

Flask-Starter-FS is a concise full-stack starter kit for Flask web development, offering quick setup and pre-configured settings for seamless integration of front-end and back-end components. Accelerate your full-stack web application development with Flask-Starter-FS.

flask machine-learning numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/aqueeqazam/student-placements-using-machine-learning-and-flask

Through the use of Flask and machine learning, I show how to implement the student placement prediction model.

flask machine-learning numpy pandas

Last synced: 14 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 14 Apr 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/ljubogdan/solar-cycle-lstm

This project predicts sunspot activity using an LSTM model for time series data. Built with TensorFlow and Keras, it uses Huber loss for outlier handling and MAE for performance evaluation. The dataset, sourced from Kaggle or SIDC, spans over 270 years of monthly sunspot data.

conv1d huber-loss-regression kaggle keras lstm machine matplotlib numpy pandas scikit-learn seaborn solar sunspots tensorflow time-ser

Last synced: 13 Apr 2026

https://github.com/dawoudtormos/rain_predicition_ml

This repository contains a machine learning project for rain prediction using historical meteorological data. The goal is to predict whether it will rain the next day based on weather measurements like atmospheric water vapor, cloud formation, and past rainfall records.

ai machine-learning machine-learning-algorithms matplotlib numpy seaborn sklearn

Last synced: 10 May 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/karthik9273/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 06 May 2026

https://github.com/eduardoprofe666/mn-api

🐍📦 Paquete de Python con implementaciones de métodos numéricos

mn-api numerical-methods numpy pandas python scipy simpy tabulate

Last synced: 04 Jan 2026

https://github.com/rebeccamorolong/market--basket--analysis

Market Basket Analysis using the Apriori algorithm to predict association rules and frequent itemsets from transactional data. The goal was to uncover product combinations that frequently co-occur in transactions, which can be used for recommendations, promotions, and inventory management.

apriori-algorithm numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/deepcloudlabs/dcl702-2021-jul-12

DCL-702: Data Analytics using Python

data-analytics numpy pandas python3

Last synced: 05 May 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/architj6/gesture-volume-control

Easily control your system volume with simple hand gestures! This project leverages OpenCV, MediaPipe, and Pycaw to track hand movements and adjust the system volume dynamically. Say goodbye to keyboard shortcuts and volume buttons—just move your hand to set the volume!

handgesture mediapipe numpy opencv os pycaw python python3 volume-control

Last synced: 03 May 2026

https://github.com/farhad-here/predict_student_performance

Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).

classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 14 Apr 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/dzakarias/waves

Fun with the sine function inspired by Chladni plates

numpy streamlit

Last synced: 10 May 2026

https://github.com/satyacoder29/crm-analytics-power-bi

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/miasanromen/math-modeling

Math modeling with Python. Statistics, math, graphics

math matplotlib numpy probality-theory scipy-stats statistics

Last synced: 14 Apr 2026

https://github.com/tugot17/neural-network-from-scratch

Simple neural network implemented from scratch using numpy package

from-scratch neural-network numpy python

Last synced: 26 Oct 2025

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/ambrish8/bell-s-palsy

BELL’S PALSY SEVERITY DETECTION USING DEEP LEARNING

cnn-architecture imagedatagenerator keras numpy opencv python pytorch resnet-50

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/jjerphan/joml

A minimalist numpy-baked Neural Network API

deep-learning fun minimalist neural-network neural-networks numpy python wheel-implementing

Last synced: 10 May 2026

https://github.com/anas436/python

This is Python Programming Language. Here, the basic problems of Python language have been solved. If you can solve these problems then your basic skills will be developed in this language and you will be suitable for doing good projects.

jupyter-notebook jupyterlab numpy pandas pycharm python3 watson-studio

Last synced: 11 Apr 2026

https://github.com/deepankkartikey/ml-projects

A repository of projects done while learning and exploring ML

classification csv machine-learning numpy pandas regression sklearn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/muhkartal/fr-photo

face-recognition system works on images first of all you need to register valid image as recognition

face-recognition numpy opencv pillow python streamlit

Last synced: 15 Apr 2026

https://github.com/ishinzoo/songrecommendation

This project is a machine learning-based system that recommends songs based on the user's detected emotions. The application uses facial expression recognition to determine the user's current emotional state and suggests songs that align with that emotion. This system can be particularly useful for personalized music streaming services, helping use

machine-learning mediapipe numpy opencv os python tenserflow

Last synced: 25 Feb 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/amruta33/credit_card_analysis

The loan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Because of that, some consumers use it as their advantage by becoming a defaulter.

numpy pandas python3

Last synced: 15 Apr 2026

https://github.com/hansalemaos/npzigloc

Zig for Numpy

numpy python zig

Last synced: 31 Jan 2026

https://github.com/dipeshgoyal013/salary-data-analysis

Salary Analysis according department and agency.

analysis matplotlib numpy pandas salary sklearn-library

Last synced: 15 Apr 2026

https://github.com/dmitruzik/python_projects

All python projects are here

matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/audeering/audmath

General math functions

math numpy

Last synced: 07 Feb 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/bursasha/pandas-numpy-matplotlib-cavies-analysis

Complete statistical analysis of cavy lifetime dataset using Python, Pandas, NumPy, Matplotlib, and SciPy to explore, visualize, and infer the impact of bacilli infection on cavy lifetimes 🦫

cavy dataset-analysis distribution-fitting hypothesis-testing jupyter-notebook matplotlib numpy pandas python-analysis scipy statistical-analysis statistical-methods

Last synced: 09 Feb 2026

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/mahmud-kandawala/simulating-covid-19-python-

This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.

matplotlib numpy python

Last synced: 16 Apr 2026

https://github.com/puureya2/catalogue-price-scraper

Scrape & store product prices with Selenium & BeautifulSoup4. Visually represent data with MatPlotLib,Personal Project

beatifulsoup4 chrome-webdriver matplotlib numpy pandas python python-csv selenium webscraping

Last synced: 16 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/karthikarajagopal44/numpy-beginner-to-advanced

This repository is designed to help you master NumPy, the fundamental package for scientific computing with Python. Whether you are just getting started or looking to refine your skills with advanced techniques, this repo covers a comprehensive range of topics and functionalities. It covers the range of topics from beginner to advance.

data-science numpy numpy-library numpy-tutorial python

Last synced: 16 Apr 2026

https://github.com/navindafernando/feature-extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling

Last synced: 10 Apr 2026

https://github.com/walkerdustin/python-linear-regression

simple linear regression in python based on coursera andrew ng course

ai linear-regression machine-learning ml numpy python

Last synced: 05 May 2026

https://github.com/ywatanabe1989/scitex-io

Universal scientific data I/O with plugin registry — save/load 30+ formats with one API. Part of SciTeX.

cli csv data-io hdf5 mcp numpy openscience pandas plugin-registry python research scientific-computing scitex

Last synced: 07 Jun 2026

https://github.com/kartmaan/plotly-intervals

Group values from a Pandas Series according to given intervals and represent them graphically with Plotly

bar-chart grouping grouping-plots intervals intervals-chart numpy numpy-arrays orca pandas pandas-series percentile pie-chart plot plotly plotting values

Last synced: 10 May 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/m1-elmasry/quickmark

Efficient OMR software designed for quick and precise grading of bubble sheet exams.

bubble-sheet image-processing numpy opencv python

Last synced: 10 May 2026

https://github.com/anshpg/exploring-ipl-rivalries-an-analysis-of-matches-from-2008-to-2022

In this project, I delved into an extensive analysis of IPL matches spanning from 2008 to 2022, utilizing a dataset sourced from Kaggle.com. My aim for this IPL season was to explore something novel, steering away from conventional analyses.

csv-files ipl kaggle-dataset matplotlib numpy pandas

Last synced: 10 May 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/fedesgh/parkinson_volatility_spread_on_cedears

Creating a function that returns a graph with the difference between Parkinson's volatility and regular volatility given a certain bounds

numpy pandas pickle seaborn

Last synced: 10 Apr 2026

https://github.com/sahil210695/convolution-evolution

This is to see how a kernel will convolve over an image and what will be its output after convolution

cnn convolutional-neural-networks data-science image-kernel image-processing numpy numpy-neural-network python streamlit

Last synced: 10 May 2026

https://github.com/mchenryspagg/economic-variable-correlation-and-productivity-model

This report provides an in-depth analysis of Nigeria's economic performance over the past decade, spanning from 2014 to 2024.

correlation exchangerates inflation moneysupply numpy productivity python regression regression-analysis regression-models visualization

Last synced: 15 Feb 2026