An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/maximlevchenko/pandas-numpy-hippocampus-analysis

This project aims to analyze the differences in hippocampus size between two groups: "Unaffect" and "Affected" using pandas, numpy and matplotlib

jypyternotebook matplotlib numpy pandas pyreadr seaborn statsmodels

Last synced: 16 Apr 2026

https://github.com/paulo-santos-ds/previsao_total_corridas_taxi

Foi desenvolvido um modelo preditivo específico para atrair mais motoristas durante o horário de pico. Prever a quantidade de pedidos de táxi para a próxima hora.

matplotlib-pyplot numpy pandas python seaborn sklearn statsmodels

Last synced: 10 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/amiridk/machinelearning

machine-learning-notes

matplotlib-pyplot numpy pandas pyhton

Last synced: 04 May 2026

https://github.com/alyssonmach/machine-learning-com-python

Aplicações de Machine Learning usando a linguagem de programação Python.

ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy

Last synced: 10 Apr 2026

https://github.com/the-developer-306/neural-style-transfer

Neural Style Transfer project leveraging TensorFlow and VGG19 to blend artistic styles with content images. Features include style and content initialization, multi-layer feature extraction, total variation loss, and post-processing enhancements like sharpening, upscaling, and brightening.

deep-learning image-processing neural-style-transfer numpy opencv-python post-processing tensorflow transfer-learning vgg19

Last synced: 01 May 2026

https://github.com/lorenzorottigni/ml-ecommerce

Machine Learning python bootcamp: linear regression on ecommerce dataset

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/sahilk12nayak/hyperspectral-corn-don-prediction-project

This project contains a machine learning pipeline for predicting DON (vomitoxin) concentration in corn samples using hyperspectral imaging data.

matplotlib numpy pandas python scikit-learn seaborn tensorflow

Last synced: 10 Apr 2026

https://github.com/rokmark0/auto-feature-engineering

🐙 Automated feature engineering toolkit that creates, selects, and transforms features for machine learning on tabular data, boosting model accuracy and speeding workflows.

auto-feature-engineering automl aws-ec2 dask data-science explainable-ml feature-engineering featuretools machine-learning mahinelearning numpy pandas pandas-library python r sagemaker supervised-learning unsupervised-learning

Last synced: 10 Apr 2026

https://github.com/ai-lenok/moex-analyzer

A simple engine for analyzing Moscow Exchange securities

docker matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/rooneyrulz/cat-dog-classifier

Building a model using a convolutional neural network that well-classifying custom cat vs dog image data. including visualizations, data-preprocessing, future predictions, model-evaluation, identifying overfitting, and more.

artificial-intelligence convolutional-neural-networks data-preprocessing data-visualizations deep-learning keras matplotlib numpy pandas predictive-modeling python3 seaborn tensorflow2

Last synced: 10 Apr 2026

https://github.com/sgb31/csv-retrival-based-rag

Created a simple RAG system that lets users upload any CSV file and ask questions about it. It reads the data, finds the most relevant info, and uses an LLM to answer the query. Super handy for quickly exploring and understanding data through natural language.

generative-ai llm numpy pandas pydantic python3 sentence-transformers sklearn streamlit transformers

Last synced: 19 Jun 2026

https://github.com/sarmad426/python

Python mastery. OOP | Numpy | Pandas | Jupyter Notebook & more.

fastapi file-handling-in-python jupyter numpy oop pandas pytest python regex unit-tests

Last synced: 10 Apr 2026

https://github.com/cartabinaria/calcolo-numerico

Una raccolta di risorse per l'insegnamento di Calcolo numerico (02023) del Corso di Laurea in Informatica.

course maths numerical-methods numpy scipy uni

Last synced: 20 Jan 2026

https://github.com/amruta33/housing_price_prediction

Consider a real estate company that has a dataset containing the prices of properties in the Delhi region. It wishes to use the data to optimise the sale prices of the properties based on important factors such as area, bedrooms, parking, etc.

linear-regression numpy pandas python rfe vif

Last synced: 28 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/shivammg/coursera-ml-numpy

Coursera Machine Learning course done using Python, NumPy, and Seaborn

machine-learning numpy python seaborn

Last synced: 09 May 2026

https://github.com/farhad-here/median-performance-comparison

Benchmarking the performance of median calculation using vanilla Python vs NumPy.

data-analysis matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/anshpg/linearluminary

Greetings! I've developed a straightforward linear regression model from scratch to predict house prices in Bangalore. But before delving into coding, let me walk you through the algorithm's conceptualization. I considered various factors such as location, ocean proximity, plot size, finished state, and flat type.

algo linea mathematics matplotlib numpy pandas pyth

Last synced: 13 May 2026

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 10 Apr 2026

https://github.com/ryan-bendelson/2024-summer-research

This is Python code that I worked with during my summer 2024 research project involving quantum physics.

density-matrices kronecker-product linear-algebra miniconda3 numpy numpy-arrays partial-trace python quantum-information

Last synced: 16 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/mogankumar/virtual-mouse

A Python-based Virtual Mouse that uses hand gestures for cursor control, clicking, scrolling, and taking screenshots. Powered by OpenCV, PyAutoGUI, and a custom hand tracking module for a touch-free experience.

autopy numpy opencv pyautogui python3

Last synced: 10 Apr 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/babagata/racunalna_fizika

Math and physics solved with python

matplotlib numpy random scipy sympy

Last synced: 10 Apr 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/speech-recognition

In this project we deal with audio dataset to make classification between ten audio and make preprocssing in it using Librosa and Scipy then be input of Neural Network Model.

librosa matplotlib neural-network numpy pandas python3 scipy sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/matx64/birads-classifier

Breast Imaging Report and Data System (BIRADS) with Graphical User Interface.

numpy pillow skimage sklearn tkinter

Last synced: 20 Apr 2026

https://github.com/asghar-rizvi/youtube-statistics-project

This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.

data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn

Last synced: 13 Jun 2026

https://github.com/navindafernando/feature-extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling

Last synced: 10 Apr 2026

https://github.com/crvargasm/moore-penrose-matrix

A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.

numerical-methods numpy python

Last synced: 10 Apr 2026

https://github.com/omarsaad21/rfm-clustering-

A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)

business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/soumyapro/wine-quality-prediction

This project is about the prediction of wine quality using machine learning algorithms

boxplot matplotlib numpy pandas random-forest smote

Last synced: 10 Apr 2026

https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system

This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.

collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn

Last synced: 11 Apr 2026

https://github.com/farhad-here/data-visualization-analysis-dva

This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.

altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/hansalemaos/hexarray2decimal

Converts a numpy string array with hex values to int

convert hex int numpy python

Last synced: 05 May 2026

https://github.com/pardhuu66/college-id-validator

FastAPI-based offline College ID Validator with Docker support

base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn

Last synced: 11 Apr 2026

https://github.com/ahmed-maher77/diabetes-prediction-app-using-machine-learning

Diabetes Prediction: Using machine learning to classify individuals as diabetic or non-diabetic based on health data, enabling early intervention and improved healthcare outcomes.

ai css data-science gradientboostinclassifier javascript logisticregression machine-learning matplotlib numpy pandas python randomforestclassifier seaborn streamlit supportvectormachine webdevelopment

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database

apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql

Last synced: 11 Apr 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/varkenvarken/blempy

small, safe utilities to efficiently transfer Blender property-collection attributes (e.g. vertex coordinates) to/from NumPy arrays and perform vectorized operations with minimal Python overhead.

blender numpy

Last synced: 13 Jan 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/lmizner/grokking_data_science

Coding practice for basic data science interview questions in Python

data-science numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/erikbrinkman/hilbert-bytes

A python library for converting between d-dimensional points and indices on a hilbert curve

hilbert-curve numba numpy python

Last synced: 08 May 2025

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/mani-prakash-n-r/stock_market_prediction_system

This project uses LSTM networks to predict stock prices based on historical data, providing insights for informed investment decisions..LSTM, NumPy, Scikit-Learn, Matplotlib, yfinance and TensorFlow

lstm matplotlib numpy python sckiit-learn tensorflow yfinance

Last synced: 11 Apr 2026

https://github.com/saliola/nonnegative_integer_matrices

code to generate and count nonnegative integer matrices with prescribe row and column sums (aka contingency tables)

cython cython-examples numpy numpy-examples python3

Last synced: 18 Apr 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/project54321/stocksniper

"An interactive Stock Dashboard that visualizes historical prices, calculates key financial metrics, displays fundamental statements, and aggregates the latest news for any ticker — all in a sleek, easy-to-use Streamlit interface."

alphavantage finance numpy pandas plotly pypi python stock-market stocknews streamlit

Last synced: 11 Apr 2026

https://github.com/kkufieta/carnd-vehicle-detection

Vehicle Detection & Tracking - Self Driving Cars

cv2 hog-features numpy self-driving-car sklearn vehicle-detection

Last synced: 11 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/anubagre/predicting-price-of-pre-owned-cars

This project is solution to data science problem based on regression model to predict the price of pre owned cars. Programming language used is Python.

data-science machine-learning numpy pandas price-prediction python seaborn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026