An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/dulajkavinda/breast-cancer-detect

🧬 breast cancer classification using machine learning

added machinelearning matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/williyam-m/sentiment_analysis

A Django-based Sentiment Analysis web app leveraging the power of the BERT transformer model to classify text sentiment accurately.

beautifulsoup bert django numpy pandas python torch transformers

Last synced: 20 Jun 2026

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 07 May 2026

https://github.com/ayushmankumar7/siamese-network-with-one-shot-learning

Implementation of this paper - https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf

numpy tensorflow tf-keras

Last synced: 07 May 2026

https://github.com/pointer2alvee/machine-learning-algorithms-scratch

Core Machine Learning concepts , algorithms & maths implemented completely from scratch using Python & numpy only.

machine-learning-algorithms numpy pandas probabilistic-machine-learning python statistical-models

Last synced: 07 May 2026

https://github.com/salvatoreamaddio/powerbi

This project combines both Python and PowerBI technologies. It cleans a dataset and imports it into a PowerBI file.

numpy pandas pandas-python powerbi python

Last synced: 07 May 2026

https://github.com/ifimust/room_generator

Room generator service for 2D game levels

flask google-app-engine microservice numpy procedural-generation python

Last synced: 07 May 2026

https://github.com/pereira-elizabeth/aah-2d-hermitian

Minimal 2D AAH model: build, diagonalize, and plot spectrum (NumPy/Matplotlib)

condensed-matter numpy physics scipy tight-binding

Last synced: 07 May 2026

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/aimin-nur/dataanalyst-cohortanalysis

Sebuah project Data Analayst untuk mengetahui bagaimana pola pembelian pelanggan dalam periode waktu tertentu (Cohort Analysis)..

cohort-analysis numpy pandas pytz

Last synced: 07 May 2026

https://github.com/4strium/predict-french-tv-audience

🎬 A beginner's machine learning project whose aim is to predict the audience for films on TV in France, based on previous data provided by the CNC and Médiamétrie.

machine-learning numpy pandas python tv xgboost

Last synced: 07 May 2026

https://github.com/navid-kalaei/numpy-neural-network

Neural network implemented in Numpy to predict bike sharing on real data

deep-learning neural-network numpy python

Last synced: 07 May 2026

https://github.com/reddyprasade/bird-classifications-problem

Birds Classification by using Scikit-learn and Scikit-image

matplotlib numpy pandas python-3 sckiit-learn sckit-image

Last synced: 08 May 2026

https://github.com/djdurga/predictive_analysis_in_diabetes

This project applies Logistic Regression to predict diabetes in patients using the Pima Indians Diabetes Dataset. It covers the full data science lifecycle — from data imputation and exploration to model training, evaluation, and insights.

matplotlib numpy pandas

Last synced: 08 May 2026

https://github.com/azaz9026/python_library_numpy

NumPy is a Python library used for working with arrays. It also has functions for working in domain of linear algebra, fourier transform, and matrices...

array jupyter-notebooks numpy python3

Last synced: 08 May 2026

https://github.com/dhruvil-26/python-projects

This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.

eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization

Last synced: 08 May 2026

https://github.com/zuecopio/vco_2425

Computer Vision (VCO) subject practices

cv2 matplotlib numpy opencv pilllow

Last synced: 08 May 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/moustafamohamed01/data-analysis-ecommerce-purchases

Data Analysis : Analyzing (Ecommerce Purchases) dataset from Kaggle

jupiter-notebook numpy pandas python

Last synced: 08 May 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/sahil210695/numpy-basics

A brief overview of NumPy basics

data-science numpy numpy-tutorial python

Last synced: 08 May 2026

https://github.com/takana671/fluidcube

Particle Movement Using Curl Noise

curl-noise cython numpy panda3d python

Last synced: 22 Jun 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/bhawnamehbubani/advanced-zomato-restaurant-clustering-and-sentiment-analysis

This project leverages this data to perform clustering and sentiment analysis, providing actionable insights for customers and business growth.

matplotlib-pyplot nltk numpy pandas seaborn sklearn wordcloud

Last synced: 05 May 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/andersoncrs/prediccion_precio_vehiculos_statsmodels

Este proyecto utiliza un modelo de regresión lineal para predecir el precio de vehículos basándose en sus características principales. El análisis incluye la definición del problema, exploración y limpieza de datos, conversión de variables categóricas a numéricas, evaluación de correlaciones y entrenamiento del modelo.

analisis-de-datos analisis-exploratorio-de-datos matplotlib numpy seaborn statsmodels visualizacion-de-datos

Last synced: 26 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/sufiyan1803/student-result-analysis

This project explores student exam data to find out what affects their scores. It covers everything from cleaning the data to analyzing score distributions, the effects of parental education and marital status, and comparisons by gender and lunch type. It’s a great way to see how different factors influence student performance.

matpolotlib numpy pandas python

Last synced: 03 Feb 2026

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 11 Apr 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/riju18/from-data-production-to-client-handover

The common tedious problem is to build a data app to demonstrate the data analysis & analytics along with Machine Learning to a client. It was an attempt to do it on small scale in the most powerful & simplest way.

machine-learning matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/cfbastarz/jupyternotebooks

A collection of several Jypyter notebooks.

dask matplotlib numpy python xarray xesmf

Last synced: 18 Jan 2026

https://github.com/chanmeng666/advanced-neural-network-applications

Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.

classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression

Last synced: 03 May 2026

https://github.com/yoavram/numpycharm

Some examples on using PyCharm with Scientific Python

cython numpy pycharm python scientific-computing training-materials

Last synced: 11 Apr 2026

https://github.com/anubagre/predicting-price-of-pre-owned-cars

This project is solution to data science problem based on regression model to predict the price of pre owned cars. Programming language used is Python.

data-science machine-learning numpy pandas price-prediction python seaborn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/ikbalcaus/HandSketch

Drawing on Canvas with Hand Gestures + AI for Letter Recognition

mediapipe numpy ocr-recognition opencv python pytorch tkinter

Last synced: 31 Mar 2025

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/diegorezm/spc

Analise de espectros de Infravermelho por Transformada de Fourier (FTIR)

ftir-data-analysis numpy python

Last synced: 21 Apr 2026

https://github.com/pavanreddy565/infosys_python_for_datascience

Data Science Course Tasks Repository This repository contains tasks and projects completed during the Data Science course from Infosys Springboard. It includes setup instructions for running Jupyter Notebook sessions in a virtual environment.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/faizantkhan/numpy

NumPy Practice Repository Dive into the world of numerical computing with our collection of NumPy exercises. Perfect for beginners and seasoned programmers alike, this repository offers a range of challenges to sharpen your array manipulation and data analysis skills.

data-science library machine-learning numpy numpy-arrays numpy-library numpy-python outlier outlier-detection python pythonlibrarires tutorial

Last synced: 21 May 2026

https://github.com/mborne/numpy-experiments

[DEV] Some basic examples written to get started with numpy and pytorch

autograd basics get-started least-squares numpy torch

Last synced: 05 May 2026

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/abhi227070/wine-quality-prediction

The Wine Quality Prediction project utilizes machine learning to assess wine quality based on various parameters. It offers a user-friendly interface built with Streamlit for easy interaction. Deployed on AWS, it provides scalable access to quality predictions for wine batches.

classification-algorithm machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas python3 seaborn

Last synced: 11 Apr 2026

https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/arnauog/ironhack_data_analytics

Weekly labs to practice everything taught in class during the Data Analytics bootcamp of Ironhack in Barcelona of Sept 2024 - March 2025.

machine-learning mongodb mysql numpy pandas python seaborn tableau webscraping

Last synced: 11 Apr 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/linsanity03/football_prediction

ML project to predict goal from football events such as penalty, free kick.

jupyter-notebook numpy pandas pyspark python3 tensorflow

Last synced: 11 Apr 2026