An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/iama02/vet-medicine-analysis

Analysis of 50,000+ veterinary medicine records with Pandas and NumPy, highlighting dosage forms, manufacturers, indications, and shelf life trends.

numpy pandas python

Last synced: 06 May 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/mikma03/datascience_python_datacamp

DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.

data-analysis data-science datacamp datascience numpy pandas python

Last synced: 06 May 2026

https://github.com/sopheakchan/dull-razor-algorithm

DullRazor hair removal for dermoscopic skin images.

numpy opencv python

Last synced: 06 May 2026

https://github.com/naveen88112/genetic_algos_maze_solver

Genetic Algorithm Maze Solver This project utilizes a Genetic Algorithm (GA) to solve the optimization of a maze problem. It evolves the population of promising solutions to the best solution finding shortest path between start and end point using selection, crossover, and mutation.

deap-library genetic-algorithms matplotlib maze-solver numpy python

Last synced: 06 May 2026

https://github.com/swapnilvishwakarma/twitter_sentiment_dashboard

Using Streamlit and Python, I created interactive data dashboards and used Pandas to manipulate data in data science workflows. I also used Plotly to create interactive graphs.

dashboard machine-learning matplotlib numpy pandas plotly sentiment-analysis streamlit-webapp webapp wordcloud

Last synced: 06 May 2026

https://github.com/timeless-residents/handson-tensorflow-numpy

A practical guide to learning and using TensorFlow and NumPy for data manipulation and machine learning

data-science machine-learning numpy tensorflow tutorial

Last synced: 06 May 2026

https://github.com/kumar-o/customer-churn-prediction

πŸ” Predict customer churn using a synthetic dataset with advanced models and metrics to enhance business retention strategies and decision-making.

churn-prediction customer-churn-prediction customer-survival-analysis data-science factorization-machines flask-application gridsearchcv libsvm machine-learning model-evaluation numpy pipelines roc-auc scikit-learn seaborn smote survival-analysis xgboost4j

Last synced: 06 May 2026

https://github.com/lunanvarg/bid-data-cac22039

En este repositorio se vΓ© mi TP Integrador del programa Codo a Codo: Big Data

colab data-studio-google deepnote excel matplotlib mysql numpy pandas-library python seaborn

Last synced: 06 May 2026

https://github.com/sahilmate/gesturepainter

Virtual Painting Application

gesturepainter mediapi numpy opencv virtualpaint

Last synced: 06 May 2026

https://github.com/zotho/grav_euler

Euler gravity compute

gravity numpy python

Last synced: 06 May 2026

https://github.com/harmanveer-2546/student-performance-in-exam

Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.

classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization

Last synced: 06 May 2026

https://github.com/abdulrahmanaymann/data-storage-and-information-retrieval

This is a Python project that performs tokenization, stop word removal, positional indexing, phrase query searching, term frequency-inverse document frequency (TF-IDF) calculation, cosine similarity computation, and document ranking.

cosine-similarity information-retrieval nltk numpy pandas postional-index python ranking stop-word-removal tf-idf tokenization

Last synced: 06 May 2026

https://github.com/kianaabrisham/svm-from-scratch

Linear SVM from scratch with hinge loss + decision boundaries

classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm

Last synced: 07 May 2026

https://github.com/sriyaroy/coding-challenges

50 Machine Learning & Computer Vision challenges ranging from Easy to Hard to sharpen you ML engineering skills.

coding-challenges computer-vision interview-prep machine-learning numpy python pytorch

Last synced: 07 May 2026

https://github.com/jonad/titanic_survival

Basic decision tree model for Titanic dataset. Predict who will survive.

jupyter-notebook matplotlib numpy pandas python3 titanic-survival-prediction

Last synced: 07 May 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/jonad/chicago_crime_analysis

Exploratory data analysis on Chicago Crime dataset.

jupyter-notebook matplotlib numpy pandas python3

Last synced: 07 May 2026

https://github.com/mgitrov/kulo-chatbot

A retrieval based chatbot aiming to simulate human-like conversation with the user via text messages on chat.

artificial-neural-networks bag-of-words deep-learning json keras lemmatization natural-language-processing nltk numpy tkinter tokenization

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/ebadshabbir/bluff-detection-model-polynomial_regression-

This project predicts employee salaries based on position levels using Linear and Polynomial Regression models. It trains models with degrees 2, 3, and 4 on a dataset of job titles, position levels, and salaries, and predicts the salary for a position level of 6.5. The results are visualized to compare model performance.

jupyter-notebook linear-regression machine-learning matplotlib-pyplot multiple-linear-regression numpy polynomial-regression python

Last synced: 07 May 2026

https://github.com/shivangmathur1/simple-qlearning

Solving gym environments using Numpy Qtables and Q-learning

gym numpy qlearning

Last synced: 07 May 2026

https://github.com/pereira-elizabeth/aah-2d-hermitian

Minimal 2D AAH model: build, diagonalize, and plot spectrum (NumPy/Matplotlib)

condensed-matter numpy physics scipy tight-binding

Last synced: 07 May 2026

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/arnie1x/massive-dataset-lab

This project involves basic data manipulation with JSON files, focusing on tasks related to data processing of the MASSive dataset and file management.

data-science numpy pandas pandas-dataframe python3

Last synced: 07 May 2026

https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

A machine learning project to classify exoplanets using light curve image data. Developed as part of the ML4SCI GSoC 2025 Test Task. Includes data processing, CNN-based model, and full report.

classification colab-notebook dbscan gsoc-2025 machine-learning-algorithms matplotlib-python ml numpy open-source pca-analysis python3 seaborn

Last synced: 07 May 2026

https://github.com/navid-kalaei/numpy-neural-network

Neural network implemented in Numpy to predict bike sharing on real data

deep-learning neural-network numpy python

Last synced: 07 May 2026

https://github.com/tinaland101/carcinoma-regression-project

The goal of this project is to analyze and visualize the results of a clinical study involving the treatment of squamous cell carcinoma (SCC) in mice. As a senior data analyst, you are tasked with preparing a technical report for Pymaceuticals, Inc.

linear-regression mathplotlib numpy pandas pyhton

Last synced: 08 May 2026

https://github.com/khushi130404/k_means

This repository showcases 2D, 3D, and custom K-Means clustering models with visualizations. It includes both Jupyter notebooks and Python scripts for ease of reproducibility.

clustering-algorithm k-means-clustering numpy plotly sklearn

Last synced: 08 May 2026

https://github.com/mttgdd/openfabmap-python3

Pybind11 and numpy array bindings to openFABMAP

cpp numpy openfabmap pybind11 python robotics vision

Last synced: 08 May 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/antonio-f/simple-k-nearest-neighbors

Simple K-Nearest Neighbors using partial sort

101 k-nearest-neighbors knn numpy partial-sort python tutorial

Last synced: 08 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/shingiraibhengesa/house-price-predictor

A machine learning project that predicts house prices based on user input features such as square footage, number of bedrooms, and more.

machine-learning-models matplotlib numpy python scikit-learn seaborn

Last synced: 09 May 2026

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 09 May 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/michaelzheng67/real_estate_ml

AI experiment developed to run against simulated real estate and investing environment

ai numpy python pytorch

Last synced: 09 May 2026

https://github.com/liannekiarra/university-dissertation

Combining Machine Learning Techniques with Multi-Objective evolutionary Algorithms to Solve Real World Engineering Problems

artificial-intelligence engineering hypervolume matplot multi-objective-optimization neuralnetwork nsga-ii numpy pareto-front pareto-optimality pymoo python3 sklearn

Last synced: 09 May 2026

https://github.com/anker661/autograd-playground

A simple Numpy & Python based auto differentiation system that supports visualization before and after back-propagation.

auto-differentiation autograd computaion-graph jupyter-notebook numpy python3

Last synced: 09 May 2026

https://github.com/arjunravi26/selenium

This repository contains code from my learning about Selenium for web scraping and automation.

numpy python selenium time undetected-chrome webdriver

Last synced: 09 May 2026

https://github.com/tim-w-james/python-data-reference-code

Python reference code for data visualization, machine learning and SQL

data-science jupyter-lab numpy python

Last synced: 10 May 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/florasteve/ml-foundations-day2

Day-2 ML foundations: probability/stats refresh and NumPy logistic regression; notebooks with visuals.

data-science jupyter-notebook logistic-regression machine-learning matplotlib numpy statistics

Last synced: 10 May 2026

https://github.com/jatin-s16/neuron_from_scratch

Hey! I am learning to build neural networks from scratch without the use of built-in Python libraries.

deep-learning neural-networks nnfs numpy python

Last synced: 10 May 2026

https://github.com/shriansh2002/stock-prediction

This project predicts Apple stock prices using linear regression. It's based on historical stock price data and uses Python and popular data science libraries like Pandas, NumPy, Matplotlib, and scikit-learn.

apple datascience finance linearregression machinelearning matplotlib ml numpy pandas prediction python scikitlearn stock-prediction stockmarket

Last synced: 10 May 2026

https://github.com/adityarohatgi11/regression_analysis_london_crime

This project examines how police presence impacts crime rates in London using regression analysis, panel data, and a natural experiment from the 2005 London attacks.

data-wrangling numpy pandas regression-analysis statsmodels

Last synced: 11 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/chaganti-reddy/heart_disease_prediction

Machine Model creation for Heart Disease Prediction

data-science deep-learning machine-learning numpy python

Last synced: 11 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/daksh90a/wine-quality-analysis

The Wine Quality Analysis project is an AI/ML-based data analysis initiative aimed at predicting and understanding the factors that influence the quality of wine.

matplotlib-python numpy pandas seaborn

Last synced: 12 May 2026

https://github.com/tfiers/unitlib

πŸ“ Physical units for NumPy arrays \ ⏱ Fast β€’ Simple β€’ High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/gmcgibbon/udacity-dlnd-project-1

Udacity Deep Learning Nanodegree - Project 1

deep-learning jupyter-notebook nn numpy udacity

Last synced: 13 May 2026

https://github.com/maroshmka/np-mlp-clf

Raw implementation of mlp classifier in python using numpy.

classification example-code learning machine-learning multilayer-perceptron neural-network numpy

Last synced: 14 May 2026

https://github.com/rokmark0/auto-feature-engineering

πŸ™ Automated feature engineering toolkit that creates, selects, and transforms features for machine learning on tabular data, boosting model accuracy and speeding workflows.

auto-feature-engineering automl aws-ec2 dask data-science explainable-ml feature-engineering featuretools machine-learning mahinelearning numpy pandas pandas-library python r sagemaker supervised-learning unsupervised-learning

Last synced: 10 Apr 2026

https://github.com/sahilk12nayak/hyperspectral-corn-don-prediction-project

This project contains a machine learning pipeline for predicting DON (vomitoxin) concentration in corn samples using hyperspectral imaging data.

matplotlib numpy pandas python scikit-learn seaborn tensorflow

Last synced: 10 Apr 2026

https://github.com/ai-lenok/moex-analyzer

A simple engine for analyzing Moscow Exchange securities

docker matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 04 May 2026

https://github.com/broodhoney/heart-disease-prediction

This is a machine learning project which has a trained model that classifies whether a patient has a heart-disease or not.

kaggle-dataset matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning uci

Last synced: 10 Apr 2026

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, NaΓ―ve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026