An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/ljadhav25/false-alarm-detection

The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.

flask flask-api logestic-regression numpy pandas pkl pycharm-ide python

Last synced: 19 Jan 2026

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 May 2026

https://github.com/saylie/cat-dog-classifier-

The Cat-Dog Classifier is a deep learning-based image classification project that can differentiate between images of cats and dogs. It uses a Convolutional Neural Network (CNN) model trained on labeled datasets to predict whether a given image belongs to a cat or a dog.

ai javascript keras numpy opencv python react tensorflow

Last synced: 14 Apr 2026

https://github.com/mairagalvao/final_grades

An analysis of the final grades of students using Python

matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/bharath-tars/deeptumor2.0

DeepTumor2.0 is a project aimed at developing a flask webapp with robust front end and highly trained model using tensorflow in the backend with several features like with automatic html template reporting. **NOTE: DeepTumor2.0 has been moved from Azure to On-Render to save costs**

css datetime docs flask github html javascript keras numpy onrender os pandas python sheety smtp tensorflow

Last synced: 21 Jan 2026

https://github.com/sweta2501/ml

Consist of ML projects based on Python along with DataSheets

machine-learning matplotlib-pyplot numpy pandas python seaborn sklearn sklearn-model

Last synced: 14 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/omcodewizard/nasa-battery-analysis

This repository contains code for analyzing battery data from NASA's battery testing dataset. The analysis involves processing battery impedance, electrolyte resistance, and charge transfer resistance across charge/discharge cycles to track the aging and performance of various batteries.

battery-impedence nasa-battery-dataset numpy pandas plotly python

Last synced: 09 May 2026

https://github.com/lorenzorottigni/ml-breast-cancer

Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 14 Apr 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/aroojzahra908/machine-learning

This repository contains helpful guides and materials for easily understanding Pandas, NumPy, and Matplotlib libraries, which are essential tools for creating visual representations of data.

algorithms machine-learning-algorithms matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/NetBr3ak/Planetary-System-Simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 08 Oct 2025

https://github.com/pedasoft-consult/employee-attrition-prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

joblib numpy pandas python3 sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/prarthana-singh/bangalore-house-price-predictor

🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.

data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/ricardorobledo/malicious_server_hack_detection

Predictive model to detect malicious hacking patterns in banking servers. Utilizes advanced Machine Learning techniques such as SMOTE, Gradient Boosting, and probability calibration to predict attacks befor

anaconda cibersecurity imbalanced-data imbalanced-learning imblearn kaggle matplotlib numpy pandas pandas-library python3 sklearn

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/ml-kyphosis

Machine Learning python bootcamp: decision trees & random forest on kyphosis dataset

decision-trees ipynb machine-learning numpy pandas python random-forest seaborn

Last synced: 14 Apr 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/waghraj1699/pokemon-predicton

Predict which type of pokemon it is using CNN

data-science deep-learning keras numpy python tensorflow utils

Last synced: 05 Apr 2026

https://github.com/priyanshul28/exercise_pandas

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib.

data-cleaning data-visualization matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/zolppy/recommendation-system

This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.

computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16

Last synced: 12 Apr 2026

https://github.com/nimaabediforud/heart-disease-prediction

The project highlights the end-to-end workflow of data preprocessing, model building, evaluation, and comparison for a healthcare prediction task.

biomedical-applications biomedical-data-science biomedical-engineering classification deep-learning machine-learning neural-networks numpy pandas prediction python seaborn sklearn tensorflow

Last synced: 15 Apr 2026

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/ricky9667/taiwanoceantrashpredict

A very simple DNN project using Keras.

dnn keras numpy python

Last synced: 14 Apr 2026

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 14 Apr 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/dwija12903/ai-lab

A collection of practical implementations from my AI Labs course

keras numpy sckit-learn tensorflow

Last synced: 19 Jan 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/chuvalniy/tulia

Contains self-implemented Machine Learning algorithms using only numpy.

boosting from-scratch knn learning logistic-regression machine ml numpy python random-forest regression sklearn testing xgboost

Last synced: 09 May 2026

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 09 May 2026

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 14 Apr 2026

https://github.com/gayatri018/ab-testing-marketing-campaign

This project involves conducting A/B testing to evaluate the performance of two marketing campaigns: Facebook Ads and Google Adwords for an online retail business. The primary goal is to determine which campaign is more effective in driving user engagement and conversions, helping the company optimize its marketing strategies.

ab-testing linear-regression matplotlib numpy pandas python scipy seaborn sklearn-

Last synced: 10 Apr 2026

https://github.com/allanotieno254/pinecone-index-creation-vector-store-database-

This project demonstrates how to create an index in Pinecone, a vector database optimized for similarity search and machine learning applications. The notebook covers setting up a Pinecone client, creating an index, and managing vector data efficiently.

index jupyter-notebook numpy pinecone python

Last synced: 13 Apr 2026

https://github.com/guybanbo/qr-algorithm

QR alogrithm for finding eigenvalues and eigenvectors of a large symmetric matrix

haskell hmatrix numpy optimization python qr-algorithm symmetric-matrix

Last synced: 19 Jan 2026

https://github.com/zahediparsa/nlp_ta_uok

This repository contains exercises designed for students in the Natural Language Processing (NLP) course at the University of Kurdistan, taught by Dr. Fatemeh Daneshfar. The course took place from February 2024 to July 2024.

cbow jupyter-notebook matplotlib nlp nltk numpy pandas python tf-idf

Last synced: 19 Jan 2026

https://github.com/michaelzheng67/real_estate_ml

AI experiment developed to run against simulated real estate and investing environment

ai numpy python pytorch

Last synced: 09 May 2026

https://github.com/chandkund/ev-market-segmentation

This project focuses on segmenting the Electric Vehicle (EV) market in India to identify key customer groups based on their preferences, behaviors, and demographics.

machine-learning matplotlib numpy pandas seaborn sklearn visualization

Last synced: 19 Jan 2026

https://github.com/isratafrin826/linear-regression-model

Linear Regression is a statistical method used to model the relationship between a dependent variable (target) and one or more independent variables (features) by fitting a straight line (regression line) to the data. It predicts the target value based on the linear relationship and is commonly used for forecasting and trend analysis.

googlecolaboratory numpy pandas sklearn

Last synced: 03 May 2026

https://github.com/umer-farooq-cs/california-housing-regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

california-housing cross-validation deep-learning-for-perception linear-regression numpy polynomial-features reproducible-research residual-analysis sgd sklearn

Last synced: 09 May 2026

https://github.com/umesh-01/python

Python is a programming language that lets you work more quickly and integrate your systems more effectively. Python is used for web development, AI, machine learning, operating systems, mobile application development, and video games.

ai application-development artificial-intelligence deep-learning machine-learning matpotlib ml nlp numpy open-source pandas python python-framework python-libraries python3 web-development

Last synced: 07 Apr 2026

https://github.com/christabelsakyi/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib-pyplot numpy python seaborn sklearn

Last synced: 08 May 2026

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 11 Apr 2026

https://github.com/dev-dur/student_exam_analysis

This is a beginner-friendly data analysis mini-project using NumPy, Pandas, and Matplotlib. The project generates random student exam scores, analyzes them, and visualizes the results.

matplotlib-pyplot numpy pandas python

Last synced: 16 Apr 2026

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/sayidmushtaq/reportcard

Tracking and analyzing student grades. It calculates averages, identifies top performers, and generates performance reports with ease.👨🏼‍🎓👨🏼‍🎓

begginer-project numpy reporting-tool

Last synced: 27 Jan 2026

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://gitlab.com/DLR-SY/boxbeam

Boxbeam is a legacy Fortran tool translated to python. It calculates effective beam properties of composite cross sections comprised of rod-like elements.

fortran numpy python

Last synced: 12 Oct 2025

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/ashishgoswami13/handgesture_recognition

Real-time hand gesture recognition using computer vision and deep learning. Collect gesture images, train a model with Google Teachable Machine, and classify gestures live for applications like drone control or human-computer interaction.

computer-vision cvzone handgesture-recognition keras numpy python teachable-machine

Last synced: 18 Apr 2026

https://github.com/angelarreola/ai_notes

Notas de la materia "Inteligencia Artificial" para su posterior extraccion mediante algun modelo de lenguaje que nos permita dar respuestas personalizadas con base a la informacion presente en este repositorio.

ai matplotlib numpy pandas phaserjs python scikit-learn

Last synced: 21 Jan 2026

https://github.com/sudharshan-bathini/chat-bot-using-cornell-movie-data-set

Developed an AI chatbot using the Cornell Movie Dialogs dataset; trained an LSTM-based model to generate context-aware, conversational responses for user inputs.

keras lstm-sentiment-analysis neural-networks nlp numpy python tensorflow tokenizer

Last synced: 08 Apr 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/sducournau/ign_lidar_hd_dataset

🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.

building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn

Last synced: 20 Jan 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/chandkund/sms-spam-detection

The goal is to develop a classification model that can accurately differentiate between spam and non-spam messages. This is crucial for applications like email filtering, SMS spam detection, and improving overall user experience by reducing the influx of unwanted or malicious content.

matplotlib nlp-machine-learning numpy pandas seaborn stemming tfidf-vectorizer tokenization

Last synced: 19 Jan 2026

https://github.com/farhad-here/height-distribution-analysis

Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.

coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance

Last synced: 13 Apr 2026

https://github.com/gsaini/streamlit-ui-getting-started

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps with only a few lines of code. Build and deploy powerful data apps in minutes. Let's get started!

codecov github-actions numpy pandas python streamlit unittest

Last synced: 14 Jun 2026

https://github.com/nisch-mhrzn/house_prediction

This project predicts house prices using data exploration, feature engineering, and machine learning models like Linear Regression and Random Forest. It demonstrates how to optimize models and evaluate their performance to accurately forecast house prices.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/mehediehteshum/minesweeper

Building classic 'Minesweeper' game from scratch using Pygame with VS Code.

numpy pygame python python3

Last synced: 06 Oct 2025

https://github.com/jsimell/sleepanalysis

A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.

data-analysis matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/sweta-kaundilya/finance-data-project

This data project focuses on exploratory data analysis of bank stock prices throughout financial crisis all the way to early 2016.

financial-analysis jupyter-notebook matplotlib numpy pandas-python seaborn

Last synced: 16 May 2026

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/petcareai/dba

PetCare DBA Admin é um sistema web avançado desenvolvido especificamente para a administração e gerenciamento de bancos de dados da plataforma PetCare. Esta ferramenta foi criada para simplificar e otimizar o trabalho de administradores de banco de dados (DBAs) em clínicas veterinárias, hospitais veterinários e sistemas de gestão pet.

api dba front-end-development git numpy pandas-dataframe petcareai ploty postgresql python shell-script sql startup streamlit streamlit-webapp supabase supabase-auth supabase-db supabase-storage virtualenv

Last synced: 14 Apr 2026

https://github.com/raduldev/datasciencepython

Interview Task From CSIR-IGIB, Delhi & maps for journal

biopython colab-notebook counter jupyter-notebook matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/shubham-bendre/football-clip-analysis

This project uses YOLO and Roboflow datasets to analyze football matches, detecting players, referees, and the ball while tracking movements in real-time.

deep-learning matplotlib numpy roboflow-dataset yolov5

Last synced: 17 May 2026

https://github.com/akashash01/support_vector_machine

It's an linear model for classification and regression problems. It can solve non-linear problems by creating a hyperplane which separates the data into classes using KERNAL .

data-science datavisualization machine-learning numpy pandas python sklearn svm-classifier

Last synced: 13 Apr 2026