An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/ml-kyphosis

Machine Learning python bootcamp: decision trees & random forest on kyphosis dataset

decision-trees ipynb machine-learning numpy pandas python random-forest seaborn

Last synced: 14 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/waghraj1699/pokemon-predicton

Predict which type of pokemon it is using CNN

data-science deep-learning keras numpy python tensorflow utils

Last synced: 05 Apr 2026

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 14 Apr 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 14 Apr 2026

https://github.com/gayatri018/ab-testing-marketing-campaign

This project involves conducting A/B testing to evaluate the performance of two marketing campaigns: Facebook Ads and Google Adwords for an online retail business. The primary goal is to determine which campaign is more effective in driving user engagement and conversions, helping the company optimize its marketing strategies.

ab-testing linear-regression matplotlib numpy pandas python scipy seaborn sklearn-

Last synced: 10 Apr 2026

https://github.com/zahediparsa/nlp_ta_uok

This repository contains exercises designed for students in the Natural Language Processing (NLP) course at the University of Kurdistan, taught by Dr. Fatemeh Daneshfar. The course took place from February 2024 to July 2024.

cbow jupyter-notebook matplotlib nlp nltk numpy pandas python tf-idf

Last synced: 19 Jan 2026

https://github.com/umesh-01/python

Python is a programming language that lets you work more quickly and integrate your systems more effectively. Python is used for web development, AI, machine learning, operating systems, mobile application development, and video games.

ai application-development artificial-intelligence deep-learning machine-learning matpotlib ml nlp numpy open-source pandas python python-framework python-libraries python3 web-development

Last synced: 07 Apr 2026

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 11 Apr 2026

https://github.com/dev-dur/student_exam_analysis

This is a beginner-friendly data analysis mini-project using NumPy, Pandas, and Matplotlib. The project generates random student exam scores, analyzes them, and visualizes the results.

matplotlib-pyplot numpy pandas python

Last synced: 16 Apr 2026

https://github.com/sayidmushtaq/reportcard

Tracking and analyzing student grades. It calculates averages, identifies top performers, and generates performance reports with ease.👨🏼‍🎓👨🏼‍🎓

begginer-project numpy reporting-tool

Last synced: 27 Jan 2026

https://gitlab.com/DLR-SY/boxbeam

Boxbeam is a legacy Fortran tool translated to python. It calculates effective beam properties of composite cross sections comprised of rod-like elements.

fortran numpy python

Last synced: 12 Oct 2025

https://github.com/angelarreola/ai_notes

Notas de la materia "Inteligencia Artificial" para su posterior extraccion mediante algun modelo de lenguaje que nos permita dar respuestas personalizadas con base a la informacion presente en este repositorio.

ai matplotlib numpy pandas phaserjs python scikit-learn

Last synced: 21 Jan 2026

https://github.com/chandkund/sms-spam-detection

The goal is to develop a classification model that can accurately differentiate between spam and non-spam messages. This is crucial for applications like email filtering, SMS spam detection, and improving overall user experience by reducing the influx of unwanted or malicious content.

matplotlib nlp-machine-learning numpy pandas seaborn stemming tfidf-vectorizer tokenization

Last synced: 19 Jan 2026

https://github.com/jsimell/sleepanalysis

A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.

data-analysis matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/petcareai/dba

PetCare DBA Admin é um sistema web avançado desenvolvido especificamente para a administração e gerenciamento de bancos de dados da plataforma PetCare. Esta ferramenta foi criada para simplificar e otimizar o trabalho de administradores de banco de dados (DBAs) em clínicas veterinárias, hospitais veterinários e sistemas de gestão pet.

api dba front-end-development git numpy pandas-dataframe petcareai ploty postgresql python shell-script sql startup streamlit streamlit-webapp supabase supabase-auth supabase-db supabase-storage virtualenv

Last synced: 14 Apr 2026

https://github.com/shubham-bendre/football-clip-analysis

This project uses YOLO and Roboflow datasets to analyze football matches, detecting players, referees, and the ball while tracking movements in real-time.

deep-learning matplotlib numpy roboflow-dataset yolov5

Last synced: 17 May 2026

https://github.com/abhishekyadav915/multiple_disease_prediction_using_ml

Developed a web-based application for predicting the likelihood of multiple diseases using machine learning models with accuracy up to 95%.

colab-notebook logistic-regression numpy pandas python3 sklearn spyder-python-ide streamlit-web streamlitcloud svm-classifier

Last synced: 05 May 2026

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/gerlero/generala

🎲 Probabilistic tool for the dice game of Generala

generala numpy python

Last synced: 27 Jan 2026

https://github.com/hansalemaos/locate_pixelcolor_cython_multi_auto_compile

Detects colors in images 8 x faster than Numpy / Uses Cython - returns a dict

bot c color colors colour colours cpp cython fast find multiprocessing numpy numpy-array opencv rgb rgba search

Last synced: 24 Feb 2026

https://github.com/ananupam/eda-titanic

An Exploratory Data Analysis project done on Titanic Dataset.

numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/aqueeqazam/student-placements-using-machine-learning-and-flask

Through the use of Flask and machine learning, I show how to implement the student placement prediction model.

flask machine-learning numpy pandas

Last synced: 14 Apr 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 14 Apr 2026

https://github.com/mizcausevic-dev/embedding-drift-graph

Track how entity embeddings drift across encoder model versions. SQLite store + Strawberry GraphQL API. Cosine drift events computed automatically on every record. Reference impl for RAG/eval pipelines re-encoding their corpus.

ai-governance drift-detection embeddings graphql llm numpy python rag sqlite strawberry-graphql vector-search

Last synced: 01 Jun 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/karthik9273/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 06 May 2026

https://github.com/mjul/scipy-lab

Scientific computation with Python

matplotlib numpy python scipy

Last synced: 04 May 2026

https://github.com/deepcloudlabs/dcl702-2021-jul-12

DCL-702: Data Analytics using Python

data-analytics numpy pandas python3

Last synced: 05 May 2026

https://github.com/neelays/xor-xnor_neural_network

NumPy neural network to approximate XOR/XNOR

numpy

Last synced: 15 May 2026

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 22 Oct 2025

https://github.com/hifly81/photox

photo and album organizer for gtk3

gtk3 numpy opencv-python photography pil pillow python

Last synced: 24 Oct 2025

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 07 Mar 2026

https://github.com/atul-maurya-30/galaxy

Galaxy Classification is a machine learning project focused on classifying galaxies into two subclasses: 'STARFORMING' and 'STARBURST'. This project demonstrates data preprocessing, model training, and evaluation using advanced machine learning techniques and Python libraries.

flask machine-learning matplotlib numpy pandas python regression-classification seaborn sklearn

Last synced: 09 Mar 2026

https://github.com/satyacoder29/crm-analytics-power-bi

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/luliatuccu/weather_analysis

This project highlights a combination of data science techniques and Python programming to explore real-world weather data.

data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns

Last synced: 02 Apr 2026

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 23 Feb 2026

https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations

This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.

jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode

Last synced: 28 Jan 2026

https://github.com/hafaio/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 24 Feb 2026

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 28 Jan 2026

https://github.com/rezasharifi82/lesson_extractor

Extract all possible "معارف" courses @ Ferdowsi University of Mashhad ,FUM

beautifulsoup4 ipynb jupyter-notebook numpy pandas python3 regex

Last synced: 06 Feb 2026

https://github.com/sonaligill/olympics-analysis

The outcome of this project is an interactive streamlit web application that visualizes the analysis of Olympic data while rendering different aspects of Olympic history, compare country performances, and gain insights into athlete demographics.

numpy plotly python scikit-learn scipy streamlit

Last synced: 28 Jan 2026

https://github.com/harmanveer2546/heart

Predicting the presence of heart disease based on several health-related factors and Performing - i.) Data Cleaning ii.) Data Pre-Processing iii.) EDA iv.) Compare 5 different classification algorithms (Logistic Regression, Decision Tree, Random Forest, KNN and SVC)

data-preprocessing decision-tree eda knn logistic-regression machine-learning numpy pandas random-forest roc-auc-curve svc

Last synced: 03 May 2026

https://github.com/lavkalsi/creditcardfrauddetector

Credit Card Fraud Detector is a React web app that predicts if a credit card transaction is fraudulent using a Python machine learning model. Users can input transaction data, and Flask facilitates communication between the backend and frontend. Backend files are located in the res folder. This app provides simple UI for user interaction.

fraud-detection machine-learning nodejs numpy pandas python reactjs sklearn-library

Last synced: 15 Apr 2026

https://github.com/andres29ac/website_heatmaprisk

Mapa hecho con Streamlit y Folium para mostrar la distribución delictiva de cada distrito de Trujillo mediante Mapas de Calor.

folium-maps heatmap-visualization numpy pandas ploty python stadistics streamlit

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/subh888999/car-prices--analysis-projects

This repository houses projects focused on data collection, assessment, cleaning, visualization, and analysis. It includes workflows and methodologies for handling data, from initial gathering and evaluation to processing, visualizing insights, and performing in-depth analysis

jupyter-notebook matplotlib numpy panda seaborn statistics

Last synced: 03 May 2026

https://github.com/deepankkartikey/ml-projects

A repository of projects done while learning and exploring ML

classification csv machine-learning numpy pandas regression sklearn

Last synced: 15 Apr 2026

https://github.com/asherk7/house-price-prediction

House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting

data-science numpy pandas regression scikit-learn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/hercules-peters/learn-ml-with-me

Welcome to **ML-Journey**, my personal progress tracker for mastering machine learning! This repository showcases my projects, notebooks, and explorations as I advance my skills, starting with hands-on projects.

machine-learning numpy pandas python sklearn

Last synced: 15 Apr 2026

https://github.com/muhkartal/fr-photo

face-recognition system works on images first of all you need to register valid image as recognition

face-recognition numpy opencv pillow python streamlit

Last synced: 15 Apr 2026

https://github.com/tamk-kol/project_orbital_data_analysis

The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.

matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2026

https://github.com/ayuub34/siesta-smart

This project is designed to be a AIaaS 2-Way-PMS optimized with AI and ML models. The goal is to enhance the efficiency of the system through advanced technology solutions.

ai data-analysis-python data-processing data-science jupyter-python machine-learning matplotlib numpy pandas powerbi-report pyqt5 scipy seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-03-q4-hypothesis-testing

Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing norm null-hypothesis numpy numpy-arrays pandas python scipy significance-testing stats

Last synced: 31 Jan 2026

https://github.com/thkox/uni_2023-compression-and-object-removal

This project is a team assignment for the "Multimedia Systems" course, offered in the 6th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

2023 6th-semester colorama compression-algorithm department-of-informatics heapq lossless-compression-algorithm motion-compensation multimedia-systems numpy object-removal opencv-python pickle python scipy-stats university-of-piraeus

Last synced: 07 Feb 2026

https://github.com/moh-27sl/image-alignment--python-edition

Alignment of an unaligned image with a base/reference image using feature detection, feature matching, and homography in OpenCV.

ai classical-computer-vision classical-image-processing computer-vision computervision image-alignment image-processing imageprocessing neural-network numpy opencv python python3

Last synced: 15 Apr 2026

https://github.com/manu-karenite/medical-insurance-cost-predictor

Medical Insurance Cost Generator is a Linear Regression based Predictor which is used to estimate and predict the Cost a person has to pay while Buying a Medical Insurance.

kaggle-dataset linear-regression machine-learning matplotlib numpy pandas python3 reactjs scikit-learn

Last synced: 15 Apr 2026

https://github.com/hajaarh/health_state_classifier

Ce projet vise à analyser un dataset contenant divers indicateurs de santé, bien-être et vie quotidienne afin de classifier les individus en trois catégories selon leur état de santé.

numpy pandas python seaborn sklearn

Last synced: 15 Apr 2026

https://github.com/sarmad426/ai

AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.

ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/l-gre/data_analytics_for_finance

Comprehensive course materials for the Data Analytics for Finance - Master Programme, covering data manipulation, statistical analysis, visualisation, automation, and real-world case studies using industry-standard tools.

automation data-cleaning data-manipulation data-visualization excel hypothesis-testing industry-applications matplotlib numpy pandas python real-world-case-studies regression-analysis seaborn sql statistical-analysis tableau workflow-automation

Last synced: 15 Apr 2026

https://github.com/karishmagupta05/airbnb-data-scraping-and-analysis

This project involves web scraping Airbnb listings using Python and BeautifulSoup, followed by data analysis with Pandas. The goal is to extract meaningful insights from the data, clean and preprocess it, and visualize the results using libraries like Matplotlib and Seaborn.

beautifulsoup4 matplotlib numpy pandas python scraping seaborn visualization

Last synced: 15 Apr 2026

https://github.com/amruta33/credit_card_analysis

The loan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Because of that, some consumers use it as their advantage by becoming a defaulter.

numpy pandas python3

Last synced: 15 Apr 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/dipeshgoyal013/salary-data-analysis

Salary Analysis according department and agency.

analysis matplotlib numpy pandas salary sklearn-library

Last synced: 15 Apr 2026

https://github.com/dmitruzik/python_projects

All python projects are here

matplotlib numpy pandas python seaborn

Last synced: 15 Apr 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026