An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/AsuquoAA/Energy-_GDP-_and_Scimago_Rankings_Analysis-2006-2015

Analysis of top 15 countries in energy, renewables, and GDP using three global datasets.

data-cleaning numpy pandas

Last synced: 21 Jul 2025

https://github.com/coder5omkar/LendingClubCaseStudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 21 Jul 2025

https://github.com/aritrakar/ml-from-scratch

🤖Common ML implementations from scratch.

machine-learning numpy pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/bonifatius94/backprop

Deep Learning Reference Implementation

deep-learning numpy reference-implementation

Last synced: 18 Apr 2026

https://github.com/shoaib1522/data-science-in-python

"Unlock the power of data science with my curated collection of Python projects—your gateway to insights, analysis, and hands-on learning! 🌟"

analytics confusion-matrix data-science kaggle-dataset matplotlib mlp-classifier notebook numpy pandas python regression streamlit

Last synced: 04 Apr 2026

https://github.com/anselmoo/useful-math-functions

useful-math-functions for Optimization, Benchmarking, Visualizing, and more ...

basic functions math mathematics numpy

Last synced: 17 Mar 2025

https://github.com/1adore1/face-analysis

An application for face recognition and prediction of gender, age range and emotional state.

numpy opencv pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/asherk7/neural-network-from-scratch

Creating a Convolutional Neural Network from scratch, modeled after ResNet

classification cnn ml neural-network numpy python resnet

Last synced: 18 May 2026

https://github.com/alex-ber/alpine-python2

Alpine Linux based based image with Python 2.7 and necessary OS level dependencies (OpenSSL, gcc, ODBC driver, etc).

alpine alpine-linux dockerfile fortran gcc gcc-compiler numpy python python-2 python27

Last synced: 03 Oct 2025

https://github.com/dipakxettri/numpy

This Repo is dedicated on learning NumPy a first step of learning DataScience/Machine Learning!

numpy python

Last synced: 18 May 2026

https://github.com/michaeldehaney94/opencv

Python OpenCV computer vision projects

matplotlib numpy opencv-python python

Last synced: 20 Jun 2025

https://github.com/aravinda-1402/covid-detection-model-using-chest-x-ray

The objective of this project is to develop a Deep Learning Model to identify the X-Rays of healthy vs. Pneumonia (Corona) afflicted patients using the Chest X-Ray dataset, and use this model to power the AI application to test the Corona Virus in a faster phase.

classification cnn covid flask keras numpy pandas tensorflow

Last synced: 09 Apr 2026

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/h12m54am/numpy-matplotlib

Best for people who are using it for mathematical and statistical work. Not really a tutorial, but rather a bunch of examples that can help you understand Numpy and Matplotlib

jupyter-notebook learning-by-doing numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sibendud/csi_2024_dataengineering

Internship on Data Engineering where below topics are applied skills that are used to complete the given tasks through out 8 weeks including the project.

adf adls azure-pipelines databricks docker ipynb json numpy pandas-python parquet-avro pipeline pyspark python sql sql-server

Last synced: 07 Mar 2026

https://github.com/neofelis2x/gl3wxpyg

A playground and testing repo to implement OpenGL 3.3 in a WxPython interface using pyglet as an OpenGL-wrapper.

3d-graphics numpy opengl pyglet wxpython

Last synced: 11 May 2026

https://github.com/prarthana-singh/heart-attack-prediction-model

A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.

classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn

Last synced: 25 Jun 2025

https://github.com/docsallover/lbw-detection-in-cricket

LBW Detection in Cricket: A Deep Dive with OpenCV & NumPy

ball-detection cricket cvzone data-science lbw machine-learning numpy opencv python python3

Last synced: 08 May 2026

https://github.com/hansalemaos/colorcountcython

This module provides functionality to count unique colors in an image represented as a NumPy array and, optionally, group their coordinates. It includes a Cython-based implementation for improved performance.

colors colour colours count cv2 numpy

Last synced: 01 May 2026

https://github.com/akshaypatra/cardiovascular_disease_detection

AI-driven ECG classification model that detects cardiovascular abnormalities such as arrhythmia and atrial fibrillation using a hybrid CNN-LSTM deep learning approach.

keras matplotlib numpy pandas python3 scikit-learn seaborn tensorflow wfdb

Last synced: 14 Apr 2026

https://github.com/mayankmittal29/duplifinder-quora-clone-catcher

An advanced system for detecting semantically duplicate question pairs using cutting-edge NLP techniques. Combines traditional ML models (XGBoost, SVM, Random Forest) with deep learning architectures (BiLSTM, Siamese Networks, Transformers) and contextual embeddings (BERT, RoBERTa). Features engineered using token similarity, fuzzy matching, and em

bert bilstm cross-validation eda fastext fuzzy-matching glove numpy pandas python3 quora-question-pairs random-forest roberta seaborn stemming svm tf-idf transformers word2vec xgboost

Last synced: 15 Apr 2026

https://github.com/chandkund/titanic-survival-prediction

Welcome to the Titanic Survival Prediction project! 🌊 This project predicts passenger survival on the Titanic using machine learning models. By analyzing features like class, gender, and age, we aim to uncover the factors that influenced survival rates during this historic event. Dive in to explore the insights and models developed!

data-science machine-learning matplotlib numpy python seaborn sklearn visualization

Last synced: 07 Apr 2026

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 09 Apr 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/hoyirul/flask_test

This is a Python programming using flask and has been set up with the MVC concept and can connect to the database

flask mysql numpy python3 restful-api

Last synced: 07 Apr 2026

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 04 May 2026

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 07 Apr 2026

https://github.com/faraazarsath/handson-numpy

This respository is on various Numpy Tasks provided as academic assignment

numpy

Last synced: 16 May 2026

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 11 Apr 2026

https://github.com/satheesh-meadi/real_time_financial_risk_dashboard

Financial Risk Analysis Dashboard 🚀. An interactive Streamlit dashboard designed for analyzing and visualizing portfolio performance. Features include CAPM analysis, portfolio optimization, efficient frontier visualization, and real-time stock data to help optimize investments.

numpy pandas plotly plotly-express python3 scikit-learn streamlit yfinance

Last synced: 05 Apr 2026

https://github.com/youngermaster/coulomb-superposition-principle

Principio de Superposición de Coulomb y Ley de Biot-Savart

coulomb-law matplotlib numpy simulation

Last synced: 18 May 2026

https://github.com/dineshdhamodharan24/data_science_final_project

Customer Insights & Recommendation System: Harnessing Decision Tree, Logistic Regression, and Random Forest models for behavior analysis. Utilizing EasyOCR and Python Imaging Library for image information extraction. Employing NLTK for sentiment analysis on textual data

classification final-project guvi-projects image-processing nltk-python numpy ocr pandas recommendation-system sentiment-analysis sklearn-library streamlit text-processing worldcloud

Last synced: 02 May 2026

https://github.com/dmarks84/coursework_project_data-analysis-apache-spark

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in data, wrote to SQL database and performed queries, performed statistical analysis and issued reports

apache-sprk automation dag data-modeling eda elt etl numpy pandas pipelines python sql statistics visualization

Last synced: 11 Apr 2026

https://github.com/jcardonamde/food-sales-predictions

A project that seeks to predict the sales of a food distributor by applying machine learning algorithms and data analysis.

data-science machine-learning machine-learning-algorithms numpy pandas pipelines python seaborn sklearn

Last synced: 05 Apr 2026

https://github.com/audrbar/py_site

Website for displaying Data Science Projects build on Streamlit

matplotlib numpy pandas python3 requests streamlit

Last synced: 11 Apr 2026

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/aqueeqazam/a-deep-dive-introduction-to-tensorflow

TensorFlow is a deep learning system that provides an interface for expressing deep learning algorithms and implementing them. It uses dataflow graphs to represent computation, shared state, and the operations that mutate that state.

deep-learning neural-network numpy tensor tensorflow

Last synced: 03 Aug 2025

https://github.com/ramyacp14/sentimentanalysis

Implements a sentiment analysis model to determine the emotional tone behind text, helping understand attitudes, opinions, and emotions in online mentions.

machine-learning natural-language-processing nltk numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 11 Apr 2026

https://github.com/jainish-prajapati/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 30 Dec 2025

https://github.com/myahninsi/credit_card_fraud_detection

This repository is for the Neural Networks and Deep Learning Course - Assignment 1, focusing on credit card fraud detection. The project utilizes a machine learning model to predict whether a transaction is fraudulent using a synthetic credit card dataset.

matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 09 Apr 2026

https://github.com/maxbleu/f1analisys

A Python repository for analyzing and visualizing Formula 1 data. Includes tools for race statistics, performance comparisons, and custom visualizations.

data-science f1 f1-analisis f1api fastf1 formula1-analysis formula1-data-analysis matplotlib-python motorsport numpy pandas python seaborn

Last synced: 12 Sep 2025

https://github.com/hellenilda/ml-housing-price-prediction

Projeto simples para fazer previsão de preços de casas usando Regressão Linear.

artificial-intelligence machine-learning matplotlib numpy python sklearn streamlit

Last synced: 05 Apr 2026

https://github.com/andreazoccatelli/light_permanova

A lightweight implementation of PERMANOVA based on Euclidean distance from centroid

computervision numpy permanova pytorch statistics tabular-data

Last synced: 08 May 2026

https://github.com/philcs/electromag

Électromagnétisme, Université Laval, 2017

matplotlib numpy python

Last synced: 18 May 2026

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 07 May 2026

https://github.com/mohammedrashithkp/drlforsnakegame

This project utilises a Deep Reinforcement Learning specifically Q learning implemented using PyTorch which learns the game by interacting with the pygame environment .I have followed a tutorial on Freecodecamp YouTube Channel and made the initial model.

deep-reinforcement-learning matplotlib-pyplot numpy pygame python

Last synced: 16 May 2026

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/ireneflorez/sf-kindergarten-lottery

Data analysis reviewing data and findings on the public school kindergarten lottery in San Francisco

ipynb-jupyter-notebook jupyter-notebook numpy pandas python sql

Last synced: 17 Feb 2026

https://github.com/exclowd/gg

A library for creating terminal based games in python.

numpy python

Last synced: 18 May 2026

https://github.com/evernext10/coursera_capstone

This repository is created for sending the IBM Data Science Lab and will be used primarily for the Capstone project.

coursera-data-science ibm numpy pandas

Last synced: 10 May 2026

https://github.com/muzahid0408/student_marks_analysis_using_numpy

This project analyzes a student marks dataset to calculate key statistical measures such as mean, median, standard deviation, and more.

datasets jupyter-notebook numpy numpy-arrays pythoh

Last synced: 04 May 2026

https://github.com/gdg-a-coruna/inteligencia-artificial-en-la-deteccion-del-cancer

Repositorio con recursos sobre la charla Build with AI: Inteligencia Artificial en la Detección del Cáncer

ann breast-cancer breast-cancer-prediction breast-cancer-wisconsin keras lasso matplotlib ml numpy pandas python r svm tensorflow

Last synced: 17 Feb 2026

https://github.com/wbadart/simplegcn

Basic implementation of a Graph Convolutional Network

data-science gcn neural-network numpy

Last synced: 18 May 2026

https://github.com/karthikarajagopal44/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

data-cleaning data-visualization matplotlib numpy pandas python python3 scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/abhisek-13/diabetes-prediction

The Diabetes Prediction Web App uses machine learning to predict if someone has diabetes based on their health data. Trained on the Kaggle diabetes dataset, it offers real-time predictions through a user-friendly Streamlit interface.

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/fahrettinsolak/ai-map-based-geographic-clustering-project

This project focuses on clustering crime incidents in San Francisco using the K-Means algorithm. The dataset is obtained from Kaggle and contains information about crime types, geographical coordinates, and other relevant features. The goal is to identify crime hotspots through geographic clustering and visualize the clusters on an interactive map.

artificial-intelligence deep-learning elbow-method jupyter-notebook machine-learning numpy openstreetmap pandas phyton plotly scikit-learn standardscaler

Last synced: 05 Apr 2026

https://github.com/crafterkolyan/eigenvalues-speed-comparison

Сравнение скорости вычисления собственных значений положительно определённых матриц. Проект по спецкурсу "Высокопроизводительные вычисления на платформе Python". Основной результат - Python с JIT компилятором обгоняет нераспараллеленный C++ по скорости.

cpp eigenvalues numba numpy python speed university

Last synced: 10 Jul 2025

https://github.com/mr-talhailyas/histogram-equalization-and-contrast-stretching

Histogram equalization and contrast stretching without using builtin library functions.

contrast-stretching histogram-equalization image-enhancement image-processing numpy

Last synced: 29 Apr 2026

https://github.com/axelsgit/fashion-recommender

This project builds a hybrid and visual recommender system for (luxury) fashion products, simulating a real-world e-commerce scenario (e.g., Zalando, Farfetch, Fashion Nova). It integrates visual filtering, collaborative filtering, content-based filtering, and hybrid models, enriched with user metadata and interaction recency.

deepfashion fashion fastapi numpy pandas python resnet-50 sklearn streamlit tensorflow ui-design

Last synced: 07 Apr 2026

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 05 Feb 2026

https://github.com/zainulmustafa/home-neural-net-gui-using-flask

I am working on a project that I call HNN whose GUI is created using Flask framework to give multiple users the access of controlling components over the same connection.

flask flask-api flask-server html5 numpy python3

Last synced: 05 May 2026

https://github.com/shwetapardhi/assignment-1-q7--basic-statistics-level-1

Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range & comment about the values / draw inferences, for the given dataset For Points,Score,Weigh> Find Mean, Median, Mode, Variance, Standard Deviation, and Range and also Comment about the values/ Draw some inferences. Use Q7.csv file

mean median mode numpy pandas python range standard-deviation variance

Last synced: 05 May 2026

https://github.com/jhaabhijeet864/numpy_for_machine_learning

It's a collection of python notebooks comprising of learning materials of numpy for machine learning

notebooks-jupyter numpy numpy-arrays numpy-library python

Last synced: 18 May 2026

https://github.com/amnydv17/algorithm-visualizer

This project aims to offer a visual representation of how different algorithms work, making it easier for developers, students, and enthusiasts to understand their functionalities and behaviors through graphical demonstrations.

matplotlib numpy python3 seaborn

Last synced: 18 May 2026

https://github.com/pradyumnakr/facial-recognition-using-opencv

Uses haarcascades to recognize facial features, patterns and performs feature extraction followed by labelling the data to recognize the faces on a live webcam.

numpy opencv python3

Last synced: 09 May 2026

https://github.com/pzuehlke/hands-on-numpy

An introduction to NumPy for undergraduate students, focusing on basic linear algebra

linear-algebra numpy numpy-tutorial python scipy

Last synced: 18 Apr 2026

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/venky-1710/superhero-recruitment

Superhero Recruitment System predicts hero selection using machine learning. Users input hero attributes through a web interface. A Random Forest model analyzes abilities, strengths, weaknesses, success rates, and missions completed. The Flask app displays results, showing if a hero is selected.

css flask html numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/gehad-ahmed30/hand-gesture-volume-control

This repository features innovative projects using OpenCV, highlighting the power of computer vision. With detailed documentation and source code, developers can easily learn various techniques. Explore to enhance your understanding of image processing and object detection! 🚀

mediapipe numpy opencv pyautogui python

Last synced: 08 Apr 2026

https://github.com/akku-1206/spotify_data_analysis

Spotify Tracks Analysis Overview This project performs an exploratory data analysis (EDA) on Spotify music data to uncover insights and correlations related to song features, popularity, and genre.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/saket-sk/data-science-lab

List of Assignments I have performed as a part of Honorary Course of Data Science and Machine Learning Lab.

data-cleaning data-science jupyter-notebook jupyter-notebook-extension numpy numpy-arrays pandas pandas-dataframe pandas-python python

Last synced: 16 Apr 2026

https://github.com/piyushamukherjee/fuxi

– A Cascade Machine Learning Forecasting System for 15-Day Global Weather Prediction

ml numpy pandas tensorflow weather-forecast

Last synced: 13 Apr 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/moritzsalla/cci-advanced-visualization

Classwork for Mick Grierson's module at the Creative Computing Institute

matplotlib numpy openframeworks python tensorflow

Last synced: 06 May 2026

https://github.com/nambukeerthi/phonepe_project

This project will create a live geo-visualization dashboard using data from the PhonePe Pulse GitHub repository.

matplotlib numpy pandas python sklearn sql streamlit

Last synced: 07 Apr 2026

https://github.com/tapas-gope/telecommunication-customer-churn

This project involves predicting customer churn in a telecommunications company using machine learning techniques, exploring various features' impact, optimizing models, and identifying key factors influencing churn.

feature-engineering matplotlib-pyplot model-evaluation-and-validation numpy pandas python scikit-learn

Last synced: 12 Sep 2025