An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/fjohnson87/planning-and-feasability

Data-driven feasibility analysis using pandas, seaborn, and FAO food supply data

matplotlib numpy pandas

Last synced: 04 May 2026

https://github.com/pyfig/s21_data-science-bootcamp

School21 Bootcamp Data Science

data data-science numpy pandas python school21

Last synced: 26 Jun 2025

https://github.com/MuhammedMorad/Egypt-Premier-Fashion-Sales-Analysis

Egypt’s Premier Fashion Sales Analysis is a data-driven project that provides interactive sales insights using Streamlit, Power BI, and Pandas. It analyzes sales trends, customer behaviors, and product performance while offering real-time dashboards and downloadable reports. helping businesses make informed, data-driven decisions. 🚀

matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Dec 2025

https://github.com/atharvkadammm/suicide-prediction-system

A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.

csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-

Last synced: 01 Jul 2025

https://github.com/iankitnegi/datapy

Welcome to DATAPy! This repo contains my Python coding projects for data science, featuring data manipulation, analysis, and visualization. Stay tuned for updates and new content!

basic-programming matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/atharvkadammm/calmlytic

An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.

anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost

Last synced: 21 Jun 2025

https://github.com/ola-kaznowska/first-steps-in-numpy

My first steps in the NumPy library

mathematics numpy python python3

Last synced: 20 May 2026

https://github.com/ayushhhh123/diwali_sales_analysis

This project provides a comprehensive analysis of Diwali sales data using Python. The primary goal is to derive business insights from customer purchasing behavior during the Diwali festival season. This analysis can help improve marketing strategies, customer targeting, and overall sales performance.

matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/elifftosunn/textdataclean

Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.

corpus deasciifier morphological-analysis ngram nltk numpy pandas sentence-embedding sentence-tokenizer stemmer stopwords string turkish turkish-sentence-tokenizer word-tokenizer

Last synced: 20 May 2026

https://github.com/kr1shnasomani/genagenet

Gender and Age recognition from face images

computer-vision deep-learning neural-network numpy opencv

Last synced: 21 Jul 2025

https://github.com/kr1shnasomani/speedvision

Vehicle speed detection using Computer Vision, Deep Learning and Neural Network

bytetrack computer-vision deep-learning neural-network numpy opencv pytorch supervision tqdm yolo

Last synced: 05 Mar 2025

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/trivediayush/python-bootcamp

This repository contains code, notes, and examples from my journey learning Python and its essential libraries.

flask matplotlib numpy pandas python python3 pythonbootcamp

Last synced: 01 Jul 2025

https://github.com/anshpg/linearluminary

Greetings! I've developed a straightforward linear regression model from scratch to predict house prices in Bangalore. But before delving into coding, let me walk you through the algorithm's conceptualization. I considered various factors such as location, ocean proximity, plot size, finished state, and flat type.

algo linea mathematics matplotlib numpy pandas pyth

Last synced: 13 May 2026

https://github.com/pedronjorge/average-face-python

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 21 Jun 2025

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/makoczoro/credit-default-risk-analysis-eda

This repository contains the detailed EDA Analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 20 May 2026

https://github.com/pzuehlke/ml-with-numpy

Basic ML algorithms using only NumPy

deep-learning machinelearning numpy

Last synced: 20 May 2026

https://github.com/nik-kusanagi/jupyter

Treinamento Para Análise de Dados.

big-data jupyter jupyter-notebook matplotlib numpy pandas python python3

Last synced: 30 Apr 2026

https://github.com/viivianreine/carbon-footprint-calculator-using-advanced-machine-learning-cnn-

🌍 An AI-powered Carbon Footprint Calculator built with Python, TensorFlow (CNNs), and Streamlit. Delivers real-time emission predictions, visual insights, and country-wise comparisons from user lifestyle data. Adaptive, scalable, and designed to drive sustainability through intelligent software solutions.

api-based-ai-integration cloud-ready convolutional-neural-networks data-featuring data-preprocessing ipcc keras keras-tensorflow matplotlib model-optimization numpy pandas python realtime-database seaborn streamlit streamlit-deployment streamlit-webapp supervised-machine-learning tensorflow

Last synced: 08 Apr 2026

https://github.com/csengupta1101/python-concepts

All basic to advanced Python Concepts Will be covered here in depth.

data-science django dsa-algorithm flask machine-learning numpy pandas python rest-api software-development

Last synced: 08 Apr 2026

https://github.com/udayraj04/facialemotion_musicrecommendations

A web application that detects facial emotions via webcam and recommends music tracks that match the user’s mood in real-time. Built with Flask, TensorFlow/Keras, OpenCV, and a React frontend.

cnn flask machine-learning numpy opencv pil reactjs tensorflow

Last synced: 08 Apr 2026

https://github.com/sridharyadav07/ai--powered-task-management-system

An intelligent Task Management System that integrates Sentiment Analysis, Task Optimization, and Forecasting to streamline project and task handling. This AI-powered tool is designed to assist teams and project managers in making data-driven decisions by understanding emotional context, forecasting productivity, and optimizing workload distribution

arima flask joblib jupyter-notebook naive-bayes-classifier nltk numpy pandas pickle-file python randomforestregressor scikit-learn stopwords-removal streamlit tfidf-vectorizer

Last synced: 08 Apr 2026

https://github.com/s-h-i-v-i-s/bangalore-housing-price-prediction

A Machine Learning Project to predict Bangalore House Prices.

jupyter-notebook machine-learning matplotlib numpy pandas python pythonflask sklearn

Last synced: 08 Apr 2026

https://github.com/rdmolony/nix-numpy-financial

A nix derivation for numpy-financial - https://pypi.org/project/numpy-financial/#files

nix numpy python

Last synced: 05 Mar 2025

https://github.com/sumaiyyaf/spotify-data-analysis

This project explores Spotify music data, using Python to clean, analyze, and visualize the dataset. It examines the relationship between audio features and sentiment in song lyrics, uncovering patterns and trends to provide valuable insights into music preferences.

eda numpy pandas python

Last synced: 08 May 2026

https://github.com/annygutierrez/data-preprocessing

Preprocessing deep learning template.

matplotlib numpy pandas sklearn

Last synced: 02 May 2026

https://github.com/harrisonwelch/pythondatascience

Repo of code from the linked-in lesson "Python: Data Analysis"

data data-science matplotlib notes numpy python tutorial

Last synced: 12 Apr 2026

https://github.com/vlad1343/tools

This repository demonstrates the acquisition of advanced Python programming skills through structured exercises and projects, with a strong emphasis on NumPy for array manipulation, slicing, masking, and reshaping

jupyter-notebook numpy numpy-library numpy-python pandas pandas-python python python3

Last synced: 08 Apr 2026

https://github.com/okoge-kaz/deep_learning

O'REILY ゼロから作るDeep Learning

deep-learning neural-network numpy pandas perce python3

Last synced: 20 Apr 2026

https://github.com/byte-sourcerer/array_contract

Contract programming for Pytorch, Numpy

contracts ndarray numpy python python3 pytorch tensor

Last synced: 14 Sep 2025

https://github.com/aritrakar/ml-from-scratch

🤖Common ML implementations from scratch.

machine-learning numpy pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/dmarks84/coursework_capstone_spacex_predictions

Final Project for IBM Data Science Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification to predict the success of SpaceX landings; issued full report to stakeholders

api classification dash eda folium linear-algebra matplotlib mysql numpy pandas plotly probability python seaborn sql statistics supervised-ml technical-writing web-scraping

Last synced: 08 Apr 2026

https://github.com/chinmayee4/diwali_sales_analysis_using_python

Analyze Diwali Sales Data to improve customer experience and sales.

data-science exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 08 Apr 2026

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 05 Apr 2026

https://github.com/1adore1/face-analysis

An application for face recognition and prediction of gender, age range and emotional state.

numpy opencv pandas python pytorch

Last synced: 07 Apr 2026

https://github.com/mikhailkhorokhorin/the-loop

Student project of team "полетики-самолетики"

3dsmax krpc ksp math matplotlib numpy python

Last synced: 20 May 2026

https://github.com/ksharma67/eda-on-ipl

In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.

data-analysis data-science eda matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026

https://github.com/mogankumar/covid-19-classification

A deep learning-based project that classifies chest X-ray images into Normal, COVID-19, and Viral Pneumonia using a two-layer CNN architecture. Achieves high accuracy and demonstrates potential for aiding in early diagnosis and treatment of COVID-19.

keras-tensorflow matplotlib numpy python3

Last synced: 11 May 2026

https://github.com/redinfinitypro/redditbot

Rating: (7/10) This script collects, preprocesses, trains models, processes images, and handles files, handling data from Reddit, image processing, and file handling.

autocorrect bytesio concurrent nltk numpy openpyxl pandas pil praw random re requests secrets sklearn string tensorflow time urllib

Last synced: 08 Apr 2026

https://github.com/andriikot/smartmatch

Smart Match with python

docker faiss huggingface numpy ptyhon pythorch

Last synced: 04 Apr 2026

https://github.com/matheusotenio/oficina-python

https://oficina-python.streamlit.app/

matplotlib numpy python streamlit

Last synced: 27 Jun 2025

https://github.com/michaeldehaney94/opencv

Python OpenCV computer vision projects

matplotlib numpy opencv-python python

Last synced: 20 Jun 2025

https://github.com/dulajkavinda/diabetes-detection

💊detect diabetes using machine learing

added machinelearning numpy pandas python scikit

Last synced: 14 Sep 2025

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/sandeepannandi/slicecam

SliceCam is a fun, camera-controlled game powered by OpenCV and Mediapipe that lets you slash flying fruits with real-time hand tracking.

collision-detection handtracking mediapipe mediapipe-hands motiondetection numpy opencv pygame python

Last synced: 06 May 2026

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 04 May 2026

https://github.com/faraazarsath/handson-numpy

This respository is on various Numpy Tasks provided as academic assignment

numpy

Last synced: 16 May 2026

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/abynxv/automatedexamsheetgrader-ocr

This Python project automates grading of handwritten or printed answer sheets using OCR (Optical Character Recognition). It extracts student answers from scanned images, compares them to an answer key, and generates a detailed report with class statistics.

numpy ocr-recognition opencv pandas python3 tesseract

Last synced: 27 Jun 2025

https://github.com/aqueeqazam/a-deep-dive-introduction-to-tensorflow

TensorFlow is a deep learning system that provides an interface for expressing deep learning algorithms and implementing them. It uses dataflow graphs to represent computation, shared state, and the operations that mutate that state.

deep-learning neural-network numpy tensor tensorflow

Last synced: 03 Aug 2025

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 11 Apr 2026

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 07 May 2026

https://github.com/mohammedrashithkp/drlforsnakegame

This project utilises a Deep Reinforcement Learning specifically Q learning implemented using PyTorch which learns the game by interacting with the pygame environment .I have followed a tutorial on Freecodecamp YouTube Channel and made the initial model.

deep-reinforcement-learning matplotlib-pyplot numpy pygame python

Last synced: 16 May 2026

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/joeylemon/python-kmeans

an implementation of the K-means clustering algorithm in Python

k-means-clustering matplotlib ml numpy python

Last synced: 21 May 2026

https://github.com/rtmigo/markov_walk_py

🔢 Python module that calculates probabilities for a random walk in 1-dimensional discrete state space

absorbing-markov-chains absorbing-states markov-chain mathematics numpy probability probability-theory random-walk stochastic-matrix stochastic-models

Last synced: 21 May 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/khaymanii/spam_mail_detection_model

This model was built using Python and Logistics Regression algorithm

matplotlib numpy pandas python sckiit-learn

Last synced: 10 Apr 2026

https://github.com/evernext10/coursera_capstone

This repository is created for sending the IBM Data Science Lab and will be used primarily for the Capstone project.

coursera-data-science ibm numpy pandas

Last synced: 10 May 2026

https://github.com/tushar365/titanic---machine-learning-from-disaster

"Titanic: Machine Learning from Disaster" is a classic Kaggle competition for beginners https://www.kaggle.com/competitions/titanic. The goal is to use machine learning to predict which passengers survived the sinking of the Titanic based on historical data. This teaches data analysis and model building skills in a real-world context.

data-science dicision-tree logistic-regression machine-learning matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 05 Feb 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/pradyumnakr/facial-recognition-using-opencv

Uses haarcascades to recognize facial features, patterns and performs feature extraction followed by labelling the data to recognize the faces on a live webcam.

numpy opencv python3

Last synced: 09 May 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/moritzsalla/cci-advanced-visualization

Classwork for Mick Grierson's module at the Creative Computing Institute

matplotlib numpy openframeworks python tensorflow

Last synced: 06 May 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/mohammedsaqibms/planar_data_classification_with_onehidden_layer

This repository implements a simple neural network for binary classification of 2D planar data using Python and NumPy. It compares logistic regression with neural networks and includes code for forward/backward propagation, gradient descent, and decision boundary visualization.

backpropagation binary-classification gradient-descent logistic-regression machine-learning neural-networks numpy python3

Last synced: 10 Jul 2025

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 08 Apr 2026