An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/alexixrugis/perceptronvisualization

Visualization of training and operation of a perceptron written from scratch in numpy

ai machine-learning numpy python

Last synced: 11 Feb 2026

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/t-lak/decision-tree

This project implements a basic Decision Tree classifier. It supports visualizing the tree and calculating performance metrics (accuracy, precision, F1-score, and recall).

decision-trees graphviz metrics numpy python3 scikit-learn

Last synced: 07 May 2026

https://github.com/harsha-yuvaraj/Data-Compression-and-Decompression-Tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 02 May 2025

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/elijahondiek/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 18 Apr 2026

https://github.com/volf52/deep-neural-net

A simple deep neural net class written to work with Numpy and Cupy

binarized-neural-networks binary-neural-networks bnn cupy deep-learning deep-neural-networks mnist numpy python python3

Last synced: 05 May 2026

https://github.com/justinzhang17/comp-5801-queens

Reinforcement learning task of teaching an agent to play the game queens/star-battle

gymnasium numpy pygame python

Last synced: 15 May 2026

https://github.com/petridhsg/firewall-data-classification

A single implementation of a machine learning algorithm for a firewall data classification task

machine-learning matplotlib numpy python seaborn

Last synced: 01 May 2026

https://github.com/youssefali11997/100-numpy-exercises

100 exercise on NumPy library. NumPy (Numerical Python) is a package used for scientific computing with Python.

data-science numpy python3

Last synced: 18 May 2026

https://github.com/ugurcan222/a-different-approach--image-enhancement-with-imputation-and-regression-methods

This experimental work presents a different approach to increase the size and quality of an image by adding a blank pixel around each pixel in an image, enlarging the image, breaking it into parts, and generating these blank pixels by predicting them with models.

ai-image-upscaling computer-vision digital-image-processing gradient-boosting image-analysis image-enhancement image-enlargement image-interpolation image-processing imputation knn machine-learning numpy opencv pixel-prediction python randomforest regression-models super-resolution xgboost

Last synced: 17 Jan 2026

https://github.com/0xkibh/books-recommendation-system

This is the website with collections of top readed books with the functionaliy of recommendation system. It basically recommends the book on the basis of given books. It also shows data of different books available.

bootstrap css flask font-awesome html numpy pandas pickle

Last synced: 07 Apr 2026

https://github.com/harmanveer2546/house-price-prediction-using-machine-learning

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn support-vector-machine

Last synced: 07 Apr 2026

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 11 Apr 2026

https://github.com/dineshdhamodharan24/singapore_flat_resale_

This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.

data-analysis flat json numpy pandas pickle project python streamlit

Last synced: 07 Apr 2026

https://github.com/bishopce16/world_weather_analysis

The project’s objective is to improve a travel app to give customers a way to decide their travel destination and ideal hotel based on weather preferences.

apis citipy datetime gmaps json-traversal jupyter-notebook matplotlib numpy pandas python requests scipy

Last synced: 13 Apr 2026

https://github.com/brenderb/python-projects

General interest and school python projects I completed. Included are lab analysis files (with uncertainty analysis, fitting, and graphing), simulation files (visualization of electric+magnetic fields, etc), specific research project files (data extraction + processing for a building science lab, graphs for a medical physics paper) and misc files

csv fitting matlab matplotlib numpy os pandas pyplot python scipy uncertainty

Last synced: 05 Apr 2025

https://github.com/vbhvsingh0/fraudulent_transactions

A few models were developed based on Decision trees and Logistic Regression to categorize fraudulent transactions

credit-card-fraud data-science logistic-regression machine-learning numpy numpy-python pandas pandas-python python3 random-forest-classifier

Last synced: 09 May 2026

https://github.com/chenna7893/enhancing-customer-retention-a-machine-learning-approach-to-churn-prediction

Churn prediction using machine learning models like Random Forest, Decision Tree, and XGBoost involves identifying customers who are likely to stop using a company's services.

decision-tree-classifier matplotlib numpy pandas random-forest-classifier xgboost

Last synced: 02 May 2026

https://github.com/technologicat/orderfix

Reorder solutions of parametric studies to make continuous curves

cython eigenvalues numpy postprocessing python python2 python27 python3 python34

Last synced: 25 Jan 2026

https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india

As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.

accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization

Last synced: 19 Jan 2026

https://github.com/harmanveer-2546/guide-to-regularization

Regularization is a crucial technique in machine learning that helps to prevent overfitting. Overfitting occurs when a model becomes too complex and learns the training data so well that it fails to generalize to new, unseen data.

generalization inline l1 l2 matplotlib numpy overfitting overfitting-prevention overfitting-reduced pandas regularization regularization-methods regularization-techniques regularization-to-avoid-overfitting seaborn

Last synced: 05 Apr 2025

https://github.com/chandkund/predicting-heart-disease

Welcome to the Heart Disease Prediction project! 🩺 This project focuses on developing a predictive model to assess heart disease risk based on health indicators like age, cholesterol levels, and blood pressure. By analyzing these features, we aim to create an effective tool for early diagnosis and heart disease prevention

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Jan 2026

https://github.com/aapo-kossi/christmas_tree_spectrogram

Colab notebook to generate a spectrogram visualization on a programmable led installation

colab-notebook numpy pandas python

Last synced: 07 Apr 2026

https://github.com/hansalemaos/stridesduplicatefinder

Calculate overlapping values between two arrays and return the results as a DataFrame

duplicates fast numexpr numpy strides

Last synced: 07 Mar 2026

https://github.com/amiridk/statics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 24 Feb 2025

https://github.com/kisaa-fatima/object-detection-using-yolo-v8

From this project I gained practical experience in AI, computer vision, and GUI development, while also understanding the importance of automation in retail processes.

computer-vision gui numpy opencv-python pil roboflow tkinter yolov8

Last synced: 18 May 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/orthoin3d/django-ndarrayfield

New Django field to store numpy ndarray.

django django-fields numpy numpy-arrays

Last synced: 07 Mar 2026

https://github.com/05afreen/indian-job-market-analysis

This repository focuses on 🔍 scraping, 🧹 cleaning, and 📊 visualizing job market data in India. It automates job listing collection from platforms like 🌐 Naukri, Indeed, and LinkedIn using 🐍 BeautifulSoup/Selenium. The scraped data is then 🛠️ cleaned and standardized, handling ❌ missing values, 🏷️ formatting inconsistencies, and 🔎 extracting

numpy pandas powerbi selenium

Last synced: 11 Apr 2026

https://github.com/lmizner/codecademy_auto_evaluation

Summarized categorical variables in Python using numerical summary statistics.

categorical-data frequency jupyter-notebook median numpy pandas proportion python

Last synced: 07 May 2026

https://github.com/hansalemaos/ffmpeg_stream_to_numpy

ffmpeg stream to numpy arrays

cv2 ffmpeg numpy python stream video

Last synced: 29 Apr 2026

https://github.com/dhanish03/credit_card_fraud_detection

Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.

kaggle-dataset numpy pandas-dataframe python3 sklearn

Last synced: 16 Apr 2026

https://github.com/khushi130404/outlier_exterminator

Outlier_Exterminator is a Python tool for detecting and treating outliers using IQR, Z-Score, and Percentile methods. It supports trimming, capping, and Winsorization, demonstrated in a Jupyter Notebook.

capping iqr machine-learning numpy percentile-method trimming winsorization z-score

Last synced: 18 May 2026

https://github.com/chandkund/time-series-forecasting

A time series forecasting project using the Airline Passenger dataset. This project leverages ARIMA modeling to predict future passenger numbers by analyzing historical trends and seasonality from 1949-1960. The results demonstrate effective forecasting for better business planning.

arima-forecasting arima-model data-science numpy pandas python seaborn

Last synced: 07 Apr 2026

https://github.com/asuquoaa/energy-_gdp-_and_scimago_rankings_analysis-2006-2015

Analysis of top 15 countries in energy, renewables, and GDP using three global datasets.

data-cleaning numpy pandas

Last synced: 01 May 2026

https://github.com/grandmoff100/pygrids

Dimensional Data Manipulation and Spreadsheet-like pretty grids.

arrays grids hacktoberfest numpy python spreadsheet

Last synced: 18 May 2026

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/johanntf/direccion_maximo_crecimiento

Dirección de máximo crecimiento en una función multivariable dado un punto de partida

customtkinter matplotlib numpy python

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 11 Apr 2026

https://github.com/werctfourth/python-border-autocrop

A Python script that crops borders from images.

border crop image-processing numpy pillow python python3

Last synced: 18 May 2026

https://github.com/santiagoenriquega/custom_neural_net

This repository focuses on building a neural network from scratch using only base python and Numpy.

gradient-descent neural-network numpy optimization python

Last synced: 18 May 2026

https://github.com/nambukeerthi/phonepe_project

This project will create a live geo-visualization dashboard using data from the PhonePe Pulse GitHub repository.

matplotlib numpy pandas python sklearn sql streamlit

Last synced: 07 Apr 2026

https://github.com/pd2871/streamlit_automl

Auto Exploratory Data Analysis using Streamlit App

matplotlib numpy pandas python streamlit streamlit-webapp

Last synced: 11 Apr 2026

https://github.com/yesenvidnath/facerecognito-pythonfacialrecognitionapplication

this is a web application created to detect faces using a pre-given image links through Excel format, this project uses Pytorch & Flask

facent flask numpy python

Last synced: 10 Apr 2026

https://github.com/biocpy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 07 Mar 2026

https://github.com/piyushamukherjee/fuxi

– A Cascade Machine Learning Forecasting System for 15-Day Global Weather Prediction

ml numpy pandas tensorflow weather-forecast

Last synced: 13 Apr 2026

https://github.com/akku-1206/spotify_data_analysis

Spotify Tracks Analysis Overview This project performs an exploratory data analysis (EDA) on Spotify music data to uncover insights and correlations related to song features, popularity, and genre.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/kr1shnasomani/facetronix

A toolkit for face detection, annotation, and mask recognition.

computer-vision deep-learning keras neural-network numpy opencv tensorflow

Last synced: 05 Apr 2026

https://github.com/gehad-ahmed30/hand-gesture-volume-control

This repository features innovative projects using OpenCV, highlighting the power of computer vision. With detailed documentation and source code, developers can easily learn various techniques. Explore to enhance your understanding of image processing and object detection! 🚀

mediapipe numpy opencv pyautogui python

Last synced: 08 Apr 2026

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/pzuehlke/hands-on-numpy

An introduction to NumPy for undergraduate students, focusing on basic linear algebra

linear-algebra numpy numpy-tutorial python scipy

Last synced: 18 Apr 2026

https://github.com/guifernandess7/ciencia-de-dados-e-ml-notebooks

Notebooks com uso dos módulos Pandas, Numpy e Matplotlib para análise de dados e criação de gráficos e tensorFlow e seaborn para machine learning e visualização

machine-learning matplotlib numpy pandas python tensorflow

Last synced: 04 Apr 2026

https://github.com/lgope/book-recommender-system

Book recommender system a Collaborative Filtering Based Recommender System using Machine Learning, Numpy, Pandas Python, Flask, Kaggle Dataset

collaborative-filtering flask html-css jupyter-notebook kaggle-dataset machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-1-q7--basic-statistics-level-1

Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range & comment about the values / draw inferences, for the given dataset For Points,Score,Weigh> Find Mean, Median, Mode, Variance, Standard Deviation, and Range and also Comment about the values/ Draw some inferences. Use Q7.csv file

mean median mode numpy pandas python range standard-deviation variance

Last synced: 05 May 2026

https://github.com/mathensley/asimpleneuralnetworklib

A simple library in Python for training datasets using neural network models

deep-learning neural-networks numpy python

Last synced: 19 Apr 2026

https://github.com/blinksta1ker/spam-mail-classification

Supervised ML model using Logistic Regression model to classify an email as Ham or Spam

jupyter-notebook logistic-regression machine-learning numpy pandas python sklearn supervised-machine-learning

Last synced: 09 Apr 2026

https://github.com/mnitin-reddy/image-compression-using-k-means-clustering

This project implements the K-means algorithm for clustering and image compression. It reduces the number of colors in an image using K-means, achieving compression while maintaining key visual features. The project demonstrates the process on a sample dataset and a real image.

image-processing imagecompression kmeans-clustering machine-learning matplotlib numpy

Last synced: 19 May 2026

https://github.com/zainulmustafa/home-neural-net-gui-using-flask

I am working on a project that I call HNN whose GUI is created using Flask framework to give multiple users the access of controlling components over the same connection.

flask flask-api flask-server html5 numpy python3

Last synced: 05 May 2026

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 06 Jan 2026

https://github.com/ebadshabbir/logistic_regression-binomial-

Logistic Regression on Social Network Ads Dataset This project applies Logistic Regression to predict whether a user will purchase a product based on their age and estimated salary, using the Social Network Ads dataset. The data is split into training and test sets, with feature scaling applied for normalization.

classification jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/mr-talhailyas/histogram-equalization-and-contrast-stretching

Histogram equalization and contrast stretching without using builtin library functions.

contrast-stretching histogram-equalization image-enhancement image-processing numpy

Last synced: 29 Apr 2026

https://github.com/memgonzales/order-brushing-shopee2020

Solution to the Week 1 Contest (Student Category) of the 2020 Shopee Code League on order brushing detection

data-analytics e-commerce fraud-detection jupyter-notebook numpy order-brushing pandas python rule-based

Last synced: 11 Apr 2026

https://github.com/pramodyasahan/custom-image-classification-model

This repository hosts a TensorFlow-based custom image classification model. The model is designed to classify images into binary classes, labeled as 'Happy' and 'Sad'.

computer-vision convolutional-neural-networks cv2 keras matplotlib numpy tensorflow

Last synced: 11 Apr 2026

https://github.com/crafterkolyan/eigenvalues-speed-comparison

Сравнение скорости вычисления собственных значений положительно определённых матриц. Проект по спецкурсу "Высокопроизводительные вычисления на платформе Python". Основной результат - Python с JIT компилятором обгоняет нераспараллеленный C++ по скорости.

cpp eigenvalues numba numpy python speed university

Last synced: 10 Jul 2025

https://github.com/sagravela/plotground

Plotting Tool with Matplotlib and Tkinter. Video Demo:

matplotlib numpy python tkinter

Last synced: 19 May 2026

https://github.com/abhisek-13/diabetes-prediction

The Diabetes Prediction Web App uses machine learning to predict if someone has diabetes based on their health data. Trained on the Kaggle diabetes dataset, it offers real-time predictions through a user-friendly Streamlit interface.

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/martinkersner/kmeans-meetup

Presentation about k-Means for Seoul AI Meetup on July 22, 2017.

kmeans numpy python scikit-learn

Last synced: 03 May 2026

https://github.com/kaikyMoura/BackCourt-api

Repository for the Basketball Advanced Stats API

fastapi nba-api nba-stats numpy pandas python swagger uvicorn

Last synced: 08 Apr 2026

https://github.com/chandkund/recipe-recommendations

The Recipe Recommendations project leverages natural language processing (NLP) techniques to process recipe descriptions and classify them into categories like desserts, main courses, and more. The system provides personalized recommendations by analyzing the text input and suggesting relevant recipes.

embeddings matplotlib nlp numpy pandas softmax tensorflow

Last synced: 06 Apr 2026

https://github.com/ahmedabdalkreem/preprocessing

All you need to be dataset ready to work in model without any Noise.

numpy pandas python3

Last synced: 07 Apr 2026

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/costopoulos/ntua-telecommunications

:radio: :telephone_receiver: NTUA ECE Telecommunications Course Code & Report

numpy nyquist python sampling scipy telecommunications

Last synced: 20 Apr 2026