An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/krook1024-uni-archive/kmeans

kmeans implementation for a university course

kmeans kmeans-clustering machine-learning numpy python

Last synced: 02 May 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/mahmood-anaam/deterministic-dynamic-programming

This repository explores deterministic dynamic programming solutions for the Shortest Path Problem (SPP) and the Traveling Salesperson Problem (TSP). It includes graphical representations, a detailed Jupyter notebook, and practical implementations for optimizing travel routes and minimizing costs in various city-based scenarios.

dynamic-programming graph-theory jupyter-notebook numpy python shortest-path-problem traveling-salesperson-problem

Last synced: 09 May 2026

https://github.com/m0saan/python-for-data-analysis

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,

data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python

Last synced: 02 May 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/ksharma67/intel-stock-predication-wiith-eda

We are trying to design a model that can predict the price of stock using different methods and algorithms.

eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn

Last synced: 02 May 2026

https://github.com/baranovserv/h-s-diagram

Парсер данных построения H-S диаграммы для веществ

diagrams json matplotlib numpy parser requests

Last synced: 02 May 2026

https://github.com/nas-research/knowledge-model

Our knowledge system systematically ingests, processes, and indexes open-access life science publications. It supports internal research by providing precise question-answering and efficient retrieval from a continuously updated repository of scientific literature

accelerate aws boto3 dataingestion keras lifesciences llama llama3 llm numpy pymupdf pytorch researchsupport sqlalchemy tensorflow textextraction

Last synced: 10 Apr 2026

https://github.com/dhruvv1402/x-twitter-bot-python

A Python-powered Twitter bot for content generation, analytics, and automation using Twitter API v2.

huggingface-transformers matplotlib nltk numpy pandas pillow python requests schedule textblob threading tweepy twitter-api wordcloud

Last synced: 09 May 2026

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 02 May 2026

https://github.com/subhrajit77/ai-ml

My personal AI/ML projects repository

jupyter numpy pandas python

Last synced: 09 Mar 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/hadjuse/spotify-recommendation-knn

Here is a personal project where i use KNN project to classify songs

data-science graphics interpretation knn-classification machine-learning matplotlib music numpy plotly spotify

Last synced: 03 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/vuanhtuan1012/data-scientist-with-python

Notes on career track "Data Scientist with Python" at DataCamp

importing-data matlab numpy pandas python3 sqlalchemy

Last synced: 09 Apr 2026

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 03 May 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/mattkevan/data-science

Notes from learning various data science topics.

data-science matplotlib numpy python3

Last synced: 09 May 2026

https://github.com/alexbie98/bietorch

neural nets with numpy

autograd neural-nets numpy torch

Last synced: 03 May 2026

https://github.com/tonoy30/emotion-mask

A pytorch model for add face mask based on detected emotion.

emotion-detection numpy opencv-python python pytorch

Last synced: 09 May 2026

https://github.com/fandredev/ml-my-guide

my own annotations about ML/DS using pandas, matplotlib, numpy, scikit learn

anaconda matplotlib numpy pandas plotly scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/layonj3000/libras-recognition

This project is a Brazilian Sign Language (LIBRAS) recognition system that utilizes computer vision and machine learning techniques.

computer-vision libras mediapipe numpy opencv python random-forest-classifier

Last synced: 03 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/bhuvilol/objdetector_webcam_and_video

Object Detection from Webcam input or video input

cvzone numpy opencv python3 yolov8

Last synced: 11 May 2026

https://github.com/balajig-24/titanic_data_analysics-

Project Title: Titanic Survival Prediction Project Overview The Titanic Survival Prediction project is a classic machine learning problem that aims to predict whether a passenger survived the Titanic disaster based on various features such as age, gender, passenger class, and more. This project demonstrates my ability to clean, analyze, and model.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/ebrahimar/ai-voice-cloner-xtts-v2

A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.

ai-voice coqui-tts deep-learning json multilingual-tts numpy pydub python speech-synthesis streamlit text-to-speech tts-model voice-cloning xtts-v2

Last synced: 03 May 2026

https://github.com/prathicashettym/machinelearningalgorithms

Contains mini projects and implementation using ML algorithms

jupyter-notebook linear-regression matplotlib numpy pandas panfas python seaborn

Last synced: 03 May 2026

https://github.com/jsnevt/decisiontreecls

Data Science - Árvores aleatórias - RandomForestClassifier

decision-tree-classifier matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/mzayles/data-preparation.py

Curso Programação em Python para Data Science | Preparando e tratando dados de uma base de vendas com Pandas e NumPy.

numpy pandas python

Last synced: 03 May 2026

https://github.com/shreenidhi7700/auto_price_prediction

We are required to use Machine Learning algorithms to predict the price of cars with the available independent variables such as Horsepower, no-of-cylinders, engine-type, city_mpg, highway_mpg etc.

decision-tree-regression ensemble knn-regression linear-regression machine-learning-algorithms matplotlib-pyplot numpy pandas python3 random-forest-regression regression seaborn xgboost-regression

Last synced: 04 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/vinicius999/icu-beds-forecast-covid-19

Predição da ocupação de leitos de UTI no Brasil devido a pandemia do COVID-19, usando Support Vector Regression (SVR)

covid-19 matplotlib numpy pandas python sklearn support-vector-regression

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/pramodyasahan/binary-classifier

This repository houses the code for a machine learning model designed to predict customer churn. The model is built using Support Vector Machine (SVM) from the scikit-learn library and incorporates preprocessing, pipeline, and grid search techniques for optimal performance.

numpy pandas scikit-learn

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/abdullah-masood-05/eye-balll-detection

This repository provides advanced eye gaze tracking using MediaPipe Face Mesh for both video files and live camera feeds.

mediapipe numpy opencv scipy

Last synced: 10 Apr 2026

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/alvii147/discretetimelib

Python library for analysis of discrete time signals and systems

numpy pandas scipy signals-and-systems sympy

Last synced: 03 May 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/feguedi/clasificador-numeros

Proyecto final de la materia de Inteligencia Artificial donde se desarrolla un software que pueda clasificar los números de la base de datos MNIST.

artificial-intelligence numpy python3

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/joao-inacio/numpy-essencial-para-desenvolvedores-python

O curso foi elaborado para desenvolvedores Python que desejam explorar os recursos poderosos da biblioteca NumPy. Através de aulas práticas, eu vou adquirir as habilidades necessárias para trabalhar com arrays multidimensionais, realizar cálculos científicos complexos e manipular dados com eficiência.

array array-manipulations array-methods jupyter-notebook numpy python python3

Last synced: 03 May 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/grachale/convolution_image

The implementation of a function that takes an image (RGB or gray one) and a filter (kernel) and applies a convolution between them.

convolution image-processing numpy python

Last synced: 03 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/kingsley-ezenwaka/mean-var-std

This is a (mini) Python project, completed as part of the requirements for the Data Analysis with Python certification from freeCodeCamp.com.

numpy python

Last synced: 09 May 2026

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 09 May 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/yareva/linear-regression-predictor

Linear Regression Predictor Model

matplotlib numpy pandas python sckit-learn

Last synced: 04 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/pomxark/smartdictor_0.1_nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

cv2 fuzzywuzzy kivy matplotlib-pyplot nuitka numpy pil pyautogui pyqt5 pytesseract pyttsx3 scipy silero simpleaudio torch

Last synced: 04 May 2026

https://github.com/gsaini/streamlit-ui-getting-started

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps with only a few lines of code. Build and deploy powerful data apps in minutes. Let's get started!

codecov github-actions numpy pandas python streamlit unittest

Last synced: 14 Jun 2026

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 04 May 2026

https://github.com/timothyjan/intro-machine-learning-polynomial-regression

We use NumPy to implement vectorized linear and polynomial regression models and compare their performance using separate training and test sets.

numpy

Last synced: 04 May 2026

https://github.com/shailikadakia/yoga-pose-accuracy-detector

Real-time yoga pose accuracy detector using MediaPipe and OpenCV. Classifies poses from webcam input with KNN and provides feedback on duration and accuracy, built with Python and scikit-learn.

computer-vision fitness health-tech knn machine-learning mediapipe numpy opencv pandas pose-estimation python real-time-detection yoga

Last synced: 09 Apr 2026

https://github.com/jakeoeding/rippy

A Python seam carving implementation

numpy python3 seam-carving

Last synced: 04 May 2026

https://github.com/jalijuhola/neurcomputing-course

Some of tasks from neurocomputing course

mypy numpy python

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/mchenryspagg/investigate_a_dataset

This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.

data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization

Last synced: 04 May 2026

https://github.com/abhivur/graduate-income-forecaster

Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/arsalan692/python-telusko-playlist

A collection of beginner Python scripts and practice exercises from my early learning days, inspired by Telusko and other resources.

basic-programming bs4 matplotlib numpy oop pandas python requests selenium tkinter

Last synced: 04 May 2026

https://github.com/nahiyanhkhan/data-processing-and-visualization

Loan Data Processing using Python's numpy and pandas libraries. For data visualization, matplotlib and seaborn are used.

data-analysis-python data-visualization matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/cheese23456/ai-based_stock_analysis_and_portfolio_optimisation

📈 Analyze stocks and optimize portfolios with AI-driven insights, technical indicators, and clear investment signals for informed decision-making.

ai asset-allocation financial-analysis huggingface-spaces numpy pandas portfolio-optimisation python scipy-optimize streamlit

Last synced: 04 May 2026

https://github.com/glbter/data-analysis-py

data-analysis labs. 50/50 made with Oleh Tonkoshkur

numpy pandas python3 scipy-stats

Last synced: 12 Jun 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/florasteve/ml-foundations-day1

Day-1 ML foundations focused on linear algebra: vectors, dot products, norms, angles, projections, and basic matrix operations—implemented in a Jupyter notebook with NumPy/Matplotlib, clear 2D visuals, a self-quiz, and a brief reflection. Emphasizes how these math primitives map to core ML/AI ideas (similarity, least squares, geometric transforms).

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy python3 terminal-mac tutorial visualization vscode

Last synced: 04 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/sivasurya681/financial-expense-tracker

The Financial Expense Tracker is a Python-based app built with Flet, Pandas, NumPy, and Matplotlib.pyplot for seamless expense management. It enables real-time tracking, categorization, and data visualization through bar and pie charts. Users can generate summaries, export/import data, and make informed financial decisions easily.

calculator-application differntial expense-tracker flet matplotlib-pyplot numpy pandas python salary-calculator total

Last synced: 09 May 2026