An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/bestmahdi2/uni__dataminningstackoverflowproject

A university project related to data mining lesson on StackOverflow website data with Python language

cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost

Last synced: 16 Feb 2026

https://github.com/yc1909/ai_impact_on_jobs

Impact of Artificial Intelligence on Jobs and Workplace-Analysis

analytics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 06 May 2026

https://github.com/darshan12345678910/air-quality-prediction-ml-algorithms

AirPredict AQI prediction using classification algorithm

classification-algorithims gradio-interface numpy pandas

Last synced: 06 May 2026

https://github.com/rishraks/handsign-ai

HandSign AI is a gesture recognition project using Mediapipe to detect custom hand signs like fist, palm, and V-sign. It automates tasks such as starting/stopping video recording and capturing screenshots, showcasing intuitive, touch-free interactions for various applications like smart devices and accessibility.

mediapipe ml numpy opencv python

Last synced: 16 Apr 2026

https://github.com/chaymabghozzi/somnolence

Détection de somnolence

dlib numpy opencv pygame python

Last synced: 13 Apr 2026

https://github.com/swimshahriar/machine-learning

This repository contains the machine learning models that I have worked on during learning "Machine Learning".

numpy pandas python3 sklearn

Last synced: 16 Apr 2026

https://github.com/wrighang/shipping-data-analysis

Independent Project: Transit time trends analysis following a major shipping process change.

data-analysis matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/abhishek010397/logisticregression

Using Logistic Regression Classifier to Predict Target Using Three Features

csv iloc logistic-regression machine-learning numpy pandas-dataframe python3 sigmoid-function sklearn

Last synced: 06 May 2026

https://github.com/sorna-fast/iris-classifier-pca

An interactive web application for Iris flower classification using Random Forest and PCA visualization, built with Streamlit. Features real-time predictions, interactive model training, and beautiful data visualizations.

matplotlib matplotlib-pyplot matplotlib-python numpy pandas python seaborn sklearn streamlit

Last synced: 16 Apr 2026

https://github.com/peterfication/num-peek

A CLI tool to peek into *.npy files

cli numpy python rust

Last synced: 20 Jan 2026

https://github.com/dataspieler12345/dash-by-plotly-lib-project

The Dash web application framework, is used to create browser-based interactive data visualization interfaces with Python.

bootstrap dash numpy pandas plot python

Last synced: 18 Apr 2026

https://github.com/udacity-machinelearning-internship/finding_donors

First stage project at Udacity on the 'Intro to Machine Learning with TensorFlow' program using sckit-learn in python

csv machine-learning matplotlib numpy pandas python sckiit-learn seaborn sklearn udacity udacity-nanodegree

Last synced: 16 Apr 2026

https://github.com/mihirh19/analyzing-crimes-in-la

This GitHub repository contains a comprehensive analysis of crime data in Los Angeles. Leveraging a variety of data sources, tools, and techniques, we have explored patterns, trends, and insights related to criminal activities within the city. Our analysis includes data visualization crime dynamics in different neighborhoods and over time.

dataset machine-learning matplotlib numpy pandas seaborn

Last synced: 07 May 2026

https://github.com/texnoforge/texnomagic

TexnoMagic library for digital Magic

gmm magic numpy python recognition scikit-learn scipy

Last synced: 03 Mar 2026

https://github.com/sudipbishwakarma/bulk-remove-onesignal-users

Bulk remove OneSignal subscribed users blazingly faster with parallel processing.

numpy pandas python python-logger python-multiprocessing python-threading python3

Last synced: 07 May 2026

https://github.com/brandoncardoso/mnk

An MNK game library.

game mnk numpy python tictactoe

Last synced: 16 Apr 2026

https://github.com/mohamedawnallah/covid19-dataanalysis-python

how to preprocess and merge datasets to calculate needed measures and prepare them for an Analysis. In this project, we are going to work with the COVID19 dataset, published by John Hopkins University, which consists of the data related to the cumulative number of confirmed cases, per day, in each Country. Also, we have another dataset consist of various life factors, scored by the people living in each country around the globe. We are going to merge these two datasets to see if there is any relationship between the spread of the virus in a country and how happy people are, living in that country.

covid19 dataanalysis datavisualization happiness-report jupyternotebook numpy pandas python

Last synced: 08 May 2026

https://github.com/octcarp-courses/SUSTech_CS303-AI_F23_Works

(49.8/50) Course code of CS303 2023 Fall: Artificial Intelligence, SUSTech. Taught by Prof. Ke TANG.

numpy pytorch sustech

Last synced: 24 Oct 2025

https://github.com/jesussantana/ibm-python-for-data-science-py0101en

This introduction to Python will kickstart your learning of Python for data science, as well as programming in general.

data-science numpy pandas python

Last synced: 08 May 2026

https://github.com/aroojzahra908/artifical-intelligence-and-machine-learning-

Explore the basics of linear regression, gradient descent, and AI using Python. Get hands-on with NumPy, pandas, Matplotlib, and scikit-learn for practical learning.

artificial-intelligence gradient-descent linear-regression machine-learning matplotlib numpy pandas scikit-learn-python

Last synced: 08 May 2026

https://github.com/akashash01/logistic-regresion

This is an contrast of linear regression model, used to examine the association between the independent variable(category or contineous) with dependent variable(binary), which is an discrete outcome.

auc-roc-curve classification confusion-matrix data-science kolmogorov-smirnov machine-learning nonlinear-regression numpy pandas python supervised-learning vif woe-and-iv

Last synced: 08 May 2026

https://github.com/drewstone/katz-centrality

Katz centrality of a wheelgraph

centrality graphs networks numpy

Last synced: 16 Apr 2026

https://github.com/qua77i20/ai2

A simple single-layer neural network (perceptron) built in Python. Demonstrates backpropagation training from scratch for beginners.

ai backpropagation data-science deep-learning machine-learning neural-network numpy perceptron python

Last synced: 15 May 2026

https://github.com/607011/1dollar

Simulation of a counterintuitive distribution problem

animation go golang matplotlib numpy plot python3 simulation statistics

Last synced: 09 May 2026

https://github.com/idow09/ud810-problem-sets

Solutions to the problem sets of Introduction to Computer Vision course at Udacity

computer-vision numpy opencv opencv-python problem-sets python solutions udacity

Last synced: 09 May 2026

https://github.com/ramamagarwal/mathematical-simulation-projects

This repository comprises my math modelling-based projects covering topics of Linear Algebra, Probability and Statistics.

matplotlib numpy python sympy

Last synced: 16 Apr 2026

https://github.com/khaymanii/medical_insurance_cost_prediction-_model

This Model was built using Python and Linear Regression algorithm

matplotlib numpy pandas python sckit-learn seaborn

Last synced: 09 May 2026

https://github.com/yannbouyeron/genopy

Analyses génétiques et phylogénétiques pour les SVT

biopython genetique lycee numpy pandas phylogenie svt

Last synced: 04 Mar 2026

https://github.com/dina-hosny/explore-us-bike-share-data-project

Explore US Bike Share Data project - FWD Data Analysis Professional Track. In this project, I used Python to explore data related to bike share systems for three major cities in the United States and answer questions about it by computing descriptive statistics.

data-analysis data-science numpy pandas python

Last synced: 09 May 2026

https://github.com/datarohit/tps-january-2022--kaggle-competition

Tabular Playground Series - Jan 2022. This Competition is Posted on Kaggle.com for practicing the EDA and Regression Skills.

kaggle-competition matplotlib numpy pandas regression-algorithms seaborn

Last synced: 09 May 2026

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/dimatura/imgutils

Miscellaneous Numpy and PIL utilities for images.

image numpy pil pillow python

Last synced: 19 Jan 2026

https://github.com/MITHILESHK11/Steganography-App-Aicte-Internship-Cybersecurity-Edunet-Foundation

Secure Image Steganography App : This Streamlit-based Steganography App lets you securely encrypt and decrypt messages in images using the Least Significant Bit (LSB) technique. Messages remain invisible unless unlocked with the correct password.

hashlib numpy opencv3 python streamlit-webapp

Last synced: 24 Oct 2025

https://github.com/gandhasiri-rahul-mohan/association-rules-q1-books

Prepare rules for the all the data sets 1) Try different values of support and confidence. Observe the change in number of rules for different support,confidence values 2) Change the minimum length in apriori algorithm 3) Visulize the obtained rules using different plots

apriori-algorithm association-rules data-science machine-learning numpy pandas python unsupervised-machine-learning

Last synced: 04 May 2026

https://github.com/leabrodyheine/water-pump-status-prediction

This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.

argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/ola-kaznowska/ai-digit-recognition

AI digit recognition. Artificial neural network for digit recognition

ai artificial-intelligence machine-learning neural-network numpy tensorflow

Last synced: 14 Apr 2026

https://github.com/datarohit/churn-prediction-for-bank

This Dataset has various information about customers of Bank. The Problem is that many customers tend to leave the bank so using the features give we have to predict whether a customer will leave the Bank or Not.

churn-prediction classification matplotlib-pyplot numpy pandas seaborn sklearn

Last synced: 17 Apr 2026

https://github.com/prakhar-002/pandas-numpy

This repo covers NumPy & Pandas basics! ⚡ NumPy 🧮 for arrays & math operations, 🐼 Pandas 📊 for data handling with DataFrames & Series. 🚀 Includes practical examples 📝 for efficient data processing! 🔥

basics-of-python numpy pandas pandas-dataframe python-library python3

Last synced: 11 May 2026

https://github.com/mikelane/neuralnetwork

A neural network implemented in python.

machine-learning matplotlib neural-network numpy python python3

Last synced: 17 Apr 2026

https://github.com/sun-data/ndfilters

Numba-accelerated, n-dimensional image filters.

filters image-processing n-dimensional numba numpy scientific-computing scipy

Last synced: 06 Mar 2026

https://github.com/duoan/machine-learning-notebook

A notebook repository for tracking learning machine learning notebook.

data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost

Last synced: 18 Jun 2026

https://github.com/dudynets/numerical-methods-labs

Labs for Numerical Methods course at Ivan Franko Lviv National University.

angular docker fastapi interpolation labs linear-algebra material-ui matplotlib ngxs numpy python typescript university

Last synced: 02 Apr 2026

https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024

This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.

bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas

Last synced: 01 May 2026

https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.

analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 02 May 2026

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/krook1024-uni-archive/kmeans

kmeans implementation for a university course

kmeans kmeans-clustering machine-learning numpy python

Last synced: 02 May 2026

https://github.com/vermenea/masters---ad-signaling-pathway-using-hsp90-inhibitor

Influence of 17-AAG a Hsp90 inhibitor on signaling pathway in Atopic Dermatitis

matplotlib numpy pandas python seaborn statmodels

Last synced: 02 May 2026

https://github.com/helenaden/data-science-fundamentals

This project delves into fundamental data science concepts using Python libraries like NumPy and Pandas

data-analysis datascience datasets datavisualization datawrangling heatmap numpy pandas patterns python

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/layonj3000/libras-recognition

This project is a Brazilian Sign Language (LIBRAS) recognition system that utilizes computer vision and machine learning techniques.

computer-vision libras mediapipe numpy opencv python random-forest-classifier

Last synced: 03 May 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/prathicashettym/machinelearningalgorithms

Contains mini projects and implementation using ML algorithms

jupyter-notebook linear-regression matplotlib numpy pandas panfas python seaborn

Last synced: 03 May 2026

https://github.com/mzayles/data-preparation.py

Curso Programação em Python para Data Science | Preparando e tratando dados de uma base de vendas com Pandas e NumPy.

numpy pandas python

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/asayda01/airport_operations_analytics

The project analyzes the performance of Hold Baggage Screening X-ray Machines, identifying trends in system load, bottlenecks, and operator efficiency. Using an interactive Streamlit app, users can explore data-driven insights to optimize airport screening processes and improve overall operational performance.

data-science dataanalysis datainsights datavisualization numpy pandas python streamlit

Last synced: 03 May 2026

https://github.com/hansalemaos/np_zoom_in

Zoom in on a picture

image numpy python zoom

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/ola-kaznowska/median-statistic

A program that calculates the median statistic. Program written in Python 3.13.1 with NumPy library during Machine Learning course.

mathematics numpy python python3

Last synced: 04 May 2026

https://github.com/miozilla/fescarefine

fescarefine :skier::mount_fuji::guide_dog: : Refine & Test ML Models # Features Scaling # Normalization # Datasets

gradient-descent linear-regression matplotlib model-testing mse norm numpy ols overfit pandas scatter seaborn sklearn standardization statsmodels underfit

Last synced: 04 May 2026

https://github.com/florianwoelki/salary_linear_regression

This project is for salary prediction using linear regression without any machine learning library.

jupyter linear-regression machine-learning matplotlib numpy pandas python regression

Last synced: 04 May 2026

https://github.com/cheese23456/ai-based_stock_analysis_and_portfolio_optimisation

📈 Analyze stocks and optimize portfolios with AI-driven insights, technical indicators, and clear investment signals for informed decision-making.

ai asset-allocation financial-analysis huggingface-spaces numpy pandas portfolio-optimisation python scipy-optimize streamlit

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/thekartikeyamishra/object-detection-for-retailer

The AI Customer Feedback Summarizer is a Python-based tool that uses extractive summarization techniques to provide concise summaries of customer feedback.

ai computer-vision cv2 numpy pillow python torch torchvision yolov5

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/elijahondiek/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/mohitsai/boston-housing-data-analysis

Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city

data-analysis data-science matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/theadeelahmed/mall-customers-segmention

This project segments mall customers based on income, age, and shopping score. Using clustering techniques, it identifies key customer groups for targeted marketing campaigns. Tools used: Pandas, Matplotlib, Seaborn, and Scikit-learn

matplotlib-pyplot numpy pandas python3 seaborn sklearn-clustering

Last synced: 05 May 2026

https://github.com/aathira2001/python-basics

Basic Python concepts with examples

functions numpy pandas python variables

Last synced: 05 May 2026