An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/offthetab/ml_homework

Практические работы по анализу данных МИРЭА 3 курс.

ml numpy pandas python shap sklearn xgboost

Last synced: 02 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/prahaladhchandrahasan/object-localization

this is a blue colour detector in keras and tensorflow

computer-vision keras numpy pkl tensorflow

Last synced: 17 Apr 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mwoss/mownit

Various tasks from numerical analysis laboratories solved using Python

jupyter-notebook mathematics matplotlib numerical-analysis numpy python scipy

Last synced: 17 Apr 2026

https://github.com/jinhx128/data_analysis

基于Matplotlib+Numpy+Pandas搭建的数据分析项目。

matpolotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/bvsam/nnfs

Basic neural network built using Python and Numpy. Created to better understand neural networks.

artificial-intelligence deep-learning machine-learning neural-network neural-networks numpy python3

Last synced: 18 Apr 2026

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/alainlebret/python-et-ia-1

Ressources personnelles du cours "Python & IA" en 2e année GPSE à l'ENSICAEN

artificial-intelligence image-processing machine-learning matplotlib numpy python scikit-image scikit-learn

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 05 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligência artificial para analisar documentos e características físicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as áreas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/rahuls-1106/dataspark

DataSpark is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe powerbi python seaborn sql

Last synced: 18 Apr 2026

https://github.com/md-emranhossen/deep-learning-hub

Welcome to Deep Learning Playground, a personal repository for exploring and experimenting with deep learning concepts, architectures, and techniques. This repo serves as a hands-on space for implementing and understanding deep learning models, from fundamental neural networks to advanced architectures.

cnn-model deep-learning deeplearning gan generative-adversarial-network neaural-network numpy pandas pthon tensorflow

Last synced: 18 Apr 2026

https://github.com/pedro-stanaka/dlnd-projects

Projects from Deep Learning Nanodegree from Udacity

artificial-neural-networks deep-learning machine-learning numpy python3

Last synced: 18 Apr 2026

https://github.com/rhazra-003/modelinglab

Assignment - Solution of Modeling Lab of 7th Semester (Batch of 2023) of NIT Durgapur

jupyter-notebook matplotlib-pyplot numpy pandas python3 shapely

Last synced: 19 Apr 2026

https://github.com/codetun/wie-act-challenge-ai

Our AI model, integrated with the Defend Her extension, uses advanced natural language processing (NLP) to predict and identify offensive or harmful language in real-time 🧠💬🚫. By continuously analyzing content, it ensures accurate detection of bad words, enhancing the extension's ability to protect users and promote a safer online environment 🌐

colab csv jupiter-notebook kaggle nltk numpy pandas

Last synced: 19 Apr 2026

https://github.com/simonusher/multi-layer-perceptron

My solution to an assignment on multi-layer perceptron. Implemented in python and numpy. Contains the forward and backward pass code, as well as some utility functions.

multi-layer-perceptron neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/emerald-geomodelling/downfile

A safe & version independent pickle alternative that integrates setuptool entry points for extensions

feather json numpy pandas python serialization

Last synced: 19 Apr 2026

https://github.com/renukadhule/airbnb_listings_and_reviews_python_analysis

Explore Airbnb listings and reviews from over 250,000 properties across 10 major cities. Dive into data on listing details, host information, pricing, and customer reviews for better insights

matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/paulo-santos-ds/previsao_preco_carros

O objetivo é realizar predições em um aplicativo de vendas de carros. A qualidade e velocidade de predições é crucial para o negócio. O projeto utiliza dados históricos, especificações técnicas, versões de acabamento e preços.

lightgbm math matplotlib numpy pandas python time

Last synced: 20 Apr 2026

https://github.com/aminnj/numpy.jl

Simple numpy snippets implemented in julia

joke julia maybenot numpy python

Last synced: 20 Apr 2026

https://github.com/marknature/codealpha-internship_python-programming

Python Programming Internship at CodeAlpha from 1st June 2025 to 30th June 2025

numpy pandas python3

Last synced: 20 Apr 2026

https://github.com/gandholisarat/mlp-backprop-pmat403

This repository contains a Jupyter Notebook developed for my course, demonstrating symbolic and numerical computation, visualization, and problem-solving in Python using libraries such as SymPy, NumPy, SciPy, and Matplotlib.

jupyter jypyternotebook matplotlib numpy

Last synced: 20 Apr 2026

https://github.com/abyss-s/data-analysis-project

[컴퓨팅사고와 데이터분석 기초] 최종 프로젝트 (2024-1)

matplotlib numpy pandas python sckiit-learn

Last synced: 20 Apr 2026

https://github.com/danpoynor/pet-shelter-data-analysis-notebook

Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.

csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data

Last synced: 21 Apr 2026

https://github.com/jbayardo/aa-tp2

Q-Learning for Four in a row

matplotlib numpy python q-learning

Last synced: 24 Apr 2026

https://github.com/redinfinitypro/pygamecicruit

Rating: (6/10) The code combines a Pygame GUI application with a PySpice circuit simulation, providing a user-friendly interface with buttons, menu options, dropdown menus, input boxes, sliders, and text elements.

matplotlib numpy os pygame pyspice random sys

Last synced: 24 Apr 2026

https://github.com/takana671/texturegenerator

Procedually generate texture images from noise.

cloud cubemap cv2 cython noise noise-textures numpy python skybox

Last synced: 25 Apr 2026

https://github.com/ydon96/versions-unveiled-the-evolution-of-nsa

Detection of Alzheimer's disease through the implementation of improved NSA algorithms

nsa numpy pandas python3 sklearn

Last synced: 27 Apr 2026

https://github.com/crispengari/netflix-statistics

This is the visualisation of two categories of netflix movies which are: * movies * tv shows

data-science datafram jupyter-notebook matplotlib numpy pandas python

Last synced: 27 Apr 2026

https://github.com/filipporanza/signal_lib

personal programs for signal analysis

numpy python3 signal-analysis signal-processing

Last synced: 28 Apr 2026

https://github.com/varun-khorgade/salesense-e-commerce-analytics

Analyzed e-commerce datasets to discover customer behavior trends and business insights. Performed EDA, data cleaning, and visualizations to help stakeholders make data-driven decisions.

customer-behavior-analysis dashboards datacleaning exploratory-data-analysis feature-engineering jupyter-notebook matplotlib numpy pandas python seaborn sql time-series-trend visulaization

Last synced: 28 Apr 2026

https://github.com/udipta14/historical-olympic-games-eda-python

Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-cleaning data-visualization eda matplotlib numpy pandas python3 seaborn

Last synced: 28 Apr 2026

https://github.com/ddeepanshu-997/datascience--olympics-games

in this repository i am going to perform some datascience techinques on the datasaet in order to get the valuable insights with the help of python libraries , packages etc.

data-science exploratory-data-analysis insights matplotlib-pyplot numpy pandas-dataframe problem-solving python visualization

Last synced: 28 Apr 2026

https://github.com/redinfinitypro/drawimages

Rating: (8/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 28 Apr 2026

https://github.com/fimbres/numpy-start

A crash project with the most important numpy functions and methods, used in data science, AI and machine learning.

data-science numpy python python-script

Last synced: 29 Apr 2026

https://github.com/iv4n-ga6l/genderdetection

Gender detection using gender classification model

genderclassification genderdetection numpy pil python resnet18 torch torchvision

Last synced: 29 Apr 2026

https://github.com/rand-asswad/controle_optimal

Une étude d'un problème de contrôle optimal avec visualisation 3D

controle-optimal numpy optimal-control python sympy

Last synced: 29 Apr 2026

https://github.com/manishkumarpatel07/facial-recognition-attendance-manager

An AI-powered face recognition attendance system using Python, OpenCV, and dlib for real-time attendance tracking.

dlib-face-recognition numpy opencv pandas python

Last synced: 29 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/dulajkavinda/document-similarity-ml

📑 Check similarity between documents using td-idf and cosine similarity.

machinelearning numpy python scikit

Last synced: 29 Apr 2026

https://github.com/ujjvalrajput/data-analysis-python-practise

Practise with matplotlib, pandas, and numpy.

matplotlib numpy pandas python

Last synced: 29 Apr 2026

https://github.com/thaisgarcia/analise-exploratoria

Respondendo 10 perguntas de negócio a partir de um dataset sobre uma rede de varejo que comercializa diversos produtos em diversas cidades dos EUA.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/giordanodaloisio/demv

Debiaser for Multiple Variables, a model- and data- agnostic method to improve fairness in binary and multi-class classification tasks

bias-mitigation numpy pandas python

Last synced: 29 Apr 2026

https://github.com/unknownman/101-numpy

learning numpy by 101 short videos and jupyter notebook

numpy

Last synced: 29 Apr 2026

https://github.com/ovsidee/psm

Fundamentals of Mathematical Simulations using Python

numpy python

Last synced: 29 Apr 2026

https://github.com/amulya20010418/zomato_data_analysis

Analyze Zomato restaurant data all the world and find the insights by using Python libaries and also visualize the dataset by using Power-bi

cleaning-data-in-python dashboard numpy pand powerbi python visualization

Last synced: 29 Apr 2026

https://github.com/al-ghaly/e-commerce-a-b-testing

A Statistical Analysis project in which I Performed an A/B test to analyze the effect of changing the user interface for an E-Commerce company's Website.

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 29 Apr 2026

https://github.com/matheusvazdata/retail-sales-forecast-linreg-sklearn

Minimal project for retail sales forecasting using linear regression (scikit-learn).

forecasting linear-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/chandkund/predicting-student-grades

This project predicts student grades (G1, G2, G3) based on various features such as demographic information, study habits, family background, and other academic factors. The aim is to develop a predictive model that can assist in identifying students who may need additional support.

classification-algorithm ensemble-learning exploratory-data-analysis machine-learning matplotlib numpy pandas regression-models seaborn

Last synced: 29 Apr 2026

https://github.com/armahdavi/ai_ml_assisted_breast_cancer_tumor_detection

Consolidating tutorial codes for breast cancer tumor detection, covering ML fundamentals like classification, feature engineering, training, evaluation, and key performance metrics.

bias-variance feature-engineering logistic-regression machine-learning machine-learning-algorithms medical-application numpy pandas python random-forest recall-precision sklearn xgboost-classifier

Last synced: 29 Apr 2026

https://github.com/divansh2025/python_notebook

Welcome to "Basics of Python", a repository designed to help you master the fundamentals of Python. 🚀

datavisualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/sakshijain009/minor-project-ml-in-python

Exploratory Data analysis on data using python

matplotlib-pyplot ml numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/yimethan/basics-of-data-analysis

2023-2 Basics of Data Analysis

data-analysis numpy pandas python

Last synced: 29 Apr 2026

https://github.com/dmarks84/ind_project_readme-generator

Independent (personal) project in which I automatically generate README files for each of my repositories from my coursework

dataframes etl numpy pandas programming python

Last synced: 29 Apr 2026

https://github.com/alfastrek/textinsight-pro

Webscraper with Natural Language Processing

beautifulsoup jupyter-notebook nltk numpy pandas python webscraping

Last synced: 29 Apr 2026

https://github.com/atchayaah/vois_aicte_oct2025_atchayaa

Comprehensive data science analysis leveraging NYC Airbnb market data to derive actionable insights on optimal pricing models, key performance indicators (KPIs), and factors influencing host trust and guest experience optimization.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/fantaso/data-analysis-and-manipulation-with-pandas

Project consists to analyse a publicly available movie dataset found in https://www.kaggle.com/beyjin/movies-1990-to-2017 and use Python tools like Pandas in order to get some initial insights about the dataset and finally proceeding to clean, transform and save a new version of the dataset in a better structure thinking about storing the data in a database.

matplotlib numpy pandas python3 seaborn

Last synced: 29 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/amirali5/numpy

In this repo, all about NumPy which basically a Python Library used for multidimensional array. It also has functions for working in domain of linear algebra, fourier transform, and matrices. NumPy was created in 2005 by Travis Oliphant. It is an open source project and you can use it freely.

numpy

Last synced: 30 Apr 2026

https://github.com/victor-gp/tfg-h16b

The code for my bachelor's thesis, a numerical exploration of the second part of Hilbert's 16th problem

bachelor-thesis dynamical-systems matplotlib-pyplot numpy odeint

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sjain2580/simple-linear-regression-model

This project demonstrates a simple, yet robust, multiple linear regression model built with Python and scikit-learn to predict median house values in California.

joblib linear-regression matplotlib matplotlib-pyplot numpy python scikit-learn

Last synced: 30 Apr 2026

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 30 Apr 2026

https://github.com/gill-singh-a/radial-velocity-of-star

A Program that approximates the number of exoplanets and their data (Mass, Radius of Revolution and Time Period of Revolution) present in a star system given the Radial Velocity of Star with noise

astronomical-data-analysis astronomy exoplanets fourier-transform matplotlib numpy physics python python-3 science stars

Last synced: 30 Apr 2026

https://github.com/maguids/supervised-learning---video-games

This project consists on exploratory data analysis and the application of supervised learning models for classification using a Video Games dataset. Second Semester of the First Year of the Bachelor's Degree in Artificial Intelligence and Data Science.

jupyter-notebook machine-learning matplotlib numpy pandas scikit-learn seaborn supervised-learning

Last synced: 30 Apr 2026