An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/skypse/santander-coders-data_science-course

Curso de Data Science, proposto pelo Satander, utilizando Python!

jupyter-notebook numpy pandas-python python scikit-learn

Last synced: 29 Apr 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/moustafamohamed01/data-analysis-ecommerce-purchases

Data Analysis : Analyzing (Ecommerce Purchases) dataset from Kaggle

jupiter-notebook numpy pandas python

Last synced: 08 May 2026

https://github.com/humankernel/hands-on-ml

Practice of Machine Learning

machine-learning matplotlib numpy pandas sklearn

Last synced: 08 Jun 2026

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/w13b3/ssim-py

Structural Similarity (SSIM) index, where the only core dependency is NumPy

image-comparison numpy opencv-python pillow python3 ssim tested unittests

Last synced: 29 Apr 2026

https://github.com/ujjvalrajput/data-analysis-python-practise

Practise with matplotlib, pandas, and numpy.

matplotlib numpy pandas python

Last synced: 29 Apr 2026

https://github.com/thaisgarcia/analise-exploratoria

Respondendo 10 perguntas de negócio a partir de um dataset sobre uma rede de varejo que comercializa diversos produtos em diversas cidades dos EUA.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/chirindaopensource/high_dimensional_matrix_variate_diffusion_index_models

End-to-end Python implementation of Ma et al.'s (2025) matrix-variate diffusion index models for macroeconomic forecasting. Features α-PCA factor extraction, supervised screening, and ILS estimation for high-dimensional forecasting with preserved structural information.

diffusion-index dimension-reduction econometrics factor-models financial-modeling high-dimensional-statistics macroeconomic-forecasting matrix-factorization monte-carlo-simulation numpy principal-component-analysis python quantitative-finance research-replication scientific-computing statistical-computing statistical-modeling supervised-learning time-series-analysis time-series-forecasting

Last synced: 29 Apr 2026

https://github.com/haleema33/cs50-final-project

This project allows the user to input a sequence of characters (a-z), and generates a unique audio file composed of piano sounds corresponding to those characters. The system reads sound files from a predefined directory, processes them, and outputs a new WAV file with the audio sequence. The audio is speeded up 3x for an accelerated playback.

numpy python3 wave

Last synced: 29 Apr 2026

https://github.com/ovsidee/psm

Fundamentals of Mathematical Simulations using Python

numpy python

Last synced: 29 Apr 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/abhishek010397/tigerprediction-machinelearning

Machine Learning Prediction Model Using Multiple Regression to predict Tiger Population in India

keras-tensorflow matplotlib-pyplot multiple-linear-regression numpy pandas python3 r2 sklearn

Last synced: 29 Apr 2026

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/allanotieno254/pneumonia-xray-classification

This project is a Deep Learning-based Pneumonia classification system that allows medical staff to upload chest X-ray images and quickly determine whether a patient shows signs of pneumonia.

keras machine-learning numpy pandas pil pillow python streamlit streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/akashash01/decision_tree

A very specific type of probability tree that enables you to make a decision on certain process(chance event outcomes, resources cost, utility).

data-science decision-tree-classifier machine-learning numpy pandas-dataframe python sklearn

Last synced: 11 May 2026

https://github.com/shaulamarquez/data-job-market

This is a Data Job Market analysis

analysis matplotlib numpy python seaborn

Last synced: 09 May 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/nikhilsree5/walmartcasestudy

Analysis of Customer Spending Habits at Walmart Inc

clt eda numpy pandas python3 statistics visualization

Last synced: 30 Apr 2026

https://github.com/tinaland101/credit-risk-classification

The purpose of this project is to build a credit risk classification model using machine learning techniques. This model helps identify the creditworthiness of borrowers based on historical lending data. Specifically, it uses a logistic regression model to predict whether a loan is healthy (0) or high-risk (1).

numpy pandas pathlib scikit-learn

Last synced: 30 Apr 2026

https://github.com/scholarchen20/pythonlp

Python保姆级教学附有对应的demo项目(成绩管理系统),适合小白打基础。此外还包含Numpy,Tensorflow,Pytorch示例代码提供学习使用上手

image-classification image-processing numpy pands python pytorch system tensorflow

Last synced: 30 Apr 2026

https://github.com/alirezasaharkhiz9/python-for-dataanalysis

The educational repository for data manipulation and analysis with the libraries numpy, pandas, and matplotlib.

matplotlib numpy pandas

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/data-science-mathematics

Proyek implementasi perhitungan matematika untuk klasifikasi probabilistik Naive Bayes serta eksperimen statistik A/B testing pada sistem rekomendasi dan pengembangan perangkat lunak.

jupyter-notebook matplotlib numpy pandas python scipy seaborn

Last synced: 08 Jun 2026

https://github.com/redinfinitypro/python-task-manager

Rating: (7/10): The Python Task Manager, is a user-friendly interface using tkinter library for system monitoring and management. It displays detailed system process information using psutil, with interactive features.

matplotlib numpy psutil pygetwindow tkinter

Last synced: 30 Apr 2026

https://github.com/yashbhutwala/pong-ai

Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment

deep-q-learning numpy openai-gym policy-gradient pong python tensorflow

Last synced: 30 Apr 2026

https://github.com/favalos/myway-to-nn

Project to store my understanding of Neural Networks.

jupiter-notebook neural-network numpy

Last synced: 30 Apr 2026

https://github.com/johassel/feed_forward_neural_network

Fully-connected feed-forward neural network based on the couse "Python for Machine Learning" at TU Berlin WiSe 24/25

machine-learning neural-networks numpy python

Last synced: 30 Apr 2026

https://github.com/darinochka/course_python_iad

ДЗ\Семинары майнора "Интеллектуальный анализ данных"

machine-learning numpy pandas python3

Last synced: 30 Apr 2026

https://github.com/abhiram-epuru/ai-powered-movie-recommender

A Movie Recommendation System using the MovieLens dataset, leveraging collaborative filtering techniques such as KNN (k-Nearest Neighbors) and Singular Value Decomposition (SVD). The project demonstrates a personalized recommendation model capable of suggesting movies based on user preferences and interaction data.

matplotlib numpy pandas python sckiit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/beyzabasarir/spotify-youtube-eda

Exploratory Data Analysis of global songs' performance metrics on Spotify and YouTube with Python, using Jupyter Notebook.

eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study

This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.

data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn

Last synced: 01 May 2026

https://github.com/bjpcjp/optimize-mktg-spend-with-lagrange

simple demo of marketing (2 channel) spend optimization

lagrange matplotlib matplotlib-pyplot numpy python scipy

Last synced: 01 May 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/splithor1zon/edu-barriermethodlp

Education material for learning/teaching of Linear Progrmming using Barrier Method. Presentation with commentary and python code available.

barrier-method convex-optimization cvxpy educational-project linear-programming numpy presentation python

Last synced: 01 May 2026

https://github.com/chaganti-reddy/heart_disease_prediction

Machine Model creation for Heart Disease Prediction

data-science deep-learning machine-learning numpy python

Last synced: 11 May 2026

https://github.com/ismaelvr1999/bioinspiredalgorithms

This repository contains implementations of various bioinspired algorithms for optimization and machine learning.

machine-learning matplotlib numpy python tensorflow

Last synced: 01 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/v-goncharenko/freespace

Implementation of signal propagation in free space

numpy python signal-processing

Last synced: 02 May 2026

https://github.com/lipskiyp/trading_dashboard

Pandas framework to analyze financial time-series and calculate various performance metrics (e.g. Sharpe Ratio and Max Drawdown Duration).

financial-analysis numpy pandas

Last synced: 02 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/smirnovlad/data-science-notebooks

A collection of various data analysis approaches

data-science deep-learning kaggle machine-learning numpy pandas pytorch

Last synced: 10 Apr 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/m0saan/python-for-data-analysis

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,

data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python

Last synced: 02 May 2026

https://github.com/dhruvv1402/x-twitter-bot-python

A Python-powered Twitter bot for content generation, analytics, and automation using Twitter API v2.

huggingface-transformers matplotlib nltk numpy pandas pillow python requests schedule textblob threading tweepy twitter-api wordcloud

Last synced: 09 May 2026

https://github.com/dineshdhamodharan24/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/aliyahscoding/tiny-recommender

Tiny item-item recommender in Python using cosine similarity over a toy user–item matrix. Simple CLI to print top-N recommendations.

collaborative-filtering cosine-similarity numpy pandas python recommender-system

Last synced: 09 May 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/layonj3000/libras-recognition

This project is a Brazilian Sign Language (LIBRAS) recognition system that utilizes computer vision and machine learning techniques.

computer-vision libras mediapipe numpy opencv python random-forest-classifier

Last synced: 03 May 2026

https://github.com/farhannaeem00/data-visualization-practice

Learning matplotlib fundamentals - seeking feedback and code reviews

jupyter matplotlib numpy pandas python

Last synced: 03 May 2026

https://github.com/diogoribeiro7/vortex2d

2D vortex particle method (Gaussian-core) with RK4, viscosity, plots, animations

cfd matplotlib numpy python simulation vortex-method

Last synced: 12 May 2026

https://github.com/itssahilwhat/AI-Fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 02 Jul 2025

https://github.com/hafaio/hilbert-bytes

A python library for converting between d-dimensional points and indices on a hilbert curve

hilbert-curve numba numpy python

Last synced: 12 Feb 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/pthmhatre/smart-attendance-system-using-face-recognition

This project aims to create a smart attendance system using face recognition technology. The system captures and recognizes faces in real-time and marks attendance automatically, providing a seamless and efficient way to manage attendance records.

adafruit computervision css face-recognition iot iot-application javascript mysql numpy opencv php pickle python raspeberry-pi rfid

Last synced: 12 Apr 2026

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/lorenzorottigni/ml-titanic

Machine Learning python bootcamp: logistic regression on titanic dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/ralolooafanxyaiml/neural-evolution-engine

A modular AI evolution simulation engine built with Python and TensorFlow. Uses Deep Learning to predict organism adaptations against environmental threats.

artificial-intelligence deep-learning evolution-simulation keras machine-learning neural-networks numpy oop pandas python scikit-learn simulation tensorflow

Last synced: 07 Jan 2026

https://github.com/saopicc/ddfacet-kernels

Modular implementation of the DDFacet Gridding Kernels

numpy radio-astronomy

Last synced: 26 Jan 2026

https://github.com/amirkh777/data-science-yandex-practicum

My projects from the Yandex Practicum Data Science course.

data-science matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 12 Apr 2026