An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/m-k-l-s/sudoku

A trivial sudoku solver in Python, using recursion and numpy.

backtracking numpy python recursion solver sudoku

Last synced: 27 Apr 2026

https://github.com/filipporanza/signal_lib

personal programs for signal analysis

numpy python3 signal-analysis signal-processing

Last synced: 28 Apr 2026

https://github.com/binoydipu/ml-libraries

Machine Learning Libraries Repository

machine-learning numpy

Last synced: 28 Apr 2026

https://github.com/ronverse17/automate-data-cleaning

This project automates messy data cleaning tasks - like fixing column names, filling missing values, and spotting outliers, so analysts and data scientists can spend more time on insights, not preprocessing.

numpy pandas pipeline python

Last synced: 28 Apr 2026

https://github.com/jovicdev97/financial-loan-datascience-notebook

using numpy and pandas to analyze a synthetic loan dataset with python

data-analysis matlabplot numpy pandas plotting python seaborn

Last synced: 28 Apr 2026

https://github.com/yanisdib/somint

Somint - AI-powered Trading Card Grading

ai api devcontainer docker numpy opencv python yolov8

Last synced: 28 Apr 2026

https://github.com/pedronjorge/average-face

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 30 Apr 2026

https://github.com/kingsley-ezenwaka/medical-data-visualizer

A data analysis project that investigates a dataset of anonymous patients' medical information, and explores the relationship between cardiac disease, body measurements, blood markers, and lifestyle choices.

analysis data matplotlib numpy pandas seaborn

Last synced: 28 Apr 2026

https://github.com/redinfinitypro/drawimages

Rating: (8/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 28 Apr 2026

https://github.com/keyurparalkar/knn-for-cifar-10

K-nearest neighbour algorithm for CIFAR-10 dataset

cifar10 knn numpy

Last synced: 28 Apr 2026

https://github.com/thekartikeyamishra/interactive-ai-model-trainer

A Google Colab notebook for interactively training simple AI/ML classification models. Features CSV upload, dummy data generation, feature/target selection, model choice, and basic evaluation. Includes a user-friendly UI. Ideal for educational purposes. See screenshots below!

google googlecolab ipywidgets matpl numpy pandas python scikit-learn seaborn

Last synced: 29 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/chirindaopensource/high_dimensional_matrix_variate_diffusion_index_models

End-to-end Python implementation of Ma et al.'s (2025) matrix-variate diffusion index models for macroeconomic forecasting. Features α-PCA factor extraction, supervised screening, and ILS estimation for high-dimensional forecasting with preserved structural information.

diffusion-index dimension-reduction econometrics factor-models financial-modeling high-dimensional-statistics macroeconomic-forecasting matrix-factorization monte-carlo-simulation numpy principal-component-analysis python quantitative-finance research-replication scientific-computing statistical-computing statistical-modeling supervised-learning time-series-analysis time-series-forecasting

Last synced: 29 Apr 2026

https://github.com/antonio-f/naive_convnet

Naive Convolutional Neural Network with TensorFlow 2

101 convolutional-neural-networks deep-learning easy matplotlib numpy simple tensorflow2 tutorial

Last synced: 29 Apr 2026

https://github.com/ovsidee/psm

Fundamentals of Mathematical Simulations using Python

numpy python

Last synced: 29 Apr 2026

https://github.com/mukeshthenraj/fraud-detection-model

Logistic Regression, Grid Search, and ROC-PR curve evaluation on fraud detection dataset

classification fraud-detection machine-learning numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/michaelzheng67/farmsa-programming-for-finance-spring2021

Source files for the programming for finance presentation (complements the material covered)

excel jupyter-notebook matplotlib numpy pandas python sql yahoo-finance-api

Last synced: 29 Apr 2026

https://github.com/alfastrek/textinsight-pro

Webscraper with Natural Language Processing

beautifulsoup jupyter-notebook nltk numpy pandas python webscraping

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/fantaso/data-analysis-and-manipulation-with-pandas

Project consists to analyse a publicly available movie dataset found in https://www.kaggle.com/beyjin/movies-1990-to-2017 and use Python tools like Pandas in order to get some initial insights about the dataset and finally proceeding to clean, transform and save a new version of the dataset in a better structure thinking about storing the data in a database.

matplotlib numpy pandas python3 seaborn

Last synced: 29 Apr 2026

https://github.com/sophiaas/transform-datasets

Datasets for modeling transformations in data.

numpy pytorch torch-geometric

Last synced: 29 Apr 2026

https://github.com/priyansusahoo/face_recognition

Major Project: Attendance system project using Face Recognition

face-recognition numpy opencv-python python3

Last synced: 30 Apr 2026

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 30 Apr 2026

https://github.com/nikhilsree5/walmartcasestudy

Analysis of Customer Spending Habits at Walmart Inc

clt eda numpy pandas python3 statistics visualization

Last synced: 30 Apr 2026

https://github.com/tinaland101/credit-risk-classification

The purpose of this project is to build a credit risk classification model using machine learning techniques. This model helps identify the creditworthiness of borrowers based on historical lending data. Specifically, it uses a logistic regression model to predict whether a loan is healthy (0) or high-risk (1).

numpy pandas pathlib scikit-learn

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/student-stress-levels-classification

Proyek pemodelan machine learning untuk mengklasifikasikan tingkat stres mahasiswa berdasarkan parameter input akademik dan psikologis.

joblib jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 08 Jun 2026

https://github.com/professornova/ann-scratch

This repository implements a simple Artificial Neural Network (ANN) from scratch using only NumPy.

artificial-intelligence neural-network numpy scratch

Last synced: 30 Apr 2026

https://github.com/johassel/feed_forward_neural_network

Fully-connected feed-forward neural network based on the couse "Python for Machine Learning" at TU Berlin WiSe 24/25

machine-learning neural-networks numpy python

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/zsailer/np2d

Common 2-D Numpy operations

array numpy

Last synced: 01 May 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/hansalemaos/a_pandas_ex_image_tools

Uses pandas/numpy/numexpr for operations on pictures - very fast

colors image locate numexpr numpy opencv pictures python

Last synced: 01 May 2026

https://github.com/paulo-santos-ds/analise_dados_musicais

Dashboard para compreender as preferências musicais dos ouvintes, tendências regionais e o impacto de artistas específicos.

numpy pandas powerbi python

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/laszlokorte/canny-notebook

Numpy Canny Edge detector implementation to visualize each step

canny-edge-detection image-processing jupyter-notebook numpy

Last synced: 02 May 2026

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/andremenezesds/machine_learning_test

Testing among various Machine Learning models and parameters, in order to further study their behaviour for Classification, Regression and Clustering analysis.

classification-models clustering-models machine-learning numpy pandas regression-models sklearn

Last synced: 02 May 2026

https://github.com/baranovserv/h-s-diagram

Парсер данных построения H-S диаграммы для веществ

diagrams json matplotlib numpy parser requests

Last synced: 02 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/fandredev/ml-my-guide

my own annotations about ML/DS using pandas, matplotlib, numpy, scikit learn

anaconda matplotlib numpy pandas plotly scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/thiagonunesbatista/deuteranopia-simulator-computer-vision-python

Live color detection with a webcam and a simulation of deuteranopia's color perception effects.

computer-vision numpy opencv python

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi

Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/alvii147/discretetimelib

Python library for analysis of discrete time signals and systems

numpy pandas scipy signals-and-systems sympy

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/ola-kaznowska/median-statistic

A program that calculates the median statistic. Program written in Python 3.13.1 with NumPy library during Machine Learning course.

mathematics numpy python python3

Last synced: 04 May 2026

https://github.com/arjunravi26/deep_learning

This repo contains codes and notes to learn deep learning

ann autoencoders cnn deep-learning deepseek-r1 lstm numpy rnn-tensorflow tensorflow transformer

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/wwwmisla/gerador-mapas-calor

Sistema de visão computacional para gerar mapas de calor com base na movimentação em espaços públicos, auxiliando no planejamento urbano e uso eficiente do espaço.

color computer-vision demo google-colab gradio heatmap marchine-learning matplotlib model numpy opencv people-detection python smart-city ufrn visao-computacional vision-computer yolo yolov8

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/chathumiamarasinghe/nn-training-model

A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.

jupyter-notebook matplotlib numpy phyton scikit-learn

Last synced: 04 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/vladd12/machine-learning-practice

Репозиторий для учебной практики и различных курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/sowmi0304/predictive-modeling-titanic-survival-prediction

Machine learning project to predict passenger survival on the Titanic using classification algorithms, data preprocessing, and model evaluation techniques.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/13anush/python-libraries-

A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.

data-analysis matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/ranaessam03/climate-prediction

A machine learning project to predict rainfall using Decision Tree, k-Nearest Neighbors (kNN), and Naïve Bayes, with preprocessing, model evaluation, and custom kNN implementation.

decision-trees knn-classification machine-learning naive-bayes-classifier numpy sklearn

Last synced: 05 May 2026

https://github.com/mohitsai/boston-housing-data-analysis

Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city

data-analysis data-science matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/tapas-gope/diwali-sales-analysis

Analyze Diwali Sales Data to improve customer experience and sales.

exploratory-data-analysis matplotlib-pyplot numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/rtgrt5645/numpy-lab

🧮 Explore, manipulate, and visualize data with NumPy to enhance your Python skills in scientific computing and data analysis.

array-operations data-analysis data-science jupyter-notebook machine-learning numerical-computing numpy numpy-arrays numpy-library numpy-python python python3 scientific-computing

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/sahu-sakshi/covid-19

A comprehensive data analysis project for COVID-19 pandemic data, featuring automated data collection, feature engineering, visualization tools, and exploratory analysis notebooks.

jupyter numpy pandas python3

Last synced: 05 May 2026

https://github.com/farial-mahmod/nuclear-energy-generation-prediction-logistic-regression

Nuclear Energy Generation Prediction Logistic Regression project is aimed at predicting the nuclear energy generation based on the production (generation) data from 1991 to 2023 by using Logistic Regression.

energy kaggle logistic-regression machine-learning matplotlib nuclear-energy nuclear-power numpy power python3 sklearn

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/elijahondiek/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026