An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/muhammadabyaz/facial-recognition-based-attendance

Facial Recognition Based Attendance System

numpy open-cv pandas pyqt python3 qml supabase

Last synced: 12 Feb 2026

https://github.com/bogdang888g/jojo_stands

Анализ параметров стендов из ДжоДжо

numpy pandas plotly python scipy

Last synced: 11 Apr 2026

https://github.com/sralter/sustainability_insights

A data analysis project that derived insights from an emissions dataset sourced from Climate TRACE.

duckdb eda matplotlib numpy pandas tableau

Last synced: 12 Feb 2026

https://github.com/cchandre/rg

Renormalization for the break-up of invariant tori in Hamiltonian flows

expm hamiltonian hamiltonian-dynamics invariant-tori kolmogorov-arnold-moser numpy python3 renormalization-group scipy

Last synced: 01 May 2026

https://github.com/piyush1927/flightforecast

ML model to predict flight prices based on various features like departure time, arrival time, duration, airline, source, destination, and number of stops.

machine-learning mathplotlib numpy pandas scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/bhattbhavesh91/tf-numpy-example

Faster NumPy using Tensorflow's tf.experimental.numpy module

keras numpy tensorflow

Last synced: 01 May 2026

https://github.com/code-taweezy/thermos-ai

An AI-powered thermal optimization and anomaly detection system for energy-efficient data centers. Note: this is a MVP

ai case-study data-visualization lovable-ai machine-learning matpplotlib mvp numpy pandas project python sckiit-learn seaborn solution tensorflow

Last synced: 02 May 2026

https://github.com/camilajaviera91/apache-beam-pipeline-first-approach

This code demonstrates how to integrate Apache Beam with scikit-learn datasets and perform simple data transformations. It loads the Linnerud dataset from scikit-learn, converts it into a Pandas DataFrame for easier manipulation.

apache-beam dataframes glob kmeans-clustering matplotlib-pyplot mean-absolute-error mean-square-error numpy os pandas pipelines scipy-stats seaborn silhouette-score sklearn sklearn-datasets standardscaler

Last synced: 28 Apr 2026

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 12 Apr 2026

https://github.com/datarohit/imdb-reviews-sentiment-analysis

This is a NLP - Sentiment Analysis Project built using Bernoulli-Naive-Bayes Algorithm to Predict is the IMDB Movie Review is Positive or Negative.

bernoulli-naive-bayes classification count-vectorizer multinomial-naive-bayes nltk numpy pandas porter-stemmer regex sentiment-analysis sklearn

Last synced: 29 Apr 2026

https://github.com/cs-joy/numpyv1.25.2

@numpy

numpy

Last synced: 03 May 2026

https://github.com/felixsoderstrom/perceptron-for-ocr

Training an AI model to recognize handwritten digits

ai backpropagation machine-learning neural-network numpy pytorch

Last synced: 23 Apr 2025

https://github.com/realtristan/micronet

Neural network library from scratch

neural-networks numpy python pytorch

Last synced: 03 May 2026

https://github.com/saptak625/doublependulum

This repository is my derivation of the equations of motion of a double pendulum using Lagrangian Mechanics and then solving the nonlinear, coupled equations using a Runge-Kutta 45 integration scheme.

differential-equations lagrangian-mechanics matplotlib numerical-integration numpy python scipy vpython

Last synced: 03 May 2026

https://github.com/htlinh0604/invoice-data-extraction

This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

flask numpy ocr-recognition opencv pandas python regex tesseract-ocr

Last synced: 10 Nov 2025

https://github.com/yangeorget/clustering-algorithms

K-means and C-means experiments with Numpy

clustering clustering-algorithm numpy

Last synced: 02 Apr 2026

https://github.com/nguyenanhtuan1912/computer_vision_exercises

Trong này sẽ chứa các file source về bài tập Computer Vision.

computer-vision numpy opencv python

Last synced: 12 Apr 2026

https://github.com/arjunravi26/image-editor

A dajngo based image manipulation webiste

django jsonresponse numpy opencv pillow

Last synced: 17 Jun 2026

https://github.com/valmir-unicap/rastreamento-de-gestos-para-controle-video

Projeto final da disciplina de visão computacional

ctypes mediapipe numpy opencv python vlc

Last synced: 13 Apr 2026

https://github.com/dipto1971/statistical-analysis-data-science

Statistical Analysis in Data Science

numpy pandas tensorflow

Last synced: 15 Apr 2026

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/pr38/cox_ph_estimation_notebooks

Personal discovery work on estimating Cox Proportional hazards coefficients for for both breslow and efron ties, using both autograd and directly calculating the gradient and hessian

cox-regression dask data-science machine-learning numpy pytensor statistics survival-analysis

Last synced: 15 Apr 2026

https://github.com/priyanka7411/customer-churn-prediction

Customer Churn Prediction using machine learning. This project covers the full data science lifecycle—data cleaning, exploratory analysis, feature engineering, model building, and extracting actionable business insights.

jupyter-notebook model numpy pandas python random-forest-classifier scikit-learn-matplotlib-seaborn

Last synced: 04 May 2026

https://github.com/hansalemaos/cythonsequencefinder

Module for finding sequences in arrays using Cython and NumPy.

cython finder locate numpy sequence

Last synced: 17 Jun 2026

https://github.com/pinedah/escom_programming-for-data-science

This repository contains personal notes, exercises, and programs from the Programming for Data Science course at Instituto Politécnico Nacional (IPN). The course focuses on using Python programming for data handling, statistical analysis, and machine learning techniques.

data-science escom matplotlib numpy pandas python python-algorithms python-library

Last synced: 04 May 2026

https://github.com/codewitheshayoutube/corvit_python_data_analytics_course

I teach Python & Data Analytics at Corvit Systems. This repository contains all course materials, including code, assignments, projects, and resources. ⭐ Stay updated!

matplotlib numpy pandas python python3 scikitlearn-machine-learning seaborn

Last synced: 13 Apr 2026

https://github.com/sehgalnaval/olympics-data-analysis

I will be building an Olympics Data Analysis Web Application using Streamlit. For development, I will be using Python and Pandas. For plotting, I will be using Seaborn and Plotly libraries. The project will also be deployed to Heroku.

heroku jupyter-notebook matplotlib numpy pandas pycharm python seaborn

Last synced: 11 Apr 2026

https://github.com/prabath1998/rice_leaf_disease_ditection_system

This project is a Paddy Crop Disease Detection System that uses machine learning to identify diseases in paddy (rice) leaves. Farmers can upload an image of a paddy leaf, and the system will predict whether the leaf is healthy or affected by a specific disease

flask joblib numpy opencv pillow sklearn

Last synced: 15 Apr 2026

https://github.com/aarryasutar/hate_speech_detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud

Last synced: 09 Apr 2026

https://github.com/balaji24092001/python

All python related content: notes, problems, progress archive.

matplotlib numpy opencv pandas python3 scipy-stats seaborn sklearn sqllite

Last synced: 12 Apr 2026

https://github.com/megha060199/academic-erp

Python, SQL, Javascript

mysql numpy python

Last synced: 04 May 2026

https://github.com/h3ar7b3a7/exploringjupyterlab

This is just an exploration in using Jupyter Lab, Notebook and some useful Python libraries. Some basic commands, markups and extensions setup are documented here too.

bokeh extensions jupyterlab matplotlib notebook-jupyter numpy pandas

Last synced: 04 May 2026

https://github.com/svetlanam/data-scripts

Simple scripts for data cleaning, etl transformations and data reorganisations

data-cleaning data-transformation dimensions etl facts keboola numpy pandas python3 snowflake sql star-schema

Last synced: 04 May 2026

https://github.com/ayushsaksena30/cosmic-classifier

This notebook implements a structured machine learning pipeline to classify cosmic data using the CatBoost Classifier, known for its efficiency with categorical features and minimal preprocessing requirements.

catboost-classifier label-encoder machine-learning matplotlib numpy pandas robust-scaler scikit-learn seaborn simple-imputer

Last synced: 15 Apr 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/jakeoeding/rippy

A Python seam carving implementation

numpy python3 seam-carving

Last synced: 04 May 2026

https://github.com/ola-kaznowska/median-statistic

A program that calculates the median statistic. Program written in Python 3.13.1 with NumPy library during Machine Learning course.

mathematics numpy python python3

Last synced: 04 May 2026

https://github.com/arsalan692/python-telusko-playlist

A collection of beginner Python scripts and practice exercises from my early learning days, inspired by Telusko and other resources.

basic-programming bs4 matplotlib numpy oop pandas python requests selenium tkinter

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/wwwmisla/gerador-mapas-calor

Sistema de visão computacional para gerar mapas de calor com base na movimentação em espaços públicos, auxiliando no planejamento urbano e uso eficiente do espaço.

color computer-vision demo google-colab gradio heatmap marchine-learning matplotlib model numpy opencv people-detection python smart-city ufrn visao-computacional vision-computer yolo yolov8

Last synced: 04 May 2026

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/thomasa/python-intro

This is a Jupyter notebook slide show for a 2-hour crash intro to Python, NumPy, Matplotlib, and SciPy

matplotlib numpy python scipy tutorial

Last synced: 05 May 2026

https://github.com/lruizap/tomography-with-algebraic-topology

This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.

gudhi jupyter-notebook matplotlib numpy opencv python tomography

Last synced: 05 May 2026

https://github.com/cicku/en.650.672

HW of EN.650.672

analytics data-analysis numpy pandas

Last synced: 05 May 2026

https://github.com/ahmedabdalkreem/airline_passenger_satisfaction

In this project, we work in machine learning project that show the satisfacion of customer from the service that have in airline this service make it like this experience or not.

decision-trees gradient-boosting knn logistic-regression machine-learning matplotlib naive-bayes numpy pandas preprocessing python random-forest svm visualization

Last synced: 05 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 05 May 2026

https://github.com/muthupillai1204/diwali_sales_analysis

The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.

data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/coder5omkar/lendingclubcasestudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/cycle-sync-ai/student-score-analysis

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/timeless-residents/handson-tensorflow-numpy

A practical guide to learning and using TensorFlow and NumPy for data manipulation and machine learning

data-science machine-learning numpy tensorflow tutorial

Last synced: 06 May 2026

https://github.com/lunanvarg/bid-data-cac22039

En este repositorio se vé mi TP Integrador del programa Codo a Codo: Big Data

colab data-studio-google deepnote excel matplotlib mysql numpy pandas-library python seaborn

Last synced: 06 May 2026

https://github.com/vandanac20/advanced-real-estate-valuation-with-ensemble-regression-models

Advanced Real Estate Valuation with Ensemble Regression Models

machine-learning numpy pandas python sklearn

Last synced: 06 May 2026