An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/asherk7/house-price-prediction

House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting

data-science numpy pandas regression scikit-learn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/hansalemaos/cv2_collage

Create a collage from images with OpenCV

collage numpy opencv python

Last synced: 06 May 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/hossein-cod/practicing-deep-learning

with keras data set we load a data base and train our machine for test

deep deep-learning jupyter-notebook keras machine-learning matplotlib numpy py python sklearn tensorflow

Last synced: 25 Feb 2026

https://github.com/ayuub34/siesta-smart

This project is designed to be a AIaaS 2-Way-PMS optimized with AI and ML models. The goal is to enhance the efficiency of the system through advanced technology solutions.

ai data-analysis-python data-processing data-science jupyter-python machine-learning matplotlib numpy pandas powerbi-report pyqt5 scipy seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-03-q4-hypothesis-testing

Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing norm null-hypothesis numpy numpy-arrays pandas python scipy significance-testing stats

Last synced: 31 Jan 2026

https://github.com/faisal-khann/python-practice

This is the section for python practice, where we engage in programming exercises to improve our Python skill.

numpy numpy-arrays pandas python

Last synced: 10 May 2026

https://github.com/1adityakadam/computer-vision-approach-for-quality-inspection-of-steel-sheets

Machine Learning and Computer Vision project improves steel defect inspection using computer vision. A segmentation technique partitions steel sheet images into regions, enabling better defect analysis. U-Net is used for segmentation, and MobileNetV2 for classification. Performance is evaluated with the Dice Coefficient.

computer-vision machine-learning numpy pandas unet-image-segmentation

Last synced: 10 May 2026

https://github.com/lunanvarg/bid-data-cac22039

En este repositorio se vé mi TP Integrador del programa Codo a Codo: Big Data

colab data-studio-google deepnote excel matplotlib mysql numpy pandas-library python seaborn

Last synced: 06 May 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/samiyaalizaidi/nn-ml-homeworks

Homework solutions for CPE-4903: Neural Networks & Machine Learning at Kennesaw State University.

machine-learning machine-learning-workflow neural-networks numpy scikit-learn

Last synced: 15 Apr 2026

https://github.com/martincastroalvarez/python-video-processing

Movie generator using MoviePy, OpenCV2 and Numpy.

moviepy numpy opencv

Last synced: 15 Apr 2026

https://github.com/vandanac20/advanced-real-estate-valuation-with-ensemble-regression-models

Advanced Real Estate Valuation with Ensemble Regression Models

machine-learning numpy pandas python sklearn

Last synced: 06 May 2026

https://github.com/pedrohenriquebr/pegasus

Finance Reporting Automation

excel finance financial numpy pandas python python3 report

Last synced: 15 Apr 2026

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 01 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/redinfinitypro/python-finance

Rating: (8/10) The code performs a linear regression analysis on stock data using the yfinance library, importing necessary libraries, selecting stock symbols, downloading data, fitting a model, making predictions, and visualizing results.

datetime matplotlib numpy pandas random sklearn yfinance

Last synced: 06 May 2026

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 01 Feb 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/jai0212/course-selector-uoft

A program that suggests university courses based on your program and provides a custom GUI to customize your course plan. Created to help UofT students

beautifulsoup csv graphs graphs-algorithms networkx-graph numpy os parser pickle pillow plotly python tkinter-gui tkmacosx

Last synced: 06 May 2026

https://github.com/luisall12/fraud-detection-machine-learning-python

Fraud detection system that recognises and signals strange transactions using machine learning

detection flask fruad html numpy pandas python ui

Last synced: 15 Apr 2026

https://github.com/harmanveer-2546/student-performance-in-exam

Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.

classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization

Last synced: 06 May 2026

https://github.com/dkm/stlsplit

Tool to split STL in several objects

numpy python stl

Last synced: 17 Jun 2026

https://github.com/jonolav95/advent_of_code_2022

Using Python with Pandas and Numpy (www.adventofcode.com).

numpy pandas

Last synced: 06 May 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 09 Feb 2026

https://github.com/elam-parithi/singapore_flatprice_predicting

Flat price prediction with Machine learning tools and python.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/kianaabrisham/svm-from-scratch

Linear SVM from scratch with hinge loss + decision boundaries

classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm

Last synced: 07 May 2026

https://github.com/mahmud-kandawala/simulating-covid-19-python-

This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.

matplotlib numpy python

Last synced: 16 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/malintha-induwara/python-for-ds

This repository contains my personal notes, exercises, and examples for Python

numpy pandas python webscraping

Last synced: 07 May 2026

https://github.com/sriyaroy/coding-challenges

50 Machine Learning & Computer Vision challenges ranging from Easy to Hard to sharpen you ML engineering skills.

coding-challenges computer-vision interview-prep machine-learning numpy python pytorch

Last synced: 07 May 2026

https://github.com/dzakarias/waves

Fun with the sine function inspired by Chladni plates

numpy streamlit

Last synced: 10 May 2026

https://github.com/esoltys/climate_data_viz

FastAPI-based web application that fetches and visualizes climate data from the Copernicus Climate Data Store (CDS).

apache-echarts copernicus-climate-data-store fastapi numpy

Last synced: 10 May 2026

https://github.com/khinthandarkyaw98/python_for_engineers

This particular Python notebook is designed to provide Engineers with an opportunity to practice scientific computations.

engineering numpy python scientific-computing youtube

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/walkerdustin/python-linear-regression

simple linear regression in python based on coursera andrew ng course

ai linear-regression machine-learning ml numpy python

Last synced: 05 May 2026

https://github.com/ledsouza/medphys-bi

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 10 Apr 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/harmonicode/signal_equalizer

Signal Equalizer is a Python-based application designed to process and analyze various types of signal data, including audio and ECG signals. It provides functionalities for reading, processing, visualizing, and enhancing signal data using techniques such as Wiener filtering.

audio-processing numpy pyqt5 scipy signal-processing wiener-filter

Last synced: 13 Feb 2026

https://github.com/separatrixxx/nm_labs_6_7_sem

👓 Laboratory work for 6 and 7 semesters of MAI on numerical methods

numerical-methods numpy python

Last synced: 16 Apr 2026

https://github.com/rakesh3596/tutedude_projects

It consist of 3 major project 1:Covid19 Analysis 2:Rainfall Prediction 3:Tumor Detection

csv-files matplotlib numpy pandas plotly python sklearn-library

Last synced: 16 Apr 2026

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 07 May 2026

https://github.com/andreashovaldt/simple-neural-network

A simple neural network, created with numpy, based on Stochastic Gradient Descent, used for classifying the MNIST dataset

mnist-classification neural-network numpy stochastic-gradient-descent

Last synced: 16 Apr 2026

https://github.com/kartmaan/plotly-intervals

Group values from a Pandas Series according to given intervals and represent them graphically with Plotly

bar-chart grouping grouping-plots intervals intervals-chart numpy numpy-arrays orca pandas pandas-series percentile pie-chart plot plotly plotting values

Last synced: 10 May 2026

https://github.com/mattia-hulathduwage/wine-quality-analyzer

A machine learning project that analyzes wine quality using clustering, regression, and classification techniques. The model predicts wine quality scores based on chemical properties and determines the most influential features affecting quality.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/pranavsuriya-sr/e-hub_energysystems

Python based Web Application for analysis and visualization of concepts in Energy Systems.

css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit

Last synced: 11 Apr 2026

https://github.com/pointer2alvee/machine-learning-algorithms-scratch

Core Machine Learning concepts , algorithms & maths implemented completely from scratch using Python & numpy only.

machine-learning-algorithms numpy pandas probabilistic-machine-learning python statistical-models

Last synced: 07 May 2026

https://github.com/mgitrov/kulo-chatbot

A retrieval based chatbot aiming to simulate human-like conversation with the user via text messages on chat.

artificial-neural-networks bag-of-words deep-learning json keras lemmatization natural-language-processing nltk numpy tkinter tokenization

Last synced: 07 May 2026

https://github.com/type0-1/salary-truth-predictor

A supervised machine learning regression model. Includes problem statement, approach to solution, code, images, dataset, and Jupyter Notebook for interactive analysis.

linear-regression machine-learning matplotlib-pyplot ml numpy pandas polynomial-regression projects scikitlearn-machine-learning support-vector-regression

Last synced: 16 Apr 2026

https://github.com/jessicahora/studies-on-linear-algebra

Repositorio com Estudos sobre Algebra Linear.

linalg linear-algebra matplotlib-pyplot matrix numpy python scipy

Last synced: 01 Mar 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026

https://github.com/williyam-m/movie_booking_app

A Django-based movie booking app featuring an admin dashboard with CRUD functionality for movies, theatres, screens, shows, and bookings. Includes an ML-powered movie recommendation system.

django machine-learning numpy orm pandas python sckiit-learn sqlite3

Last synced: 16 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/susanketsarkar/nn-from-scratch

A pure NumPy implementation of a deep neural network, built for educational purposes and deep learning understanding. This project implements a multi-layer neural network with various features commonly found in modern deep learning frameworks.

deep-learning implementation-of-algorithms neural-network numpy python

Last synced: 10 May 2026