An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/jsnevt/series_temporais

Previsão de eventos que ocorrem sazonalmente

datetime matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 09 May 2026

https://github.com/shaikats/python-assignment

Simulation and Modeling Lab assignments using Python, NumPy, Matplotlib, and Jupyter Notebook.

jupyter-notebook matplotlib numpy python simulation-modeling

Last synced: 09 May 2026

https://github.com/sivasurya681/financial-expense-tracker

The Financial Expense Tracker is a Python-based app built with Flet, Pandas, NumPy, and Matplotlib.pyplot for seamless expense management. It enables real-time tracking, categorization, and data visualization through bar and pie charts. Users can generate summaries, export/import data, and make informed financial decisions easily.

calculator-application differntial expense-tracker flet matplotlib-pyplot numpy pandas python salary-calculator total

Last synced: 09 May 2026

https://github.com/mairagalvao/final_grades

An analysis of the final grades of students using Python

matplotlib numpy pandas python3

Last synced: 09 May 2026

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 09 May 2026

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/anker661/autograd-playground

A simple Numpy & Python based auto differentiation system that supports visualization before and after back-propagation.

auto-differentiation autograd computaion-graph jupyter-notebook numpy python3

Last synced: 09 May 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/alphacrypto246/air-quality-prediction

The Air Pollution Prediction project forecasts pollution levels using the Air Quality UCI dataset, including variables like CO, NOx, temperature, and humidity. It analyzes temporal and environmental factors to predict air quality trends.

machine-learning numpy pandas scikitlearn-machine-learning sckiit-learn

Last synced: 10 May 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/esoltys/climate_data_viz

FastAPI-based web application that fetches and visualizes climate data from the Copernicus Climate Data Store (CDS).

apache-echarts copernicus-climate-data-store fastapi numpy

Last synced: 10 May 2026

https://github.com/florasteve/ml-foundations-day2

Day-2 ML foundations: probability/stats refresh and NumPy logistic regression; notebooks with visuals.

data-science jupyter-notebook logistic-regression machine-learning matplotlib numpy statistics

Last synced: 10 May 2026

https://github.com/niteshchawla/loantap-logisticregression

Given a set of attributes for an Individual, determine if a credit line should be extended to them. If so, what should the repayment terms be in business recommendations?

confusion-matrix encoding feature-engineering featureimportance hyperparameter-tuning logistic-regression machine-learning numpy pandas-python precision-recall roc-auc-curve sklearn-metrics smote-sampling

Last synced: 10 May 2026

https://github.com/sahil210695/convolution-evolution

This is to see how a kernel will convolve over an image and what will be its output after convolution

cnn convolutional-neural-networks data-science image-kernel image-processing numpy numpy-neural-network python streamlit

Last synced: 10 May 2026

https://github.com/susanketsarkar/nn-from-scratch

A pure NumPy implementation of a deep neural network, built for educational purposes and deep learning understanding. This project implements a multi-layer neural network with various features commonly found in modern deep learning frameworks.

deep-learning implementation-of-algorithms neural-network numpy python

Last synced: 10 May 2026

https://github.com/adityarohatgi11/regression_analysis_london_crime

This project examines how police presence impacts crime rates in London using regression analysis, panel data, and a natural experiment from the 2005 London attacks.

data-wrangling numpy pandas regression-analysis statsmodels

Last synced: 11 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/akashash01/decision_tree

A very specific type of probability tree that enables you to make a decision on certain process(chance event outcomes, resources cost, utility).

data-science decision-tree-classifier machine-learning numpy pandas-dataframe python sklearn

Last synced: 11 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/daksh90a/wine-quality-analysis

The Wine Quality Analysis project is an AI/ML-based data analysis initiative aimed at predicting and understanding the factors that influence the quality of wine.

matplotlib-python numpy pandas seaborn

Last synced: 12 May 2026

https://github.com/jsnevt/knn_neighborhood

ML - classificação por vizinhança

kneighborsclassifier matplotlib numpy pandas seaborn

Last synced: 13 May 2026

https://github.com/gsaini/streamlit-ui-getting-started

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps with only a few lines of code. Build and deploy powerful data apps in minutes. Let's get started!

codecov github-actions numpy pandas python streamlit unittest

Last synced: 14 Jun 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 09 Apr 2026

https://github.com/ashkan-khd/recommender-systems

A series of recommender systems projects for DATA.ML.360-2024-2025-1 Recommender Systems at Tampere University

collaborative-filtering group-recommendation-system group-recommendations jupyter jupyter-notebooks numpy pandas python recommender-system

Last synced: 09 Apr 2026

https://github.com/RedInfinityPro/DrawImages

Rating: (7/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 30 Sep 2025

https://github.com/subhas-pramanik-09/mediscan-ai

A smart and scalable ML-powered health prediction system that can help detect the risk of three major diseases: Diabetes + Heart Disease + Parkinsons Disease

jupyter-notebook logistic-regression machine-learning numpy pandas scikit-learn streamlit svm-classifier

Last synced: 09 Apr 2026

https://github.com/sebastianofazzino/ibm-data-science-professional-certificate

In this repository I've stored exercises and projects I've been working on while attending IBM Data Science Professional Certificate, using Python and its libraries.

data-analysis data-mining data-science data-structures data-visualization database machine-learning matplotlib numpy pandas python regression seaborn sql

Last synced: 09 Apr 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/jcm-ai/Robust-MLOps-Elevating-Machine-Learning-to-Production-Scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 09 Aug 2025

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 09 Apr 2026

https://github.com/kasraskari/learn-python-codes

A beginner-friendly repository to learn Python programming through structured examples and scripts, covering topics from basics to advanced concepts.

jupyter-notebook matplotlib matplotlib-tutorial numpy numpy-tutorial pandas pandas-tutorial python python-learning python-tutorial python-tutorials scipy scipy-tutorial tutorials

Last synced: 09 Apr 2026

https://github.com/mdarshad1000/ml-libraries

Code for the scientific and visualization libraries in Python for Machine Learning.

matplotlib numpy pandas seaborn

Last synced: 09 Apr 2026

https://github.com/hansalemaos/group_by_continuous_sequence

Groups lists/NumPy arrays by continuous sequence

group list numpy python

Last synced: 19 Feb 2026

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 09 Apr 2026

https://github.com/adi3042/diamond-price-prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupytrnotebooks matplotlib numpy pandas price-prediction readme regression sckiit-learn setuptools venv

Last synced: 09 Apr 2026

https://github.com/hajdu-patrik/nlp_music-mood-classifier_bme_hf

This project is a Python-based NLP system that analyzes song lyrics to determine their emotional profile and recommends similar songs.

nlp nlp-machine-learning numpy pandas python

Last synced: 10 Nov 2025

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://gitlab.com/DLR-SY/beos

Beos is a legacy Fortran tool used to calculate the static and dynamic buckling behavior of flat/curved fiber composite structures.

fortran numpy python

Last synced: 16 Aug 2025

https://github.com/abdullah-masood-05/eye-balll-detection

This repository provides advanced eye gaze tracking using MediaPipe Face Mesh for both video files and live camera feeds.

mediapipe numpy opencv scipy

Last synced: 10 Apr 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/harmanveer-2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python

Last synced: 10 Apr 2026

https://github.com/mjshubham21/ny_yellow_taxi_python_da_project

A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/liblaf/array-typing

🧮 Type-safe array operations for NumPy, JAX, and PyTorch

array code-quality documentation jax numpy python static-analysis torch type-hints typing

Last synced: 19 Aug 2025

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/chandkund/customer-segmentation-using-k-means-clustering

Implemented K-Means Clustering to segment customers based on purchasing behavior, enabling targeted marketing strategies. Analyzed data, optimized clusters using the Elbow Method, and derived insights to enhance customer engagement and retention.

data-science kmeans-clustering machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/ledsouza/machine-learning-introducao-a-classificacao-com-sklearn

Explorar classificação por machine learning, analisando algoritmos em exemplos cotidianos, comparando técnicas e entendendo o aprendizado de máquina por trás, com abordagem científica.

graphviz matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/trivediayush/Accident-and-Drowsiness-Detection

Our college minor project is a comprehensive solution aimed at enhancing road safety. The project comprises two main components: an Accident Detection Model and a Real-Time Driver Drowsiness System.

accident-detection accidents-analysis college-project cvzone dlib-face-detection dlib-face-recognition face-detection face-recognition imutils minor-project numpy opencv pandas python python3 yolov8

Last synced: 22 Aug 2025

https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas

This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian

data-analysis numpy pandas python

Last synced: 10 Apr 2026

https://github.com/kunalshelke90/wine-quality-testing

This project is about creating a machine learning algorithm that can predict the quality of wine based on the given dataset. Different machine learning algorithms such as logistic regression, decision tree and random forest are used in this project.

eda feature-engineering flask machine-learning numpy pandas python

Last synced: 10 Apr 2026

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 10 Apr 2026

https://github.com/rotantos/ml-projects

Includes various ML projects created for educational purposes.

classification-model jupyter-notebook machine-learning numpy pandas python sklearn-library

Last synced: 10 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/kwasi-dankwa/game-sales

This project investigates why Nintendo’s global market share fluctuated between 2010–2016, identifies top-performing genres and regions, and builds predictive models to forecast sales and regional performance. It culminates in specific, actionable recommendations to grow market share.

business-intelligence gaming juptyernotebook-language matplotlib numpy pandas postgresql python3

Last synced: 09 Apr 2026

https://github.com/yousefmostafa125/image-processing

An interactive image processing web app built with Streamlit and OpenCV. The app allows users to upload an image, apply various processing steps, and download the results individually or as a ZIP file.

numpy opencv-python python streamlit

Last synced: 03 May 2026

https://github.com/sc0v0ne/blueflix

Simple Application Recommend Movies and Tv Shows

csv k-means k-means-clustering kaggle movies numpy pandas python recommends-movies sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/bdurga26/fake-news-detection

This project aims to find whether the given news is real or fake by using Machine learning Algorithms.

ds ml numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/gregoritsch3/project_spring_pendulum

A solution to the chaotic equations of motion of a spring pendulum using Lagrangean mechanics integrated through symbolic and numeric Python libraries (Sympy, Numpy, Scipy, Matplotlib, etc.).

lagrangian-mechanics maptlotlib numpy python scipy spring-pendulum sympy

Last synced: 12 Apr 2026

https://github.com/sandyherho/buku_metnum_python

Repositori ini memuat kode dan gambar yang digunakan pada buku "Pengantar Metode Numerik Terapan Menggunakan Python".

belajar-pemrograman belajar-python indonesia indonesian-language matematika-terapan matplotlib metode-numerik numerical-methods numpy python scipy

Last synced: 16 Jan 2026

https://github.com/yeonjaee/statistics

통계기초 with using python libs

matplotlib numpy pandas statistics

Last synced: 06 May 2026

https://github.com/quickheaven/scs-3251-statistics-for-data-science

Repository dedicated to Term Project of UofT Statistics for Data Science Course

data-science jupyter-notebook logistic-regression numpy pandas python

Last synced: 10 Apr 2026

https://github.com/yessasvini23/pytorch-bootcamp-on-ai-by-opencv

This OpenCV course is highly recommended for individuals seeking an introductory experience in Computer Vision and AI, and who wish to establish a strong foundation for their career development.

cnn-classification computer-vision cv2-library deeplearning-ai numpy opencv pytorch rnn-tensorflow

Last synced: 13 Apr 2026

https://github.com/thiagohrcosta/machinelearning-temperature

A Small Machine Learning application leveraging Scikit-Learn and statistical learning to extract knowledge from data without explicit programming.

machine-learning numpy pandas python3 scikit-learn

Last synced: 08 Apr 2026

https://github.com/vrush9840/emotion-detection-from-tweets-using-dl

Emotion detection from tweets using Deep Learning (LSTM & BERT) to classify emotions such as Positive, Negative, Neutral, and Irrelevant with emoji recommendations.

glove-embeddings lstm matplotlib nltk numpy pandas python seaborn shap tensorflow

Last synced: 10 Apr 2026

https://github.com/teslaproduuction/data_analiz_al

Прогнозирование курса алюминия с использованием модели ARIMAX

arimax csv django html-css-javascript numpy pandas python rss scipy sklearn sqlite yfinance-library

Last synced: 10 Apr 2026

https://github.com/sujal2004/stock-analysis-project

Stock Market Analysis and Prediction is the project on technical analysis, visualization and prediction using data provided by Google Finance.

jupyter-notebook matplotlib numpy pandas python stock-market

Last synced: 10 Apr 2026