An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/armahdavi/mlops

Productionizing ML Models using a variety of tools including FastAPI, Flask, Doocker, AWS, GCP, TensorFlow Extended (TFX), and TF.js.

apache-beam docker fastapi logistic-regression mlops mlops-workflow numpy random-forest sklearn tensorflow-extended unicorn xgboost

Last synced: 12 Apr 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/lijesh010/employeeanalysis-

This is a simple data analysis project developed as part of my Entri Elevate Data Science and Machine Learning course. It serves as a practical demonstration of my skills in data analysis.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/fabricioarendtorres/streamauc

Light-weight package for classification metrics computed on streams or minibatches of data. Mainly for area under the curve (AUC) of precision-recall (PR) or receiver operating characteristic (ROC) curves. Supports multi-class setting with either macro- or micro aggregation..

classification-model machine-learning metrics numpy precision-recall-curve receiver-operating-characteristic

Last synced: 06 Jan 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/rizkyrajitha/ann_101

A really simple neural network for binary classification written from scratch using python

jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/bhushan148/ipl-stats-analysis

This project explores IPL data to show team and player performances. 📈 Key highlights include analyzing match results, highest scores, and performance at different venues. 🏟️ It uses charts to reveal trends like win margins and average scores by venue. 📉 Discover important insights into what affects match outcomes and team success.

jupyter-notebook markdown matplotlib-pyplot numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/constraintautomaton/client_pgu1

The client part of the PgU1 project it. It command the robot and do the video processing

numpy opencv paramiko pyqt5 python3 raspberry-pi-3

Last synced: 12 Apr 2026

https://github.com/22p31a0512/sentimental-analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

feature-engineering machine-learning natural-language-processing numpy pandas python supervised-learning text-classification text-processing tokenization

Last synced: 12 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/sonatipek/matris_transactions_py

My "Python Programlama-I" lesson's homework. Some matris transactions with numpy library in Python

matrix numpy numpy-arrays python sonatipek

Last synced: 24 Mar 2025

https://github.com/sothulthorn/stock-prediction-portal

The Stock Prediction Portal is a full-stack web application that demonstrates how to use machine learning models to analyze stock market data and forecast future trends.

axios django django-rest-framework javascript keras matplotlib numpy pandas python react tensorflow yfinance

Last synced: 02 Apr 2026

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 12 Apr 2026

https://github.com/Sadegh-Khedry/Pneumonia-Detection-Using-CNN

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 12 Apr 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/manikantasanjay/data_analysis_using_python_libraries_series

This Series contains Data Analysis projects performed on different Kaggle datasets and providing valuable insights into the data by making use of Python libraries.

exploratory-data-analysis kaggle-datasets machine-learning matplotlib numpy opendatasets pandas python visualization

Last synced: 12 Apr 2026

https://github.com/andrewzhoumq123/sci_graph_api

Generates different types of detailed plots for scientific data

fastapi h5py json matplotlib numpy pandas scipy

Last synced: 12 Apr 2026

https://github.com/chandkund/sentiment-analysis-using-nlp

This project focuses on Sentiment Analysis using the textual content from product reviews. The goal is to analyze user sentiments based on their written feedback, particularly focusing on the "reviewText" column in the dataset.

lemmatization matplotlib nlp nltk numpy pandas seaborn tokenization

Last synced: 31 Jan 2026

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 12 Apr 2026

https://github.com/soumyapro/heart-disease-prediction

This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.

logistic-regression numpy pandas simpleimputer sklearn

Last synced: 21 Jan 2026

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/abideen-olawuwo/logistic-regression-house-prediction

Using logistic Regression Model to predict house price

logistic-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/linggarm/priority-task-selection-using-evolutionary-programming

A web app that utilizes Evolutionary Programming to determine and prioritize tasks for efficient scheduling.

artificial-intelligence css evolutionary-algorithms evolutionary-programming flask html javascript machine-learning numpy python

Last synced: 12 Apr 2026

https://github.com/nilayhangarge/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python

Last synced: 12 Apr 2026

https://github.com/d1ogocs/afinador-de-instrumentos

Desenvolvimento de um afinador que se ajusta automaticamente ao instrumento musical escolhido pelo utilizador

butterworth-filter instrument-tuner matplotlib numpy pyaudio python scipy threading tkinter

Last synced: 07 Jan 2026

https://github.com/konynour/facedectection

This project demonstrates real-time face detection using OpenCV and a pre-trained SSD (Single Shot MultiBox Detector) model. The system captures video from a camera, processes the frames, detects faces, and displays the results with bounding boxes and confidence scores. This application showcases the use of deep learning models for object detection

cv2 jupyter-notebook lambda-functions matplotlib numpy opencv python3

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/mairagalvao/steak_risk_data

A data analysis based on correlations and comparisons among the samples.

database matplotlib numpy pandas python3 sql

Last synced: 12 Apr 2026

https://github.com/supriya811106/twitter-sentiment-analysis

Analyzing the mood of tweets! We sort tweets on popular topics into positive, negative, or neutral categories to gauge public opinion. See what Twitter really thinks!

bernoulli-naive-bayes jupyter-notebook matplotlib nlp-machine-learning nltk numpy pandas python scikit-learn seaborn sentiment-analysis text-classification tfidf-vectorizer wordcloud

Last synced: 05 Apr 2026

https://github.com/AsuquoAA/CDC_2017_Immunization_Data_Analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 21 Jul 2025

https://github.com/rubada/machine-learning-with-ruba-dabbas

Advance your skills and start your career here, by taking the online courses on Intuidemy.

course learning machine machine-learning matplotlib matplotlib-pyplot models numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/ahmedabdalkreem/skin-cancer

In this project we work to extraction features from Images using CNN and build the Neural Network to arrive the patient have skin-cancer Malignant or Benign.

cnn deep-learning matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 12 Apr 2026

https://github.com/riya2624/heart-disease-diagnostic-analysis

Analyzed heart disease diagnostic data through an ETL process, ensuring data accuracy and readiness for analysis using Python (Pandas, NumPy, Matplotlib, Seaborn). Conducted comprehensive exploratory data analysis (EDA) to uncover trends related to heart disease rates by gender and age. Developed interactive dashboards with Power BI and Tableau

dashboard dax-query eda machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/ankitmishralive/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 22 Mar 2025

https://github.com/liviuxyz-ctrl/financialportofoliosimulator

FinancialPortfolio is your financial planning ally, helping you manage portfolios, calculate future values, and visualize performance, all with user-friendly ease🚀

matplotlib numpy python simulation

Last synced: 12 Apr 2026

https://github.com/ishtufinquant/portfolio-optimizer

Optimizes stock portfolios using simulation and real-world constraints (SLSQP).

backtesting finance numpy pandas portfolio-optimization python quant scipy yfinance

Last synced: 12 Apr 2026

https://github.com/subhamsarangi/pdnp

Data Analysis using Python

matplotlib numpy pandas python3

Last synced: 12 Apr 2026

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 12 Apr 2026

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 12 Apr 2026

https://github.com/nidomika/digit-recognition

Digit recognition project in Python

digit-recognition numpy opencv python tensorflow

Last synced: 12 Apr 2026

https://github.com/llwusill/firepredictioncnn

CNN kullanarak Orman Yangını Tahmin/Tespit Sistemi

matplotlib numpy opencv-python pandas sckiit-learn seaborn tensorflow

Last synced: 12 Apr 2026

https://github.com/renukadhule/crop_production_python_analysis

This project analyzes crop production data in India from 1997 to 2015, identifying trends, seasonality, and top-performing states and crops. The insights aim to guide improvements in agricultural productivity and policy decisions.

matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system

Prime numbers and polar coordinate system

matplotlib numba numpy python

Last synced: 01 May 2026

https://github.com/dan-niles/neural-networks-from-scratch

A neural network implemented using only NumPy and Pandas for CS3631: Deep Neural Networks

neural-networks numpy pandas

Last synced: 05 May 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/murugavl/loan_approval_prediction

This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.

jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/barkerbg001/framefusion

Python script that creates a video from a set of images and adds a random audio clip to each video

glob moviepy numpy opencv pygt5 python python-3 random video video-editor

Last synced: 16 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/sunilsm7/numpy-starter

IPython Notebook(s) demonstrating NumPy functionality.

numpy python

Last synced: 13 Apr 2026

https://github.com/rajsinha7/hand-gesture-recognition

This project serves as a foundation for integrating hand gesture controls into interactive applications like games, presentations, or robotic controls. It features: Real-time hand tracking with MediaPipe Hands and Live visualization of hand landmarks and gesture labels

mediapipe numpy opencv python

Last synced: 13 Apr 2026

https://github.com/alinenog/desenvolve_gb_2022

Formação Desenvolve 2022 do Grupo Boticário na área de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/allanotieno254/pinecone-index-creation-vector-store-database-

This project demonstrates how to create an index in Pinecone, a vector database optimized for similarity search and machine learning applications. The notebook covers setting up a Pinecone client, creating an index, and managing vector data efficiently.

index jupyter-notebook numpy pinecone python

Last synced: 13 Apr 2026

https://github.com/mike014/soundbot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 13 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/goessl/adjugate

A package for calculating submatricies, minors, adjugate- and cofactor matrices.

adjugate cofactor determinant inverse linear-algebra matrix minor numpy python submatrix

Last synced: 07 Jan 2026

https://github.com/siddhi-c19/tourism_sid_tournaut

A comprehensive analysis of tourism data of various countries

data-science dataset mathplot mathplotlib numpy plotly statistics streamlit

Last synced: 06 May 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/prateekrajsrivastav/financial-transition-classification

This project focuses on classifying financial transactions using machine learning techniques. By leveraging labeled data, the model aims to categorize transactions into predefined categories (e.g., "Food," "Transport," "Shopping," etc.).

matplotlib numpy pandas-python scikitlearn-machine-learning seaborn

Last synced: 07 Sep 2025

https://github.com/murugavl/customer_churn_prediction

Customer churn prediction is a crucial task in business analytics, helping companies identify customers who are likely to leave and take preventive actions. This project utilizes Machine Learning (ML) techniques to analyze customer behavior and predict churn.

jupyter-notebook machine-learning matplotlib numpy pandas pickle python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de predição de preços de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/oleksiilatypov/titanic_ml

Titanic Survival Prediction using ML and Flask Deployment.

data-science docker flask machine-learning numpy pandas skit-learn

Last synced: 13 Apr 2026

https://github.com/ksprugevics/magic-square

Some simple algorithm implementations for solving (ordinary) magic squares using NumPy

algorithms mathematics numpy python

Last synced: 21 Apr 2026