An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/paulo-santos-ds/analise_dados_musicais

Dashboard para compreender as preferências musicais dos ouvintes, tendências regionais e o impacto de artistas específicos.

numpy pandas powerbi python

Last synced: 17 Oct 2025

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de predição de preços de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 17 Oct 2025

https://github.com/sunilvarma-l/liverdiseaseprediction

"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."

machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 18 Oct 2025

https://github.com/architj6/gesture-volume-control

Easily control your system volume with simple hand gestures! This project leverages OpenCV, MediaPipe, and Pycaw to track hand movements and adjust the system volume dynamically. Say goodbye to keyboard shortcuts and volume buttons—just move your hand to set the volume!

handgesture mediapipe numpy opencv os pycaw python python3 volume-control

Last synced: 19 Oct 2025

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 19 Oct 2025

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 19 Oct 2025

https://github.com/gurramankit/censusproject_ml_randomforest

The objective of this project is to build a classification model using the Census Income dataset from the UCI Machine Learning Repository. The model predicts whether an individual's income exceeds $50,000 per year, based on their demographic and employment-related attributes.

matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 20 Oct 2025

https://github.com/sun-data/ndoptimize

Numba-accelerated minimization of objective functions.

astropy minimization numba numpy optimization scientific-computing

Last synced: 27 Jan 2026

https://github.com/filipporanza/rc4

A simple RC4 implementation in Python3. Useful for testing and studing RC4 properties

matplotlib numpy python3 rc4 university-project

Last synced: 22 Oct 2025

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 23 Oct 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 24 Oct 2025

https://github.com/yahiazakaria445/image-classification-using-vgg-16

computer vision model for 10 classes classification

matplotlib numpy opencv pandas sklearn tensorflow

Last synced: 24 Oct 2025

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 24 Oct 2025

https://github.com/atul-maurya-30/galaxy

Galaxy Classification is a machine learning project focused on classifying galaxies into two subclasses: 'STARFORMING' and 'STARBURST'. This project demonstrates data preprocessing, model training, and evaluation using advanced machine learning techniques and Python libraries.

flask machine-learning matplotlib numpy pandas python regression-classification seaborn sklearn

Last synced: 24 Oct 2025

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 25 Oct 2025

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 25 Oct 2025

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 28 Jan 2026

https://github.com/mondalbidisha/landmark-recognition

A solution for identifying and recognizing landmarks from images, addressing key challenges and leveraging both algorithmic and human expertise to achieve high accuracy and reliability.

jupyter-notebook kaggle kaggle-datasets numpy pandas python3

Last synced: 26 Oct 2025

https://github.com/shohail-dev/python-sales-analysis

Python analysis project :- Analyze Diwali sales data to improve customer experience and sales .

matplotlib numpy pandas python-lambda python3 seaborn

Last synced: 26 Oct 2025

https://github.com/lorenzorottigni/ml-lending-club

Machine Learning python bootcamp: random forest classifier on LendingClub dataset

ipynb machine-learning numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 27 Oct 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 28 Oct 2025

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/hansalemaos/npzigloc

Zig for Numpy

numpy python zig

Last synced: 31 Jan 2026

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 01 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/zainulmustafa/home-neural-net-gui-using-flask

I am working on a project that I call HNN whose GUI is created using Flask framework to give multiple users the access of controlling components over the same connection.

flask flask-api flask-server html5 numpy python3

Last synced: 12 Sep 2025

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/pzuehlke/hands-on-numpy

An introduction to NumPy for undergraduate students, focusing on basic linear algebra

linear-algebra numpy numpy-tutorial python scipy

Last synced: 07 May 2025

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 30 Dec 2025

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 24 Feb 2025

https://github.com/yahiazakaria445/sentiment-analysis-using-lstm

A Sentiment analysis NLP project using LSTM model

matplotlib nltk numpy pandas scikit-learn seaborn tensorflow

Last synced: 30 Dec 2025

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 27 Oct 2025

https://github.com/jonathanp-code/upland.me-property-price-predictor

I wanted to use a neural network built with TensorFlow and Keras to predict future real estate prices based on property attributes like size, location, and market trends. Users can input details interactively, and the model provides a price prediction along with a 12-month forecast visualization.

matplotlib neural-network nft numpy prediction prediction-model prognostic-models property python sklearn tensorflow

Last synced: 29 Mar 2025

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/lordy5/ml-algorithms

Implementing ml algorithms from scratch to learn how they work.

from-scratch machine-learning ml numpy

Last synced: 05 Apr 2025

https://github.com/lmizner/codecademy_auto_evaluation

Summarized categorical variables in Python using numerical summary statistics.

categorical-data frequency jupyter-notebook median numpy pandas proportion python

Last synced: 25 Jun 2025

https://github.com/tigureis/aula_deployment

This project not only demonstrates a machine learning pipeline but also serves as a template for deploying models into production. By leveraging the project's structure and code, you can streamline the process of deploying your own machine learning models.

numpy pandas pickle sklearn

Last synced: 30 Dec 2025

https://github.com/nagipragalathan/python_tutorial_for_data-science

This repository is a comprehensive guide for learning data science using Python. It covers various essential libraries and tools commonly used in the field of data science, including Jupyter Notebook, Matplotlib, NumPy, Pandas, Scikit-learn, and PyTorch.

datascience datavisualization deeplearning jupyter jupyter-notebook learning-by-doing learningresources machinelearning matplotlib numpy opensource pandas python python-script python3 pytorch pytorch-implementation scikitlearn tutorial

Last synced: 30 Dec 2025

https://github.com/tinaland101/credit-risk-classification

The purpose of this project is to build a credit risk classification model using machine learning techniques. This model helps identify the creditworthiness of borrowers based on historical lending data. Specifically, it uses a logistic regression model to predict whether a loan is healthy (0) or high-risk (1).

numpy pandas pathlib scikit-learn

Last synced: 24 Feb 2025

https://github.com/kingsley-ezenwaka/medical-data-visualizer

A data analysis project that investigates a dataset of anonymous patients' medical information, and explores the relationship between cardiac disease, body measurements, blood markers, and lifestyle choices.

analysis data matplotlib numpy pandas seaborn

Last synced: 24 Feb 2025

https://github.com/fatimatuzzahromutmainnah/predictive-maintenance-for-industrial-equipment

This project focuses on Predictive Maintenance for industrial equipment using machine learning. The goal is to predict potential machine failures before they occur, enabling proactive maintenance and reducing downtime.

decision-trees digitalocean docker feature-selection flask logistic-regression matplotlib modelevaluation numpy pandas predictive-maintenance predictive-modeling seaborn sklearn-library

Last synced: 24 Feb 2025

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 24 Feb 2025

https://github.com/albert-2002/robust-penalized-empirical-likelihood-estimation-method-for-linear-regression

Implementation for the paper titled: Robust Penalized Empirical Likelihood Estimation Method for Linear Regression

linear-regression machine-learning numpy pandas python regression regression-analysis statistics

Last synced: 24 Feb 2025

https://github.com/petridhsg/firewall-data-classification

A single implementation of a machine learning algorithm for a firewall data classification task

machine-learning matplotlib numpy python seaborn

Last synced: 16 Nov 2025

https://github.com/wisskirchenj/matrix-processor

Python app to process matrices using numpy

matrix-calculations numpy python3

Last synced: 12 Sep 2025

https://github.com/ugurcan222/a-different-approach--image-enhancement-with-imputation-and-regression-methods

This experimental work presents a different approach to increase the size and quality of an image by adding a blank pixel around each pixel in an image, enlarging the image, breaking it into parts, and generating these blank pixels by predicting them with models.

ai-image-upscaling computer-vision digital-image-processing gradient-boosting image-analysis image-enhancement image-enlargement image-interpolation image-processing imputation knn machine-learning numpy opencv pixel-prediction python randomforest regression-models super-resolution xgboost

Last synced: 17 Jan 2026

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 24 Feb 2025

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 03 Aug 2025

https://github.com/jonolav95/advent_of_code_2022

Using Python with Pandas and Numpy (www.adventofcode.com).

numpy pandas

Last synced: 05 Apr 2025

https://github.com/aqueeqazam/numpy-for-mathematical-computing

You're at the perfect spot if you're interested in developing your career in data science or machine learning, or if you just enjoy having fun with mathematical operations.

mathematics numpy numpy-arrays

Last synced: 04 Mar 2025

https://github.com/vbhvsingh0/cdc_immunization

This project explores the relationships in between different vaccines and the sex, age and other basic features in the data.

data-cleansing data-manipulation-with-pandas data-science numpy pandas-python python3

Last synced: 04 Mar 2025

https://github.com/vbhvsingh0/deforestation_rainfall_correlation

The aim of this project is to check if there is any correlation of rainfall with deforestation in Pennsylvania state of USA.

correlation-analysis data-science matplotlib-pyplot numpy pandas python3

Last synced: 04 Mar 2025

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 30 Dec 2025

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 04 Mar 2025

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 30 Dec 2025

https://github.com/alexsolov28/vkrb

Выпускная квалификационная работа бакалавра «Система анализа алгоритмов машинного обучения для решения задач классификации с использованием Pandas»

jupyter-notebook matplotlib numpy pandas python scikitlearn seaborn streamlit

Last synced: 11 Sep 2025

https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india

As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.

accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization

Last synced: 19 Jan 2026

https://github.com/harmanveer-2546/guide-to-regularization

Regularization is a crucial technique in machine learning that helps to prevent overfitting. Overfitting occurs when a model becomes too complex and learns the training data so well that it fails to generalize to new, unseen data.

generalization inline l1 l2 matplotlib numpy overfitting overfitting-prevention overfitting-reduced pandas regularization regularization-methods regularization-techniques regularization-to-avoid-overfitting seaborn

Last synced: 05 Apr 2025

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 24 Aug 2025

https://github.com/mchenryspagg/economic-variable-correlation-and-productivity-model

This report provides an in-depth analysis of Nigeria's economic performance over the past decade, spanning from 2014 to 2024.

correlation exchangerates inflation moneysupply numpy productivity python regression regression-analysis regression-models visualization

Last synced: 10 Sep 2025

https://github.com/Mike014/SoundBot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 12 Sep 2025

https://github.com/ianisdev/lab_ms_ai_programing_for_ai

Lab work for Programming for AI

kafka numpy pandas python

Last synced: 21 Mar 2025

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 12 May 2025

https://github.com/ankit21111/carpredict

This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!

data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn

Last synced: 13 Mar 2025

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 30 Dec 2025

https://github.com/samaalharbi2/100-days-of-code

This repository is a collection of 100 Python projects built as part of the "100 Days of Code: Python Developer Challenge" course.

data-science game-development machine-learning numpy pandas python udemy

Last synced: 05 Apr 2025

https://github.com/hansalemaos/ffmpeg_stream_to_numpy

ffmpeg stream to numpy arrays

cv2 ffmpeg numpy python stream video

Last synced: 28 Nov 2025

https://github.com/rosacarla/projetodio_covid19

Projeto desenvolve modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil

etl machine-learning numpy pandas python

Last synced: 24 Feb 2025

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 31 Dec 2025

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 31 Dec 2025

https://github.com/manorohith-03/attendance-system-using-face-recognition

The system leverages the Python programming language and several powerful libraries for image processing and face recognition. The ultimate goal is to provide a more efficient, accurate, and user-friendly attendance tracking solution.

face-recognition numpy opencv pandas python-3

Last synced: 12 May 2025

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 05 Apr 2025

https://github.com/yash22222/analysis-of-amcat-aspiring-minds

Analyzing employment outcomes for engineering graduates based on a provided dataset. Specifically, it aims to verify a claim regarding the salary range for fresh graduates in certain engineering roles and investigate potential relationships between gender and specialization preferences.

amcat aspiring-minds charts computer-engineering data-analytics data-vizualisation dataset graph machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Oct 2025

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 31 Dec 2025

https://github.com/batthulavinay/ev-population

This repository contains a Jupyter Notebook focused on analyzing Electric Vehicle (EV) population data. The notebook includes data visualizations, exploratory analysis, and key insights.

data-science datacleaning datapreprocessing datavisualization jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 13 Apr 2025

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025

https://github.com/muhkartal/fr-framework

A modular and extensible face recognition framework featuring real-time detection, facial landmark analysis, and recognition with API support.

dlib face-recognition fastapi numpy opencv pillow pydantic python-multipart requests scipy uvicorn

Last synced: 31 Dec 2025