An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/ivancaez/analisis_dades_microbit

Data analysis of Micro:bit with maplotlib, numpy and pandas

csv jupyter-notebook matplotlib microbit numpy pandas python

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 13 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/blairnangle/python3-numpy-ta-lib

Docker image with NumPy and TA-Lib installed. Built and uploaded to Docker Hub using GitHub Actions.

docker numpy python3 ta-lib

Last synced: 13 Apr 2026

https://github.com/venkat-0706/accenture-hackathon

Developing an e-commerce recommendation system involves utilizing technologies such as Python for programming, Pandas for data manipulation, SQL for database management, FastAPI for building APIs, PostgreSQL for data storage, and Docker for containerization.

alembic api docker fastapi machinelearningalgorithms matplotlib numpy postgresql pydantic python3 scipy seaborn sqlmodel

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/fearless-spider/spideroptic-cli

Spider Optic CLI is a command line application.

cement cli cli-app command-line makefile numpy oanda pypi python

Last synced: 01 May 2026

https://github.com/archishmansengupta/dnn

Digit Neural Network is a digit recognition network based on MNIST data set using numpy, pandas and matplotlib

matplotlib mnist neural-network numpy pandas python

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/prediction-of-ticket-cancellation

The objective is to develop a model that accurately predicts whether users will cancel their tickets. Each cancellation incurs a fine for the ticket registration site from the passenger company.

datetime evaluation gridsearchcv labelencoder numpy pandas standardscaler stratified-k-fold train-test-split xgboost-model

Last synced: 04 May 2026

https://github.com/hootbu/cyberpunk-filter-desing

This project aims to design filters with a cyberpunk aesthetic by performing low-level image manipulation in Python.

computer-vision image-manipulation matplotlib numpy python

Last synced: 08 May 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 13 Apr 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/nur-alam39/course-lab-experiments

The lab works: Code of academic courses-Digital Image Processing, Microprocessor, OOP, Numerical Analysis

cpp lab lex matplotlib numpy oop opencv python

Last synced: 13 Apr 2026

https://github.com/ondiekelijah/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 09 Apr 2025

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 13 Apr 2026

https://github.com/animesh-chourey/loan-classifier

Trained machine learning algorithms (Logistic Regression, KNN, SVM, Decision Tree) specifically, after performing visualization and pre-preocessing tasks on a loan dataset. Executed the evaluation metrics such as F1-score, Log loss and jaccard-similarity score to assess the algorithms performance.

decision-tree f1-score jaccard-similarity knn logistic-regression logloss matplotlib numpy pandas preprocessing-data svm

Last synced: 06 May 2026

https://github.com/tatmorenno/python_para_data_science

Python para Data Science Introdução à linguagem e Numpy 🌎💻

jupyter-notebook numpy python

Last synced: 13 Apr 2026

https://github.com/a-parida12/numericalmethods

Teaching Material for Workshop for Engineering Students

numpy ode pde python3

Last synced: 13 Apr 2026

https://github.com/bhattbhavesh91/numpy-performance-test

Comparing the performance of Base Python Square root function with a numpy equivalent

list numpy python

Last synced: 20 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/chandkund/wine-quality-prediction

This project predicts wine quality based on physicochemical properties using machine learning models. By leveraging Random Forest Classifier, Logistic Regression, and SVM, the goal is to classify wines into quality categories and uncover the key factors that influence wine quality.

logistic-regression matplotlib numpy pandas-python random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/harmanveer-2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg

Last synced: 13 Apr 2026

https://github.com/chmohit91/introduction-to-python

A comprehensive collection of Python programming resources, from basics to advanced concepts. Built for learners, by learners

matplotlib numpy pandas plotly python python-library python3 seaborn

Last synced: 13 Apr 2026

https://github.com/n1k1f0rm/ml-predicts

Place where you can find in transparent way how ML algos works

machine-learning ml numpy python

Last synced: 21 Apr 2026

https://github.com/falakjatin/face-recognition

This is my final year project of engineering. It might not work because its not maintained.

haarcascade-frontalface numpy opencv pandas python3

Last synced: 13 Apr 2026

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ifte-13/early-stage-brain-stroke-detection

Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm

decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/shivamkumar818/mbti-personality-prediction-from-text-data

MBTI Personality Prediction from Text Data This project leverages machine learning to predict Myers-Briggs Type Indicator (MBTI) personality types based on textual data, specifically from social media posts.

confusion-matrix correlation-matrix data-visualization dataset knn-classification linear-regression logistic-regression modeltraining navebayes numpy pandas python

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025

https://github.com/iv4n-ga6l/Geometric-ShapesRecognizer-in-image

Geometric shapes recognizer API built in python

api-rest flask numpy opencv python shaperecognitionapi

Last synced: 28 Apr 2025

https://github.com/harmanveer-2546/finding-you-next-perfect-house

In this exploratory data analysis, we compare a dataset which consists of various features about renting of houses available on these renting platforms listed by owners of these houses, and try to derive some constructive conclusions by performing Descriptive statistics of the available features.

adaboostregressor decisiontreeregressor exploratory-data-analysis house-price-prediction inline linearregression matplotlib numpy onehotencoder pandas plotly python randomforestregressor renting seaborn standardscaler xgbregressor

Last synced: 13 Apr 2026

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de réseaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/soumyapro/house-price-prediction

Explored different attributes of Boston housing dataset then a part of dataset was used to train the linear regression algorithm after that trained model was used to give predictions on remaining part of dataset.

linear-regression matlplotlib numpy pandas sklearn svm

Last synced: 13 Apr 2026

https://github.com/eny2930/ai-ml-projects

A project demonstrating various libraries and tools in using algorithms in machine learning. This repository includes code samples, datasets, and detailed explanations.

iris-dataset keras linear-regression logistic-regression matplotlib numpy pandas pytorch sckiit-learn seaborn sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 13 Apr 2026

https://github.com/pramodyasahan/spaceship-titanic

This repository features a machine learning model designed to predict whether passengers of a space travel company are likely to be transported. The model employs CatBoostClassifier, a machine learning algorithm known for handling categorical data effectively.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/udacity-machinelearning-internship/support-vector-machine

Implementing SVM's using pandas and sklearn in python

numpy pandas sklearn support-vector-machines svm

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/data-analysis-with-python

Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook

data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/ot-code/coca-cola-stock-prediction

This repo compares four predictive models—Linear Regression, ARIMA, XGBoost, and LSTM—to forecast Coca‑Cola FEMSA stock closing prices using Python and five years of historical data.

arima csv linear-regression lstm-neural-networks mae matplotlib mse numpy pandas python r2 scikit-learn seaborn tensorflow-keras xgboost

Last synced: 13 Apr 2026

https://github.com/csengupta1101/netflix-rating

The project revolves around Netflix shows and movies around the world. The problem statement that is being tried to address here is that what kind of show to come up with in future times and how well that will fit with the audience

jupyter-notebook matplotlib movies netflix numpy pandas plotly python python3 rating tvseries

Last synced: 13 Apr 2026

https://github.com/sadegh15khedry/housing-prices-prediction-using-randomforest

This repository contains an implementation of random forest model to predict housing prices using the Boston Housing dataset.

csv joblib jupyter-notebook matplotlib numpy pandas pil python random-forest seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/5hraddha/eda-instacart-customers-shopping-habits

In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.

data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas

Last synced: 13 Apr 2026

https://github.com/thanhloc81/rfm-model-project

✨ Build a flow to deploy Segmentation evaluation through Python programming.

customer-segmentation numpy pandas python rfm-analysis

Last synced: 03 May 2026

https://github.com/paulo-freitas-junior/ifood-dev-week2023

Repositório sobre a Dev Week iFood sobre uso de Python e Inteligência Artificial para realização de cálculo de NPS ( Net Promoter Score ) e a utilização de IA para análise de sentimentos dos comentários usando tecnologia do ChatGPT-4.

bert chatgpt colab-notebook matplotlib nltk numpy pandas python textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 30 May 2026

https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning

Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.

data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api

Last synced: 13 Jul 2025

https://github.com/arrnikhilchauhan/book-recommendation

This Book Recommendation System is designed to provide personalized book suggestions based on user ratings and interaction. By leveraging the power of Pandas and NumPy, it filters and processes data efficiently, ensuring that recommendations are relevant and tailored to the user’s preferences

numpy pandas-library python

Last synced: 15 Apr 2026

https://github.com/jcm-ai/robust-mlops-elevating-machine-learning-to-production-scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 09 Apr 2026

https://github.com/jfjlaros/online-array

Online array, pass a function as a NumPy array.

array function numpy online unbounded

Last synced: 13 May 2026

https://github.com/niel0503/probabilidade-e-amostragem

Desafio desarrollado para el bootcamp de WoMakersCode

estadistica matplotlib numpy pandas stats

Last synced: 18 Jan 2026

https://github.com/moonshallow5/solar-system-simulation

A solar system simulation using the Beeman scheme and the Euler method

matplotlib-animation matplotlib-pyplot numpy

Last synced: 13 May 2026

https://github.com/geetu040/fetcher

AI GAME: We have a random maze and 2 agents at random places. You control the Fetcher and find the way through the maze to get ot the target. You can automate the Fetcher to use trained models that help Fetcher fetch the target by itself. The models are based on deep neural networks, implemented by tensorflow and Keras API. The models are so efficiently trained, they find ways through the maze without hitting the walls and have an accuracy of more than 99%

artificial-intelligence css deep-learning html javascript jupyter-notebook machine-learning matplotlib neural-networks numpy python sklearn tensorflow web-design

Last synced: 13 Apr 2026

https://github.com/mastermindromii/car-price-prediction-model

Here is My Regression Project based on Predicting Price of Car using Linear Regression.

linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/fabriziomusacchio/ann_with_numpy

A how-to for building an artificial neural network from scratch using NumPy

ann artificial-neural-networks neural-network numpy

Last synced: 08 May 2026

https://github.com/vishant007/annadataa

A Website For Farmers To Guide Them Regarding Crop Prouction In Their Native Language

django flask-application google-collab kaggle machine-learning-algorithms numpy pandas python3 scikit-learn

Last synced: 13 Apr 2026

https://github.com/lesiaukr/mathematic_for_cs_colab_hw

Master's degree | HW | Mathematics for Computer Science course | Homeworks

goit google-colab home-works latex linear-algebra mathematics-for-computerscience matrices numpy python vectors

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/healty-care

The Complete project using neural network to arrive the best healt know he is stoke or not.

matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 13 Apr 2026