An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de predição de preços de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/ivancaez/analisis_dades_microbit

Data analysis of Micro:bit with maplotlib, numpy and pandas

csv jupyter-notebook matplotlib microbit numpy pandas python

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/deep-learning-projects-phase-1

Deep Learning Projects which demonstrate the use of Python and its complex ML algorithms

deep-learning machine-learning neural-network numpy opencv pandas python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/sanjaiveeran44/shopping-intent-analasys

This project predicts whether a visitor to an e-commerce website will make a purchase based on their browsing behavior. It includes data analysis, preprocessing, and machine learning modeling to understand online shopping patterns.

matplotlib matplotlib-pyplot numpy pandas python3 sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/joiceo/python

Projetos e exercícios em Python

eda machine-learning numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/blairnangle/python3-numpy-ta-lib

Docker image with NumPy and TA-Lib installed. Built and uploaded to Docker Hub using GitHub Actions.

docker numpy python3 ta-lib

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/data-analytics-projects-phase-1

A set of projects on Data Analysis using Python and its libraries

matplotlib numpy pandas python seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/fearless-spider/spideroptic-cli

Spider Optic CLI is a command line application.

cement cli cli-app command-line makefile numpy oanda pypi python

Last synced: 01 May 2026

https://github.com/archishmansengupta/dnn

Digit Neural Network is a digit recognition network based on MNIST data set using numpy, pandas and matplotlib

matplotlib mnist neural-network numpy pandas python

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/eda-on-indian-railways

Indian Railways is a statutory body under the ownership of the Ministry of Railways of the Government of India that operates India's national railway system. As of 2023, it manages the fourth largest national railway system by size with a track length of 132,310 km, running track length of 106,493 km and route length of 68,584 km.

clean-data eda exploratory-data-analysis geometry geopandas indian-railways json linestring matplotlib numpy os pandas plotly python railway seaborn shapely train visualization

Last synced: 09 Apr 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 13 Apr 2026

https://github.com/nikhilsree5/netflixcasestudy

Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth

eda numpy pandas python visualization

Last synced: 13 Apr 2026

https://github.com/alex1iv/asr_ru_numbers

Automatic Speech Recognition (ASR) system for Russian digits

audio-processing librosa numpy speech-recognition tensorflow

Last synced: 13 Apr 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/shlok-nahar/mnist-cnn-classifier

This repository trains and evaluates three CNN models on MNIST, providing performance comparisons and 5 unique visualizations.

confusion-matrix graph heatmap-visualization json machine matplotlib mnist numpy precision-recall python receiver-operating-characteristic seaborn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/machinelearninglogisticregression

A fraude de cartão de crédito é uma forma comum de fraude que afeta milhões de pessoas em todo o mundo. Essa fraude pode ter consequências graves, tanto para as vítimas quanto para as instituições financeiras que emitem os cartões de crédito.

math matplot numpy pandas python3 sklearn

Last synced: 13 Apr 2026

https://github.com/yuu-eguci/cognitive-services-trial

Try to play with Cognitive Services!! [Cognitive Services] [OpenCV] [Numpy]

cognitive-services dotenv numpy opencv-python pipenv python python3

Last synced: 05 Jul 2025

https://github.com/ondiekelijah/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 09 Apr 2025

https://github.com/bhanuprakashnani/scit-tasks

Tasks for the SCIT Work Shop.

numpy python3

Last synced: 25 Apr 2026

https://github.com/kecnry/nparray

High-Level Wrappers for Building and Manipulating Numpy Arrays

numpy numpy-arrays python

Last synced: 08 Jan 2026

https://github.com/itshyphen/mass-mailing-script

A simple mass mailing script that sends personalized email to multiple emails importing from csv

numpy pandas python smtplib

Last synced: 13 Apr 2026

https://github.com/a-parida12/numericalmethods

Teaching Material for Workshop for Engineering Students

numpy ode pde python3

Last synced: 13 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/amir0234-afk/housing-price-prediction-dnn

Keras + NumPy baselines for housing price regression

keras numpy regression

Last synced: 13 Apr 2026

https://github.com/raeinlayeghpour/test-bench-for-signal-validation-simulated-3-phase-system

Automated Project - Test Bench for Signal Validation (Simulated 3-Phase System)

numpy pandas pycharm pytest python

Last synced: 13 Apr 2026

https://github.com/chandkund/wine-quality-prediction

This project predicts wine quality based on physicochemical properties using machine learning models. By leveraging Random Forest Classifier, Logistic Regression, and SVM, the goal is to classify wines into quality categories and uncover the key factors that influence wine quality.

logistic-regression matplotlib numpy pandas-python random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/harmanveer-2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg

Last synced: 13 Apr 2026

https://github.com/chmohit91/introduction-to-python

A comprehensive collection of Python programming resources, from basics to advanced concepts. Built for learners, by learners

matplotlib numpy pandas plotly python python-library python3 seaborn

Last synced: 13 Apr 2026

https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/tromesh/attendance-predictor

Student Attendance predictor for Scheduling lecture sessions

matplotlib numpy pandas tensorflow

Last synced: 05 May 2026

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ifte-13/early-stage-brain-stroke-detection

Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm

decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/harmanveer2546/recognizing-digits

I used the MNIST dataset for the implementation of a handwritten digit recognition app. To implement this, will be using a special type of deep neural network called Convolutional Neural Networks. In the end, I also build a Graphical user interface(GUI) where you can directly draw the digit and recognize it straight away.

dense flatten gui keras maxpooling mnist numpy pandas pillow python sequential

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025

https://github.com/soumyapro/house-price-prediction

Explored different attributes of Boston housing dataset then a part of dataset was used to train the linear regression algorithm after that trained model was used to give predictions on remaining part of dataset.

linear-regression matlplotlib numpy pandas sklearn svm

Last synced: 13 Apr 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/lmtreser/intro-data-science

Trayecto de formación en Ciencias de Datos con Python y Pandas

data-science matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/pramodyasahan/spaceship-titanic

This repository features a machine learning model designed to predict whether passengers of a space travel company are likely to be transported. The model employs CatBoostClassifier, a machine learning algorithm known for handling categorical data effectively.

machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-03-q2--hypothesis-testing

Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth

annova ftest hypothesis-testing numpy pandas python scipy significance-testing stats

Last synced: 11 Apr 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/data-analysis-with-python

Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook

data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sadia-khan13/data-preprocessing

Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.

artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn

Last synced: 11 Apr 2026

https://github.com/1adityakadam/uber_data_analytics

End to end Google Bigquery + Looker Studio Data Analytics Project Transforming NYC Taxi Data into Actionable Intelligence

bigquery looker-studio mage-ai-pipeline numpy pandas sql

Last synced: 13 Apr 2026

https://github.com/sadegh15khedry/housing-prices-prediction-using-randomforest

This repository contains an implementation of random forest model to predict housing prices using the Boston Housing dataset.

csv joblib jupyter-notebook matplotlib numpy pandas pil python random-forest seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/abhisek-13/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.

data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 30 May 2026

https://github.com/nicolascoiado/nivel-mar

Este projeto realiza uma análise detalhada do nível médio global do mar (GMSL), utilizando uma base de dados pública que abrange medições históricas. O objetivo é explorar tendências, calcular a taxa média de elevação e visualizar os dados por meio de gráficos.

google-colab jupyter-notebook matplotlib numpy pandas python python3 scikit-learn

Last synced: 11 Mar 2025

https://github.com/arrnikhilchauhan/book-recommendation

This Book Recommendation System is designed to provide personalized book suggestions based on user ratings and interaction. By leveraging the power of Pandas and NumPy, it filters and processes data efficiently, ensuring that recommendations are relevant and tailored to the user’s preferences

numpy pandas-library python

Last synced: 15 Apr 2026

https://github.com/jcm-ai/robust-mlops-elevating-machine-learning-to-production-scale

This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.

aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml

Last synced: 09 Apr 2026

https://github.com/jfjlaros/online-array

Online array, pass a function as a NumPy array.

array function numpy online unbounded

Last synced: 13 May 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/dharma-acha/imageclassification

This project is an interactive Streamlit web application using the VGG-13 model to classify images from the CIFAR-10 dataset. Users can upload images to receive real-time predictions and visual explanations of the model's decisions. The goal is to accurately classify images into one of the ten CIFAR-10 classes: airplanes, automobiles, birds, cats,

colab-notebook matplotlib numpy pandas python3 pytorch scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/vishant007/annadataa

A Website For Farmers To Guide Them Regarding Crop Prouction In Their Native Language

django flask-application google-collab kaggle machine-learning-algorithms numpy pandas python3 scikit-learn

Last synced: 13 Apr 2026

https://github.com/nyx1311/toxicity-detector-using-bilstm

🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web app that flags multiple forms of online abuse in real time and offers tools for emotional recovery. 📊 Under the hood: BiLSTM + Word2Vec embeddings for deep, context-aware detection Trained on 21K+ labeled comments across 7 toxicity categories Built with Python, Tensor

epoch genism gpu keras-tensorflow matplotlib model nlp nlp-machine-learning nlpaug-textual numpy pandas pandas-library python3 streamlit tensorflow word2vec

Last synced: 14 Apr 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/lesiaukr/mathematic_for_cs_colab_hw

Master's degree | HW | Mathematics for Computer Science course | Homeworks

goit google-colab home-works latex linear-algebra mathematics-for-computerscience matrices numpy python vectors

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/healty-care

The Complete project using neural network to arrive the best healt know he is stoke or not.

matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 13 Apr 2026

https://github.com/ediloaz/linear-regression-with-lists-numpy-and-pytorch

This was done like the homework #01 of Machine Learning Course, of Computer Science Master.

arrays-raw jupyter-notebook machine-learning numpy python pytorch

Last synced: 13 Apr 2026

https://github.com/railab/nxscli-np

Nxscli extention with Numpy operations

numpy nxscli

Last synced: 21 Apr 2026

https://github.com/mmerlyn/analysis-of-tomato-prices

Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.

css flask html matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Jul 2025