An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification

This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.

classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn

Last synced: 06 May 2026

https://github.com/cheese23456/ai-based_stock_analysis_and_portfolio_optimisation

📈 Analyze stocks and optimize portfolios with AI-driven insights, technical indicators, and clear investment signals for informed decision-making.

ai asset-allocation financial-analysis huggingface-spaces numpy pandas portfolio-optimisation python scipy-optimize streamlit

Last synced: 04 May 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 13 Apr 2026

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 08 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/venkat-0706/accenture-hackathon

Developing an e-commerce recommendation system involves utilizing technologies such as Python for programming, Pandas for data manipulation, SQL for database management, FastAPI for building APIs, PostgreSQL for data storage, and Docker for containerization.

alembic api docker fastapi machinelearningalgorithms matplotlib numpy postgresql pydantic python3 scipy seaborn sqlmodel

Last synced: 13 Apr 2026

https://github.com/jaweria-b/eda-basketball

The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.

matplotlib numpy python streamlit

Last synced: 10 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/eda-on-indian-railways

Indian Railways is a statutory body under the ownership of the Ministry of Railways of the Government of India that operates India's national railway system. As of 2023, it manages the fourth largest national railway system by size with a track length of 132,310 km, running track length of 106,493 km and route length of 68,584 km.

clean-data eda exploratory-data-analysis geometry geopandas indian-railways json linestring matplotlib numpy os pandas plotly python railway seaborn shapely train visualization

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026

https://github.com/babagata/racunalna_fizika

Math and physics solved with python

matplotlib numpy random scipy sympy

Last synced: 10 Apr 2026

https://github.com/semihbugrasezer/rockvsmine

Rock vs Mine Prediction with Python | Machine Learning Project

numpy pandas python

Last synced: 05 May 2026

https://github.com/amr-yasser226/pagerank-power-method

Modular Python implementation of the PageRank algorithm via the power method, with discrete‑math foundations, end‑to‑end code, and experimental validation.

discrete-mathematics graph-algorithms markov-chain network-analysis networks numpy pagerank power-method python scipy

Last synced: 09 May 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/charanpool/image-patch-mapping

A Python toolkit for image processing — edge detection, template matching, and patch detection using OpenCV

canny-edge-detector computer-vision edge-detection image-processing numpy opencv python sobel-filter template-matching

Last synced: 18 Jan 2026

https://github.com/anubhavkumar31/simple-heart_disease_prediction-using-logisticregression

Its a simple yet good model which predicts if a person have heart disease or not. This is a binary classification model i.e its output is either 0(dont have heart disease) or 1 (have heart disease).

logistic-regression machine-learning numpy python sklearn sklearn-linear-model sklearn-metrics

Last synced: 10 Apr 2026

https://github.com/abideen-olawuwo/nyc-taxi

Prediction the duration of New York Taxi trip

linear-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/nikhilsree5/netflixcasestudy

Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth

eda numpy pandas python visualization

Last synced: 13 Apr 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/machinelearninglogisticregression

A fraude de cartão de crédito é uma forma comum de fraude que afeta milhões de pessoas em todo o mundo. Essa fraude pode ter consequências graves, tanto para as vítimas quanto para as instituições financeiras que emitem os cartões de crédito.

math matplot numpy pandas python3 sklearn

Last synced: 13 Apr 2026

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 13 Apr 2026

https://github.com/kecnry/nparray

High-Level Wrappers for Building and Manipulating Numpy Arrays

numpy numpy-arrays python

Last synced: 08 Jan 2026

https://github.com/tatmorenno/python_para_data_science

Python para Data Science Introdução à linguagem e Numpy 🌎💻

jupyter-notebook numpy python

Last synced: 13 Apr 2026

https://github.com/subhas-pramanik-09/car-detection-and-counting

This project counts the number of cars passing through a designated line in a video file using OpenCV and background subtraction techniques.

machine-learning numpy object-detection opencv

Last synced: 13 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/khaymanii/spam_mail_detection_model

This model was built using Python and Logistics Regression algorithm

matplotlib numpy pandas python sckiit-learn

Last synced: 10 Apr 2026

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/mogankumar/virtual-mouse

A Python-based Virtual Mouse that uses hand gestures for cursor control, clicking, scrolling, and taking screenshots. Powered by OpenCV, PyAutoGUI, and a custom hand tracking module for a touch-free experience.

autopy numpy opencv pyautogui python3

Last synced: 10 Apr 2026

https://github.com/chandkund/wine-quality-prediction

This project predicts wine quality based on physicochemical properties using machine learning models. By leveraging Random Forest Classifier, Logistic Regression, and SVM, the goal is to classify wines into quality categories and uncover the key factors that influence wine quality.

logistic-regression matplotlib numpy pandas-python random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/tromesh/attendance-predictor

Student Attendance predictor for Scheduling lecture sessions

matplotlib numpy pandas tensorflow

Last synced: 05 May 2026

https://github.com/ifte-13/early-stage-brain-stroke-detection

Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm

decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/sahil210695/gradient-descent

A simplified explanation of gradient descent for linear regression in python using numpy

gradient-descent gradient-descent-algorithm linear-regression matplotlib mini-batch-gradient-descent numpy python stochastic-gradient-descent

Last synced: 03 May 2026

https://github.com/shivamkumar818/mbti-personality-prediction-from-text-data

MBTI Personality Prediction from Text Data This project leverages machine learning to predict Myers-Briggs Type Indicator (MBTI) personality types based on textual data, specifically from social media posts.

confusion-matrix correlation-matrix data-visualization dataset knn-classification linear-regression logistic-regression modeltraining navebayes numpy pandas python

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/GenderDetection

Gender detection using gender classification model

genderclassification genderdetection numpy pil python resnet18 torch torchvision

Last synced: 28 Apr 2025

https://github.com/ashithapallath/comparison-of-ssd-sad-in-motion-estimation-of-video-coding

Course assignment for Algorithm and Massive Datasets comparing SAD and SSD for motion estimation. Includes analysis of accuracy, speed (Python/NumPy), and error sensitivity. Highlights trade-offs for speed-critical vs. precision-focused applications.

l1-normalization l2-norm numpy pytho video-cod

Last synced: 04 Apr 2025

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/soumyapro/house-price-prediction

Explored different attributes of Boston housing dataset then a part of dataset was used to train the linear regression algorithm after that trained model was used to give predictions on remaining part of dataset.

linear-regression matlplotlib numpy pandas sklearn svm

Last synced: 13 Apr 2026

https://github.com/eny2930/ai-ml-projects

A project demonstrating various libraries and tools in using algorithms in machine learning. This repository includes code samples, datasets, and detailed explanations.

iris-dataset keras linear-regression logistic-regression matplotlib numpy pandas pytorch sckiit-learn seaborn sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 02 May 2026

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 13 Apr 2026

https://github.com/yesenvidnath/facerecognitionapplight

This is a face recognition application wich is capable of recognizing a person based on thire face and take attendence from it

face-recognition facent flask numpy python

Last synced: 13 Apr 2026

https://github.com/ryan-bendelson/2024-summer-research

This is Python code that I worked with during my summer 2024 research project involving quantum physics.

density-matrices kronecker-product linear-algebra miniconda3 numpy numpy-arrays partial-trace python quantum-information

Last synced: 16 Apr 2026

https://github.com/udacity-machinelearning-internship/support-vector-machine

Implementing SVM's using pandas and sklearn in python

numpy pandas sklearn support-vector-machines svm

Last synced: 13 Apr 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/munawar-code/car_price_predictor

This project is a ML-based car price prediction system. The model is built using Jupyter Notebook for training and evaluation, while a simple one-page website was developed using Pycharm to provide interface for users to input car details and get price predictions.

datapreprocessing datavisualization exploratory-data-analysis feature-engineering flask-application html-css-javascript linear-regression machine-learning-algorithms matplotlib numpy pandas python scikitlearn-machine-learning

Last synced: 13 Apr 2026

https://github.com/aritrakar/ai-projects

🤖A centralized collection of links to my AI projects roughly divided by topic.

numpy pandas python pytorch tensorflow

Last synced: 13 Apr 2026

https://github.com/sadegh15khedry/housing-prices-prediction-using-randomforest

This repository contains an implementation of random forest model to predict housing prices using the Boston Housing dataset.

csv joblib jupyter-notebook matplotlib numpy pandas pil python random-forest seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/dino-kupinic/pocolocosolver

algorithmic solution with 3D visualization

algorithms numpy panda3d puzzle pytest python solver

Last synced: 19 Jun 2026

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 10 Apr 2026

https://github.com/thanhloc81/rfm-model-project

✨ Build a flow to deploy Segmentation evaluation through Python programming.

customer-segmentation numpy pandas python rfm-analysis

Last synced: 03 May 2026

https://github.com/anshpg/linearluminary

Greetings! I've developed a straightforward linear regression model from scratch to predict house prices in Bangalore. But before delving into coding, let me walk you through the algorithm's conceptualization. I considered various factors such as location, ocean proximity, plot size, finished state, and flat type.

algo linea mathematics matplotlib numpy pandas pyth

Last synced: 13 May 2026

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 30 May 2026

https://github.com/arrnikhilchauhan/book-recommendation

This Book Recommendation System is designed to provide personalized book suggestions based on user ratings and interaction. By leveraging the power of Pandas and NumPy, it filters and processes data efficiently, ensuring that recommendations are relevant and tailored to the user’s preferences

numpy pandas-library python

Last synced: 15 Apr 2026

https://github.com/seymencanates/pythonn

Here is my all personal educations , notes , basic apps I did. Here will be my backside for all supportive information about python I am gonna need.

django flask numpy pandas python3 scikitlearn-machine-learning tensorflow tkinter

Last synced: 08 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 07 Apr 2026

https://github.com/farhad-here/median-performance-comparison

Benchmarking the performance of median calculation using vanilla Python vs NumPy.

data-analysis matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/niel0503/probabilidade-e-amostragem

Desafio desarrollado para el bootcamp de WoMakersCode

estadistica matplotlib numpy pandas stats

Last synced: 18 Jan 2026

https://github.com/hansalemaos/cppradixsort

parallel_radixsort from C++ for Python (Windows)

cpp numpy parallel-computing python sort

Last synced: 05 Feb 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/shivammg/coursera-ml-numpy

Coursera Machine Learning course done using Python, NumPy, and Seaborn

machine-learning numpy python seaborn

Last synced: 09 May 2026

https://github.com/shaikats/python-assignment

Simulation and Modeling Lab assignments using Python, NumPy, Matplotlib, and Jupyter Notebook.

jupyter-notebook matplotlib numpy python simulation-modeling

Last synced: 09 May 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 10 Apr 2026

https://github.com/railab/nxscli-np

Nxscli extention with Numpy operations

numpy nxscli

Last synced: 21 Apr 2026

https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset

Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.

classification machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/mmerlyn/analysis-of-tomato-prices

Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.

css flask html matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Jul 2025