An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/abhishekyadav915/e-commerce-sales-analysis

E-Commerce Sales Analysis is a data analysis project that explores sales data from an e-commerce platform to uncover insights and trends. The analysis includes visualizing sales performance, customer behavior, and product trends to help optimize business strategies and improve customer satisfaction.

matplotlib-pyplot numpy pandas-library plotly python3

Last synced: 04 May 2026

https://github.com/chathumiamarasinghe/nn-training-model

A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.

jupyter-notebook matplotlib numpy phyton scikit-learn

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/kaungkhantkyaw1997/face_recognition

A Computer Vision tool designed to identify and recognize characters in images.

face-recognition flask numpy opencv

Last synced: 05 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/vladd12/machine-learning-practice

Репозиторий для учебной практики и различных курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/farial-mahmod/nuclear-energy-generation-prediction-logistic-regression

Nuclear Energy Generation Prediction Logistic Regression project is aimed at predicting the nuclear energy generation based on the production (generation) data from 1991 to 2023 by using Logistic Regression.

energy kaggle logistic-regression machine-learning matplotlib nuclear-energy nuclear-power numpy power python3 sklearn

Last synced: 05 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/lruizap/tomography-with-algebraic-topology

This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.

gudhi jupyter-notebook matplotlib numpy opencv python tomography

Last synced: 05 May 2026

https://github.com/cicku/en.650.672

HW of EN.650.672

analytics data-analysis numpy pandas

Last synced: 05 May 2026

https://github.com/chdl17/marvelvsdc

This GitHub repository contains Python and Tableau for analyzing Marvel Vs DC superheros.

excel jupyter-notebook numpy pandas python tableau-public

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/theadeelahmed/mall-customers-segmention

This project segments mall customers based on income, age, and shopping score. Using clustering techniques, it identifies key customer groups for targeted marketing campaigns. Tools used: Pandas, Matplotlib, Seaborn, and Scikit-learn

matplotlib-pyplot numpy pandas python3 seaborn sklearn-clustering

Last synced: 05 May 2026

https://github.com/aathira2001/python-basics

Basic Python concepts with examples

functions numpy pandas python variables

Last synced: 05 May 2026

https://github.com/nandinimarepalli/ai_ml_internship_projects

Projects completed during my AI/ML and Data Expert internship, including EDA, machine learning models, and dashboard development using Python, pandas, scikit-learn, and visualization libraries.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/waghraj1699/fraud-detection-project

A fraud detection system with machine learning will be able to detect risk based on your historical data

data-processing decision-tree-classifier machine-learning numpy pandas

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/shahriarha/programming

Python problem solving, HR-30 days of code, DSA, NumPy

algorithms-and-data-structures cpp numpy programming python

Last synced: 06 May 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/sopheakchan/dull-razor-algorithm

DullRazor hair removal for dermoscopic skin images.

numpy opencv python

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/fbi555/fashion-mnist-cnn-tf

🧵 Classify fashion images using a compact CNN model in TensorFlow/Keras, designed for the Fashion-MNIST dataset with easy execution in Google Colab.

adam classification cnn colab-notebook computer-vision convolutional-neural-networks custom-training-loop deep-learning fashion-mnist gans image-classification interactive jupyter-notebook keras lowlevelapi mnist-dataset numpy regularization

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/swapnilvishwakarma/twitter_sentiment_dashboard

Using Streamlit and Python, I created interactive data dashboards and used Pandas to manipulate data in data science workflows. I also used Plotly to create interactive graphs.

dashboard machine-learning matplotlib numpy pandas plotly sentiment-analysis streamlit-webapp webapp wordcloud

Last synced: 06 May 2026

https://github.com/prab9166/-web-content-extraction-and-sentiment-analysis-

web content extraction and sentiment analysis on URLs provided in an Excel file. Its divided into classes to manage different stages such as data loading, web scraping, text processing, sentiment scoring, and output generation

beautifulsoup lxml nltk-python numpy pandas python re requests xlsxwriter

Last synced: 06 May 2026

https://github.com/zotho/grav_euler

Euler gravity compute

gravity numpy python

Last synced: 06 May 2026

https://github.com/voorhs/ml-practice

ML algorithms implementation

aimasters lomonosov-msu machine-learning numpy python

Last synced: 07 May 2026

https://github.com/sriyaroy/coding-challenges

50 Machine Learning & Computer Vision challenges ranging from Easy to Hard to sharpen you ML engineering skills.

coding-challenges computer-vision interview-prep machine-learning numpy python pytorch

Last synced: 07 May 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/miss-mhv/retail_strategy_and_analytics

The purpose of this work is to analyze data from a supermarket to better understand the types of customers who buy chips and their buying behavior in the area.

jupyter-notebook numpy pandas python3

Last synced: 07 May 2026

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 07 May 2026

https://github.com/ayushmankumar7/siamese-network-with-one-shot-learning

Implementation of this paper - https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf

numpy tensorflow tf-keras

Last synced: 07 May 2026

https://github.com/mgitrov/kulo-chatbot

A retrieval based chatbot aiming to simulate human-like conversation with the user via text messages on chat.

artificial-neural-networks bag-of-words deep-learning json keras lemmatization natural-language-processing nltk numpy tkinter tokenization

Last synced: 07 May 2026

https://github.com/imskully/python-cv-fault-detection

Fault detection algorithm that determines whether or not a fault exists within a ring using raw computer vision.

ccl computer-vision matplotlib morphology numpy opencv python

Last synced: 07 May 2026

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/kavayk29/audio-classification-using-python-library

This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.

librosa matplotlib-pyplot mfcc-features numpy pandas sklearn-library

Last synced: 07 May 2026

https://github.com/jnayak1/naive_bayes_cars

Using Naive Bayes and SGD ML algorithms to predict if a new car will decrease in price

cars machine-learning naive-bayes numpy sklearn

Last synced: 07 May 2026

https://github.com/bettenoch/yolo_real_time_object_detector

The YOLO Real-Time Object Detector leverages the ultralytics package to provide robust object detection capabilities. This project is designed to identify and classify objects in images, video streams, and live webcam feeds with high accuracy and efficiency.

numpy object-detection object-tracking pandas python3 real-time-processing ultralytics yolov8

Last synced: 07 May 2026

https://github.com/r-mahesh45/reccomendation-system

This project involves building a recommender system using cosine similarity to measure the similarity between items or users. It leverages numerical data to provide personalized recommendations. The model focuses on extracting meaningful insights and improving decision-making processes.

numpy pandas-dataframe python3 scikitlearn-machine-learning

Last synced: 07 May 2026

https://github.com/momo790-m/startup-profit-prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

assumptions business colab-notebook linear-regression machine-learning matplotlib numpy pandas preprocessing seaborn skit-learn sklearn

Last synced: 08 May 2026

https://github.com/alain-godo/spotify-popularity

A simple analysis and modeling for prediction of the popularity of songs on Spotify

linear-regression machine-learning numpy pandas random-forest sckiit-learn spotify tree-decision

Last synced: 08 May 2026

https://github.com/egrizq/bike_analysis

Exploring the bike sharing system to find the patterns behind user behavior by weather, season, hour, and months.

data-science matplot numpy python seaborn

Last synced: 08 May 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 08 May 2026

https://github.com/diegodscamara/whisperlive

🎙️ WhisperLive: Real-time audio transcription powered by OpenAI's Whisper model. Convert live speech to text with high accuracy, supporting multiple languages and real-time processing. Perfect for accessibility, content creation, and live captioning.

ai blackhole numpy python wave whisper

Last synced: 09 May 2026

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 09 May 2026

https://github.com/rishieeee/spam-email-classifier

A simple machine learning project that classifies emails as spam or ham using TF-IDF and a Multinomial Naive Bayes model. The project covers data cleaning, text preprocessing, feature extraction, model training, and evaluation. A great beginner-friendly introduction to NLP and ML workflows.

multinomial-naive-bayes numpy pandas python sckit-learn tf-idf

Last synced: 09 May 2026

https://github.com/tonoy30/emotion-mask

A pytorch model for add face mask based on detected emotion.

emotion-detection numpy opencv-python python pytorch

Last synced: 09 May 2026

https://github.com/omogbolahan94/nobel-prize-winner-analysis

Data exploration, cleaning, visualization and presentation

markdown matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/starlitdreams/lunar-landing

This project implements a DQN agent using PyTorch to solve the LunarLander-v2 environment from OpenAI Gym. The agent learns to control the lunar lander using experience replay and a target network, aiming to maximize rewards by landing smoothly. Uses CUDA for computation.

artificial-intelligence cuda deep-learning gymnasium neural-network neural-networks numpy nvidia-gpu python python3 torch

Last synced: 09 May 2026

https://github.com/ishaj72/analysis_model2

Dataset analysis model using matplotlib and other libraries

machine-learning matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 09 May 2026

https://github.com/nahiyanhkhan/data-processing-and-visualization

Loan Data Processing using Python's numpy and pandas libraries. For data visualization, matplotlib and seaborn are used.

data-analysis-python data-visualization matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 09 May 2026

https://github.com/michaelzheng67/real_estate_ml

AI experiment developed to run against simulated real estate and investing environment

ai numpy python pytorch

Last synced: 09 May 2026

https://github.com/raduldev/datasciencepython

Interview Task From CSIR-IGIB, Delhi & maps for journal

biopython colab-notebook counter jupyter-notebook matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/arjunravi26/selenium

This repository contains code from my learning about Selenium for web scraping and automation.

numpy python selenium time undetected-chrome webdriver

Last synced: 09 May 2026

https://github.com/vviveksharma/barcodescanner

The Bar Code scanning Application built using Computer Vision library such as Opencv in python the main objective of the project is to read the barcode that is being shown in the camera and decoding it and telling if the Database or Text file that is attached to it has that permitted decoded number if yes then it shows "Permitted" is green else "Not-Permitted" in red

numpy opencv python3

Last synced: 10 May 2026

https://github.com/rebeccamorolong/market--basket--analysis

Market Basket Analysis using the Apriori algorithm to predict association rules and frequent itemsets from transactional data. The goal was to uncover product combinations that frequently co-occur in transactions, which can be used for recommendations, promotions, and inventory management.

apriori-algorithm numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/mahmood-anaam/grid-world-value-iteration

An implementation of the Value Iteration algorithm for solving the Grid World problem. This project provides a function to compute the optimal value function for a grid-based environment where a robot navigates to maximize rewards while avoiding penalties.

dynamic-programming grid-world numpy optimization-problem python value-iteration-algorithm

Last synced: 10 May 2026

https://github.com/niteshchawla/loantap-logisticregression

Given a set of attributes for an Individual, determine if a credit line should be extended to them. If so, what should the repayment terms be in business recommendations?

confusion-matrix encoding feature-engineering featureimportance hyperparameter-tuning logistic-regression machine-learning numpy pandas-python precision-recall roc-auc-curve sklearn-metrics smote-sampling

Last synced: 10 May 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/tfiers/unitlib

📐 Physical units for NumPy arrays \ ⏱ Fast • Simple • High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026