An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/antonio-f/simple-k-nearest-neighbors

Simple K-Nearest Neighbors using partial sort

101 k-nearest-neighbors knn numpy partial-sort python tutorial

Last synced: 08 May 2026

https://github.com/egrizq/bike_analysis

Exploring the bike sharing system to find the patterns behind user behavior by weather, season, hour, and months.

data-science matplot numpy python seaborn

Last synced: 08 May 2026

https://github.com/moustafamohamed01/data-analysis-ecommerce-purchases

Data Analysis : Analyzing (Ecommerce Purchases) dataset from Kaggle

jupiter-notebook numpy pandas python

Last synced: 08 May 2026

https://github.com/filnow/tinygimp

project for image processing class

gimp numpy opencv pysimplegui python3

Last synced: 08 May 2026

https://github.com/samaalharbi2/100-days-of-code

This repository is a collection of 100 Python projects built as part of the "100 Days of Code: Python Developer Challenge" course.

data-science game-development machine-learning numpy pandas python udemy

Last synced: 08 May 2026

https://github.com/ishaan28malik/masters-admission-analyser

Admissions of Indian students for foreign universities.

admission classification machine-learning matplotlib numpy python students

Last synced: 08 May 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/mzayles/analise-descritiva.py

Curso Programação em Python para Data Science | Análise descritiva e mensuramento de dados de um report consolidado utilizando matplotlib.

matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 08 May 2026

https://github.com/shaulamarquez/data-job-market

This is a Data Job Market analysis

analysis matplotlib numpy python seaborn

Last synced: 09 May 2026

https://github.com/shingiraibhengesa/house-price-predictor

A machine learning project that predicts house prices based on user input features such as square footage, number of bedrooms, and more.

machine-learning-models matplotlib numpy python scikit-learn seaborn

Last synced: 09 May 2026

https://github.com/alejandrolara11/data-preprocessing

Data preprocessing through the use of the libraries NumPy and pandas.

data-analysis data-cleaning data-preprocessing numpy pandas python

Last synced: 09 May 2026

https://github.com/jsnevt/series_temporais

Previsão de eventos que ocorrem sazonalmente

datetime matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 09 May 2026

https://github.com/normyee/chatbot-pytorch_nlp

ChatBot utilizing neural networks, NLP techniques, and the Bag of Words model. Implements tokenization and stemming for efficient language processing.

bag-of-words chatbotai neural-network nlp-machine-learning nltk-python numpy python pytorch-nlp

Last synced: 09 May 2026

https://github.com/nihanthbhargav/time-series-stock-market

This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.

matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 09 Apr 2026

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 11 Apr 2026

https://github.com/jcardonamde/food-sales-predictions

A project that seeks to predict the sales of a food distributor by applying machine learning algorithms and data analysis.

data-science machine-learning machine-learning-algorithms numpy pandas pipelines python seaborn sklearn

Last synced: 05 Apr 2026

https://github.com/ramyacp14/sentimentanalysis

Implements a sentiment analysis model to determine the emotional tone behind text, helping understand attitudes, opinions, and emotions in online mentions.

machine-learning natural-language-processing nltk numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/jainish-prajapati/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 30 Dec 2025

https://github.com/maxbleu/f1analisys

A Python repository for analyzing and visualizing Formula 1 data. Includes tools for race statistics, performance comparisons, and custom visualizations.

data-science f1 f1-analisis f1api fastf1 formula1-analysis formula1-data-analysis matplotlib-python motorsport numpy pandas python seaborn

Last synced: 12 Sep 2025

https://github.com/hellenilda/ml-housing-price-prediction

Projeto simples para fazer previsão de preços de casas usando Regressão Linear.

artificial-intelligence machine-learning matplotlib numpy python sklearn streamlit

Last synced: 05 Apr 2026

https://github.com/philcs/electromag

Électromagnétisme, Université Laval, 2017

matplotlib numpy python

Last synced: 18 May 2026

https://github.com/muzahid0408/student_marks_analysis_using_numpy

This project analyzes a student marks dataset to calculate key statistical measures such as mean, median, standard deviation, and more.

datasets jupyter-notebook numpy numpy-arrays pythoh

Last synced: 04 May 2026

https://github.com/abhisek-13/diabetes-prediction

The Diabetes Prediction Web App uses machine learning to predict if someone has diabetes based on their health data. Trained on the Kaggle diabetes dataset, it offers real-time predictions through a user-friendly Streamlit interface.

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/mr-talhailyas/histogram-equalization-and-contrast-stretching

Histogram equalization and contrast stretching without using builtin library functions.

contrast-stretching histogram-equalization image-enhancement image-processing numpy

Last synced: 29 Apr 2026

https://github.com/jhaabhijeet864/numpy_for_machine_learning

It's a collection of python notebooks comprising of learning materials of numpy for machine learning

notebooks-jupyter numpy numpy-arrays numpy-library python

Last synced: 18 May 2026

https://github.com/pzuehlke/hands-on-numpy

An introduction to NumPy for undergraduate students, focusing on basic linear algebra

linear-algebra numpy numpy-tutorial python scipy

Last synced: 18 Apr 2026

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/akku-1206/spotify_data_analysis

Spotify Tracks Analysis Overview This project performs an exploratory data analysis (EDA) on Spotify music data to uncover insights and correlations related to song features, popularity, and genre.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/tapas-gope/telecommunication-customer-churn

This project involves predicting customer churn in a telecommunications company using machine learning techniques, exploring various features' impact, optimizing models, and identifying key factors influencing churn.

feature-engineering matplotlib-pyplot model-evaluation-and-validation numpy pandas python scikit-learn

Last synced: 12 Sep 2025

https://github.com/samankhamesian/rating-prediction-system

This project is an implementation of simple rating prediction systems for items from user

latent-factor-model machine-learning numpy pattern-recognition python rating-prediction

Last synced: 18 May 2026

https://github.com/chandkund/time-series-forecasting

A time series forecasting project using the Airline Passenger dataset. This project leverages ARIMA modeling to predict future passenger numbers by analyzing historical trends and seasonality from 1949-1960. The results demonstrate effective forecasting for better business planning.

arima-forecasting arima-model data-science numpy pandas python seaborn

Last synced: 07 Apr 2026

https://github.com/khushi130404/outlier_exterminator

Outlier_Exterminator is a Python tool for detecting and treating outliers using IQR, Z-Score, and Percentile methods. It supports trimming, capping, and Winsorization, demonstrated in a Jupyter Notebook.

capping iqr machine-learning numpy percentile-method trimming winsorization z-score

Last synced: 18 May 2026

https://github.com/harishtpj/rawnn

A minimal Neural Network framework written from scratch

framework neural-network numpy python

Last synced: 11 Jun 2026

https://github.com/petridhsg/firewall-data-classification

A single implementation of a machine learning algorithm for a firewall data classification task

machine-learning matplotlib numpy python seaborn

Last synced: 01 May 2026

https://github.com/rohansoni45/movie-recommendation-system

This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.

content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn

Last synced: 17 Apr 2026

https://github.com/dineshdhamodharan24/singapore_flat_resale_

This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.

data-analysis flat json numpy pandas pickle project python streamlit

Last synced: 07 Apr 2026

https://github.com/vbhvsingh0/cdc_immunization

This project explores the relationships in between different vaccines and the sex, age and other basic features in the data.

data-cleansing data-manipulation-with-pandas data-science numpy pandas-python python3

Last synced: 05 May 2026

https://github.com/vbhvsingh0/fraudulent_transactions

A few models were developed based on Decision trees and Logistic Regression to categorize fraudulent transactions

credit-card-fraud data-science logistic-regression machine-learning numpy numpy-python pandas pandas-python python3 random-forest-classifier

Last synced: 09 May 2026

https://github.com/mituskillologies/aiml-dypiemr-sep24

Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.

artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3

Last synced: 05 Apr 2025

https://github.com/chandkund/predicting-heart-disease

Welcome to the Heart Disease Prediction project! 🩺 This project focuses on developing a predictive model to assess heart disease risk based on health indicators like age, cholesterol levels, and blood pressure. By analyzing these features, we aim to create an effective tool for early diagnosis and heart disease prevention

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Jan 2026

https://github.com/hansalemaos/stridesduplicatefinder

Calculate overlapping values between two arrays and return the results as a DataFrame

duplicates fast numexpr numpy strides

Last synced: 07 Mar 2026

https://github.com/orthoin3d/django-ndarrayfield

New Django field to store numpy ndarray.

django django-fields numpy numpy-arrays

Last synced: 07 Mar 2026

https://github.com/hansalemaos/ffmpeg_stream_to_numpy

ffmpeg stream to numpy arrays

cv2 ffmpeg numpy python stream video

Last synced: 29 Apr 2026

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025

https://github.com/mathensley/asimpleneuralnetworklib

A simple library in Python for training datasets using neural network models

deep-learning neural-networks numpy python

Last synced: 19 Apr 2026

https://github.com/blinksta1ker/spam-mail-classification

Supervised ML model using Logistic Regression model to classify an email as Ham or Spam

jupyter-notebook logistic-regression machine-learning numpy pandas python sklearn supervised-machine-learning

Last synced: 09 Apr 2026

https://github.com/kaikyMoura/BackCourt-api

Repository for the Basketball Advanced Stats API

fastapi nba-api nba-stats numpy pandas python swagger uvicorn

Last synced: 08 Apr 2026

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/subratamondal1/machine-learning

Machine Learning Notes with tools like Numpy, Pandas, Scikit-Learn.

machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/ulf1/numpy-linreg

Linear Regression with numpy only.

linear-regression numpy

Last synced: 19 May 2026

https://github.com/pkx8326/blackjack-weighted-random-choices-visualization-with-numpy-and-python

This is a small project to test and visualize the result of the weighted probability capability of Python's numpy.random.choice()

blackjack blackjack-game choice jupyter-notebook numpy python random weighted-probability weighted-random

Last synced: 04 May 2026

https://github.com/msikorski93/linear-regression-from-scratch

Different solutions of linear and polynomial regression made from scratch.

from-scratch linear-regression numpy polynomial-regression tensorflow

Last synced: 08 May 2026

https://github.com/carlescn/snake_game

An implementation of the classic Snake game using Pygame and NumPy.

numpy pygame python snake-game

Last synced: 15 Apr 2026

https://github.com/emanuelefavero/machine-learning

This repository contains code for various machine learning concepts and algorithms

algorithms machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/thiagohrcosta/calculator-app

Calculator APP is a Python-based API developed as part of the Rockeseat specialization in Python. The project simulates a real-world application while applying best practices in software design, architecture, and scalability. The main focus is on:

api design-patterns flask numpy python

Last synced: 08 May 2026

https://github.com/a-poor/qgates

Small helper library for working with quantum matrices

bell-states numpy pypi pypi-package python quantum-computing

Last synced: 04 May 2026

https://github.com/open-risk-academy/academy-course-dat31055

Support material for Academy course DAT31055

linear-algebra matrix-libraries ndarray numpy

Last synced: 20 May 2026

https://github.com/badranalyst/udemy-course-data-analysis-with-pandas-and-visualization-libraries

Explores Udemy course data using NumPy and Pandas for data manipulation. Provides insights into course topics, ratings, and pricing with visualizations created using Matplotlib, Seaborn, and Plotly.

data-science dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn

Last synced: 16 Apr 2026

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/separatrixxx/tmandcm_labs_3_sem

👓 Laboratory works for 3 semester of MAI on theoretical mechanics

matplotlib numpy python theoretical-mechanics

Last synced: 20 May 2026

https://github.com/jigyasag18/fake-news-prediction-project

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization

Last synced: 08 Jun 2026

https://github.com/MuhammedMorad/Egypt-Premier-Fashion-Sales-Analysis

Egypt’s Premier Fashion Sales Analysis is a data-driven project that provides interactive sales insights using Streamlit, Power BI, and Pandas. It analyzes sales trends, customer behaviors, and product performance while offering real-time dashboards and downloadable reports. helping businesses make informed, data-driven decisions. 🚀

matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Dec 2025

https://github.com/iankitnegi/datapy

Welcome to DATAPy! This repo contains my Python coding projects for data science, featuring data manipulation, analysis, and visualization. Stay tuned for updates and new content!

basic-programming matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/ola-kaznowska/first-steps-in-numpy

My first steps in the NumPy library

mathematics numpy python python3

Last synced: 20 May 2026

https://github.com/kr1shnasomani/speedvision

Vehicle speed detection using Computer Vision, Deep Learning and Neural Network

bytetrack computer-vision deep-learning neural-network numpy opencv pytorch supervision tqdm yolo

Last synced: 05 Mar 2025

https://github.com/s-h-i-v-i-s/bangalore-housing-price-prediction

A Machine Learning Project to predict Bangalore House Prices.

jupyter-notebook machine-learning matplotlib numpy pandas python pythonflask sklearn

Last synced: 08 Apr 2026

https://github.com/annygutierrez/data-preprocessing

Preprocessing deep learning template.

matplotlib numpy pandas sklearn

Last synced: 02 May 2026