An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ksharma67/k-means-algorithm-on-the-iris-dataset

Applied the K-Means algorithm on the Iris dataset, and utilized the Silhouette Score method to find the best value of K

eda elbow-method iris-dataset kmeans-clustering matplotlib numpy pandas python seaborn silhouette-score

Last synced: 29 Apr 2026

https://github.com/jack-thesparrow/tube-color-detect

A Python program that detects the colors stored in tubes and returns a JSON file.

matplotlib-pyplot numpy opencv python3 sklearn

Last synced: 29 Apr 2026

https://github.com/adalbertobrant/fundamentalista

Analise de Graham e de Greenblatt para escolha de ações baratas e com bom potencial de crescimento ao longo prazo

graham greenblatt matplotlib numpy pandas python3 streamlit

Last synced: 29 Apr 2026

https://github.com/abhishek010397/tigerprediction-machinelearning

Machine Learning Prediction Model Using Multiple Regression to predict Tiger Population in India

keras-tensorflow matplotlib-pyplot multiple-linear-regression numpy pandas python3 r2 sklearn

Last synced: 29 Apr 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/gama1903/digital_image_processing

Implementation of Basic Algorithms of Digital Image Processing mainly through Numpy.

digital-image-processing numpy opencv python

Last synced: 08 May 2026

https://github.com/rebeccamorolong/mtn-customer-churn-prdiction

This project addresses a real-world business problem: predicting customer churn for MTN, a major telecommunications company. Customer churn significantly impacts profitability, and accurate predictions enable targeted retention efforts.

anova-test matplotlib-pyplot numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/balajig-24/titanic_data_analysics-

Project Title: Titanic Survival Prediction Project Overview The Titanic Survival Prediction project is a classic machine learning problem that aims to predict whether a passenger survived the Titanic disaster based on various features such as age, gender, passenger class, and more. This project demonstrates my ability to clean, analyze, and model.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/chandadiya2004/movie-recommendation-system

A Movie Recommendation System built using TfidfVectorizer and cosine similarity. The model processes a large dataset of movies and recommends similar movies based on a given input movie by analyzing textual features and calculating similarity scores.

cosine-similarity numpy pandas python sklearn tfidf-vectorizer

Last synced: 29 Apr 2026

https://github.com/x-nimesh/panoramer-dockerized

Panoramer is a cutting-edge panoramic image stitching web application designed to transform your images into stunning, professional-quality panoramas. Whether you're a seasoned photographer or a hobbyist, Panoramer's intuitive interface and precise alignment tools make it easy for users of all skill levels to create seamless panoramas.

flask image-processing numpy python react

Last synced: 29 Apr 2026

https://github.com/prakashjha1/stock-investment-analysis

Stock Investment Analysis Project can help investor to select the better performing stocks.

data-analysis data-science numpy pandas pandas-datareader parallel-programming python

Last synced: 08 May 2026

https://github.com/alexgustafsson/word-frequencies

Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.

language nlp nltk numpy python python3 sklearn spelling-correction trigram

Last synced: 10 May 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/amirali5/numpy

In this repo, all about NumPy which basically a Python Library used for multidimensional array. It also has functions for working in domain of linear algebra, fourier transform, and matrices. NumPy was created in 2005 by Travis Oliphant. It is an open source project and you can use it freely.

numpy

Last synced: 30 Apr 2026

https://github.com/victor-gp/tfg-h16b

The code for my bachelor's thesis, a numerical exploration of the second part of Hilbert's 16th problem

bachelor-thesis dynamical-systems matplotlib-pyplot numpy odeint

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sjain2580/simple-linear-regression-model

This project demonstrates a simple, yet robust, multiple linear regression model built with Python and scikit-learn to predict median house values in California.

joblib linear-regression matplotlib matplotlib-pyplot numpy python scikit-learn

Last synced: 30 Apr 2026

https://github.com/shohail-dev/python-sales-analysis

Python analysis project :- Analyze Diwali sales data to improve customer experience and sales .

matplotlib numpy pandas python-lambda python3 seaborn

Last synced: 30 Apr 2026

https://github.com/gill-singh-a/radial-velocity-of-star

A Program that approximates the number of exoplanets and their data (Mass, Radius of Revolution and Time Period of Revolution) present in a star system given the Radial Velocity of Star with noise

astronomical-data-analysis astronomy exoplanets fourier-transform matplotlib numpy physics python python-3 science stars

Last synced: 30 Apr 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/ebadshabbir/naive_bayes_classification-gaussain-

A Python project that applies Naive Bayes classification to predict user purchases based on age and salary using the Social Network Ads dataset. The project includes data preprocessing, model training, prediction, and visualization of decision boundaries for both training and test datasets.

classification gaussian jupyter-notebook machine-learning matplotlib-pyplot naive-bayes-classifier numpy pandas python

Last synced: 30 Apr 2026

https://github.com/blankscreen-exe/triple-trial-card-maker

Triple Triad Card maker for a game inspired by Final fantasy

card-game numpy opencv python tripletriad

Last synced: 30 Apr 2026

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 30 Apr 2026

https://github.com/himanshugoyal77/movie-recommendation

Cosine similarity based Recommendation Engine

nltk numpy pandas requests-library-python tmdb-api vectorization

Last synced: 08 May 2026

https://github.com/redinfinitypro/python-task-manager

Rating: (7/10): The Python Task Manager, is a user-friendly interface using tkinter library for system monitoring and management. It displays detailed system process information using psutil, with interactive features.

matplotlib numpy psutil pygetwindow tkinter

Last synced: 30 Apr 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/favalos/myway-to-nn

Project to store my understanding of Neural Networks.

jupiter-notebook neural-network numpy

Last synced: 30 Apr 2026

https://github.com/johassel/feed_forward_neural_network

Fully-connected feed-forward neural network based on the couse "Python for Machine Learning" at TU Berlin WiSe 24/25

machine-learning neural-networks numpy python

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/darinochka/course_python_iad

ДЗ\Семинары майнора "Интеллектуальный анализ данных"

machine-learning numpy pandas python3

Last synced: 30 Apr 2026

https://github.com/abhiram-epuru/ai-powered-movie-recommender

A Movie Recommendation System using the MovieLens dataset, leveraging collaborative filtering techniques such as KNN (k-Nearest Neighbors) and Singular Value Decomposition (SVD). The project demonstrates a personalized recommendation model capable of suggesting movies based on user preferences and interaction data.

matplotlib numpy pandas python sckiit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/maximlevchenko/python-course-convolution-orb-detector-data-processing-2d-game

This repository contains a collection of projects developed during a university Python course. Each project explores different aspects of data processing, analysis, and computer vision.

2dgame bomberman-game computer-vision convolution harris-corner-detector image-processing keypoint-detection numpy pandas python statistical-analysis

Last synced: 30 Apr 2026

https://github.com/mogankumar/bird-flock-simulation-with-pyspark

A PySpark-based bird flock simulation modeling alignment, cohesion, and separation forces with a leader guiding the flock. Efficiently handles large-scale simulations and generates a visualized GIF of the flock’s movement.

argparse matplotlib numpy pyspark python3

Last synced: 01 May 2026

https://github.com/amruta33/melanoma-detection

To build a CNN based model which can accurately detect melanoma. Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution that can evaluate images and alert dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.

keras machine-learning numpy pandas python3 sckiit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/splithor1zon/edu-barriermethodlp

Education material for learning/teaching of Linear Progrmming using Barrier Method. Presentation with commentary and python code available.

barrier-method convex-optimization cvxpy educational-project linear-programming numpy presentation python

Last synced: 01 May 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/filnow/tinygimp

project for image processing class

gimp numpy opencv pysimplegui python3

Last synced: 08 May 2026

https://github.com/deusares/funpil

A collection of PIL, Numpy and OpenCV functions to manipulate images

batch image-processing lazy media numpy opencv pil pillow python script

Last synced: 01 May 2026

https://github.com/686f6c61/numeros-primos-gemelos

Este proyecto permite generar y visualizar números primos gemelos de diferentes formas para ayudar a comprender sus patrones y distribución. Incluye múltiples visualizaciones y análisis estadísticos avanzados.

numpy primos python scipy

Last synced: 10 May 2026

https://github.com/dostuffthatmatters/jax-capability-analysis

demo repository containing the experiments for my master's seminar @ TUM

autograd click deep-learning huggingface jax jit-compilation mnist numpy python pytorch

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/dpb44/exploring-the-intuition-of-neural-networks-on-a-classification-problem-using-only-numpy

Implementing a softmax-based neural network from scratch using NumPy to classify the Iris dataset, leveraging vectorization, gradient descent, and decision boundary visualization.

deep-learning neural-network numpy softmax-classifier

Last synced: 02 May 2026

https://github.com/ducha-aiki/laf2ell

Utils for LAF to ellipse and back conversion. Also function for drawing LAFs.

ellipse laf local-features numpy visualization

Last synced: 02 May 2026

https://github.com/v-goncharenko/freespace

Implementation of signal propagation in free space

numpy python signal-processing

Last synced: 02 May 2026

https://github.com/voidkeishi/computational-science

This repository contains implementations of algorithms from the course "Computational Science" at HUST

computational-science numpy python

Last synced: 02 May 2026

https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.

analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 02 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/fabriciocovalesci/sistema-vendas-graficos

Sistema simples de vendas e estoque, com gráficos usando Matplotlib

matplotlib matplotlib-pyplot numpy python-script python3 venv virtualenv

Last synced: 02 May 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/shibam120302/all_about_python

Here I upload python from basic to advance ,oops in python, dsa using python system design, numpy, pandas, data science, ML also. Follow @shibam120302 and star this repo.

numpy numpy-arrays oops-in-python pandas python python-dsa python-library python3 system-design tabula

Last synced: 10 May 2026

https://github.com/ksharma67/intel-stock-predication-wiith-eda

We are trying to design a model that can predict the price of stock using different methods and algorithms.

eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn

Last synced: 02 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/hajdu-patrik/nlp_music-mood-classifier_bme_hf

This project is a Python-based NLP system that analyzes song lyrics to determine their emotional profile and recommends similar songs.

nlp nlp-machine-learning numpy pandas python

Last synced: 10 Nov 2025

https://github.com/daksh90a/wine-quality-analysis

The Wine Quality Analysis project is an AI/ML-based data analysis initiative aimed at predicting and understanding the factors that influence the quality of wine.

matplotlib-python numpy pandas seaborn

Last synced: 12 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/k-ilyas/ai-projects-handwritten-digit-recognition-titanic-survivor-analysis

AI Projects: Handwritten Digit Recognition & Titanic Survivor Analysis using scikit-learn

machine-learning numpy pandas sickit-learn

Last synced: 03 May 2026

https://github.com/niteshchawla/netflix-case-study

Analyze the data and generate insights that could help Netflix ijn deciding which type of shows/movies to produce and how they can grow the business in different countries

matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/shoaib1522/credit-card-fraud-analysis

"A comprehensive project on Credit Card Fraud Detection combining Exploratory Data Analysis, Machine Learning, and an interactive Streamlit web app for real-world applicability."

eda epochs mlp-classifier neural-network numpy pandas-python perceptron-learning-algorithm python regression sckiit-learn seaborn streamlit streamlit-webapp tensorflow

Last synced: 09 Apr 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/prathicashettym/machinelearningalgorithms

Contains mini projects and implementation using ML algorithms

jupyter-notebook linear-regression matplotlib numpy pandas panfas python seaborn

Last synced: 03 May 2026

https://github.com/jsnevt/decisiontreecls

Data Science - Árvores aleatórias - RandomForestClassifier

decision-tree-classifier matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/shreenidhi7700/auto_price_prediction

We are required to use Machine Learning algorithms to predict the price of cars with the available independent variables such as Horsepower, no-of-cylinders, engine-type, city_mpg, highway_mpg etc.

decision-tree-regression ensemble knn-regression linear-regression machine-learning-algorithms matplotlib-pyplot numpy pandas python3 random-forest-regression regression seaborn xgboost-regression

Last synced: 04 May 2026

https://github.com/vinicius999/icu-beds-forecast-covid-19

Predição da ocupação de leitos de UTI no Brasil devido a pandemia do COVID-19, usando Support Vector Regression (SVR)

covid-19 matplotlib numpy pandas python sklearn support-vector-regression

Last synced: 03 May 2026

https://github.com/pramodyasahan/binary-classifier

This repository houses the code for a machine learning model designed to predict customer churn. The model is built using Support Vector Machine (SVM) from the scikit-learn library and incorporates preprocessing, pipeline, and grid search techniques for optimal performance.

numpy pandas scikit-learn

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/manikantasanjay/youtube_exploratory_data_analytics

This is a Mini Project made using Python Libraries like Numpy, Pandas, Matplotlib etc to perform Data Analytics on US Video and Comments dataset.

jupyter-notebook matplotlib numpy pandas plotly python seaborn wordcloud

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/ankitmishralive/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 22 Mar 2025

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/nidomika/digit-recognition

Digit recognition project in Python

digit-recognition numpy opencv python tensorflow

Last synced: 12 Apr 2026