An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/sandyg6/breast-cancer-classification

Building a Breast cancer classification model using Random Forest Classifier and deploying it to a web application using Flask framework

flask html-css-javascript numpy pandas python random-forest-classifier

Last synced: 30 Dec 2025

https://github.com/adirbella37/safety-analytics-project

Final project in Safety Management: analytics and predictive modeling for occupational incidents. Includes EDA, logistic regression, Poisson/Negative Binomial with overdispersion checks, ROC/AUC, and prediction exercises.

classification data-visualization drunk-and-drive eda logistic-regression matplotlib negative-binomial numpy occupational-safety overdispersion pandas poisson-regression python road-safety roc-auc scikit-learn seaborn statmodels

Last synced: 30 Dec 2025

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 30 Dec 2025

https://github.com/dhanas3kar/python-libraries

Practice codes using Python libraries like NumPy, SciPy, Matplotlib, Pandas, and others. Covers basics, examples, and simple exercises for learning data handling, numerical computing, and visualization.

coding-excersise data-analytics-project data-v matplotlib numpy pandas practice python python-libraries scipy

Last synced: 30 Dec 2025

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025

https://github.com/pramodyasahan/titanic-survival

This repository contains a machine learning project focused on predicting the survival of passengers on the Titanic. The project uses a Support Vector Regression (SVR) model from the sklearn library and involves data preprocessing and prediction.

data-preprocessing matplotlib numpy pandas python scikit-learn support-vector-regression

Last synced: 30 Dec 2025

https://github.com/davityak03/basics-of-python-

In this we have done multiple programs in python so as to develop the basic skill sets in python

array dictionary lists matrix numpy pandas programming python sets

Last synced: 30 Dec 2025

https://github.com/poziloi/-math-

Системы компьютерной математики, задания

jupyter-notebook math matplotlib numpy pandas python scipy seaborn

Last synced: 30 Dec 2025

https://github.com/lruizap/tomography-with-algebraic-topology

This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.

gudhi jupyter-notebook matplotlib numpy opencv python tomography

Last synced: 28 Mar 2025

https://github.com/navindafernando/heart-disease-prediction

Deploying the heart risk level predicting Regression model in a web application using Flask :white_flower:

flask html joblib machine-learning numpy supervised

Last synced: 28 Oct 2025

https://github.com/tberchanov/clustering-k-means

Sample program where is implemented clustering by K-means algorithm, and its visualisation.

clustering-algorithm kmeans-clustering matplotlib numpy python

Last synced: 23 Feb 2025

https://github.com/ashwathdazur/understanding-ds-using-python

Python NumPy, Pandas, Matplotlib and Seaborn for Data Analysis, Data Science and ML. Pre-machine learning Analysis.

data-science dataanalysis matplotlib numpy panda python3

Last synced: 07 Jul 2025

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 10 Apr 2025

https://github.com/bishopce16/movies-etl

The goal of this project was to create an automated pipeline for a company Amazing Prime that is hosting a hackathon.

json numpy pandas pgadmin4 psycopg2 python regular-expressions sql sqlalchemy time

Last synced: 30 Dec 2025

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 30 Dec 2025

https://github.com/xmcp/midify

Experimental WAVE to MIDI converter

midi numpy waveform

Last synced: 12 Jul 2025

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 30 Dec 2025

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 23 Jun 2025

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 27 Mar 2025

https://github.com/abdur-rahim-mygit/ml-movie-recommendation-system

This project is a Machine Learning-based Movie Recommendation System developed during my internship at YBI Foundation. The system uses collaborative filtering techniques to recommend movies based on user preferences and behavior. It was implemented on Google Colab using Python and popular machine learning libraries.

google-colab machinelearning movie-recomendation-system numpy pandas pyhton

Last synced: 02 Feb 2026

https://github.com/chubi-x/sort-it-image-classifier

Sort-it is an image classification application that classifies images of cats, dogs, bicycles, cars, and motorcycles.

image-classification machine-learning numpy pandas python streamlit tensorflow

Last synced: 30 Dec 2025

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 23 Feb 2025

https://github.com/synthstellar/data-preprocessing-with-python

A data preprocessing repository focused on cleaning, transforming, and preparing datasets for machine learning tasks. It includes functions for handling missing values, scaling, encoding, and feature engineering for improved model performance.

data data-cleaning feature-engineering machine-learning numpy pandas preprocessing python scikit-learn

Last synced: 27 Mar 2025

https://github.com/tharushi11/titanic-data-analysis

This repository contains a simple prediction model and survival predictions based on the Titanic dataset from Kaggle, along with detailed visualizations using Power BI.

dax machine-learning numpy pandas powerbi sklearn titanic-survival-prediction visualization

Last synced: 27 Mar 2025

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 30 Dec 2025

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 08 Oct 2025

https://github.com/tahir1072a/titanic-veri-analizi

Kaggle Titanic veri seti kullanılarak NumPy ve Pandas ile yapılan veri analizi projesi.

data-science kaggle numpy pandas python titanic-survival-prediction

Last synced: 30 Dec 2025

https://github.com/redinfinitypro/ai_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 20 Jul 2025

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 28 Mar 2025

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 28 Mar 2025

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/paulinhok14/csgo-datascience-project

📊 Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.

matplotlib mlflow numpy python scikit-learn scipy seaborn

Last synced: 30 Dec 2025

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 30 Dec 2025

https://github.com/jolivas51565/streamlit-hotkeys

Streamlit Hotkeys adds app-wide keyboard shortcuts to Streamlit apps, with edge-triggered events and a simple Python API for Ctrl/Cmd/Alt/Shift bindings. 🐙

base64 browser-game cmdk game-development hotkeys keybindings keyboard-input numpy pillow pixel-art png-encoding python realtime shortcuts streamlit streamlit-component streamlit-hotkeys ui

Last synced: 11 Sep 2025

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 30 Dec 2025

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 30 Dec 2025

https://github.com/zalkklop/lvsm

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias" (ICLR 2025 Oral)

attention-mechanisms deep-learning github-config large-view-synthesis matplotlib multi-label-classification nltk numpy one-vs-rest python seaborn sql tensorflow transformers

Last synced: 09 Apr 2025

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 07 Jul 2025

https://github.com/apbetioli/neural-network

A simple neural network implementation

machine-learning neural-network numpy python3

Last synced: 02 Apr 2025

https://github.com/13anush/python-libraries-

A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.

data-analysis matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2025

https://github.com/kavayk29/speech-recognition-using-tdnn-and-data-augmentation

Developed a speech recognition system using TDNN, preprocessing audio, extracting MFCC features, and training the model. Fine-tuning with augmented data (19,000 rows) improved accuracy from 9% to 80% training and 40% validation. Data augmentation proved crucial for enhancing model performance and generalization. Still working to increase the acc.

deep-learning keras-tensorflow numpy os pandas tdnn tensorflow

Last synced: 01 Sep 2025

https://github.com/shefreenkaur/nlp_query_documents

This repository contains two implementations of an NLP document query system that processes PDF documents and ranks them based on relevance to user queries.

easyocr naive-bayes nlp numpy ppmi pymupdf tf-idf

Last synced: 29 Dec 2025

https://github.com/bbergerud/tenops

Creating a common interface for numpy, pytorch, tensorflow, etc.

numpy pytorch tensorflow

Last synced: 20 Jan 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 30 Dec 2025

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/nikhilfuke1/movies-recommendation-system-python-project-

This project is a simple movie recommendation system built using Python. It suggests movies based on the similarity of titles and a predefined similarity score matrix.

difflib numpy pandas python sklearn

Last synced: 30 Dec 2025

https://github.com/hansalemaos/ffmpeg_stream_to_numpy

ffmpeg stream to numpy arrays

cv2 ffmpeg numpy python stream video

Last synced: 28 Nov 2025

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 30 Dec 2025

https://github.com/hoyirul/flask-starter-fs

Flask-Starter-FS is a concise full-stack starter kit for Flask web development, offering quick setup and pre-configured settings for seamless integration of front-end and back-end components. Accelerate your full-stack web application development with Flask-Starter-FS.

flask machine-learning numpy pandas python sklearn

Last synced: 24 Jun 2025

https://github.com/aminkhavari78/-maternal-health-risk-data-set

In this part I'm working on Maternal Health Risk Prediction

data-science linearsvc machine-learning matplotlib numpy pandas scatter-plot sklearn

Last synced: 24 Jun 2025

https://github.com/wondermongering/nonlineardatagen

A flexible Python framework for generating, fitting, and visualizing noisy nonlinear data. Perfect for educational purposes, algorithm testing, and demonstrating statistical concepts. Includes tools for various noise models, custom function fitting, robust error metrics, and publication-quality visualizations

curve-fitting data-simulation data-visualization education educational-tool interpolation mathematical-modelling matplotlib noise-modeling numpy regression-analysis scientific-computing scipy signal-processing statistics teaching-resource time-series-analysis

Last synced: 22 Aug 2025

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 03 Sep 2025

https://github.com/kasraskari/tumor-predict

Streamlit app for predicting tumor malignancy using logistic regression.

logistic-regression machine-learning numpy pandas python scikit-learn streamlit tumor-detection

Last synced: 30 Dec 2025

https://github.com/yash-dave/creditcard-fraud-detector

An end-to-end Streamlit app for detecting credit card fraud using unsupervised anomaly detection techniques like Isolation Forest, One-Class SVM, LOF, and Autoencoders.

numpy pandas plotly python seaborn streamlit tensorflow

Last synced: 30 Dec 2025

https://github.com/s-h-i-v-i-s/bangalore-housing-price-prediction

A Machine Learning Project to predict Bangalore House Prices.

jupyter-notebook machine-learning matplotlib numpy pandas python pythonflask sklearn

Last synced: 23 Feb 2025

https://github.com/carmoreno/aviones-aldeas

:airplane: Final project o Complexity and Optimisation course :house_with_garden:

algorithms numpy pyqt4 scipy

Last synced: 23 Feb 2025

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 13 Apr 2025

https://github.com/mrnust/ai-powered-socialmediapostoptimizer_llm

AI-Powered Social Media Post Optimizer enhances your social media content by optimizing text posts with relevant tags and keywords, predicting engagement metrics, and analyzing image quality. Upload your post and image to receive tailored suggestions for improving engagement and visual appeal.

groq langchain large-language-model llama-3-70b matplotlib numpy opencv streamlit textblob

Last synced: 29 Dec 2025

https://github.com/dulajkavinda/handwritten-recognition

🛢Handwritten Digit Recognition using OpenCV

added machinelearning numpy opencv python

Last synced: 30 Dec 2025

https://github.com/toon-van-berkel/imagetovideofx

ImageToVideoFX is a simple Python project that takes an image (JPG/PNG) and transforms it into a short video with a smooth zoom effect. You can optionally add background music. This is perfect for content creators or developers who want to automate turning images into engaging video content.

imagetovideo moviepy numpy pillow python

Last synced: 08 May 2025

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 30 Dec 2025

https://github.com/themysterysolver/python_basics

"This repo includes simple Python programs for beginners, covering various concepts, object-oriented programming, and different module usages, to help with understanding and learning Python."

basic-python-syntax collections dsa matplotlib numpy oops-in-python pandas python re scipy

Last synced: 30 Dec 2025

https://github.com/marksikaundi/30daysofdeeplearning

This challenge is designed to help you build a strong foundation in Python programming and deep learning over the course of 30 days. By the end of this challenge, you will have developed a comprehensive understanding of deep learning concepts.

data-science nlp numpy python pytorch tensorflow

Last synced: 30 Dec 2025