An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ronverse17/automate-data-cleaning

This project automates messy data cleaning tasks - like fixing column names, filling missing values, and spotting outliers, so analysts and data scientists can spend more time on insights, not preprocessing.

numpy pandas pipeline python

Last synced: 28 Apr 2026

https://github.com/hai4320/ml_ai_notebook

All my note about ML, AI and Data Science

ai machine-learning numpy pandas scikit-learn

Last synced: 28 Apr 2026

https://github.com/hadson0/chess-live-ratings-data

A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.

beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping

Last synced: 28 Apr 2026

https://github.com/dwade-eng/amazon-product-recommender-prototype-

This project is a content-based product recommendation engine inspired by Amazon's "Customers who viewed this item also viewed" feature. It uses a dataset of product metadata and user interactions to suggest similar items based on product titles, brands, and categories using TF-IDF vectorization and cosine similarity.

html numpy pandas python3 scikit-learn

Last synced: 28 Apr 2026

https://github.com/yanisdib/somint

Somint - AI-powered Trading Card Grading

ai api devcontainer docker numpy opencv python yolov8

Last synced: 28 Apr 2026

https://github.com/anshpg/popularity-based-book-recommendation-system

I developed a popularity-based book recommendation system using Google Colab, leveraging libraries like pandas, numpy, and matplotlib. The dataset was obtained from Kaggle.com, consisting of three CSV files: books.csv, ratings.csv, and users.csv.

collab csv kaggle-dataset matplotlib-pyplot numpy pandas python

Last synced: 28 Apr 2026

https://github.com/shreeparab1890/indian-elections-2019-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.

data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization

Last synced: 28 Apr 2026

https://github.com/ddeepanshu-997/datascience--olympics-games

in this repository i am going to perform some datascience techinques on the datasaet in order to get the valuable insights with the help of python libraries , packages etc.

data-science exploratory-data-analysis insights matplotlib-pyplot numpy pandas-dataframe problem-solving python visualization

Last synced: 28 Apr 2026

https://github.com/technologicat/pylu

Small nogil-compatible linear equation system solver

cython linear-equations numerical numpy python python2 python27 python3 python34 solver

Last synced: 28 Apr 2026

https://github.com/khalidkhankakar/numpy-for-data-science

Exploring the full Numpy library for DS

data-science dataexploration jupyter numpy python

Last synced: 28 Apr 2026

https://github.com/mahmood-anaam/wheeled-robot-simulation

A Python simulation of a robot equipped with four Swedish wheels, focusing on accurately modeling its kinematics and dynamics to analyze specific motion scenarios.

kinematics-dynamics matplotlib-python numpy python robotics-simulation roboticstoolbox-python swedish-wheels

Last synced: 28 Apr 2026

https://github.com/josualimbu/face-recognition

Face Recognition with Name, and only need 1 image for 1 person for the dataset to run Face Recognition.

face-detection face-recognition numpy python simple-face-recognition

Last synced: 28 Apr 2026

https://github.com/linsanity03/fraud_detection_using_autoencoders

Leverage autoencoders, a type of neural network, for detecting fraudulent financial transactions by identifying anomalies in transaction patterns that deviate from the norm.

matplotlib numpy pandas pytorch seaborn

Last synced: 28 Apr 2026

https://github.com/alonmell/modulnet

Lightweight NumPy-based neural network framework with core layers, loss functions, optimizers, and data utilities

deep-learning from-scratch machine-learning neural-network numpy

Last synced: 29 Apr 2026

https://github.com/fimbres/numpy-start

A crash project with the most important numpy functions and methods, used in data science, AI and machine learning.

data-science numpy python python-script

Last synced: 29 Apr 2026

https://github.com/thekartikeyamishra/interactive-ai-model-trainer

A Google Colab notebook for interactively training simple AI/ML classification models. Features CSV upload, dummy data generation, feature/target selection, model choice, and basic evaluation. Includes a user-friendly UI. Ideal for educational purposes. See screenshots below!

google googlecolab ipywidgets matpl numpy pandas python scikit-learn seaborn

Last synced: 29 Apr 2026

https://github.com/skypse/santander-coders-data_science-course

Curso de Data Science, proposto pelo Satander, utilizando Python!

jupyter-notebook numpy pandas-python python scikit-learn

Last synced: 29 Apr 2026

https://github.com/muazify/pyequationsolver

A Python script that solves mathematical equations for the variable 'x' using a hybrid approach, combining symbolic solving (for exact solutions) with numerical methods (for approximations when symbolic solving is insufficient).

equation-solver numpy python3 sympy

Last synced: 29 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/0nom4d/306radiator

Mathematical EPITECH Project - This project reproduces the behaviour of a radiator placed at a position in a room of a given size.

epitech epitech-project numpy pytest python3

Last synced: 29 Apr 2026

https://github.com/w13b3/ssim-py

Structural Similarity (SSIM) index, where the only core dependency is NumPy

image-comparison numpy opencv-python pillow python3 ssim tested unittests

Last synced: 29 Apr 2026

https://github.com/giordanodaloisio/demv

Debiaser for Multiple Variables, a model- and data- agnostic method to improve fairness in binary and multi-class classification tasks

bias-mitigation numpy pandas python

Last synced: 29 Apr 2026

https://github.com/thaivincent/fft_image_compression

A simple image compression program using FFT and numPy

fft image-processing numpy

Last synced: 29 Apr 2026

https://github.com/antonio-f/naive_convnet

Naive Convolutional Neural Network with TensorFlow 2

101 convolutional-neural-networks deep-learning easy matplotlib numpy simple tensorflow2 tutorial

Last synced: 29 Apr 2026

https://github.com/solomonbaez/koios

Neural Network Framework, built from scratch in NumPy.

machine-learning neural-network neural-networks numpy

Last synced: 29 Apr 2026

https://github.com/al-ghaly/e-commerce-a-b-testing

A Statistical Analysis project in which I Performed an A/B test to analyze the effect of changing the user interface for an E-Commerce company's Website.

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 29 Apr 2026

https://github.com/shariqayan/diwali_sales_analysis_python

The Diwali Sales Analysis project focuses on analyzing sales data during the Diwali festival to gain insights into customer behavior, improve customer experience, and optimize sales strategies.

data-visualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/chandkund/predicting-student-grades

This project predicts student grades (G1, G2, G3) based on various features such as demographic information, study habits, family background, and other academic factors. The aim is to develop a predictive model that can assist in identifying students who may need additional support.

classification-algorithm ensemble-learning exploratory-data-analysis machine-learning matplotlib numpy pandas regression-models seaborn

Last synced: 29 Apr 2026

https://github.com/armahdavi/ai_ml_assisted_breast_cancer_tumor_detection

Consolidating tutorial codes for breast cancer tumor detection, covering ML fundamentals like classification, feature engineering, training, evaluation, and key performance metrics.

bias-variance feature-engineering logistic-regression machine-learning machine-learning-algorithms medical-application numpy pandas python random-forest recall-precision sklearn xgboost-classifier

Last synced: 29 Apr 2026

https://github.com/ksharma67/k-means-algorithm-on-the-iris-dataset

Applied the K-Means algorithm on the Iris dataset, and utilized the Silhouette Score method to find the best value of K

eda elbow-method iris-dataset kmeans-clustering matplotlib numpy pandas python seaborn silhouette-score

Last synced: 29 Apr 2026

https://github.com/djdurga/upi-transactions-eda

The dataset provided encompasses a comprehensive record of Unified Payments Interface (UPI) transactions spanning the period from January 2023 to December 2023. Unified Payments Interface (UPI) has emerged as a prominent digital payment system in India, facilitating seamless fund transfers between bank accounts instantly.

matplotlib numpy pandas python

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/adalbertobrant/fundamentalista

Analise de Graham e de Greenblatt para escolha de ações baratas e com bom potencial de crescimento ao longo prazo

graham greenblatt matplotlib numpy pandas python3 streamlit

Last synced: 29 Apr 2026

https://github.com/dmarks84/ind_project_readme-generator

Independent (personal) project in which I automatically generate README files for each of my repositories from my coursework

dataframes etl numpy pandas programming python

Last synced: 29 Apr 2026

https://github.com/rebeccamorolong/mtn-customer-churn-prdiction

This project addresses a real-world business problem: predicting customer churn for MTN, a major telecommunications company. Customer churn significantly impacts profitability, and accurate predictions enable targeted retention efforts.

anova-test matplotlib-pyplot numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/alfastrek/textinsight-pro

Webscraper with Natural Language Processing

beautifulsoup jupyter-notebook nltk numpy pandas python webscraping

Last synced: 29 Apr 2026

https://github.com/chandadiya2004/movie-recommendation-system

A Movie Recommendation System built using TfidfVectorizer and cosine similarity. The model processes a large dataset of movies and recommends similar movies based on a given input movie by analyzing textual features and calculating similarity scores.

cosine-similarity numpy pandas python sklearn tfidf-vectorizer

Last synced: 29 Apr 2026

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/snehasunilnair/image-processing-projects

Image pre-processing techniques using OpenCV.

numpy opencv python3

Last synced: 29 Apr 2026

https://github.com/nellyndj52/breast-cancer-classification-ml

This project aims to build a machine learning models to predict whether a tumor is malignant or benign based on the Breast Cancer dataset. Various classification algorithms are tested, including Logistic Regression, Random Forest, and Support Vector Machine (SVM).

breast-cancer cancer dataset flask-api heroku-deployment lime machine-learning mlp-networks numpy perceptron-neural-networks pickle segmentation support-vector-machines svm

Last synced: 03 Apr 2025

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/johanntf/direccion_maximo_crecimiento

Dirección de máximo crecimiento en una función multivariable dado un punto de partida

customtkinter matplotlib numpy python

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 11 Apr 2026

https://github.com/lefteris-souflas/economic-connectedness-analysis

Jupyter notebook, replicating studies on social capital from Nature journal, analyze economic connectedness, upward income mobility, and more. Python and relevant datasets are utilized to recreate figures and analyses.

analytics choropleth-map json jupyter-notebook matplotlib matplotlib-pyplot numpy pandas plotly-express python3 regplot scatterplot seaborn stata visualization

Last synced: 07 Apr 2026

https://github.com/santiagoenriquega/custom_neural_net

This repository focuses on building a neural network from scratch using only base python and Numpy.

gradient-descent neural-network numpy optimization python

Last synced: 18 May 2026

https://github.com/toon-van-berkel/imagetovideofx

ImageToVideoFX is a simple Python project that takes an image (JPG/PNG) and transforms it into a short video with a smooth zoom effect. You can optionally add background music. This is perfect for content creators or developers who want to automate turning images into engaging video content.

imagetovideo moviepy numpy pillow python

Last synced: 08 May 2025

https://github.com/trups39/nyc-green-taxi-trip-analysis

Analyze NYC Green Taxi data, predicting fares and optimizing pickup locations using Regression machine learning models uncover travel patterns.

numpy pandas predictive-modeling python3 regression-models skit

Last synced: 17 Feb 2026

https://github.com/nazmul-1117/100-days-of-machine-learning

I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah

data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics

Last synced: 11 Aug 2025

https://github.com/dor-sketch/audio-visual-synth

Combining real-time graphics, audio synthesis, and interactive elements to create an engaging, cross-platform music visualizer.

image-processing midi-player music-composition musicsheet notes-tool numpy pygame pygbag pyopengl

Last synced: 18 May 2026

https://github.com/rajesh9943/developing-a-bike-buyer-sales-report-dashboard-using-excel

This dataset has details of 1000 users from different backgrounds and whether or not they buy a bike. This data can be used to build the dashboard in Google Sheets. There are some NA (Null / Empty) values injected in the dataset. Use this dataset for Data Cleaning, Exploration, and Visualization.

excel google-sheets numpy pandas-library python3

Last synced: 17 May 2026

https://github.com/yash22222/analysis-of-amcat-aspiring-minds

Analyzing employment outcomes for engineering graduates based on a provided dataset. Specifically, it aims to verify a claim regarding the salary range for fresh graduates in certain engineering roles and investigate potential relationships between gender and specialization preferences.

amcat aspiring-minds charts computer-engineering data-analytics data-vizualisation dataset graph machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/yesenvidnath/facerecognito-pythonfacialrecognitionapplication

this is a web application created to detect faces using a pre-given image links through Excel format, this project uses Pytorch & Flask

facent flask numpy python

Last synced: 10 Apr 2026

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/biocpy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 07 Mar 2026

https://github.com/wondermongering/nonlineardatagen

A flexible Python framework for generating, fitting, and visualizing noisy nonlinear data. Perfect for educational purposes, algorithm testing, and demonstrating statistical concepts. Includes tools for various noise models, custom function fitting, robust error metrics, and publication-quality visualizations

curve-fitting data-simulation data-visualization education educational-tool interpolation mathematical-modelling matplotlib noise-modeling numpy regression-analysis scientific-computing scipy signal-processing statistics teaching-resource time-series-analysis

Last synced: 22 Aug 2025

https://github.com/aminkhavari78/-maternal-health-risk-data-set

In this part I'm working on Maternal Health Risk Prediction

data-science linearsvc machine-learning matplotlib numpy pandas scatter-plot sklearn

Last synced: 05 May 2026

https://github.com/kr1shnasomani/facetronix

A toolkit for face detection, annotation, and mask recognition.

computer-vision deep-learning keras neural-network numpy opencv tensorflow

Last synced: 05 Apr 2026

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025

https://github.com/muhkartal/fr-framework

A modular and extensible face recognition framework featuring real-time detection, facial landmark analysis, and recognition with API support.

dlib face-recognition fastapi numpy opencv pillow pydantic python-multipart requests scipy uvicorn

Last synced: 13 Apr 2026

https://github.com/nikhilfuke1/movies-recommendation-system-python-project-

This project is a simple movie recommendation system built using Python. It suggests movies based on the similarity of titles and a predefined similarity score matrix.

difflib numpy pandas python sklearn

Last synced: 30 Dec 2025

https://github.com/guifernandess7/ciencia-de-dados-e-ml-notebooks

Notebooks com uso dos módulos Pandas, Numpy e Matplotlib para análise de dados e criação de gráficos e tensorFlow e seaborn para machine learning e visualização

machine-learning matplotlib numpy pandas python tensorflow

Last synced: 04 Apr 2026

https://github.com/lgope/book-recommender-system

Book recommender system a Collaborative Filtering Based Recommender System using Machine Learning, Numpy, Pandas Python, Flask, Kaggle Dataset

collaborative-filtering flask html-css jupyter-notebook kaggle-dataset machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/jai0212/gpt-rnn-poetry-generator

A generative pre-trained transformer (GPT) using a recurrent neural network (RNN) to generate poetry with customizable length and creativity index.

ai csv-files gpt machine-learning neural-networks nlp-machine-learning numpy pandas-library poetry-generator rnn-tensorflow training-dataset

Last synced: 12 Aug 2025

https://github.com/zalkklop/lvsm

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias" (ICLR 2025 Oral)

attention-mechanisms deep-learning github-config large-view-synthesis matplotlib multi-label-classification nltk numpy one-vs-rest python seaborn sql tensorflow transformers

Last synced: 09 Apr 2025

https://github.com/sudarshanc00/brain-tumor-classification

This project uses a deep learning model in PyTorch to classify brain MRI images into four tumor types, aiding early diagnosis and treatment planning. Two ResNet-based models were developed and optimized, achieving high accuracy to support healthcare professionals in identifying tumor categories.

matplotlib numpy pytorch resnet scikit-learn streamlit

Last synced: 10 Apr 2026

https://github.com/mathensley/asimpleneuralnetworklib

A simple library in Python for training datasets using neural network models

deep-learning neural-networks numpy python

Last synced: 19 Apr 2026

https://github.com/firewind25/gdp-data-extraction-and-processing

Data Extraction from a web resource using Web-Scraping and request APIs. Processed it using Pandas and NumPy libraries. Finally exporting it to csv file for further analysis

numpy pandas python webscraping

Last synced: 12 Jul 2025

https://github.com/blinksta1ker/spam-mail-classification

Supervised ML model using Logistic Regression model to classify an email as Ham or Spam

jupyter-notebook logistic-regression machine-learning numpy pandas python sklearn supervised-machine-learning

Last synced: 09 Apr 2026

https://github.com/mnitin-reddy/image-compression-using-k-means-clustering

This project implements the K-means algorithm for clustering and image compression. It reduces the number of colors in an image using K-means, achieving compression while maintaining key visual features. The project demonstrates the process on a sample dataset and a real image.

image-processing imagecompression kmeans-clustering machine-learning matplotlib numpy

Last synced: 19 May 2026

https://github.com/paulinhok14/csgo-datascience-project

📊 Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.

matplotlib mlflow numpy python scikit-learn scipy seaborn

Last synced: 30 Dec 2025

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/Madhuresh2011/Leetcode-Questions-Analysis-Using-Python

The LeetCode Question Analysis is a Python-based project designed to help users analyze and gain insights into LeetCode problems. It allows users to track, categorize, and evaluate their progress, identify patterns in problem-solving, and optimize the preparation.

analysis csv-files matplotlib-pyplot numpy pandas python python-project seaborn

Last synced: 13 Sep 2025

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/redinfinitypro/ai_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 20 Jul 2025

https://github.com/thavinduushan/linear-regression

A linear regression model to predict premium price of an insurance. The dataset includes around 1,000 observations. The project, aimed at learning, was implemented in Jupyter notebooks using NumPy, Matplotlib, Pandas, Keras, and TensorFlow.

matplotlib numpy pandas python tensorflow

Last synced: 09 Apr 2026

https://github.com/tahir1072a/titanic-veri-analizi

Kaggle Titanic veri seti kullanılarak NumPy ve Pandas ile yapılan veri analizi projesi.

data-science kaggle numpy pandas python titanic-survival-prediction

Last synced: 30 Dec 2025

https://github.com/chubi-x/sort-it-image-classifier

Sort-it is an image classification application that classifies images of cats, dogs, bicycles, cars, and motorcycles.

image-classification machine-learning numpy pandas python streamlit tensorflow

Last synced: 05 Apr 2026