An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/ranaessam03/climate-prediction

A machine learning project to predict rainfall using Decision Tree, k-Nearest Neighbors (kNN), and Naรฏve Bayes, with preprocessing, model evaluation, and custom kNN implementation.

decision-trees knn-classification machine-learning naive-bayes-classifier numpy sklearn

Last synced: 05 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/chopikus/raytracing-bench

Benchmarking ray tracers in Python, Java, Numpy, Cupy, Pypy

cupy java numpy pypy python raytracing

Last synced: 06 May 2026

https://github.com/sopheakchan/dull-razor-algorithm

DullRazor hair removal for dermoscopic skin images.

numpy opencv python

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/fazal-coder/capturing_img

python code for Creating blue color image

loop numpy opencv processing python scheduler threading

Last synced: 06 May 2026

https://github.com/cycle-sync-ai/student-score-analysis

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student

Last synced: 06 May 2026

https://github.com/naveen88112/genetic_algos_maze_solver

Genetic Algorithm Maze Solver This project utilizes a Genetic Algorithm (GA) to solve the optimization of a maze problem. It evolves the population of promising solutions to the best solution finding shortest path between start and end point using selection, crossover, and mutation.

deap-library genetic-algorithms matplotlib maze-solver numpy python

Last synced: 06 May 2026

https://github.com/albert-2002/robust-penalized-empirical-likelihood-estimation-method-for-linear-regression

Implementation for the paper titled: Robust Penalized Empirical Likelihood Estimation Method for Linear Regression

linear-regression machine-learning numpy pandas python regression regression-analysis statistics

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/kumar-o/customer-churn-prediction

๐Ÿ” Predict customer churn using a synthetic dataset with advanced models and metrics to enhance business retention strategies and decision-making.

churn-prediction customer-churn-prediction customer-survival-analysis data-science factorization-machines flask-application gridsearchcv libsvm machine-learning model-evaluation numpy pipelines roc-auc scikit-learn seaborn smote survival-analysis xgboost4j

Last synced: 06 May 2026

https://github.com/prab9166/-web-content-extraction-and-sentiment-analysis-

web content extraction and sentiment analysis on URLs provided in an Excel file. Its divided into classes to manage different stages such as data loading, web scraping, text processing, sentiment scoring, and output generation

beautifulsoup lxml nltk-python numpy pandas python re requests xlsxwriter

Last synced: 06 May 2026

https://github.com/lunanvarg/bid-data-cac22039

En este repositorio se vรฉ mi TP Integrador del programa Codo a Codo: Big Data

colab data-studio-google deepnote excel matplotlib mysql numpy pandas-library python seaborn

Last synced: 06 May 2026

https://github.com/sahilmate/gesturepainter

Virtual Painting Application

gesturepainter mediapi numpy opencv virtualpaint

Last synced: 06 May 2026

https://github.com/piero24/lossfuncition-4sn

This research will show an innovative method useful in the segmentation of polyps during the screening phases of colonoscopies. To do this we have adopted a new approach which consists in merging the hybrid semantic network (HSNet) architecture model with the Reagion-wise(RW) as a loss function for the backpropagation process.

dataset deep-learning deep-neural-networks loss loss-functions matplotlib neural-network numpy opencv polyp-detection polyp-segmentation python python3 pytorch segmentation segmentation-models torchvision transformers unipd unipd-dei

Last synced: 06 May 2026

https://github.com/vandanac20/advanced-real-estate-valuation-with-ensemble-regression-models

Advanced Real Estate Valuation with Ensemble Regression Models

machine-learning numpy pandas python sklearn

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/jayanthpotluri5513/heart-disease-prediction-using-ml

This project aims to develop a machine learning model for heart disease prediction, leveraging the potential of advanced algorithms and a comprehensive dataset.

machine-learning matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/jai0212/course-selector-uoft

A program that suggests university courses based on your program and provides a custom GUI to customize your course plan. Created to help UofT students

beautifulsoup csv graphs graphs-algorithms networkx-graph numpy os parser pickle pillow plotly python tkinter-gui tkmacosx

Last synced: 06 May 2026

https://github.com/abdulrahmanaymann/data-storage-and-information-retrieval

This is a Python project that performs tokenization, stop word removal, positional indexing, phrase query searching, term frequency-inverse document frequency (TF-IDF) calculation, cosine similarity computation, and document ranking.

cosine-similarity information-retrieval nltk numpy pandas postional-index python ranking stop-word-removal tf-idf tokenization

Last synced: 06 May 2026

https://github.com/aleksibovellan/ai-2048-game

AI Heuristics in Python for solving the 2048 Game

2048 2048-game 2048-solver ai expectimax game heuristics numba numpy python python3

Last synced: 06 May 2026

https://github.com/shimu-i/pythonmastery

Diving into Python essentials, libraries, and tricks to become an AI & ML pro. Coding, learning, and having fun while building the future!

numpy pandas python

Last synced: 07 May 2026

https://github.com/malintha-induwara/python-for-ds

This repository contains my personal notes, exercises, and examples for Python

numpy pandas python webscraping

Last synced: 07 May 2026

https://github.com/sriyaroy/coding-challenges

50 Machine Learning & Computer Vision challenges ranging from Easy to Hard to sharpen you ML engineering skills.

coding-challenges computer-vision interview-prep machine-learning numpy python pytorch

Last synced: 07 May 2026

https://github.com/ly16302/mask

mask generation library for wyvern

f2py fortran mpi4py numpy openmp testing

Last synced: 21 Mar 2025

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/johanntf/direccion_maximo_crecimiento

Direcciรณn de mรกximo crecimiento en una funciรณn multivariable dado un punto de partida

customtkinter matplotlib numpy python

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 11 Apr 2026

https://github.com/lefteris-souflas/economic-connectedness-analysis

Jupyter notebook, replicating studies on social capital from Nature journal, analyze economic connectedness, upward income mobility, and more. Python and relevant datasets are utilized to recreate figures and analyses.

analytics choropleth-map json jupyter-notebook matplotlib matplotlib-pyplot numpy pandas plotly-express python3 regplot scatterplot seaborn stata visualization

Last synced: 07 Apr 2026

https://github.com/santiagoenriquega/custom_neural_net

This repository focuses on building a neural network from scratch using only base python and Numpy.

gradient-descent neural-network numpy optimization python

Last synced: 18 May 2026

https://github.com/toon-van-berkel/imagetovideofx

ImageToVideoFX is a simple Python project that takes an image (JPG/PNG) and transforms it into a short video with a smooth zoom effect. You can optionally add background music. This is perfect for content creators or developers who want to automate turning images into engaging video content.

imagetovideo moviepy numpy pillow python

Last synced: 08 May 2025

https://github.com/trups39/nyc-green-taxi-trip-analysis

Analyze NYC Green Taxi data, predicting fares and optimizing pickup locations using Regression machine learning models uncover travel patterns.

numpy pandas predictive-modeling python3 regression-models skit

Last synced: 17 Feb 2026

https://github.com/hansalemaos/group_by_continuous_sequence

Groups lists/NumPy arrays by continuous sequence

group list numpy python

Last synced: 19 Feb 2026

https://github.com/dor-sketch/audio-visual-synth

Combining real-time graphics, audio synthesis, and interactive elements to create an engaging, cross-platform music visualizer.

image-processing midi-player music-composition musicsheet notes-tool numpy pygame pygbag pyopengl

Last synced: 18 May 2026

https://github.com/rajesh9943/developing-a-bike-buyer-sales-report-dashboard-using-excel

This dataset has details of 1000 users from different backgrounds and whether or not they buy a bike. This data can be used to build the dashboard in Google Sheets. There are some NA (Null / Empty) values injected in the dataset. Use this dataset for Data Cleaning, Exploration, and Visualization.

excel google-sheets numpy pandas-library python3

Last synced: 17 May 2026

https://github.com/yash22222/analysis-of-amcat-aspiring-minds

Analyzing employment outcomes for engineering graduates based on a provided dataset. Specifically, it aims to verify a claim regarding the salary range for fresh graduates in certain engineering roles and investigate potential relationships between gender and specialization preferences.

amcat aspiring-minds charts computer-engineering data-analytics data-vizualisation dataset graph machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/yesenvidnath/facerecognito-pythonfacialrecognitionapplication

this is a web application created to detect faces using a pre-given image links through Excel format, this project uses Pytorch & Flask

facent flask numpy python

Last synced: 10 Apr 2026

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/biocpy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 07 Mar 2026

https://github.com/wondermongering/nonlineardatagen

A flexible Python framework for generating, fitting, and visualizing noisy nonlinear data. Perfect for educational purposes, algorithm testing, and demonstrating statistical concepts. Includes tools for various noise models, custom function fitting, robust error metrics, and publication-quality visualizations

curve-fitting data-simulation data-visualization education educational-tool interpolation mathematical-modelling matplotlib noise-modeling numpy regression-analysis scientific-computing scipy signal-processing statistics teaching-resource time-series-analysis

Last synced: 22 Aug 2025

https://github.com/aminkhavari78/-maternal-health-risk-data-set

In this part I'm working on Maternal Health Risk Prediction

data-science linearsvc machine-learning matplotlib numpy pandas scatter-plot sklearn

Last synced: 05 May 2026

https://github.com/kr1shnasomani/facetronix

A toolkit for face detection, annotation, and mask recognition.

computer-vision deep-learning keras neural-network numpy opencv tensorflow

Last synced: 05 Apr 2026

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025

https://github.com/muhkartal/fr-framework

A modular and extensible face recognition framework featuring real-time detection, facial landmark analysis, and recognition with API support.

dlib face-recognition fastapi numpy opencv pillow pydantic python-multipart requests scipy uvicorn

Last synced: 13 Apr 2026

https://github.com/nikhilfuke1/movies-recommendation-system-python-project-

This project is a simple movie recommendation system built using Python. It suggests movies based on the similarity of titles and a predefined similarity score matrix.

difflib numpy pandas python sklearn

Last synced: 30 Dec 2025

https://github.com/guifernandess7/ciencia-de-dados-e-ml-notebooks

Notebooks com uso dos mรณdulos Pandas, Numpy e Matplotlib para anรกlise de dados e criaรงรฃo de grรกficos e tensorFlow e seaborn para machine learning e visualizaรงรฃo

machine-learning matplotlib numpy pandas python tensorflow

Last synced: 04 Apr 2026

https://github.com/lgope/book-recommender-system

Book recommender system a Collaborative Filtering Based Recommender System using Machine Learning, Numpy, Pandas Python, Flask, Kaggle Dataset

collaborative-filtering flask html-css jupyter-notebook kaggle-dataset machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/nellyndj52/breast-cancer-classification-ml

This project aims to build a machine learning models to predict whether a tumor is malignant or benign based on the Breast Cancer dataset. Various classification algorithms are tested, including Logistic Regression, Random Forest, and Support Vector Machine (SVM).

breast-cancer cancer dataset flask-api heroku-deployment lime machine-learning mlp-networks numpy perceptron-neural-networks pickle segmentation support-vector-machines svm

Last synced: 03 Apr 2025

https://github.com/zalkklop/lvsm

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias" (ICLR 2025 Oral)

attention-mechanisms deep-learning github-config large-view-synthesis matplotlib multi-label-classification nltk numpy one-vs-rest python seaborn sql tensorflow transformers

Last synced: 09 Apr 2025

https://github.com/sudarshanc00/brain-tumor-classification

This project uses a deep learning model in PyTorch to classify brain MRI images into four tumor types, aiding early diagnosis and treatment planning. Two ResNet-based models were developed and optimized, achieving high accuracy to support healthcare professionals in identifying tumor categories.

matplotlib numpy pytorch resnet scikit-learn streamlit

Last synced: 10 Apr 2026

https://github.com/mathensley/asimpleneuralnetworklib

A simple library in Python for training datasets using neural network models

deep-learning neural-networks numpy python

Last synced: 19 Apr 2026

https://github.com/firewind25/gdp-data-extraction-and-processing

Data Extraction from a web resource using Web-Scraping and request APIs. Processed it using Pandas and NumPy libraries. Finally exporting it to csv file for further analysis

numpy pandas python webscraping

Last synced: 12 Jul 2025

https://github.com/blinksta1ker/spam-mail-classification

Supervised ML model using Logistic Regression model to classify an email as Ham or Spam

jupyter-notebook logistic-regression machine-learning numpy pandas python sklearn supervised-machine-learning

Last synced: 09 Apr 2026

https://github.com/mnitin-reddy/image-compression-using-k-means-clustering

This project implements the K-means algorithm for clustering and image compression. It reduces the number of colors in an image using K-means, achieving compression while maintaining key visual features. The project demonstrates the process on a sample dataset and a real image.

image-processing imagecompression kmeans-clustering machine-learning matplotlib numpy

Last synced: 19 May 2026

https://github.com/paulinhok14/csgo-datascience-project

๐Ÿ“Š Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.

matplotlib mlflow numpy python scikit-learn scipy seaborn

Last synced: 30 Dec 2025

https://github.com/kasraskari/learn-python-codes

A beginner-friendly repository to learn Python programming through structured examples and scripts, covering topics from basics to advanced concepts.

jupyter-notebook matplotlib matplotlib-tutorial numpy numpy-tutorial pandas pandas-tutorial python python-learning python-tutorial python-tutorials scipy scipy-tutorial tutorials

Last synced: 09 Apr 2026

https://github.com/Madhuresh2011/Leetcode-Questions-Analysis-Using-Python

The LeetCode Question Analysis is a Python-based project designed to help users analyze and gain insights into LeetCode problems. It allows users to track, categorize, and evaluate their progress, identify patterns in problem-solving, and optimize the preparation.

analysis csv-files matplotlib-pyplot numpy pandas python python-project seaborn

Last synced: 13 Sep 2025

https://github.com/magzybog12/real-time-object-distance-measurement-using-opencv-python

Using an A4 paper as a guide, performing the distance measurements to find the height and width of objects placed in the region

numpy opencv-python pycharm-ide

Last synced: 05 May 2026

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/redinfinitypro/ai_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 20 Jul 2025

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/tahir1072a/titanic-veri-analizi

Kaggle Titanic veri seti kullanฤฑlarak NumPy ve Pandas ile yapฤฑlan veri analizi projesi.

data-science kaggle numpy pandas python titanic-survival-prediction

Last synced: 30 Dec 2025

https://github.com/chubi-x/sort-it-image-classifier

Sort-it is an image classification application that classifies images of cats, dogs, bicycles, cars, and motorcycles.

image-classification machine-learning numpy pandas python streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/memgonzales/order-brushing-shopee2020

Solution to the Week 1 Contest (Student Category) of the 2020 Shopee Code League on order brushing detection

data-analytics e-commerce fraud-detection jupyter-notebook numpy order-brushing pandas python rule-based

Last synced: 11 Apr 2026

https://github.com/omarnahhass/crypto-cast

CryptoCast is a machine learning-powered tool that showcases three fundamental prediction models for forecasting cryptocurrency prices

cryptocurrency machine-learning numpy pandas python

Last synced: 05 May 2026

https://github.com/pramodyasahan/custom-image-classification-model

This repository hosts a TensorFlow-based custom image classification model. The model is designed to classify images into binary classes, labeled as 'Happy' and 'Sad'.

computer-vision convolutional-neural-networks cv2 keras matplotlib numpy tensorflow

Last synced: 11 Apr 2026

https://github.com/bishopce16/movies-etl

The goal of this project was to create an automated pipeline for a company Amazing Prime that is hosting a hackathon.

json numpy pandas pgadmin4 psycopg2 python regular-expressions sql sqlalchemy time

Last synced: 11 Apr 2026

https://github.com/elmezianech/email-spam-ham-classifier-lr

Email Classifier: A machine learning project using Python that categorizes emails into spam and ham (non-spam). Utilizes the Scikit-Learn library, employing logistic regression and TF-IDF (Term Frequency-Inverse Document Frequency) vectorization for text analysis and classification.

ai emails jupyter-notebook logistic-regression machine-learning ml numpy pandas python spam-detection spam-filtering tfidfvectorizer

Last synced: 09 Apr 2026

https://github.com/sagravela/plotground

Plotting Tool with Matplotlib and Tkinter. Video Demo:

matplotlib numpy python tkinter

Last synced: 19 May 2026

https://github.com/urvee1810/bitcoin-price-forecasting-using-arma

The analysis reveals the challenges of predicting Bitcoin prices during highly volatile periods and demonstrates how traditional time series models perform under different market conditions. The project includes comparative analysis of model performance during stable and volatile market phases.

arima arma augmented-dickey-fuller-test feature-engineering machine-learning matplotlib mplfina numpy pandas python random-forest randomforestregressor scikit-learn seaborn statsmodels time-series-analysis

Last synced: 06 Mar 2026