An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/sandipkushwaha20/brain_tumour_detector

This project utilizes deep learning to detect and classify brain tumors from MRI images. The primary objective is to determine whether a tumor is present and, if so, classify its type with high accuracy.

deeplearning keras numpy pandas python tensorflow

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/ahmedabdalkreem/skin-cancer

In this project we work to extraction features from Images using CNN and build the Neural Network to arrive the patient have skin-cancer Malignant or Benign.

cnn deep-learning matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 12 Apr 2026

https://github.com/ishtufinquant/portfolio-optimizer

Optimizes stock portfolios using simulation and real-world constraints (SLSQP).

backtesting finance numpy pandas portfolio-optimization python quant scipy yfinance

Last synced: 12 Apr 2026

https://github.com/nidomika/digit-recognition

Digit recognition project in Python

digit-recognition numpy opencv python tensorflow

Last synced: 12 Apr 2026

https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system

Prime numbers and polar coordinate system

matplotlib numba numpy python

Last synced: 01 May 2026

https://github.com/devbigboy/php-crud-project

PHP-CRUD-Project

jupyter numpy pandas

Last synced: 20 Apr 2026

https://github.com/elifftosunn/data-labeling

Bu araç veri kazıma aşamasından sonra verilerin istenilen kategoriye göre etiketlenmesi ve kullanılmayacak olan verilerin database'ten silinmesi ile makine öğrenmesi modellerini kullanmak için etiketleme adımınını otomatikleştirmek amaçlı yapılmıştır.

base64 kaleido numpy os pandas plotly postgresql psycopg2 psycopg2-binary time

Last synced: 12 Apr 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/alphacrypto246/stock-price-movement-prediction

A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.

machine-learning numpy pandas polynomial-regression scikit-learn yfinance

Last synced: 13 Apr 2026

https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker

Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.

cnn-classification computer-vision keras-tensorflow numpy python tensorflow

Last synced: 07 Jan 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026

https://github.com/alinenog/desenvolve_gb_2022

Formação Desenvolve 2022 do Grupo Boticário na área de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/murugavl/customer_churn_prediction

Customer churn prediction is a crucial task in business analytics, helping companies identify customers who are likely to leave and take preventive actions. This project utilizes Machine Learning (ML) techniques to analyze customer behavior and predict churn.

jupyter-notebook machine-learning matplotlib numpy pandas pickle python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/chandkund/predicting-diabetes-onset

The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.

deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/blairnangle/python3-numpy-ta-lib

Docker image with NumPy and TA-Lib installed. Built and uploaded to Docker Hub using GitHub Actions.

docker numpy python3 ta-lib

Last synced: 13 Apr 2026

https://github.com/asier-ortiz/python-for-data-science-and-machine-learning-bootcamp

Python for Data Science and Machine Learning Bootcamp: NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn, TensorFlow, and more

matplotlib numpy pandas plotty python scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/eda-on-indian-railways

Indian Railways is a statutory body under the ownership of the Ministry of Railways of the Government of India that operates India's national railway system. As of 2023, it manages the fourth largest national railway system by size with a track length of 132,310 km, running track length of 106,493 km and route length of 68,584 km.

clean-data eda exploratory-data-analysis geometry geopandas indian-railways json linestring matplotlib numpy os pandas plotly python railway seaborn shapely train visualization

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/prediction-of-ticket-cancellation

The objective is to develop a model that accurately predicts whether users will cancel their tickets. Each cancellation incurs a fine for the ticket registration site from the passenger company.

datetime evaluation gridsearchcv labelencoder numpy pandas standardscaler stratified-k-fold train-test-split xgboost-model

Last synced: 04 May 2026

https://github.com/hootbu/cyberpunk-filter-desing

This project aims to design filters with a cyberpunk aesthetic by performing low-level image manipulation in Python.

computer-vision image-manipulation matplotlib numpy python

Last synced: 08 May 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/alepm1990/nba-analytics

Análisis y trabajos en Python con BBDD deportivas

matplotlib-pyplot numpy pandas pyhon python3 seaborn

Last synced: 05 May 2026

https://github.com/zvdy/movie_recommendation

Movie Recommendation Search Engine using Jupyter Notebooks, Pandas, Nnmpy, SciKit Learn, IPyWidgets

data-science jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/mayankmittal29/algovision-statistical_methods_in_ai

Implementation of various machine learning algorithms from scratch, including Linear Regression, K-Nearest Neighbors, Decision Trees, and K-Means clustering. Also done EDA on data, Implemented LSH, IVF, SLIC algorithms also with evaluation metrics

decision-tree-classifier eda gradient-descent image-segmentation ivf knn-classification linear-regression lsh-implementation matplotlib-pyplot numpy pandas python3 seaborn sgd-optimizer sklearn slic-superpixel-algorithm

Last synced: 11 Apr 2026

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 13 Apr 2026

https://github.com/kecnry/nparray

High-Level Wrappers for Building and Manipulating Numpy Arrays

numpy numpy-arrays python

Last synced: 08 Jan 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/raeinlayeghpour/test-bench-for-signal-validation-simulated-3-phase-system

Automated Project - Test Bench for Signal Validation (Simulated 3-Phase System)

numpy pandas pycharm pytest python

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg

Last synced: 13 Apr 2026

https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/n1k1f0rm/ml-predicts

Place where you can find in transparent way how ML algos works

machine-learning ml numpy python

Last synced: 21 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de réseaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-03-q2--hypothesis-testing

Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth

annova ftest hypothesis-testing numpy pandas python scipy significance-testing stats

Last synced: 11 Apr 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/rajan-bhateja/data-analysis-with-python

Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook

data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/1adityakadam/uber_data_analytics

End to end Google Bigquery + Looker Studio Data Analytics Project Transforming NYC Taxi Data into Actionable Intelligence

bigquery looker-studio mage-ai-pipeline numpy pandas sql

Last synced: 13 Apr 2026

https://github.com/abhisek-13/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.

data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/thanhloc81/rfm-model-project

✨ Build a flow to deploy Segmentation evaluation through Python programming.

customer-segmentation numpy pandas python rfm-analysis

Last synced: 03 May 2026

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 30 May 2026

https://github.com/jfjlaros/online-array

Online array, pass a function as a NumPy array.

array function numpy online unbounded

Last synced: 13 May 2026

https://github.com/nyx1311/toxicity-detector-using-bilstm

🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web app that flags multiple forms of online abuse in real time and offers tools for emotional recovery. 📊 Under the hood: BiLSTM + Word2Vec embeddings for deep, context-aware detection Trained on 21K+ labeled comments across 7 toxicity categories Built with Python, Tensor

epoch genism gpu keras-tensorflow matplotlib model nlp nlp-machine-learning nlpaug-textual numpy pandas pandas-library python3 streamlit tensorflow word2vec

Last synced: 14 Apr 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/ediloaz/linear-regression-with-lists-numpy-and-pytorch

This was done like the homework #01 of Machine Learning Course, of Computer Science Master.

arrays-raw jupyter-notebook machine-learning numpy python pytorch

Last synced: 13 Apr 2026

https://github.com/abhinavraj009/x_ray-conversion-csv-

To convert an X-ray image into a CSV format for machine learning or data analysis, you'll typically want to represent the image as numerical pixel values.

machine-learning numpy opencv pandas python

Last synced: 13 Apr 2026

https://github.com/takana671/partiallytransparentimage

A simple tool to make an image partially transparent

geomipterrain numpy opencv pillow python3 transparent-image

Last synced: 07 May 2026

https://github.com/alexpeits/indicator-plot

Gtk3 indicator for generating 2d plots from user input

gtk indicator numpy plot sympy

Last synced: 13 Apr 2026

https://github.com/aiwithqasim/ab-testing-for-shoefly.com

In this project, I had analyzed the data set of SHOEFLY.COM to find out valuable insights to check which platform is more useful for advertisement.

numpy pandas python3 sqlite3

Last synced: 13 Apr 2026

https://github.com/spear97/montecarlo-simulaton

This project involved developing a Monte Carlo Simulation using Python, implemented with libraries like NumPy, Pandas, and Matplotlib within Visual Studio. The program's primary goal is to generate figures illustrating compound interest results over time for a specified amount.

data-science matplotlib matplotlib-figures numpy pandas-python python python-3 python3

Last synced: 13 Apr 2026

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/patex1987/ghcn-temperature-dataset

Analysis of GHCN (Global Historical Climatology Network) temperature dataset using Numpy

climate-data jupyter-notebook numpy python temperature-data

Last synced: 10 May 2026

https://github.com/18mahi/digital_cave

An intermediate-level deep learning project that compares Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP) on the MNIST handwritten digits dataset. This project demonstrates data augmentation, learning rate scheduling, and visual comparison of model performance

cnn confusion-matrix data-augmentation data-science deep-learning evaluation-metrics jupyter-notebook keras learning-rate-scheduler machine-learning matplotlib mlp numpy python3 scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/pranavsp108/time-series-forcasting

A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.

data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost

Last synced: 10 Apr 2026

https://github.com/ahmadbuilds/fake-news-classifier

Classifies news articles as real or fake using an NLP pipeline with TF-IDF + n-grams and machine learning models. Includes text preprocessing, feature engineering, model training, and evaluation.

fastapi logistic-regression matplotlib n-grams nextjs nltk numpy pandas python3 random-forest-classifier react scikit-learn seaborn supervised-learning tf-idf typescript xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/nihanthbhargav/time-series-stock-market

This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.

matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/sweta-kaundilya/finance-data-project

This data project focuses on exploratory data analysis of bank stock prices throughout financial crisis all the way to early 2016.

financial-analysis jupyter-notebook matplotlib numpy pandas-python seaborn

Last synced: 16 May 2026