An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/shreeparab1890/movie-recommender-system

This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.

bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization

Last synced: 09 Apr 2026

https://github.com/ankushmallick1100/Diabetes-Prediction-of-Females-using-Maching-Learning-Techniques

This is a machine learning project that uses various machine learning algorithms to predict whether a patient is diabetic or not. Here various machine learning algorithms like SVM, RF Classifier, DT Classifier, KNN, LR , LRwith CV, NB Classifier, and XGB are used. For this work, a website is made with Python Streamlit library. Paper is ongoing.

classification cross-validation decision-tree-classifier diabetes diabetes-prediction jupyter jupyter-notebook logistic-regression machine-learning machine-learning-algorithms numpy pandas python random-forest random-forest-classifier regression streamlit support-vector-machine xgboost xgboost-classifier

Last synced: 29 Sep 2025

https://github.com/malwarebo/carcinet

Basic breast cancer detection using Machine Learning

deep-learning keras machine-learning matplotlib numpy python3 scipy tensorflow

Last synced: 09 Apr 2026

https://github.com/matheussc017/irisclassification_sklearn

Application of Machine Learning technique to classify 3 types of iris flower

data-science machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/ankushmallick1100/django-insurance-premium-predictor-web-app

This is a web app where a user can signup to the website first and then login to access the website. Then, he/she can give their age, select his/her gender, bmi, number of children, select whether he/she is a smoker or not, and select his/her region. Gradient Boosting Regressor is used in this project which gives the best accuracy of 89.798.

bootstrap bootstrap5 css django gradientboosting gradientboostingregressor html insurance-prediction insurance-premium-predictor machine-learning matplotlib numpy pandas python regression regression-models sklearn tailwind tailwind-css tailwindcss

Last synced: 09 Apr 2026

https://github.com/aakashks/ct-reconstruction-2d

Python Implementation of Computed Tomography (CT) reconstruction for 2D images

computed-tomography ct-reconstruction numpy

Last synced: 09 Aug 2025

https://github.com/ZauJulio/FeaturesAnalyzer

Python data analyze and machine learning tool

gtk machine-learning numpy pandas python sklearn

Last synced: 10 Aug 2025

https://github.com/dwarkadhish2005/house-price-predictor

Applied neural network for machine learning and done many analysis of data

keras matplotlib numpy pandas python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/paulo-santos-ds/desenvolvimento_pocos_petroleo

Este projeto tem como objetivo desenvolver um modelo que possa prever o melhor local para a perfuração de poços de petróleo. Utilizando modelo de Machine Learning para poder prever o lucro de cada poço.

math matplotlib-pyplot numpy pandas python3 sklearn

Last synced: 09 Apr 2026

https://github.com/amanpatel-1234/automatic_data_analyzer_app

📊 Automatic Data Analyzer is a Streamlit web app that performs complete data analysis with just a single file upload. Simply upload your CSV file, and the app automatically generates: Data Overview: Preview, summary statistics, and data types. Visual Analysis: Histograms, distribution plots, and interactive charts for numeric columns.

app data-science data-visualization matplotlib-pyplot numpy pandas python3 seaborn seaborn-plots streamlit

Last synced: 09 Apr 2026

https://github.com/c0sogi/python-dcm

A high-performance Python package for handling ETAS DCM(Data Conversion Format) files used in engine calibration tools like INCA, MDA, EHANDBOOK, and CANape.

automotive dcm ecu-calibration etas inca numpy pandas python

Last synced: 12 Aug 2025

https://github.com/alexandrbig1/py-design-analysis-algo

Advanced Python project on the design and analysis of algorithms. Includes practical modules on dynamic programming, graph theory, big data (Bloom filter, HyperLogLog), parallel computing (MapReduce), and optimization. Uses libraries like NumPy, Matplotlib, NetworkX, BTrees, PrettyTable, and Datasketch for efficient computation and visualization.

algorithms asyncio git heapq matplotlib numpy python python3 turtle

Last synced: 09 Apr 2026

https://github.com/htlinh0604/invoice-data-extraction

This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

flask numpy ocr-recognition opencv pandas python regex tesseract-ocr

Last synced: 10 Nov 2025

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026

https://github.com/jswong65/machine_learning_nanodegree

Projects of Udacity Machine Learning nanodegree

machine-learning numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/thedevreda/python

My Full Backup Packagaes & Files Guide for Python 👷‍♂️

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/Tritonix711/FractureAI

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 19 Aug 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/mousazourob/stockdataresampler

Python script that resamples an instruments minute by minute candle stick data to any interval

alpha-vantage-api numpy pandas python

Last synced: 07 May 2026

https://github.com/yogeshnile/flight-delay-prediction

In this repo i developed predict a flight delay using machine learning.

flight-delay-prediction jupyter-notebook machine-learning numpy pandas python3 seaborn sklearn

Last synced: 13 Feb 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/nikita620/phishing-website-detection-by-machine-learning-techniques

Phishing is one of the familiar attacks that trick users to access malicious content and gain their information. In terms of website interface and uniform resource locator (URL), most phishing webpages look identical to the actual webpages. Various strategies for detecting phishing websites, such as blacklist, heuristic, Etc.

bagging boosting decision-trees numpy pandas python random-forest sckiit-learn sql

Last synced: 10 Apr 2026

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 07 Apr 2026

https://github.com/thamirisq/data_analysis_culinary_journey

Project to analyze key information from Culinary Jorney, a marketplace Company, and develop an initial strategy to lead the entire team effectively.

folium-maps groupby-method jupyter-notebook markercluster marketplace-ds matplotlib-pyplot numpy pandas pandas-dataframe streamlit

Last synced: 10 Apr 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/aryansk/fake-news-detection

A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!

fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/datarohit/fifa-2020--data-analysis

This is dataset is from Kaggle.com which contains data of 18000+ fifa players with more than 100 features about them for analysis. Simple analysis performed on this Dataset.

exploratory-data-analysis exploratory-data-visualizations matplotlib-pyplot numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 10 Apr 2026

https://github.com/datarohit/titanic-survival-data--classification

This is a Titanic Survival Data from Kaggle Competition with Multiple Classification Algorithm testing and Detailed Exploratory Data Analysis.

classification-algorithm kaggle-competition machine-learning-algorithms matplotlib-pyplot numpy pandas seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/cploutarchou/covid-19-tracker-django-app

COVID-19 Tracker Dashboard for Cyprus.

django numpy pandas plotly redis

Last synced: 11 Apr 2026

https://github.com/smolodtsova13/website

Веб-сайт для стоматологии

bootstrap django numpy pandas python

Last synced: 11 Apr 2026

https://github.com/sizzlins/kalkulator-ai

A Simple Command Line Input Symbolic Regression Engine and Computer Algebra System (CAS) capable of discovering the laws of the universe, solving calculus, algebra, and trigonometrics.

calculator calculus cli computer-algebra-system curve-fitting machine-learning mathematics numpy physics python scientific-computing scikit-learn sparse-regression symbolic-regression sympy

Last synced: 13 Jan 2026

https://github.com/tasninanika/callifornia-housing-price-prediction-svr

Support Vector Regression (SVR) is a type of Support Vector Machine used for predicting continuous values.

matplotlib numpy pandas python3 scikit-learn seaborn svm-regression

Last synced: 11 Apr 2026

https://github.com/ryannapp12/quant_trading_engine

A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.

algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies

Last synced: 11 Apr 2026

https://github.com/dagdelenbunyamin/barcode-scanner-flb

Ein interaktives Barcode-Scanner-Tool zur Verwaltung von Schülerabmeldungen. Mit Webcam-Unterstützung, automatischer Zeiterfassung und PDF-Export für Lehrkräfte – entwickelt mit Streamlit, OpenCV und SQLite.

fpdf numpy opencv pandas python pyzbar streamlit

Last synced: 11 Apr 2026

https://github.com/shanmukhsrisaivedullapalli/automatic-ticket-classification

This project processes customer complaint data using pandas for data manipulation and applies text preprocessing techniques, including lemmatization, to clean and normalize complaint text. The `tqdm` library provides progress bars for efficient tracking of text processing tasks.

matplotlib neural-networks nlp numpy pandas python3 scikit-learn seaborn tensorflow tqdm wordcloud

Last synced: 11 Apr 2026

https://github.com/mihirh19/python

a repository for collaborative development and version control using Git that houses Python code files, documentation, tests, and other project-related files.

algorithms data-structures django fastapi flask machine-learning numpy pandas python

Last synced: 11 Apr 2026

https://github.com/nvinayvarma189/deep-learning-with-tensorflow

This repo consists all the basics of tensorflow and all major Deep Learning algorithms and techniques implemented in tensorflow. It will explain you the basics of tensorflow. You can visualize how tensors flow through a grapg while you create one

autoencoder cnn deep-learning gan matplotlib neural-network numpy pandas python3 reinforcement-learning rnn tensorflow

Last synced: 11 Apr 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/suraj-sedai/lstm-deep-learning

A step-by-step guide to mastering sequence prediction using TensorFlow and LSTM. This repository covers everything from basic linear predictions to advanced multi-step forecasting, many-to-many LSTM models, and real-world applications like stock price forecasting and sales prediction.

lstm-neural-networks lstm-sentiment-analysis numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/avijit-jana/used-car-price-prediction

A Streamlit-based application utilizing machine learning to predict used car prices based on features like make, model, year, fuel type, and transmission, enhancing customer experience and streamlining the pricing process.

carpriceprediction data-science jupyter-notebooks machine-learning numpy pandas pickle pricing-data-science python3 streamlit streamlit-webapp webapplication

Last synced: 11 Apr 2026

https://github.com/rahulvictor12/the-movie-database-data-scrapper

A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.

beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping

Last synced: 11 Apr 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/sergio11/breast_cancer_diagnosis_mlp

🩺🔬 MLP-based Breast Cancer Diagnosis: Predicts tumor malignancy from image features, aiding in early detection. 📊🤖

deep-learning deep-neural-networks machine-learning mlp-classifier mlp-networks numpy sklearn sklearn-classify

Last synced: 21 Feb 2026

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 17 Apr 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/m-rishab/housing-price-prediction

The Housing Price Prediction Accuracy Improvement project is a data-driven initiative focused on enhancing the precision and reliability of housing price predictions. This project encompasses a multidisciplinary approach, combining data science, machine learning, and real estate insights to optimize the accuracy of forecasts in the housing market.

gridsearchcv hyperparameter-tuning matplotlib matplotlib-pyplot numpy pandas pickle python3 randomforestregressor scipy sklearn

Last synced: 11 Apr 2026

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/rkarahul/machinelearning

This is for my all machine learning code

analysis computer-vision matplotlib mysql nlp numpy pandas python

Last synced: 11 Apr 2026

https://github.com/prithivsakthiur/qwen-image-edit-2509-loras-fast

Qwen-Image-Edit-2509-LoRAs-Fast is a high-performance, user-friendly web application built with Gradio that leverages the advanced Qwen/Qwen-Image-Edit-2509 model from Hugging Face for seamless image editing tasks.

aoti diffusers diffusion-models flash-attention-3 huggingface-spaces huggingface-transformers kernel numpy peft python pytorch qwen-image-edit qwen-image-edit-2509 qwen2-5-vl qwen3-vl torch torchvision

Last synced: 03 May 2026

https://github.com/mrtaz77/numerical-analysis

Hub of numerical methods, implemented using python and its data analysis libraries like numpy and matplotlib.

interpolation matplotlib-pyplot newton-raphson numpy regression

Last synced: 19 Jun 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/harmanveer2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics dataanalysis datetime matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/beliavsky/array-compiler

Translate Python/NumPy and other array languages to Fortran

array-programming fortran linear-algebra modern-fortran numpy python scientific-computing transpiler

Last synced: 27 May 2026

https://github.com/harsh8088/py_cluster

Grouping in data values.

numpy python sklearn

Last synced: 11 Apr 2026

https://github.com/jayakrishnagaddam/deep_sheild_

This is a Machine Learning applied Artificial intelligence project. Deepsheild is a cutting-edge deepfake detection system designed to safeguard the integrity of visual content in an era where synthetic media poses unprecedented challenges. Deepsheild stands as a robust fortress against the proliferation of manipulated videos and images.

artificial-intelligence bootstrap css cv2 deep-learning flask html machine-learning mongodb numpy tensorflow torch

Last synced: 11 Jan 2026

https://github.com/serhatderya/fall_detection

This project implements a computer vision-based fall detection system using deep learning techniques. The system can detect whether a person has fallen in images or video frames.

ai artificial-intelligence artificial-neural-networks computer-vision deep-learning deep-neural-networks deeplearning falldetection matplo neural-network numpy opencv pil pillow pytorch tensorflow yolo yolov5 yolov8

Last synced: 11 Apr 2026

https://github.com/pradipece/two_dimensional_data_visualization

This repository for different kinds of Data science and Machine learning project

data-science data-visualization jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn statistics

Last synced: 11 Apr 2026

https://github.com/poornesh-v/bank-customer-churn-prediction-system

The repository is meant to predict customer churn in a banking dataset based on transaction history and demographics.

jupyter-notebook matplotlib numpy pandas-python python seaborn

Last synced: 11 Apr 2026

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 11 Apr 2026

https://github.com/michael-insights/portfolio

This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.

data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql

Last synced: 11 Apr 2026

https://github.com/bogdang888g/shad_homework_2024-2025

Репозиторий с домашними работами из курса "Школы Аналитиков Данных" от МТС

matplotlib numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/adityashinde716/liam-the-healthcare-chatbot

Made a healthcare chatbot Liam which will help you to detect your health problem and gives precautions.

csv numpy pandas python python3 random random-forest random-forest-classifier requests sklearn sklearn-library sklearn-model

Last synced: 05 May 2026

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/zazi2002/neuroscience-project-lfp-signalanalysis

This project is part of the Neuroscience of Learning, Memory, Cognition course. The goal of this assignment is to introduce students to Local Field Potential (LFP) signals, which are critical for understanding brain activity at a local neuronal level.

jupiter-notebook lfp-analysis matplotlib numpy signal-processing

Last synced: 01 May 2026

https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-

# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻

analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits

Last synced: 01 Jul 2025

https://github.com/aarryasutar/linear_multivariate_regression_on_football_statistics

Linear regression models are used to predict football player attacking stats based on attributes like finishing and passing, with the model trained, evaluated, and applied for predictions. Multiple features improve accuracy, and performance is assessed using metrics like MSE and R-squared.

datasets feature-selection football-stats linear-reg machine-learning mae mse multivariate-regression numpy pandas rmse scatter-plot train-test-split

Last synced: 14 Apr 2026