An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/403errors/offset-to-hydrostatic-curve

Group project for making hydrostatic curves from the given ship offset table using python libraries

data-visualization matplotlib-pyplot numpy python3 sklearn

Last synced: 10 May 2026

https://github.com/dhruvv1402/ai-based-realtime-hand-detection-using-opencv-and-mediapipe

A real-time AI system for detecting and tracking hands using Python, OpenCV, and Mediapipe. This project demonstrates how to identify hand landmarks and extract gesture data for further applications like gesture control, sign language interpretation, and more.

mediapipe numpy opencv os python python3 time

Last synced: 12 Apr 2026

https://github.com/otaboyevsardorbek1/math_web

MATEMATIK FUNQSIYLARNING O`ZARO YAQINLASHISHNI TESHKIRISh

css fast-work flask js numpy python web3

Last synced: 12 Apr 2026

https://github.com/alibowndyn/flickerforge

A customizable image and GIF generator mimicking old television static, offering control over colors, blur, dimensions, pixel size, and many more.

numpy pillow pyqt6 python

Last synced: 05 May 2026

https://github.com/mattdelaune/saas_cohort_analysis

This project analyzes customer segmentation and behavior using data science and cohort analysis. Key metrics like CRR, NRR, CLR, and CLV are examined through detailed charts, including the cohort layer cake and CLR vs. CLV cost efficiency analysis. Exploratory Data Analysis and systematic data manipulation reveal actionable insights.

anaconda git jupyter-notebook matplotlib numpy pandas python saas-data sales-data sales-data-analysis-project seaborn

Last synced: 03 Jan 2026

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 07 Apr 2026

https://github.com/codehass/facial-emotional-detection

Implement a complete AI pipeline in Computer Vision, from face detection to emotion classification using CNNs (TensorFlow/Keras) and Haar Cascades (OpenCV), and integrate this model into a FastAPI API connected to a PostgreSQL database.

cnn-model fastapi keras mathplotlib numpy opencv postgresql pydantic pytest sqlalchemy tensorflow

Last synced: 12 Apr 2026

https://github.com/thamirisq/data_analysis_culinary_journey

Project to analyze key information from Culinary Jorney, a marketplace Company, and develop an initial strategy to lead the entire team effectively.

folium-maps groupby-method jupyter-notebook markercluster marketplace-ds matplotlib-pyplot numpy pandas pandas-dataframe streamlit

Last synced: 10 Apr 2026

https://github.com/uhstray-io/pyrizon

Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python

api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow

Last synced: 09 Apr 2026

https://github.com/yyhsong/ipydataanalysis

Python数据分析与展示

matplotlib numpy pandas scipy

Last synced: 12 Apr 2026

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 10 Apr 2026

https://github.com/lakhidaksh-01/carinsightanalyzer

Interactive Car Ratings & Insights

cars matplotlib numpy pandas python seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/msthamizh/singapore-resale-flat-prices-predicting

Developing a Streamlit application that allows users to interactively predict the resale prices of flats in Singapore using machine learning. The application supports regression tasks, enabling users to input various features, such as location, flat type, floor area, and lease start date, and receive predictions for resale prices.

exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn streamlit

Last synced: 09 Apr 2026

https://github.com/lolosoares/dataanalises

This repository shows my projects made in my journey learning about data analises, ciency, engeniering and ML.

acces data-science dax excel mysql numpy pandas powerbi powerquery python r seaborn sql vba

Last synced: 10 Apr 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/rushikeshbihade/django_bsased_dataanalyzer_webapp

Data Analyzer is a Django web application that enables users to upload CSV files, perform data analysis using pandas and numpy, and view results and visualizations on an interactive web interface. It simplifies data analysis by offering a user-friendly platform for data upload, processing, and visualization.

backend-development bootstrap css3 dataanalysis dataprocessing django-application django-framework djangotemplates html5 javascript numpy pandas plotly-express python3 seaborn-plots webapp

Last synced: 02 Mar 2026

https://github.com/tasninanika/heart-disease-analysis

The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.

knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 28 Oct 2025

https://github.com/hansalemaos/cythonfastsort

C++ parallel sorting algorithms through Cython - up to 8 times faster than NumPy

c cpp cython numpy parallel-computing parallel-programming python sorting

Last synced: 04 Feb 2026

https://github.com/arpandatta011/facial_image_recognition

This project includes the introduction of the facial expression recognition and an investigation on the recent previous researches for extracting the effective and efficient method for facial expression recognition.

cv2 keras matplotlib numpy pandas tensorflow

Last synced: 09 Apr 2026

https://github.com/KernFerm/PresFutImpactSim

This repository contains a Python simulation model that forecasts political, economic, and social factors under a hypothetical presidency. Users input initial values, and the model simulates their progression through defined interactions and random variations.

economic-model forecasting matplotlib numpy political-model python scipy simpy simulation social-model

Last synced: 16 Oct 2025

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 18 Jan 2026

https://github.com/achalesh27022003/python-assignments-iwt

This repository is objected to have all the assignments and project related to IWT (Semester - 3).

cosinewave frame grid image imagetk labels matplotlib-pyplot numpy numpy-arrays numpy-library photoimage pil python3 sinewave spyder tkinter tkinter-gui window

Last synced: 10 Apr 2026

https://github.com/shreeyas-48/creditcardfrauddetection

Project for detecting credit card frauds using neural networks and logistic regression

autoencoder keras logistic-regression matplotlib neural-networks numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/earth-app/doc2lora

Generate LoRA Adapters from documents

ai cloudflare-ai cloudflare-workers lora numpy py python torch

Last synced: 16 May 2026

https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning

Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!

machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn

Last synced: 12 Apr 2026

https://github.com/mardavsj/weather-prediction

Weather prediction model which mainly focuses on visualization.

data-analysis data-visualization matplotlib numpy pandas pandas-dataframe

Last synced: 10 Apr 2026

https://github.com/rayyan9477/machine-learning-driven-backorder-prediction-system

Experience a state-of-the-art Django web application designed to predict product backorders with exceptional accuracy. This platform leverages advanced machine learning techniques, incorporating pre-trained Random Forest Classifier, Decision Tree, and LGBM models.

matplotlib notebook numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/bhavyemathur/redstone-ml

High-performance Machine Learning, Dynamic Auto-Differentiation and Tensor Algebra crate for Rust

autograd-engine linear-algebra machine-learning numpy pytorch tensor

Last synced: 17 Apr 2026

https://github.com/aisurjyasamantaray/customer-purchase-analysis

This project involves an in-depth analysis of customer purchasing behavior and sales performance to drive business insights and strategies.

customer-purchasing customer-segmentation data-visualization dataanalysis dataanalysis-projects marketing-insights matplotlib numpy pandas python revenue-and-performance-analysis seaborn-plots

Last synced: 09 Apr 2026

https://github.com/danhenriquex/data-science-project

The main goal of this project was to apply the concepts of data visualization and analysis.

data-analysis data-science numpy pandas python

Last synced: 12 Apr 2026

https://github.com/esvs2202/credit-card-fraud-detection-system

This fraud detection system is powered by a Machine Learning model, which accurately identifies whether an initiated transaction is fraudulent.

azurewebapp flask gunicorn-web-server html-css machine-learning matplotlib-pyplot numpy pandas powerbi python3 randomforestclassifier seaborn

Last synced: 12 Apr 2026

https://github.com/bunu23/image-classification-resnet50-streamlit

This project demonstrates an image classification system using ResNet50, fine-tuned on a custom dataset. It includes a Streamlit application for user interaction, allowing image uploads and classification results. It integrates Google's generative AI models for enhanced analysis.

cnn deep-learning generative-ai keras matplotlib numpy pillow python resnet-50 streamlit tensorflow

Last synced: 07 Apr 2026

https://github.com/anish202020/weather_prediction_model_21int68_internship

Predicts weather using data analysis and machine learning neural networks. The project reads data from a CSV file, processes it, trains a neural network, and visualizes the results.

keras-neural-networks keras-tensorflow matplotlib numpy pandas sklearn

Last synced: 12 Apr 2026

https://github.com/gracysapra/pandas-numpy-data-visualisation

This repository contains essential Python scripts and notebooks for data analysis and visualization. It includes: pandas: Data manipulation and analysis, including operations on series and dataframes. NumPy: Efficient numerical computations and array processing. Data Visualization: Creating insightful visualizations using Matplotlib and Seaborn.

data-science data-visualization matplotlib numpy numpy-arrays pandas pandas-dataframe pandas-series seaborn

Last synced: 07 May 2026

https://github.com/kavayk29/quora-duplicate-question-pair

This project improves information retrieval by detecting duplicate question pairs in the Quora dataset using data exploration, text preprocessing, feature engineering, and models like Random Forest and LSTM, aiming to streamline question-answering.

beautifulsoup4 bilstm gensim keras lstm matplotlib numpy pandas pytorch random-forest seaborn sklearn tensorflow xgboost

Last synced: 21 Jan 2026

https://github.com/datarohit/nlp-course-files

The files in this Repo are files for the online NLP-Course from Udemy.com which I completed.

nlp nlp-machine-learning nltk numpy panda python sklearn spacy

Last synced: 12 Apr 2026

https://github.com/datarohit/garbage-image-classification

This is Garbage Image Classification End-To-End Project built using Python and Streamlit.

cnn-classification cnn-keras inceptionv3 keras-tensorflow matplotlib numpy

Last synced: 30 Apr 2026

https://github.com/nomadsdev/pulse-detect

PulseDetect is a Python tool that detects audio frequencies in real-time. It captures sound from the microphone and identifies the dominant frequency using pyaudio and numpy

numpy pulse-detect pyaudio python scipy

Last synced: 07 Jan 2026

https://github.com/pranav-bot/ml-dl-models-from-scratch

This contains the most commonly used machine learning and deep learning models written from scratch without the use if any libraries except Numpy which is used for calculations

deep-learning descion-tree from-scratch linear-regression logistic-regression machine-learning naive-bayes neural-networks numpy perceptron pytorch qlearning random-forest regression reinforcement-learning support-vector-machines svm

Last synced: 15 Apr 2026

https://github.com/zainashrafofficial/ml_project_2024

A machine learning semester project of university, implementing classification and regression models to predict house prices & number of rooms using a dataset with over 160,000 entries.

ai ann artificial-neural-networks classification decision-trees jupyter-notebook linear-regression logistic-regression matplotlib ml numpy pandas python random-forest regression-models svm tensorflow

Last synced: 13 Apr 2026

https://github.com/nazli-d/binary-classification-using-cnn

This project utilizes a CNN model to classify cat and dog images through training and testing processes. The model is created using the Keras library on the TensorFlow backend.

binary-classification cat-and-dog-classifier cnn convolutional-neural-networks flatten keras max-pooling numpy opencv python relu-activation sigmoid-function tensorflow

Last synced: 13 Apr 2026

https://github.com/abtaaahi/smartxi

SmartXI — A Streamlit app to recommend football teams based on budget, formation, and style, plus predict player market values using machine learning models. Easy setup and interactive visualizations included.

machine-learning machine-learning-algorithms numpy pandas plotly python random-forest streamlit

Last synced: 13 Apr 2026

https://github.com/manjillama/machine-learning-101

Introduction to machine learning, conda environment, Jupyter Notebook, Pandas, NumPy and Matplotlib.

conda-environment jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/mihirh19/python

a repository for collaborative development and version control using Git that houses Python code files, documentation, tests, and other project-related files.

algorithms data-structures django fastapi flask machine-learning numpy pandas python

Last synced: 11 Apr 2026

https://github.com/reddyprasade/bicycle-sharing-system-in-us

A bicycle-sharing system, public bicycle system, or bike-share scheme, is a service in which bicycles are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" and return it at another dock belonging to the same system. Docks are special bike racks that lock the bike, and only release it by computer control. The user enters payment information, and the computer unlocks a bike. The user returns the bike by placing it in the dock, which locks it in place. Other systems are dockless. For many systems, smartphone mapping apps show nearby available bikes and open docks.

dataanalysis jupyter maplotlib numpy pandas python r

Last synced: 29 Aug 2025

https://github.com/nitsc/spell-from-threebodytrilogy

Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.

3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization

Last synced: 02 May 2026

https://github.com/raduldev/gromacs

Molecular Dynamics Simulation using GROMACS in Jupyter notebook

bash gromacs matplotlib molecular-dynamics-simulation numpy pandas python3 shell

Last synced: 09 Apr 2026

https://github.com/gitfrandu4/40982-p1

Repositorio de la Práctica 1 en Visión por Computador, explorando el uso de OpenCV para crear imágenes artísticas, manipular píxeles y capturar video. Incluye ejercicios como diseño de un tablero de ajedrez, estilo Mondrian, y un efecto “Pop Art” con la cámara web.

computer-vision image-processing mondrian-style numpy opencv pop-art python visual-art webcam-capture

Last synced: 09 Apr 2026

https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning

This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics

data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn

Last synced: 23 Jul 2025

https://github.com/rahulvictor12/the-movie-database-data-scrapper

A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.

beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping

Last synced: 11 Apr 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/savitar-hub/image-classification

Notebooks on how image classification can help businesses make better decisions and improve the customer experience.

image-classification image-processing jupyter-notebook numpy opencv pandas pytorch tensorflow

Last synced: 08 Apr 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/sebsikora/curve_fitting

A short guide to using Python tools to perform arbitrary curve-fitting of research data via constrained minimisation.

curve-fitting minimisation numpy optimisation python scipy

Last synced: 13 Apr 2026

https://github.com/shanmukhsrisaivedullapalli/smsspamclassification

SMSSpamClassification is a machine learning project aimed at accurately classifying SMS messages as either spam or ham (non-spam). It employs natural language processing techniques to extract relevant features from the text data and utilizes various classification algorithms to build a robust spam detection model.

jupyter-notebook numpy pandas pickle python3 sklearn spam-classification spam-detection

Last synced: 11 Apr 2026

https://github.com/urme-b/multimodal-multisensor

Longitudinal neurophysiological study of adult psychometric testing.

keras matplotlib numpy pandas python pytorch scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/dust-detection-on-solar-panel-using

The accumulation of dust, snow, bird drops etc. on the surface of solar panels reduces the efficiency of the solar modules and hence the amount of produced energy. Monitoring and cleaning solar panels is a crucial task, hence developing an optimal procedure to monitor and clean these panels is very important in order to increase modules efficiency.

classification dirty inceptionv3-model keras numpy pandas plotly python regularizer sequential solar-panels tensorflow visualization

Last synced: 13 Apr 2026

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/kientech/calories-burnt-prediction-with-machine-learning

The Calories Burnt Prediction with Machine Learning project aims to predict the number of calories burned during physical activities based on various factors such as duration, activity type, age, weight, and more.

fastapi heroku machine-learning matplotlib-pyplot modeling numpy pandas python reactjs tailwindcss

Last synced: 02 Aug 2025

https://github.com/otuemre/simplenet

A simple neural network framework built from scratch using Python and NumPy. Designed as a learning tool, SimpleNET includes essential layers, activation functions, and loss functions, enabling users to build and train basic neural networks from the ground up.

backpropagation deep-learning from-scratch machine-learning neural-network numpy python

Last synced: 13 Apr 2026

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 17 Apr 2026

https://github.com/zion-dev1/e-commerce-analysis

Short machine learning project which analyses customer behaviour 🔎

analysis clustering ecommerce machine-learning numpy pandas sklearn

Last synced: 05 Jul 2025

https://github.com/tasninanika/k-nearest_neighbors

K-Nearest Neighbors (KNN) is a simple and powerful machine learning algorithm used for classification and regression.

knn matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/vortexv7/engagevision-yt

Thumbnail Engager is a Streamlit-based web app that uses AI to analyze YouTube thumbnails, providing actionable feedback to improve engagement. Upload a thumbnail to get insights on its strengths, weaknesses, and tips for maximizing views. Perfect for creators looking to optimize their visuals!

gcp numpy pandas python3 streamlit streamlit-cloud tensorflow youtubeapiv3

Last synced: 09 Apr 2026

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shivendrra/Seeker

This project will help to write new scripts for YouTube videos from scratch using fine-tuned Llama-7B

ai artificial-intelligence finetuning google-custom-search-api google-custom-search-engine llama2 llm numpy pandas python pytorch react-native reactjs transformers youtube-api

Last synced: 30 Dec 2025

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/vitroid/tilecycles

Tessellate a graph into cycles.

genice2 graph numpy tiling

Last synced: 27 Feb 2025

https://github.com/m-rishab/housing-price-prediction

The Housing Price Prediction Accuracy Improvement project is a data-driven initiative focused on enhancing the precision and reliability of housing price predictions. This project encompasses a multidisciplinary approach, combining data science, machine learning, and real estate insights to optimize the accuracy of forecasts in the housing market.

gridsearchcv hyperparameter-tuning matplotlib matplotlib-pyplot numpy pandas pickle python3 randomforestregressor scipy sklearn

Last synced: 11 Apr 2026

https://github.com/rkarahul/machinelearning

This is for my all machine learning code

analysis computer-vision matplotlib mysql nlp numpy pandas python

Last synced: 11 Apr 2026

https://github.com/willprice/arsenal

Personal arsenal of python utilities

data-science machine-learning numpy pandas personal-tools python

Last synced: 09 Apr 2026

https://github.com/iv4n-ga6l/Realtime-Car-Traffic-Tracking

Realtime car traffic tracking using yolov8 model

cartracking numpy opencv python ultralytics yolov8 yolov8n

Last synced: 28 Apr 2025

https://github.com/shibam120302/plant_deseases_prediction

Plant disease forecasting is a management system used to predict the occurrence or change in severity of plant diseases. At the field scale, these systems are used by growers to make economic decisions about disease treatments for control.

keras numpy opencv python

Last synced: 13 Apr 2026

https://github.com/theanujsinha01/rainfall-prediction-using-machine-learning

This project predicts whether it will rain or not based on weather features like pressure, humidity, dew point, cloud cover, sunshine, wind direction, and wind speed. We use a Random Forest Classifier, a popular ML algorithm, trained on historical weather data. The model learns patterns and helps us forecast rain chances.

classification data-analysis eda machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn supervised-learning

Last synced: 11 Apr 2026

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/rotatividade_de_clientes

A operadora de comunicações InternetGO está interessada em prever a rotatividade de seus clientes (churn). Se for identificado que um usuário está planejando trocar de operadora, a empresa poderá oferecer códigos promocionais e opções de planos especiais para evitar a perda desse cliente.

catboost numpy pandas pyplot python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/mahnoorsheikh16/Credit-Card-Default-Prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

chi-square-test encoding hiplot imblearn json knn-imputer matplotlib numpy pandas pca-analysis pillow plotly robust-scalar scipy seaborn sklearn smote streamlit ttest visualization

Last synced: 01 Mar 2025

https://github.com/dpgitaccount/file-handling-data-analytics-toolkits

In this repo File handling, Inventory management system and data toolkits resources are provided these are essential for managing, storing, and manipulating data in various formats, such as JSON files. JSON (JavaScript Object Notation), is widely used for data exchange due to its simplicity and readability. also File Handling included in this repo.

filehandling inventory-management json jupyter-notebook numpy os

Last synced: 13 Apr 2026

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/aymen016/image-processing-algorithms

Explore Python-based image processing from scratch with core algorithms like interpolation, convolution, and filtering!

matplotlib numpy pillow python

Last synced: 13 Apr 2026