NumPy
NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
- GitHub: https://github.com/topics/numpy
- Wikipedia: https://en.wikipedia.org/wiki/NumPy
- Repo: https://github.com/numpy/numpy
- Created by: Travis Oliphant
- Related Topics: python, scipy,
- Last updated: 2026-06-26 00:18:09 UTC
- JSON Representation
https://github.com/raufjatoi/ai-programming
Programming for AI labs
ai deep-learning jupyter-notebooks machine-learning matplotlib numpy pandas python pytorch seaborn sklearn tensorflow
Last synced: 11 Apr 2026
https://github.com/coueghlani/nlp
Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos
mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy
Last synced: 08 Feb 2026
https://github.com/fimbres/matplotlib-start
A crash project with the most important matplotlib functions and methods, used in data science, AI and machine learning.
data-science matplotlib matplotlib-pyplot numpy pandas python python-script
Last synced: 13 Apr 2026
https://github.com/18mahi/digital_cave
An intermediate-level deep learning project that compares Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP) on the MNIST handwritten digits dataset. This project demonstrates data augmentation, learning rate scheduling, and visual comparison of model performance
cnn confusion-matrix data-augmentation data-science deep-learning evaluation-metrics jupyter-notebook keras learning-rate-scheduler machine-learning matplotlib mlp numpy python3 scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/ahmadbuilds/fake-news-classifier
Classifies news articles as real or fake using an NLP pipeline with TF-IDF + n-grams and machine learning models. Includes text preprocessing, feature engineering, model training, and evaluation.
fastapi logistic-regression matplotlib n-grams nextjs nltk numpy pandas python3 random-forest-classifier react scikit-learn seaborn supervised-learning tf-idf typescript xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/dhanish03/credit_card_fraud_detection
Developed and implemented an advanced CCFDS using ML algorithms and pattern recognition techniques. Integrated real-time monitoring and adaptive learning capabilities into the system to dynamically adjust fraud detection parameters, ensuring effectiveness in identifying emerging fraud patterns.
kaggle-dataset numpy pandas-dataframe python3 sklearn
Last synced: 16 Apr 2026
https://github.com/abideen-olawuwo/sign-language
Multi Class Image Classification
convolutional-neural-networks deep-learning keras machine-learning matplotlib neural-network numpy pandas tensorflow
Last synced: 11 Apr 2026
https://github.com/nihanthbhargav/time-series-stock-market
This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.
matplotlib numpy pandas plotly python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/elvisrock/government_analysis
Esse projeto, tem como objetivo, analisar, explorar algumas técnicas e propor uma sugestão ao governo de forma geral, sem qualquer viés politico.
data-science machine-learning numpy pandas python sklearn
Last synced: 05 Apr 2026
https://github.com/patilsukanya/assignment-06.-logistic-regression
Used libraries and functions as follows:
classifier concatination confusion-matrix eda linear-models logistic-regression logistic-regression-algorithm logit-model matplotlib-pyplot numpy one-hot-encoding pandas python roc-auc-score roc-curve seaborn
Last synced: 05 Oct 2025
https://github.com/timothyjan/intro-machine-learning-classifiers
We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.
numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/giihcastro/trabalho-fisica
3D VPython Electromagnetic Particle Motion Simulator
Last synced: 15 Apr 2026
https://github.com/mehradi-github/ref-jupyter-2510
using python in machine learning
matplotlib numpy pandas python sklearn statistics
Last synced: 11 Apr 2026
https://github.com/alexixrugis/perceptronvisualization
Visualization of training and operation of a perceptron written from scratch in numpy
ai machine-learning numpy python
Last synced: 11 Feb 2026
https://github.com/smahala02/materials-science-introduction
Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.
data-visualization materials-science matplotlib numpy python scientific-computing
Last synced: 09 Feb 2026
https://github.com/rampal-punia/data-science-toolkit
Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.
artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow
Last synced: 09 Feb 2026
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/sarowarahmed/predicting-kolkata-house-price
🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.
app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit
Last synced: 26 Feb 2026
https://github.com/kirtipratihar/python_libraries_for_ds
This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.
artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit
Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.
analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics
Last synced: 08 May 2026
https://github.com/ashishsingh789/bcg_virtual_internship
This repository showcases my BCG X virtual internship project on customer churn analysis for PowerCo, covering business understanding, EDA, feature engineering, and modeling using Python and machine learning.
data-manipulation data-science dataanalysis datavisualization eda machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/dmarks84/coursework_project_apache-airflow-kafka-on-toll-booth-data
Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in live toll booth data, wrangles and transformed, and wrote into a SQL database
apache-airflow apache-kafka automation dags data-modeling databases eda elt etl mysql numpy pandas pipelines python sql
Last synced: 11 Apr 2026
https://github.com/sducournau/ign_lidar_hd_dataset
🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.
building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn
Last synced: 20 Jan 2026
https://github.com/ejw-data/python-bitwise-operators
Difference between bitwise and logical operators when using pandas
Last synced: 08 May 2026
https://github.com/ashishgoswami13/handgesture_recognition
Real-time hand gesture recognition using computer vision and deep learning. Collect gesture images, train a model with Google Teachable Machine, and classify gestures live for applications like drone control or human-computer interaction.
computer-vision cvzone handgesture-recognition keras numpy python teachable-machine
Last synced: 18 Apr 2026
https://github.com/ahmed-maher77/diabetes-prediction-app-using-machine-learning
Diabetes Prediction: Using machine learning to classify individuals as diabetic or non-diabetic based on health data, enabling early intervention and improved healthcare outcomes.
ai css data-science gradientboostinclassifier javascript logisticregression machine-learning matplotlib numpy pandas python randomforestclassifier seaborn streamlit supportvectormachine webdevelopment
Last synced: 11 Apr 2026
https://github.com/jigyasag18/fake-news-prediction-app
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization
Last synced: 11 Apr 2026
https://github.com/18mahi/tweet-sentiment-analysis
Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.
data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob
Last synced: 30 Apr 2026
https://github.com/isratafrin826/linear-regression-model
Linear Regression is a statistical method used to model the relationship between a dependent variable (target) and one or more independent variables (features) by fitting a straight line (regression line) to the data. It predicts the target value based on the linear relationship and is commonly used for forecasting and trend analysis.
googlecolaboratory numpy pandas sklearn
Last synced: 03 May 2026
https://github.com/raveenaross/bulldozer-price-prediction
Predicting the sale price of bulldozers using machine learning
matplotlib numpy pandas sklearn time
Last synced: 07 Oct 2025
https://github.com/zolppy/recommendation-system
This project demonstrates the creation of a content-based image recommendation system. It leverages a pre-trained VGG16 deep learning model to extract meaningful feature vectors from images. These features are then compared using cosine similarity to identify and recommend visually similar images.
computer-vision deep-learning keras machine-learning numpy recommendation-system sklearn tensorflow vgg16
Last synced: 12 Apr 2026
https://github.com/ricardorobledo/malicious_server_hack_detection
Predictive model to detect malicious hacking patterns in banking servers. Utilizes advanced Machine Learning techniques such as SMOTE, Gradient Boosting, and probability calibration to predict attacks befor
anaconda cibersecurity imbalanced-data imbalanced-learning imblearn kaggle matplotlib numpy pandas pandas-library python3 sklearn
Last synced: 14 Apr 2026
https://github.com/privatepeople/machinelearning-deeplearning
머신러닝과 딥러닝 공부를 위한 저장소
deep-learning deep-neural-networks deeplearning dl machine-learning machinelearning ml numpy pandas python scipy tensorflow
Last synced: 11 Apr 2026
https://github.com/kishanlalchoudhary/te-sem-6
TE SEM 6 Assignments
cpp data-science dsa-cpp matplotlib nltk numpy pandas python salesforce scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/codealphaprojects/spotify-music-recommender
CodeAlpha Machine Learning Task-1
jupyter-notebook machine-learning matplotlib numpy pandas pycharm-ide python scikit-learn spotify spotify-api streamlit vscode
Last synced: 09 Feb 2026
https://github.com/pedasoft-consult/employee-attrition-prediction
This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.
joblib numpy pandas python3 sklearn streamlit
Last synced: 10 Apr 2026
https://github.com/roluochke/learning-ds-ml
Learning DS and ML
keras matplotlib numpy pandas python seaborn sklearn tensorflow
Last synced: 11 Apr 2026
https://github.com/yahiazakaria445/arabic-answer-grading-using-sequence-models
An NLP project on Arabic data using LSTM model
matplotlib nltk numpy pandas scikit-learn tensorflow
Last synced: 08 Oct 2025
https://github.com/thkox/uni_2023-content-based-image-retrieval
This project is a team assignment for the "Image Analysis" course, offered in the 7th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.
7th-semester cartesian-product content-based-image-retrieval department-of-informatics hyperedge-similarity-calculation hypergraph image-analysis matplotlib networkx numpy python rank-normalization scipy torchvision university-of-piraeus
Last synced: 08 Oct 2025
https://github.com/NetBr3ak/Planetary-System-Simulation
Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.
astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application
Last synced: 08 Oct 2025
https://github.com/ercodex/predicting-new-medicines-with-sgd-classifier
"Shittiest Model of the Competition" award winner!
classifier-model dask kaggle-competition machine-learning numpy python scikit-learn
Last synced: 14 Apr 2026
https://github.com/georgehanymilad/startups-profit-prediction
Machine Learning Project
anaconda data-science dataanalysis kaggle machine-learning machine-learning-algorithms matplotlib numpy pandas python python3 regression seaborn
Last synced: 11 Apr 2026
https://github.com/pardhuu66/college-id-validator
FastAPI-based offline College ID Validator with Docker support
base64 dnn docker easyocr fastapi mobilenetv2 numpy onnx onnxruntime opencv pillow pydantic python tensorflow uvicorn
Last synced: 11 Apr 2026
https://github.com/lluispp/hunting_for_exoplanets_with_ai
An AI-powered tool that classifies exoplanets using NASA datasets and provides an interactive web interface for real-time exploration and discovery.
ai challenge exoplanets nasa numpy python spaceappchallenge-2025 spaceapps swai web
Last synced: 18 May 2026
https://github.com/simon2k/ieee-cis-fraud-detection
Can you detect fraud from customer transactions?
customer-transactions fraud-detection fraud-relation machine-learning numpy pandas scikit-learn
Last synced: 11 Apr 2026
https://github.com/towaquimbayo/comp-3948
BCIT Computer Systems Technology (CST) - COMP 3948 (Predictive Modelling)
classification matplotlib numpy pandas prediction-model predictive-analytics predictive-modeling python regression scikit-learn scipy sklearn statsmodels
Last synced: 09 Apr 2026
https://github.com/farhad-here/data-visualization-analysis-dva
This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.
altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/ahmrina/unetbraintumorseg
UNetBrainTumorSeg is a 3D Slicer extension named "seg_module" that integrates a 3D UNet trained on BraTS2020 dataset to segment brain tumor area from MRI volumes
3d-slicer-extension monai numpy python pytorch slicer unet vtk
Last synced: 07 May 2026
https://github.com/toluwalase-taiwo/ml-zoomcamp
Machine learning repository
algorithms jupyter-notebook machine-learning matplotlib numpy pandas python3 scikit-learn
Last synced: 11 Apr 2026
https://github.com/hiftd22/wpattern
📈 Analyze and visualize stock patterns with wPattern for better trading insights. Simplify your investment decisions through data-driven analysis.
cli finance financial-analysis financial-data matplotlib numpy pandas pattern-recognition python scikit-learn stock-scanner technical-analysis yfinance
Last synced: 29 Apr 2026
https://github.com/alex-s2222/mem_chat_bot
телеграмм бот для создания gif из текста введённого пользователем
cv2 google-api moviepy numpy python python-telegram-bot requests
Last synced: 08 Oct 2025
https://github.com/manjotkaurgill/agritech
Enter details of your soil and weather, and find best suitable crop for farming. With our advanced AI system, you can make informed decisions and optimize your agricultural practices.
flask generative-ai insight-generation machine-learning matplotlib mongodb nextjs numpy pandas python scikit-learn seaborn
Last synced: 12 Apr 2026
https://github.com/chandkund/stock-price-prediction-
Predict Tesla stock prices using historical data. The project preprocesses data, applies normalization and standardization, trains a linear regression model, and evaluates its performance. Visualizations include box plots and time series plots.
data-science linear-regression numpy pandas plotly python sklearn sklearn-model
Last synced: 21 Jan 2026
https://github.com/davityak03/object-detection-in-images-using-yolo
This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.
computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo
Last synced: 18 Jan 2026
https://github.com/greatepee/book-recommendation-system
Book Recommendation System
collaborative-filtering cosine-similarity evaluation-metrics k-precision kaggle kaggledatasets mae numpy pandas python scikit-learn scipy svd svd-matrix-factorisation
Last synced: 12 Mar 2026
https://github.com/piyushxbhardwaj/credit-card-fraud-detection
In this project, we focus on developing a machine learning model to detect fraudulent transactions within credit card data. Credit card fraud poses a significant challenge to financial institutions and consumers globally, with traditional rule-based detection systems often struggling to adapt to new fraud tactics.
matplotlib numpy pandas python scikit-image seaborn
Last synced: 16 Feb 2026
https://github.com/null-byte-00/fargasht
A simple evolution simulator written in python
ai data-science evolution evolutionary-algorithms machine-learning neural-network numpy pygame
Last synced: 18 Jan 2026
https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system
This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.
collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn
Last synced: 11 Apr 2026
https://github.com/paulbrenker/decision-tsp
Scripts from a university data analysis project to approximate the decision tsp problem using heuristics and machine learning
data-science machine-learning numpy poetry python
Last synced: 26 May 2026
https://github.com/sc0v0ne/ai-discipline-work
AI Discipline Work - Movie recommendation
jupyter-notebook machine-learning numpy pandas python python3
Last synced: 15 Apr 2025
https://github.com/lewis-morris/image_augment
Image augmentation for CNN's. Aiming to increase generalization.
augmentation image numpy python
Last synced: 18 Feb 2026
https://github.com/hmotez/medclaimml
An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.
healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss
Last synced: 12 Apr 2026
https://github.com/amish5ingh/cricket-data-analytics-ipl
Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.
data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn
Last synced: 09 May 2026
https://github.com/samuelpuerta/modelosiasaberprocolombia
PROYECTO KAGGLE - Crear un modelo para predecir el redimiento de los estudiantes en las pruebas Saber Pro
ai ia jupyter-notebook machine-learning numpy pandas prediction-model predictive-modeling python
Last synced: 06 May 2026
https://github.com/mahmud-kandawala/simulating-covid-19-python-
This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.
Last synced: 16 Apr 2026
https://github.com/gherrada22/spanish-sign-translator
This project uses Python and the OpenCV and MediaPipe libraries to detect and translate signs in real time.
language matplotlib mediapipe numpy opencv pycharm-ide python python3 spanish-language toml
Last synced: 18 Jan 2026
https://github.com/yappy2000d/pps-ndarray
Multi-dimensional array implementation
Last synced: 18 Jan 2026
https://github.com/carterbox/libimage
Provides large (2k) test images as NumPy arrays.
Last synced: 15 Apr 2026
https://github.com/yeisson8a/tallerdataanalysis
Taller de Data Analysis utilizando Python y las librerías Pandas, NumPy, Seaborn, Matplotlib para manipulación de datos de un archivo, así como la librería Statsmodels para regresiones lineales.
analisis-de-datos linear-regression matplotlib numpy pandas python3 seaborn statsmodels
Last synced: 14 Apr 2026
https://github.com/akshatkmistry/parkinsons_disease_predictor-voice_measures
This project implements a machine learning system to detect Parkinson's disease using voice measurements. The application uses a Random Forest classifier trained on voice feature data to predict the likelihood of Parkinson's disease with high accuracy (94%).
machine-learning matplotlib numpy pandas random-forest-classifier seaborn sklearn streamlit
Last synced: 09 Apr 2026
https://github.com/crvargasm/moore-penrose-matrix
A small implementation of an algorithm that is capable of computing the Moore-Penrose inverse of any matrix.
numerical-methods numpy python
Last synced: 10 Apr 2026
https://github.com/shakebshamsi/youtube-scraping-using-api
Youtube Scraping
jupyter-notebook numpy pandas python seaborn
Last synced: 10 Apr 2026
https://github.com/bhavnanahar/coral_project
ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.
colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo
Last synced: 14 Apr 2026
https://github.com/navindafernando/feature-extraction
Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:
categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling
Last synced: 10 Apr 2026
https://github.com/03-bunny-06/japan-heart-attack-analysis
This project analyzes heart attack data in Japan, focusing on gender-based differences in risk factors such as smoking, diabetes, alcohol consumption, and regional variations. The dataset is cleaned and processed using Pandas and NumPy, extracting insights on affected individuals based on lifestyle and health conditions.
data-cleaning dataframes numpy pandas python risk-analysis
Last synced: 20 Apr 2026
https://github.com/arsh-pixel-cmd/ai-image-classifier
AI Image Classifier is a web app that uses AI and deep learning to classify images in real-time. Built with Streamlit, TensorFlow (MobileNetV2), OpenCV, and NumPy, it allows users to upload single or multiple images and get the top-3 predictions with confidence scores instantly in the browser.
docker numpy opencv python3 streamlit tensorflow2 uv
Last synced: 10 Apr 2026
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/amruta33/telecom-churn-case-study
Building a classification model for reducing the churn rate for a telecom company.
logistic-regression matplotlib numpy pandas plotly python seaborn skit-learn statmodels
Last synced: 14 Apr 2026
https://github.com/azaz9026/data_cleaning
Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.
data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/vishalgaud17/stroke
A simple Streamlit web app that predicts stroke risk based on user input features like age, BMI, glucose level, and lifestyle factors, using a pre-trained machine learning model.
machine-learning numpy pandas python scikit-learn streamlit
Last synced: 14 Apr 2026
https://github.com/tuokri/rs2simlib
Python library for Rising Storm 2: Vietnam physics objects simulation and automated data collection.
numba numpy physics rising-storm-2-vietnam simulation unreal-engine-3
Last synced: 23 Feb 2026
https://github.com/ahmedabdalkreem/connected_component_labeling
Technique used to detect small object in the image like shapes and number can used this technique in OCR.
computer-vision connected-components matplotlib numpy object-detection python rgb2gray threshold
Last synced: 11 Apr 2026
https://github.com/gurramankit/censusproject_ml_randomforest
The objective of this project is to build a classification model using the Census Income dataset from the UCI Machine Learning Repository. The model predicts whether an individual's income exceeds $50,000 per year, based on their demographic and employment-related attributes.
matplotlib numpy pandas python random-forest scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/elon-fask/nlp_num1
Natural Language Processing with Disaster Tweets
ai machine-learning nlp nlp-machine-learning numpy pandas python text-processing
Last synced: 10 Apr 2026
https://github.com/thekartikeyamishra/colorextractor
A Python-based project that extracts the dominant colors from an image using the KMeans clustering algorithm. This project is useful for applications like image processing, color palette generation, or analyzing the most prominent colors in images.
kmeans-clustering matplotlib matplotlib-pyplot numpy p pillow python sklearn
Last synced: 05 May 2026
https://github.com/rscr1/summer_practice_mai_2
Решение всех заданий летней практики
matplotlib numpy pandas python scipy sympy
Last synced: 14 Apr 2026
https://github.com/gustavodinizmonteiro/pagehank
gephi information-retrieval numpy pagerank pandas pipenv python python3
Last synced: 14 Apr 2026
https://github.com/tnel7/data-science-yandex-practicum
Проекты из Яндекс Практикума "Специалист по Data Science"
data-science machine-learning nlp-machine-learning numpy pandas seaborn sklearn
Last synced: 19 Jan 2026
https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification
This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.
classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn
Last synced: 06 May 2026
https://github.com/amiridk/statistics-and-algebra-in-python
simple and important statics operations and Famous statistical problems , also algebra operations
algebra itertools matplotlib numpy pandas python scipy seaborn static
Last synced: 14 Apr 2026
https://github.com/jaweria-b/eda-basketball
The Streamlit app analyzes NBA player stats with user-selected filters, offering data download and intercorrelation heatmap.
matplotlib numpy python streamlit
Last synced: 10 Apr 2026
https://github.com/adi3042/diamond_price_prediction
🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor
anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv
Last synced: 14 Apr 2026
https://github.com/anuuragg/numpyneuralnet
machine-learning matplotlib neural-network numpy pandas
Last synced: 18 Jan 2026