NumPy
NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
- GitHub: https://github.com/topics/numpy
- Wikipedia: https://en.wikipedia.org/wiki/NumPy
- Repo: https://github.com/numpy/numpy
- Created by: Travis Oliphant
- Related Topics: python, scipy,
- Last updated: 2026-06-26 00:18:09 UTC
- JSON Representation
https://github.com/sudarshanc00/mnist-digit-classification
This project uses a machine learning model to classify handwritten digits (0-9) from the MNIST dataset, a popular collection of grayscale images for image classification benchmarking. It includes a Jupyter Notebook to train a neural network, enabling accurate recognition and classification of digits.
jupyter-notebook matplotlib numpy python tensorflow
Last synced: 12 Apr 2026
https://github.com/trissim/polystore
Framework-agnostic multi-backend storage abstraction for ML and scientific computing
backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr
Last synced: 12 Apr 2026
https://github.com/ahmedabdalkreem/skin-cancer
In this project we work to extraction features from Images using CNN and build the Neural Network to arrive the patient have skin-cancer Malignant or Benign.
cnn deep-learning matplotlib neural-network numpy pandas python3 sklearn-library
Last synced: 12 Apr 2026
https://github.com/ankitmishralive/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 22 Mar 2025
https://github.com/ishtufinquant/portfolio-optimizer
Optimizes stock portfolios using simulation and real-world constraints (SLSQP).
backtesting finance numpy pandas portfolio-optimization python quant scipy yfinance
Last synced: 12 Apr 2026
https://github.com/subhamsarangi/pdnp
Data Analysis using Python
matplotlib numpy pandas python3
Last synced: 12 Apr 2026
https://github.com/llwusill/firepredictioncnn
CNN kullanarak Orman Yangını Tahmin/Tespit Sistemi
matplotlib numpy opencv-python pandas sckiit-learn seaborn tensorflow
Last synced: 12 Apr 2026
https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis
This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R
classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow
Last synced: 02 Feb 2026
https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system
Prime numbers and polar coordinate system
Last synced: 01 May 2026
https://github.com/dahsie/machine_learning_from_scratch
This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills
classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression
Last synced: 04 May 2026
https://github.com/chandkund/iris-classification
Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.
machine-learning matplotlib-pyplot numpy pandas python sklearn
Last synced: 21 Jan 2026
https://github.com/tralahm/kaggle-titanic-competition
Predicting Titanic Passenger Survival Using Machine Learning
data-analysis jupyter-notebook kaggle-competition kaggle-dataset machine-learning matplotlib numpy pandas predictive-modeling python3 sklearn tralahm tralahtek
Last synced: 13 Apr 2026
https://github.com/gustavo-venceslau/housing-price
A Machine Learning project to create a model to predict housing prices. The project is based on the example in chapter 2 of the book Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélion Géron
conda machine-learning matplotlib numpy pandas python sklearn
Last synced: 13 Apr 2026
https://github.com/jim-by/tweet-sentiment-analysis
Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.
nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis
Last synced: 13 Apr 2026
https://github.com/rkarahul/ok.win-big-small-predictor
Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.
joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn
Last synced: 13 Apr 2026
https://github.com/miss-mhv/udacityproject_dataviz
Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day
jupyter-notebook matplotlib numpy pandas python3 seaborn-plots
Last synced: 13 Apr 2026
https://github.com/rajsinha7/hand-gesture-recognition
This project serves as a foundation for integrating hand gesture controls into interactive applications like games, presentations, or robotic controls. It features: Real-time hand tracking with MediaPipe Hands and Live visualization of hand landmarks and gesture labels
Last synced: 13 Apr 2026
https://github.com/vvipjain/iphone-sales-analysis
Iphone Sales Analysis
jupyter-notebook numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library pandas-python plotly plotly-express plotly-python python python3
Last synced: 13 Apr 2026
https://github.com/kucera-lukas/imagesecrets
RESTful API for encoding/decoding messages into/from images.
api asyncio docker fastapi heroku mypy numpy pillow poetry postgresql pydantic pytest pytest-cov pytest-mock python python-typing restful-api steganography swagger tortoise-orm
Last synced: 28 Feb 2025
https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions
A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders
machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn
Last synced: 22 Feb 2026
https://github.com/siddhi-c19/tourism_sid_tournaut
A comprehensive analysis of tourism data of various countries
data-science dataset mathplot mathplotlib numpy plotly statistics streamlit
Last synced: 06 May 2026
https://github.com/djdhairya/car-parking-detection
cnn cv2 deep deep-learning epoch flask keras numpy numpy-arrays opencv pickle tensorflow
Last synced: 13 Apr 2026
https://github.com/cezlul/analyse-ventes-immobilier
Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.
data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn
Last synced: 13 Apr 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/bbhargavpanchal/brain-tumor-segmentation-unet
U-Net based Convolutional Neural Network for brain tumor segmentation using MRI images. Implemented in Python with TensorFlow, OpenCV, and Scikit-learn.
keras-tensorflow numpy opencv pandas python scikit-learn sklearn tensorflow
Last synced: 13 Apr 2026
https://github.com/ksprugevics/magic-square
Some simple algorithm implementations for solving (ordinary) magic squares using NumPy
algorithms mathematics numpy python
Last synced: 21 Apr 2026
https://github.com/chandkund/predicting-diabetes-onset
The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.
deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization
Last synced: 13 Apr 2026
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 11 Apr 2026
https://github.com/mosalem149/data-mining-assignments
That repo for data mining quizzes and tasks.
anaconda data-mining dataframe jupyter-notebook matplotlib mlxtend numpy pandas python quizzes scikit-learn seaborn tasks
Last synced: 13 Apr 2026
https://github.com/tharushiimasha/python-practice
This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.
api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping
Last synced: 13 Apr 2026
https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml
A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.
colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest
Last synced: 13 Apr 2026
https://github.com/asier-ortiz/python-for-data-science-and-machine-learning-bootcamp
Python for Data Science and Machine Learning Bootcamp: NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn, TensorFlow, and more
matplotlib numpy pandas plotty python scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/anujdutt9/language-model-using-rnn
Text Generation using basic RNN from scratch using Python and Numpy.
language-model numpy python rnn text-generation
Last synced: 08 May 2026
https://github.com/archishmansengupta/dnn
Digit Neural Network is a digit recognition network based on MNIST data set using numpy, pandas and matplotlib
matplotlib mnist neural-network numpy pandas python
Last synced: 13 Apr 2026
https://github.com/lingumd/school_district_analysis
Updated test score data and school district analysis using Python.
averages binning jupyterlab numpy pandas pandas-loc python
Last synced: 13 Apr 2026
https://github.com/csengupta1101/rock-paper-scissor-game
Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.
game if-else-statements numpy python python3 random
Last synced: 11 May 2026
https://github.com/unvercan/activation-function-comparison-pytorch
Comparison of common activation functions on MNIST dataset using PyTorch.
activation-functions dataset deep-learning image-classification machine-learning matplotlib mnist neural-network numpy python pytorch relu sigmoid tanh
Last synced: 13 Jun 2025
https://github.com/vedanty3/supermarket-sales-data-analysis
This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.
data-analysis data-science jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/abideen-olawuwo/nyc-taxi
Prediction the duration of New York Taxi trip
linear-regression matplotlib numpy pandas python seaborn sklearn
Last synced: 11 Apr 2026
https://github.com/prowilliam/maze_generator
A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format
Last synced: 28 Apr 2026
https://github.com/ondiekelijah/shadowcoach-formtracker
A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.
mediapipe-pose numpy opencv-python
Last synced: 09 Apr 2025
https://github.com/mayankmittal29/algovision-statistical_methods_in_ai
Implementation of various machine learning algorithms from scratch, including Linear Regression, K-Nearest Neighbors, Decision Trees, and K-Means clustering. Also done EDA on data, Implemented LSH, IVF, SLIC algorithms also with evaluation metrics
decision-tree-classifier eda gradient-descent image-segmentation ivf knn-classification linear-regression lsh-implementation matplotlib-pyplot numpy pandas python3 seaborn sgd-optimizer sklearn slic-superpixel-algorithm
Last synced: 11 Apr 2026
https://github.com/hari7261/playwithdata-python
This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring
data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn
Last synced: 13 Apr 2026
https://github.com/kecnry/nparray
High-Level Wrappers for Building and Manipulating Numpy Arrays
Last synced: 08 Jan 2026
https://github.com/itshyphen/mass-mailing-script
A simple mass mailing script that sends personalized email to multiple emails importing from csv
Last synced: 13 Apr 2026
https://github.com/subhas-pramanik-09/car-detection-and-counting
This project counts the number of cars passing through a designated line in a video file using OpenCV and background subtraction techniques.
machine-learning numpy object-detection opencv
Last synced: 13 Apr 2026
https://github.com/bhattbhavesh91/numpy-performance-test
Comparing the performance of Base Python Square root function with a numpy equivalent
Last synced: 20 Apr 2026
https://github.com/msikorski93/Linear-Regression-From-Scratch
Different solutions of linear and polynomial regression made from scratch.
from-scratch gradient-descent linear-regression numpy ordinary-least-squares polynomial-regression tensorflow
Last synced: 27 Apr 2025
https://github.com/harmanveer-2546/credit-card-fraud-detection
The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.
ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg
Last synced: 13 Apr 2026
https://github.com/themusharraf/loan-predictor
fastapi jobl ml numpy pandas sckiit-learn
Last synced: 13 Apr 2026
https://github.com/ivangael/image-toolbox-backend
api-rest flask numpy opencv python tensorflow
Last synced: 28 Apr 2025
https://github.com/dbriane208/python-for-data-science
Machine Learning and Data Science repository. Love crafting Machine Learning models.
data-analysis data-science data-visualization machine-learning numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/ifte-13/early-stage-brain-stroke-detection
Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm
decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn
Last synced: 06 Jul 2025
https://github.com/iv4n-ga6l/GenderDetection
Gender detection using gender classification model
genderclassification genderdetection numpy pil python resnet18 torch torchvision
Last synced: 28 Apr 2025
https://github.com/harmanveer-2546/finding-you-next-perfect-house
In this exploratory data analysis, we compare a dataset which consists of various features about renting of houses available on these renting platforms listed by owners of these houses, and try to derive some constructive conclusions by performing Descriptive statistics of the available features.
adaboostregressor decisiontreeregressor exploratory-data-analysis house-price-prediction inline linearregression matplotlib numpy onehotencoder pandas plotly python randomforestregressor renting seaborn standardscaler xgbregressor
Last synced: 13 Apr 2026
https://github.com/hajaarh/malaria_hematie_cnn
Algorithme de réseaux de neurones convolutionnels
numpy pandas python resnet-50 sklearn-library tensorflow vgg16
Last synced: 13 Apr 2026
https://github.com/eny2930/ai-ml-projects
A project demonstrating various libraries and tools in using algorithms in machine learning. This repository includes code samples, datasets, and detailed explanations.
iris-dataset keras linear-regression logistic-regression matplotlib numpy pandas pytorch sckiit-learn seaborn sentiment-analysis
Last synced: 13 Apr 2026
https://github.com/intscription/machine-learning
Machine Learning and it's advance concepts
adaboost numpy pandas pca-analysis pipeline random-forest scikit-learn svm
Last synced: 28 Apr 2026
https://github.com/nambukeerthi/copper_project
Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards
eda machine-learning numpy pandas python streamlit
Last synced: 13 Apr 2026
https://github.com/yesenvidnath/facerecognitionapplight
This is a face recognition application wich is capable of recognizing a person based on thire face and take attendence from it
face-recognition facent flask numpy python
Last synced: 13 Apr 2026
https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2
Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t
collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif
Last synced: 09 May 2026
https://github.com/saraasgari99/videoclassificationusingdeeplearning
ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques
computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow
Last synced: 13 Apr 2026
https://github.com/bropenguin847/python-learning
Learning Python for Scientific Programming. Intended for beginners with some knowledge of Python.
array-manipulations convolution jupyter matlab matplotlib-pyplot numpy pandas-python pep8 python python3 scientific-programming scipy spyder-ide vscode
Last synced: 13 Apr 2026
https://github.com/rajan-bhateja/data-analysis-with-python
Cleaning, analysis and visualization on various datasets using Python and its various libraries via Jupyter Notebook
data-cleaning data-science jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/aritrakar/ai-projects
🤖A centralized collection of links to my AI projects roughly divided by topic.
numpy pandas python pytorch tensorflow
Last synced: 13 Apr 2026
https://github.com/abhisek-13/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.
data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/5hraddha/eda-instacart-customers-shopping-habits
In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.
data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas
Last synced: 13 Apr 2026
https://github.com/paulo-freitas-junior/ifood-dev-week2023
Repositório sobre a Dev Week iFood sobre uso de Python e Inteligência Artificial para realização de cálculo de NPS ( Net Promoter Score ) e a utilização de IA para análise de sentimentos dos comentários usando tecnologia do ChatGPT-4.
bert chatgpt colab-notebook matplotlib nltk numpy pandas python textblob-sentiment-analysis
Last synced: 13 Apr 2026
https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning
Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.
data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api
Last synced: 13 Jul 2025
https://github.com/nirmaldeepponnada/codeclauseinternshipproject2
Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.
nltk numpy pandas pickle python scikit-learn scipy
Last synced: 07 Apr 2026
https://github.com/jcm-ai/robust-mlops-elevating-machine-learning-to-production-scale
This project aims to address the challenge faced by the Office of Foreign Labor Certification in managing a high volume of job certification applications for employers seeking to hire foreign workers in the United States. By leveraging ML models, this project seeks to streamline the process of shortlisting visa applicants based on historical data.
aws-ec2 aws-ecr aws-s3 cicd-pipeline data-science docker-image end-to-end exploratory-data-analysis git github-actions html machine-learning-models mlops mlops-project model-deployment mongodb numpy pandas python yaml
Last synced: 09 Apr 2026
https://github.com/niel0503/probabilidade-e-amostragem
Desafio desarrollado para el bootcamp de WoMakersCode
estadistica matplotlib numpy pandas stats
Last synced: 18 Jan 2026
https://github.com/moonshallow5/solar-system-simulation
A solar system simulation using the Beeman scheme and the Euler method
matplotlib-animation matplotlib-pyplot numpy
Last synced: 13 May 2026
https://github.com/rishabhmathur06/pytorch
This repository contains codes related to the PyTorch tutorial provided by Patrick Loeber
neural-network numpy pandas python pytorch sklearn
Last synced: 13 Apr 2026
https://github.com/mastermindromii/car-price-prediction-model
Here is My Regression Project based on Predicting Price of Car using Linear Regression.
linear-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 13 Apr 2026
https://github.com/aaaastark/wordassociation-mutualinformation-nlp-project
Word Association and Mutual Information Project
gensim machine-learning matplotlib natural-language-processing nlp nltk numpy pandas python seaborn sklearn wordcloud
Last synced: 13 Apr 2026
https://github.com/shreeparab1890/heart-attack-analysis-prediction-accuracy-85.24
This iPython Notebook implements the Analysis and Predictions for Heart Attack. The Analysis is done on the dataset to identify the key features which has a impact on Heart Attack. After the analysis and data cleaning, the dataset is used to train the model using different ML algorithm.
eda exploratory-data-analysis matplotlib numpy pandas plotly prediction python scikit sklearn
Last synced: 13 Apr 2026
https://github.com/hilalozdemirbuyukasik/deep-learning
A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.
bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow
Last synced: 12 Apr 2026
https://github.com/lesiaukr/mathematic_for_cs_colab_hw
Master's degree | HW | Mathematics for Computer Science course | Homeworks
goit google-colab home-works latex linear-algebra mathematics-for-computerscience matrices numpy python vectors
Last synced: 13 Apr 2026
https://github.com/jbayardo/dip-tps
coursework image-processing introduction jupyter-notebook matplotlib notebook numpy python scikit-image scipy
Last synced: 13 Apr 2026
https://github.com/takana671/partiallytransparentimage
A simple tool to make an image partially transparent
geomipterrain numpy opencv pillow python3 transparent-image
Last synced: 07 May 2026
https://github.com/mohamed15058/amazon-sales
amazon-sales
dashboard excel machine-learning-algorithms matplotlib numpy pandas powerbi report seaborn
Last synced: 13 Apr 2026
https://github.com/lupin3000/augmentedreality
Introduction into augmented reality (AR) with Python OpenCV ArUco markers.
aruco aruco-marker camera-calibration marker-detection numpy opencv pose-estimation python
Last synced: 13 Apr 2026
https://gitlab.com/DLR-SY/mcodac
MCODAC is a Fortran library for the numerical evaluation of fiber composite damage. The library contains analysis methods specifically tailored to fiber composites, from micromechanical homogenization approaches to macroscopic fatigue models of orthotropic multilayer composites.
Last synced: 02 Apr 2025
https://github.com/lingesh81051/similar-template-document-matching-and-fraud-detection
An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.
numpy opencv opencv-python pillow pymupdf pytesseract pytesseract-ocr python tkinter
Last synced: 13 Apr 2026
https://github.com/adithaker/basic_nlp
🖨 Basic Data Viz; Sentiment Analysis and Cosine Similarity for starting NLP
cosine-similarity datavisualization jupyter-notebook matplotlib-pyplot nlp numpy pandas python sentiment-analysis sklearn textsimilarity wordembeddings
Last synced: 15 Apr 2026
https://github.com/18mahi/digital_cave
An intermediate-level deep learning project that compares Convolutional Neural Networks (CNN) and Multi-Layer Perceptrons (MLP) on the MNIST handwritten digits dataset. This project demonstrates data augmentation, learning rate scheduling, and visual comparison of model performance
cnn confusion-matrix data-augmentation data-science deep-learning evaluation-metrics jupyter-notebook keras learning-rate-scheduler machine-learning matplotlib mlp numpy python3 scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/ahmadbuilds/fake-news-classifier
Classifies news articles as real or fake using an NLP pipeline with TF-IDF + n-grams and machine learning models. Includes text preprocessing, feature engineering, model training, and evaluation.
fastapi logistic-regression matplotlib n-grams nextjs nltk numpy pandas python3 random-forest-classifier react scikit-learn seaborn supervised-learning tf-idf typescript xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/lmizner/codecademy_airline_analysis
Using visualization to explore a dataset
histplot jupyter-notebook lmplot math matplotlib-pyplot numpy pandas python seaborn
Last synced: 08 May 2026
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/sducournau/ign_lidar_hd_dataset
🏗️ Comprehensive Python library for processing IGN LiDAR HD data into machine learning-ready datasets for Building Level of Detail (LOD) classification. Features GPU/CPU processing, smart data management, and complete ML pipeline integration.
building-classification data-processing dataset france geospatial gis ign lidar lidar-hd numpy point-cloud scikit-learn
Last synced: 20 Jan 2026