An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/vedikasnehil/my-data-science-projects

This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.

data data-science deep-learning machine-learning matplotlib numpy python sql visualization

Last synced: 10 Apr 2026

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/kgruiz/linalg-practice

LinAlg-Practice is a Python library developed to deepen my understanding of linear algebra through hands-on implementation of various matrix operations. It includes comprehensive tests that compare the results with established libraries like NumPy to ensure accuracy and reliability.

algorithms data-science linear-algebra math matrix-operations numpy python sympy

Last synced: 21 Apr 2026

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/vetrivel07/flight-price-prediction

Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 15 Jun 2025

https://github.com/nguyenanhtuan1912/datatable-image-to-text

Repository này là đồ án môn học Computer Vision

computer-vision javascript nodejs numpy opencv python tesseract

Last synced: 09 Apr 2026

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/vinicius999/eda-imdb-top1000-films

Análise exploratória dos Top 1000 filmes no IMDB até 2020

eda numpy pandas python

Last synced: 07 May 2026

https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera

introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera

data-analysis matplotlib numpy pandas

Last synced: 03 May 2026

https://github.com/vyjayanthipolapragada/image_classifier_cnn_data_augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

adam-optimizer convolutional-neural-networks data-augmentation deep-learning image-classification jupyter-notebook neural-networks numpy optimizer pandas python pytorch relu-layer

Last synced: 09 Apr 2026

https://github.com/abdullahashfaqvirk/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 02 Apr 2026

https://github.com/aravinda-1402/covid-detection-model-using-chest-x-ray

The objective of this project is to develop a Deep Learning Model to identify the X-Rays of healthy vs. Pneumonia (Corona) afflicted patients using the Chest X-Ray dataset, and use this model to power the AI application to test the Corona Virus in a faster phase.

classification cnn covid flask keras numpy pandas tensorflow

Last synced: 09 Apr 2026

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/mirzaazwad/tymbert

TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy

bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/Khushi130404/CatNet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 19 Sep 2025

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 09 Apr 2026

https://github.com/shauryashaurya/marty_mcfly

Code, text and notebooks on a tutorial for Introduction to Machine Learning using open sources

anaconda jupyter-notebooks machine-learning machine-learning-tutorials notebook numpy python regression scikit-learn scipy tutorial

Last synced: 09 Apr 2026

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 16 May 2026

https://github.com/haseebulhassan437/lstm-next-word-predictor

A deep learning-based LSTM model for predicting the next word in a sequence using natural language processing techniques.

keras-tensorflow lstm numpy tensorflow

Last synced: 18 Apr 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/ghaniketrajputp005/adobe-gensolve-hackathon-2024

This Project aims to identify, regularize, and beautify curves in 2D Euclidean space.

cv2 keras-tensorflow numpy os sklearn

Last synced: 07 Feb 2026

https://github.com/marcinz20/sortingalgorithms

This is a basic console program which contains just a few, basic sorting algorithms and showcases their usage

algorithms numpy object-oriented-programming python

Last synced: 15 May 2026

https://github.com/qanastek/parseur-pdf

https://trello.com/b/SbT2XGyF/g%C3%A9nie-logiciel-scrum

beautifulsoup4 numpy python

Last synced: 15 May 2026

https://github.com/mustafadanabasi/python-linearregression-evfiyatlari

Ev Fiyatlarını Linear Regrasyon ile tahminleme çalışması.

linear-regression numpy pandas python

Last synced: 06 Apr 2026

https://github.com/franciscomartinez45/Social-Network-Analysis

Applied to analyze how misinformation propagates within communities. With the goal of addressing health disparities and improving health literacy particularly in minority populations, the project explores both supervised and unsupervised learning approaches to understand patterns in graph-structured data using a custom Graph Attention Network

matplotlib ml networkx numpy pytorch

Last synced: 11 Apr 2025

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 06 Apr 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/kenwuqianghao/ml-zoomcamp

Code and homework for ML Zoomcamp

machinelearning numpy pandas python3 tensorflow

Last synced: 06 Apr 2026

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 09 Apr 2026

https://github.com/mzayles/vendas_ficticias_dataprep

💻📊✅ Curso de Programação em Python para Data Science | Analisando e tratando dados fictícios.

numpy pandas python

Last synced: 15 Jun 2025

https://github.com/kikoveiga/feup-ia1

Artificial Intelligence (IA) First Project (2023/2024): BSc in Informatics and Computing Engineering @ FEUP

feup feup-ia feup-leic genetic-algorithms hill-climbing numpy pandas python simulated-annealing tabu-search

Last synced: 06 May 2026

https://github.com/mgitrov/coce

A deep learning-based project aiming to classify images out of 10 classes.

computer-vision convolutional-neural-networks deep-learning docker fastapi keras matplotlib numpy pillow regularization

Last synced: 06 Apr 2026

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 06 Apr 2026

https://github.com/lfgodoi/rpm-deep-guesser

A deep learning-based RPM estimator based on spectral features extracted from vibration signals of rotating machines.

condition-monitoring deep-learning docker flask machine-learning neural-networks numpy python pytorch scipy signal-processing spectral-analysis

Last synced: 02 Mar 2025

https://github.com/shreyasdankhade/portfolio_optimatization_project

The Portfolio Optimization Project uses optimization techniques to balance risk and return, helping investors make efficient asset allocation decisions.

flask flask-application matplotlib numpy pandas pandas-python porfolio-optimization portfolio python

Last synced: 06 Apr 2026

https://github.com/shambac/shamboflow

Fierce tensorflow competitor

cuda cupy machine-learning numpy pypi-package

Last synced: 19 Feb 2026

https://github.com/t-lak/decision-tree

This project implements a basic Decision Tree classifier. It supports visualizing the tree and calculating performance metrics (accuracy, precision, F1-score, and recall).

decision-trees graphviz metrics numpy python3 scikit-learn

Last synced: 07 May 2026

https://github.com/mallocode300/colour_palette_generator

Upload an image and immediately obtain a colour palette with the 10 most common colors in HEX codes and RGB

numpy python

Last synced: 29 Jul 2025

https://github.com/morsalinislamshapon/diabetes-prediction-systemv3

This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 29 Jul 2025

https://github.com/harsha-yuvaraj/Data-Compression-and-Decompression-Tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 02 May 2025

https://github.com/elijahondiek/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 18 Apr 2026

https://github.com/justinzhang17/comp-5801-queens

Reinforcement learning task of teaching an agent to play the game queens/star-battle

gymnasium numpy pygame python

Last synced: 15 May 2026

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/vipinchaudhary31122002/valuevanguard

ValueVanguard is a machine learning project for accurate house price predictions. Using advanced algorithms and real-world data, it empowers users to analyze and forecast property values efficiently. Perfect for developers, data enthusiasts, and real estate professionals. 🏠📈

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 12 Apr 2026

https://github.com/zeeshan4002911/data-analysis-hub

Quality control, data processing, data cleaning, data ploting

jupyter-notebook jupyterlab matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 06 Jan 2026

https://github.com/ledsouza/MedPhys-BI

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 22 Sep 2025

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/youssefali11997/free-style-coding

Scripts written as practice, trying new technology or just playing with the code !

bash javascript numpy python3 ruby-on-rails scripts vuejs

Last synced: 12 Apr 2026

https://github.com/abeed04/face-recognition-using-computer-vision-cv2

OpenCV (cv2) can be used for face recognition by detecting faces, extracting facial features, and comparing them to a database of known faces.

cmake cv2-library dlib-face-recognition face-recognition flask numpy pycharm-ide python

Last synced: 08 Feb 2026

https://github.com/lovesaroha/learning-neural-networks

Various concepts of neural networks applied in python (numpy) to help people get started with AI.

batch-normalization dropout gradient-descent logistic-regression neural-network numpy python regularization

Last synced: 08 May 2026

https://github.com/giacomolat/realestateai-solutions---a-forecasting-model-for-the-housing-market

This project applies regularization techniques (Ridge, Lasso, and Elastic Net) to improve real estate price forecasting. This project focuses on reducing overfitting and increasing the stability of regression models' predictions

cross-validation elasticnet-regression lasso-regression-model machine-learning-algorithms matplotlib matplotlib-pyplot numpy pandas python regularization-methods regularization-to-avoid-overfitting ridge-regression-model seaborn standardization

Last synced: 01 May 2026

https://github.com/chernyakid/russian-film-distribution

Исследование российского кинопроката

jupyter numpy pandas python seaborn

Last synced: 15 May 2026

https://github.com/ourway/simple-cnn

A simple CNN implementation using numpy

convolutional-neural-networks numpy

Last synced: 10 May 2026

https://github.com/alejoduarte23/fast_fdd

Fast implementation of frequency domain decomposition (FDD) in python with multiple identification techniques

numpy scipy-signal

Last synced: 10 May 2026

https://github.com/paul-bokelman/nn

Basic neural network in python

machine-learning neural-networks numpy python

Last synced: 09 May 2026

https://github.com/valmir-unicap/deteccao-de-mascaras-faciais

Experimento baseado no artigo: A Face Mask Detection Algorithm Based on YOLO

kaggle-dataset mobilenetv2 numpy opencv python torch yolo

Last synced: 12 Apr 2026

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/vuanhtuan1012/data-scientist-with-python

Notes on career track "Data Scientist with Python" at DataCamp

importing-data matlab numpy pandas python3 sqlalchemy

Last synced: 09 Apr 2026

https://github.com/kiok46/subset-sum-problem

Solving the Subset Sum Problem using Python, Pandas and Numpy.

numpy pandas python subset-sum

Last synced: 05 May 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 May 2026

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 01 May 2026

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/nagipragalathan/python_tutorial_for_data-science

This repository is a comprehensive guide for learning data science using Python. It covers various essential libraries and tools commonly used in the field of data science, including Jupyter Notebook, Matplotlib, NumPy, Pandas, Scikit-learn, and PyTorch.

datascience datavisualization deeplearning jupyter jupyter-notebook learning-by-doing learningresources machinelearning matplotlib numpy opensource pandas python python-script python3 pytorch pytorch-implementation scikitlearn tutorial

Last synced: 09 Apr 2026

https://github.com/muavia1/roman-urdu-poetry-generation-using-lstm

Here’s a short description you can add to your GitHub project: Roman Urdu Poetry Generator A deep learning project using LSTM and TensorFlow to generate Roman Urdu poetry. Trained on a poetry dataset and deployed with a Gradio interface for interactive text generation.

deep-learning gradio gradio-interface lstm model-training numpy pandas poetry-generator python tensorflow

Last synced: 09 Apr 2026

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026