An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 23 Feb 2025

https://github.com/winterwind/mlp_and_cnn

A short two-part project involving using an MLP and a CNN for ECG signal data

cnn convolutional-neural-network csv-files jupyter jupyter-notebook mlp neur numpy python pytorch

Last synced: 16 Jun 2025

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 23 Feb 2025

https://github.com/chopikus/raytracing-bench

Benchmarking ray tracers in Python, Java, Numpy, Cupy, Pypy

cupy java numpy pypy python raytracing

Last synced: 16 Jun 2025

https://github.com/xanonymous-github/multimediax

The collections of class projects in Multimedia Technologies and Applications class, NTUT.

deep-learning machine-learning numpy opencv-python

Last synced: 22 Feb 2025

https://github.com/azaz9026/house_price_prediction_with_machine_learning

🏡 House Price Prediction with Machine Learning Project 🤖💰 Seeking to revolutionize the real estate industry with cutting-edge technology? Explore my latest project, where I leveraged machine learning algorithms to predict house prices accurately.

linear-regression machine-learning-algorithms metaploit numpy pandas pickle python seaborn visualization

Last synced: 12 Sep 2025

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 22 Feb 2025

https://github.com/separatrixxx/crypto_labs_6_sem

👓 Laboratory work for the 6th semester of MAI on cryptography

cryptography matplotlib numpy python

Last synced: 29 Dec 2025

https://github.com/mohammedsaqibms/planar_data_classification_with_onehidden_layer

This repository implements a simple neural network for binary classification of 2D planar data using Python and NumPy. It compares logistic regression with neural networks and includes code for forward/backward propagation, gradient descent, and decision boundary visualization.

backpropagation binary-classification gradient-descent logistic-regression machine-learning neural-networks numpy python3

Last synced: 10 Jul 2025

https://github.com/mbarbetti/hello-snakemake

Simple implementation of snakemake to sample and plot data

hello-world matplotlib numpy python snakemake

Last synced: 22 Feb 2025

https://github.com/vishal-verma-96/honey_production_analysis_using_python

In this project, I analyzed U.S. honey production from 1998 to 2021 across different U.S. cities, explored yields, and examined production values using Matplotlib and Seaborn visualizations.

matplotlib numpy pandas python-3 python-visualization seaborn

Last synced: 22 Feb 2025

https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction

Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.

analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap

Last synced: 02 Mar 2025

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 30 Dec 2025

https://github.com/hariprasath-v/av_job-a-thon-june-2022

Machine learning model to predict if the user would buy the product in the next 3 months or not.

matplotlib numpy pandas pycaret python seaborn sklearn

Last synced: 01 Sep 2025

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 02 Mar 2025

https://github.com/hariprasath-v/hackerearth_get-a-room-ml-hackathon

Build a Machine Learning model to identify the habitability score of the property based on the property's basic information and location-based information.

exploratory-data-analysis machine-learning numpy pandas pycaret python randomforestregressor regression scikit-learn seaborn

Last synced: 02 Mar 2025

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 02 Mar 2025

https://github.com/moritzsalla/cci-advanced-visualization

Classwork for Mick Grierson's module at the Creative Computing Institute

matplotlib numpy openframeworks python tensorflow

Last synced: 13 Apr 2025

https://github.com/hariprasath-v/zindi_umojahack-india-income-prediction-challenge

Create a machine learning model to predict whether an individual earns above 50,000 in a specific currency or not.

catboost-classifier classification exploratory-data-analysis f1-score matplotlib numpy pandas python scikit-learn seaborn shap zindi

Last synced: 02 Mar 2025

https://github.com/kunalshelke90/students_marks_prediction_in_exams

Students Performance Evaluation using Feature Engineering, Feature Extraction, Manipulation of Data, Data Analysis, Data Visualization .Later on deployed on AWS with help of docker

dokcerfile flask githubactions machine-learning matplotlib mlops numpy pandas python seaborn

Last synced: 22 Feb 2025

https://github.com/davityak03/english-audio-to-hindi-text-using-pretrained-model

The primary goal of this project is to transcribe English audio and then translate the transcribed text into Hindi.

adam-weight-decay css flask html nlp numpy python tensorflow tokenizer transformers translation

Last synced: 22 Feb 2025

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 08 Sep 2025

https://github.com/ahmedsamir45/demograghic-data-analyzer

answer some questions about the data

demographic-data-analyzer numpy pandas

Last synced: 11 Apr 2025

https://github.com/werctfourth/python-border-autocrop

A Python script that crops borders from images.

border crop image-processing numpy pillow python python3

Last synced: 17 Mar 2025

https://github.com/eoastafurov/numpy-neural-net

Simple and extendable library for deep learning in numpy

deep-learning mlp-classifier numpy

Last synced: 02 Mar 2025

https://github.com/syedsaadahmed/playing-with-pandas-numpy-matplotlib

Practicing pandas, matplotlib and numpy for different data-sets, practicng these libraries and putting all the bit here in this repository

jupyter-notebook matplotlib numpy pandas pandas-numpy-matplotlib python3

Last synced: 22 Feb 2025

https://github.com/iadtya/hate-text-classification

Training RNN(LSTM) on Hate speech with tensorflow and keras

fastapi keras lstm nltk numpy pandas tenserflow

Last synced: 22 Feb 2025

https://github.com/thoughtscript/python_api_2023

Some fun with Flask, Python, and Machine Learning

angular docker docker-compose flask machine-learning mysql numpy

Last synced: 30 Dec 2025

https://github.com/amnydv17/algorithm-visualizer

This project aims to offer a visual representation of how different algorithms work, making it easier for developers, students, and enthusiasts to understand their functionalities and behaviors through graphical demonstrations.

matplotlib numpy python3 seaborn

Last synced: 22 Feb 2025

https://github.com/pradyumnakr/facial-recognition-using-opencv

Uses haarcascades to recognize facial features, patterns and performs feature extraction followed by labelling the data to recognize the faces on a live webcam.

numpy opencv python3

Last synced: 13 Apr 2025

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 30 Dec 2025

https://github.com/amnydv17/audio-analysis-and-classification

This project demonstrates how to analyse the audio datasets and apply the existing Machine learning/Deep Learning techniques to the classify various environmental sounds...

jupyter-notebook matplotlib numpy pandas

Last synced: 22 Feb 2025

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 30 Dec 2025

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026

https://github.com/abhinav330/credit-score-classification-problem

This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn

credit-card credit-card-fraud credit-card-fraud-detection credit-risk credit-scoring data-science machinelearning numpy pandas python sklearn

Last synced: 08 Sep 2025

https://github.com/nischalkshaj/image-identification

This is a repository for AI image training for beginners.

express mongodb node numpy pillow python3 pytorch reactjs

Last synced: 30 Dec 2025

https://github.com/arnie1x/massive-dataset-lab

This project involves basic data manipulation with JSON files, focusing on tasks related to data processing of the MASSive dataset and file management.

data-science numpy pandas pandas-dataframe python3

Last synced: 17 Jun 2025

https://github.com/anufrievsy/man_women

Данный скрипт был создан с целью поиска субъективно интересных серий ТВ-шоу "Мужское Женское"

bs4 clear-output fuzzywuzzy ipython ipython-display jupyter jupyter-notebook jupyter-notebooks matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python requests time youtube youtube-search

Last synced: 11 Jul 2025

https://github.com/evernext10/coursera_capstone

This repository is created for sending the IBM Data Science Lab and will be used primarily for the Capstone project.

coursera-data-science ibm numpy pandas

Last synced: 17 Jun 2025

https://github.com/batthulavinay/indian-food-insights---exploratory-data-analysis-eda

This project performs an **Exploratory Data Analysis (EDA)** on an Indian food dataset to uncover insights about different cuisines, ingredients, nutritional values, and regional food trends.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 18 Jun 2025

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 11 Jul 2025

https://github.com/saket-sk/data-science-lab

List of Assignments I have performed as a part of Honorary Course of Data Science and Machine Learning Lab.

data-cleaning data-science jupyter-notebook jupyter-notebook-extension numpy numpy-arrays pandas pandas-dataframe pandas-python python

Last synced: 08 Aug 2025

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 11 Jul 2025

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 18 Jun 2025

https://github.com/mpbeatriz/santander_coders_2024

Repositório dos exercícios e projetos do curso de Engenharia de Dados da Ada em parceria com o Santander (em andamento).

apache-kafka apache-spark api data-engineering data-extraction fastapi flask numpy oop pandas python webhooks

Last synced: 30 Dec 2025

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 21 Mar 2025

https://github.com/amr-yasser226/pagerank-power-method

Modular Python implementation of the PageRank algorithm via the power method, with discrete‑math foundations, end‑to‑end code, and experimental validation.

discrete-mathematics graph-algorithms markov-chain network-analysis networks numpy pagerank power-method python scipy

Last synced: 18 Jun 2025

https://github.com/pointer2alvee/machine-learning-algorithms-scratch

Core Machine Learning concepts , algorithms & maths implemented completely from scratch using Python & numpy only.

machine-learning-algorithms numpy pandas probabilistic-machine-learning python statistical-models

Last synced: 08 Sep 2025

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 22 Feb 2025

https://github.com/rijul007/smartwatch-data-analysis-using-python

Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python

Last synced: 19 Jun 2025

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 30 Dec 2025

https://github.com/fandreuz/python-serialization-benchmark

Low-effort benchmark of several serialization libraries for scientific applications

avro json numpy protobuf python serialization

Last synced: 29 Dec 2025

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 19 Jun 2025

https://github.com/sadegh-khedry/pneumonia-detection-using-cnn

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 19 Jun 2025

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Mar 2025

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index. Explore the relationship between national sports performance and socioeconomic indicators with rich statistical charts.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 19 Jun 2025

https://github.com/aqueeqazam/a-deep-dive-introduction-to-tensorflow

TensorFlow is a deep learning system that provides an interface for expressing deep learning algorithms and implementing them. It uses dataflow graphs to represent computation, shared state, and the operations that mutate that state.

deep-learning neural-network numpy tensor tensorflow

Last synced: 03 Aug 2025

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 11 Jul 2025

https://github.com/abdelrahman-amen/face_recognition

This project implements a real-time face recognition system using Python, OpenCV, and the face_recognition library. It identifies individuals by comparing facial features from live video streams with pre-stored encodings, enabling fast and accurate recognition.

cmake cv2 dlib-face-detection face-recognition numpy opencv os python

Last synced: 17 Mar 2025

https://github.com/sorna-fast/customer_personality_clustering

Customer behavior analysis, data analysis, data frame analysis, and clustering using machine learning

matplotlib-pyplot numpy pandas-dataframe plotly-express seaborn-plots sklearn-clustering sklearn-decomposition sklearn-metrics

Last synced: 16 Jun 2025

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/itzzmeakhi/ab-testing

This project was done as a part of Udacity's Data Analyst Nanodegree program in which AB-Testing is performed for the users under treatment and control groups to predict whether changes should be done for an website or it is ideal to keep the existing one, based on the performance of old and new pages. Hypothesis testing, z-test, and Logistic Regression is performed to predict results.

bootstrapping hypothesis-testing jupyter-notebook matplotlib numpy pandas python statistics z-test

Last synced: 20 Jun 2025

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 30 Dec 2025

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 29 Sep 2025

https://github.com/akshaypatra/cardiovascular_disease_detection

AI-driven ECG classification model that detects cardiovascular abnormalities such as arrhythmia and atrial fibrillation using a hybrid CNN-LSTM deep learning approach.

keras matplotlib numpy pandas python3 scikit-learn seaborn tensorflow wfdb

Last synced: 21 Mar 2025

https://github.com/reshiadavan/musica

A Music Composer AI which produces music and associated transcripts when given Midi files.

keras lstm-ann music21 numpy pickle python

Last synced: 02 Mar 2025

https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 02 Mar 2025

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/michaeldehaney94/opencv

Python OpenCV computer vision projects

matplotlib numpy opencv-python python

Last synced: 20 Jun 2025

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 20 Jun 2025

https://github.com/azaz9026/loan_approval_prediction

Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio

data data-analysis data-visualization eda machine-learning numpy pandas python statistics

Last synced: 30 Dec 2025

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 28 Oct 2025

https://github.com/mayankmittal29/duplifinder-quora-clone-catcher

An advanced system for detecting semantically duplicate question pairs using cutting-edge NLP techniques. Combines traditional ML models (XGBoost, SVM, Random Forest) with deep learning architectures (BiLSTM, Siamese Networks, Transformers) and contextual embeddings (BERT, RoBERTa). Features engineered using token similarity, fuzzy matching, and em

bert bilstm cross-validation eda fastext fuzzy-matching glove numpy pandas python3 quora-question-pairs random-forest roberta seaborn stemming svm tf-idf transformers word2vec xgboost

Last synced: 21 Mar 2025

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 30 Dec 2025

https://github.com/saif-gitreps/face-mask-detection

A CNN classification model scoring 94% accuracy built from scratch using a dataset that detects if an image of a person has Mask on.

cnn-classification image-classification image-processing kaggle keras numpy pandas

Last synced: 30 Dec 2025

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 20 Jun 2025

https://github.com/fimbres/emotion-recognition

This is an AI project developed in Python to recognize emotions in people.

ai kaggle-dataset keras numpy opencv python tensorflow

Last synced: 30 Dec 2025