An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/asuquoaa/cdc_2017_immunization_data_analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 12 Apr 2026

https://github.com/humankernel/hands-on-ml

Practice of Machine Learning

machine-learning matplotlib numpy pandas sklearn

Last synced: 08 Jun 2026

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/manoj-kumar-c/data_analyst_projects

Live Data Analyst Projects.

excel numpy pandas powerbi

Last synced: 07 Feb 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/elmezianech/email-spam-ham-classifier-nb-hard

Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.

ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 11 Apr 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/0nom4d/306radiator

Mathematical EPITECH Project - This project reproduces the behaviour of a radiator placed at a position in a room of a given size.

epitech epitech-project numpy pytest python3

Last synced: 29 Apr 2026

https://github.com/zhanziyuan/pixelpuzzle-web

Encode/decode images using Base64 or shuffle/recover the pixels of images.

anime encryption image image-processing numpy python

Last synced: 11 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1

Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?

confidence-interval normalization numpy pandas python scipy stats

Last synced: 13 Apr 2026

https://github.com/murugavl/loan_approval_prediction

This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.

jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/kzon94/torn-market-analyzer

Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.

data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api

Last synced: 11 Apr 2026

https://github.com/dineshdhamodharan24/industrial-copper-modeling

It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.

bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit

Last synced: 13 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/alphacrypto246/stock-price-movement-prediction

A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.

machine-learning numpy pandas polynomial-regression scikit-learn yfinance

Last synced: 13 Apr 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker

Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.

cnn-classification computer-vision keras-tensorflow numpy python tensorflow

Last synced: 07 Jan 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/rkarahul/ok.win-big-small-predictor

Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.

joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/alinenog/desenvolve_gb_2022

Formação Desenvolve 2022 do Grupo Boticário na área de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/imdeepmind/knn_from_scratch

KNN classifier in pure Numpy and Python

knn knn-classification machine-learning numpy python

Last synced: 11 Apr 2026

https://github.com/dulajkavinda/document-similarity-ml

📑 Check similarity between documents using td-idf and cosine similarity.

machinelearning numpy python scikit

Last synced: 29 Apr 2026

https://github.com/costopoulos/ntua-telecommunications

:radio: :telephone_receiver: NTUA ECE Telecommunications Course Code & Report

numpy nyquist python sampling scipy telecommunications

Last synced: 20 Apr 2026

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/ebadshabbir/logistic_regression-binomial-

Logistic Regression on Social Network Ads Dataset This project applies Logistic Regression to predict whether a user will purchase a product based on their age and estimated salary, using the Social Network Ads dataset. The data is split into training and test sets, with feature scaling applied for normalization.

classification jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/guptamudit/whatsapp-chat-analyser

Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.

matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/amiriiw/text_classification

Welcome to the Text Classification Project! This project is designed to train a model for classifying texts based on their emotional content and then using it to categorize new texts into corresponding emotional categories.

keras numpy pandas pickle scikit-learn tensorflow text-classification

Last synced: 20 Jan 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/pranavsuriya-sr/e-hub_energysystems

Python based Web Application for analysis and visualization of concepts in Energy Systems.

css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit

Last synced: 11 Apr 2026

https://github.com/romerorodriguezd/homography-with-opencv

Homography operations applied to insert images inside others regardless of the perspective.

computer-vision homography matrices-transformation numpy opencv opencv-python

Last synced: 29 Apr 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/shubhamahobia/lstm-hamlet

The LSTM Next Word Prediction project trains a model on Hamlet to predict the next word in a sequence. After preprocessing the text, an LSTM network learns patterns in the language. The trained model can then generate text by predicting the next word in a given sequence, with its accuracy evaluated to ensure it effectively models the language.

deep-learning github hamlet lstm lstm-neural-network lstm-neural-networks machine-learning neural-network nltk numpy pandas pickle python

Last synced: 11 Apr 2026

https://github.com/amnydv17/landmark-detection

This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/anshpg/vectorwave

The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.

collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/jminjares4/selected-areas-in-networks

Assignments for Selected Areas in Networks (EE 5390)

matplotlib network-analysis numpy pandas python

Last synced: 08 May 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/giordanodaloisio/demv

Debiaser for Multiple Variables, a model- and data- agnostic method to improve fairness in binary and multi-class classification tasks

bias-mitigation numpy pandas python

Last synced: 29 Apr 2026

https://github.com/unknownman/101-numpy

learning numpy by 101 short videos and jupyter notebook

numpy

Last synced: 29 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/chandkund/predicting-diabetes-onset

The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.

deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/yanglr/mlnotes-python

Machine learning Notes for Python 3 (NumPy, TensorFlow etc.), with interactive Jupyter notebook.

jupyter jupyter-notebook numpy numpy-notes

Last synced: 21 May 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/sohan181204/airline-ticket-price-analysis

The analysis of airline ticket prices explores the dynamic and competitive nature of the aviation industry in India. With rapid economic growth and increasing air travel demand, understanding ticket price trends is crucial for airlines, passengers, and policymakers.

numpy pandas python selenium

Last synced: 11 Apr 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/kmkurn/text2array

Python library to convert text dataset into arrays.

deep-learning natural-language-processing neural-networks numpy

Last synced: 14 May 2026

https://github.com/sanjaiveeran44/shopping-intent-analasys

This project predicts whether a visitor to an e-commerce website will make a purchase based on their browsing behavior. It includes data analysis, preprocessing, and machine learning modeling to understand online shopping patterns.

matplotlib matplotlib-pyplot numpy pandas python3 sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/aciluca/credit-approval-system

A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.

data-science keras machine-learning numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/antonio-f/naive_convnet

Naive Convolutional Neural Network with TensorFlow 2

101 convolutional-neural-networks deep-learning easy matplotlib numpy simple tensorflow2 tutorial

Last synced: 29 Apr 2026

https://github.com/ly16302/mask

mask generation library for wyvern

f2py fortran mpi4py numpy openmp testing

Last synced: 21 Mar 2025

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/johanntf/direccion_maximo_crecimiento

Dirección de máximo crecimiento en una función multivariable dado un punto de partida

customtkinter matplotlib numpy python

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026