An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/sadegh15khedry/creditcard-transaction-anomaly-detection-using-isolationforest

detecting fraudulent transactions in credit card data using anomaly detection techniques.

anomaly-detection matplotlib numpy pandas python seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/marvin-vw/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 04 Feb 2026

https://github.com/webintellectual/coloured-image-to-b-w-converter-web-app

A Streamlit web application to convert a coloured image to black and white image.

numpy opencv pil pillow python3 streamlit streamlit-webapp

Last synced: 12 Apr 2026

https://github.com/offthetab/matcha_code_generation

Fine-tunning MatCha model for better python matplot lib code generation by given chart.

fine-tuning huggingface matcha matplotlib numpy pandas pil sklearn torch

Last synced: 20 Jan 2026

https://github.com/gurpreet0022/unveiling-pcos

Data Driven approach to get insights about PCOS

analysis eda insights matplotlib numpy pandas python3 scipy-stats seaborn visualisation

Last synced: 12 Apr 2026

https://github.com/amruta33/customer-support-on-twitter

The Customer Support on Twitter dataset is a large, modern corpus of tweets and replies to aid innovation in natural language understanding and conversational models, and for study of modern customer support practices and impact.

nlp numpy pandas python regex sklearn

Last synced: 12 Apr 2026

https://github.com/id-andyyy/alfahack

πŸ“ˆπŸ’° Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/hairymax/python-for-science-lecture-notes

Tutorial material on the scientific Python ecosystem in Russian

matplotlib numpy python scipy sympy

Last synced: 12 Apr 2026

https://github.com/winterwind/housingpricesproject

A two-part project involving making predictions using various regressors and then implementing linear regression from scratch and predicting that way

csv csv-files data-science decision-tree gradient-descent jupyter jupyter-notebook knearest-neighbors knn linear-regression linear-regression-scratch machine-learning matplotlib matplotlib-pyplot numpy pandas python random-forest regression sklearn

Last synced: 12 Apr 2026

https://github.com/khushi130404/echobot

EchoBot is a chatbot built using TensorFlow to classify user inputs into predefined intents and respond accordingly. It uses a neural network trained with the data in intents.json.

keras nlp nltk numpy pickle tensorflow

Last synced: 12 Apr 2026

https://github.com/3rd-son/movie-streaming-service-analysis

Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn

Last synced: 18 Apr 2026

https://github.com/nicolas-giacomelli/modelo-polinomial-api-streamlit

Codigo focado em treinamento de modelo de previsΓ£o de salario por tempo de empresa e nivel dentro da empresa disponibilizado por api e usando front-end streamlit para calcular basta inserir tempo de empresa e nivel e calcular o resultado de salario compativel

api fastapi joblib machine-learning matplotlib numpy pandas pingouin pydantic scikit-learn seaborn streamlit uvicorn

Last synced: 12 Apr 2026

https://github.com/nazwright/daria

Real-time fraud detection architecture powered by AWS Kinesis, KaggleHub, and SMOTE-augmented data β€” the foundation of DARIA, the Detection And Risk-Intelligence Agent.

aws evm fraud fraud-detection-using-machine-learning kaggle kinesis machine-learning math numpy pandas python random web3

Last synced: 12 Apr 2026

https://github.com/allenvox/statistics

Workspace for applied problems of probability theory & mathematical statistics & modelling class

acf autocorrelation distribution k-means k-means-clustering maths matplotlib numpy probability-theory python sklearn statistics

Last synced: 17 Apr 2026

https://github.com/nisch-mhrzn/book-recommender-system

A Flask-based application that suggests books tailored to user preferences, leveraging collaborative filtering and a curated dataset.

flask numpy pandas python

Last synced: 12 Apr 2026

https://github.com/fimbres/emotion-recognition

This is an AI project developed in Python to recognize emotions in people.

ai kaggle-dataset keras numpy opencv python tensorflow

Last synced: 12 Apr 2026

https://github.com/urvee1810/eda-time-series

A comprehensive time series analysis of French retail quarterly sales data from 2012 to 2017. The project focuses on analyzing sales patterns, seasonal decomposition, and trend analysis using various statistical techniques and visualizations.

arima-modeling data-visualization exploratory-data-analysis matplotlib numpy pandas pmdarima python scikit-learn seaborn statsmodels time-series-analysis trend-analysis

Last synced: 12 Apr 2026

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 12 Apr 2026

https://github.com/jvictor011/analise_de_precos_de_materias-primas_agricolas

Este projeto concentra-se na anΓ‘lise dos dados de preΓ§os de vΓ‘rias matΓ©rias-primas agrΓ­colas ao longo do tempo. O conjunto de dados usado para esta anΓ‘lise Γ© chamado de agricultural_raw_material.csv.

matplotlib numpy pandas python seaborn

Last synced: 07 May 2025

https://github.com/parkjungyoon/til-data_study

✍ 데이터뢄석, λ°μ΄ν„°λ² μ΄μŠ€, ν΄λΌμš°λ“œ μ»΄ν“¨νŒ… 등을 κ³΅λΆ€ν•˜κ³  κΈ°λ‘ν•˜λŠ” μ €μž₯μ†Œ μž…λ‹ˆλ‹€.

crawling machine-learning monodb mysql numpy pandas python

Last synced: 12 Apr 2026

https://github.com/mzayles/iris-seaborn_visualizacao-dados

πŸ’»πŸ“Šβœ… Curso de ProgramaΓ§Γ£o em Python para Data Science | GrΓ‘ficos utilizando a base de vendas da Seaborn e a biblioteca Matplotlib.

matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/mzayles/report_consolidado_analise-descritiva

πŸ’»πŸ“Šβœ… Curso de ProgramaΓ§Γ£o em Python para Data Science | Mensuramento de dados com Matplotlib.

matplotlib numpy pandas python

Last synced: 24 Mar 2025

https://github.com/mzayles/base_vendas_dataprep

πŸ’»πŸ“Šβœ… Curso de ProgramaΓ§Γ£o em Python para Data Science | Preparando e tratando dados com Pandas e NumPy.

numpy pandas python

Last synced: 24 Mar 2025

https://github.com/jvictor011/big-five-personality-test

UsandoΒ machine learning para fazer o agrupamento de um sistema de traΓ§o de personalidade

matplotlib numpy os pandas python seaborn

Last synced: 07 May 2025

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciΓͺncia de dados avanti 2024.2, com o objetivo de analisar e observar padrΓ΅es no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/sujalk777/signal_systems_lab

This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24

jupyter-notebook librosa linux matplotlib numpy python raspberry-pi

Last synced: 12 Apr 2026

https://github.com/ifrazaib/pythoncodes

In this repository I have Added my handwritten python codes to master the language and includes all the basic concepts of python.

classes inheritance lists numpy pandas scipy tupels

Last synced: 06 Jan 2026

https://github.com/shandilya1998/neurorobotics

Python tools for robotics, deep reinforcement learning and neuroscience research.

computational-neuroscience mujoco mujoco-py numpy openai-gym pytorch reinforcement-learning robotics

Last synced: 12 Apr 2026

https://github.com/raoulnormand/easygrader

A library to easily calculate course grades.

gradebook gradescope numpy pandas teaching-tool

Last synced: 10 Mar 2025

https://github.com/hansalemaos/cycolorcrop

Crops an image based on allowed colors

color crop cython image numpy opencv

Last synced: 01 May 2026

https://github.com/raduldev/ml-projects

End To End Machine Learning Project guided by Krish Naik from Ineuron.

catboost dill flask-application numpy pandas python scikit-learn xgboost

Last synced: 12 Apr 2026

https://github.com/RedInfinityPro/AI_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 30 Sep 2025

https://github.com/kunalshelke90/students_marks_prediction_in_exams

Students Performance Evaluation using Feature Engineering, Feature Extraction, Manipulation of Data, Data Analysis, Data Visualization .Later on deployed on AWS with help of docker

dokcerfile flask githubactions machine-learning matplotlib mlops numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/azaz9026/car_price_prediction_model

This repository contains a machine learning model designed to predict car prices based on various features. Using historical data on car attributes such as make, model, year, mileage, and other relevant factors, the model aims to provide accurate and reliable price estimates for used cars.

data-analysis data-engineering liner-regestion machine-learning modeling numpy pandas python3 rendering

Last synced: 09 Apr 2026

https://github.com/alphan26/humidityregressor

This project involves developing a model to predict relative humidity using PyTorch. The model was trained on a dataset containing monthly data, with the goal of forecasting the relative humidity for the subsequent month.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/jeslipriya/emotion-detection

This project is an enhanced real-time facial emotion detection system that uses deep learning (OpenCV + mini-XCEPTION) to accurately recognize human emotions through a webcam or image input. It features advanced preprocessing, temporal smoothing, and a clean visual interface for smooth and stable emotion analysis.

7-emotions deep-learning emotion-detection fer2013 numpy opencv opencv-python python tensorflow

Last synced: 03 Jul 2025

https://github.com/c-mohammed-zaid/python

A diverse collection of Python projects showcasing skills in game development, data manipulation, and utility tools, including classic games, encryption, and data analysis.

jupyter-notebook matplotlib numpy pandas python scipy turtle

Last synced: 06 Jan 2026

https://github.com/chmohit91/Introduction-to-Python

A comprehensive collection of Python programming resources, from basics to advanced concepts. Built for learners, by learners

matplotlib numpy pandas plotly python python-library python3 seaborn

Last synced: 21 Jul 2025

https://github.com/kareem74x/data-mining-uni-project

A project focused on applying data mining techniques to extract insights from datasets

data-mining matplotlib numpy pandas python

Last synced: 25 Feb 2026

https://github.com/akashash01/navie_bayes

It's an simple probabilistic classifiers based on applying Bayes theorem with strong independence assumptions between the features. Highly effective for text sequence oriented data(independent variables).

data-science machine-learning numpy pandas python sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/akashash01/random_forest

It enable learning method for classification, regression and other task. Operates by building multiple decision tree ay training time. It is much more robust than Decision Tree algorithm.

data-science machine-learning numpy pandas python random-forest-classifier sklearn supervised-learning

Last synced: 01 Apr 2025

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rohitinu6/image_cartoonify

To Cartoonify an Image using opencv-python

image-processing matplotlib numpy opencv opencv-python python

Last synced: 12 Apr 2026

https://github.com/prashhhant213/customer-behavior-analysis-for-walmart-black-friday-sales

This project is a Walmart case study analyzing customer purchase behavior by gender and demographics to inform business decisions, especially around spending habits during events like Black Friday.

matplotlib numpy pandas python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/arrmansa/leetcode-nice-solutions

Some solutions for leetcode problems that I found fun - either the algorithm or using numpy bruteforce where it shouldn't be

leetcode leetcode-python leetcode-python3 leetcode-solutions numpy python python3

Last synced: 16 Mar 2025

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/hrolive/fundamentals-of-accelerated-data-science

How to use GPU-accelerated tools to conduct data science faster, leading to more scalable, reliable, and cost-effective results.

accelerated-computing bokeh cudf cugraph cuml cupy dask data-science jupyter numpy pandas python rapids xgboost

Last synced: 12 Apr 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/apal21/tensorflow-linear-regression-fit-data

Fit a line passing through data with some random noise using Tensorflow linear regression with Gradient Descent Optimizer.

gradient-descent linear-regression numpy python3 tensorflow

Last synced: 12 Apr 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para anΓ‘lise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 12 Apr 2026

https://github.com/gregoritsch3/project_radioactive_decay

An analytic ( system of coupled ODEs), numeric (Runge-Kutta 4, Newton-Raphson) and stochastic (Monte Carlo) treatment of the chained radioactive decay of Bi[210, 83] into Po[210,84], as well as the emission of alpha-particles. Associated comments and paper are in Croatian.

cplusplus matplotlib montecarlo-simulation numerical-methods numpy python radioactive-decay scipy

Last synced: 12 Apr 2026

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 12 Apr 2026

https://github.com/emhash/machine-learning-and-ai

Personal tasks or codes of Machine Learning and Artificial Intelligence. Practice codes and projects of Ml/AI will be share here.

ai datasets ipynb jupyter-notebook keras linear-regression logistic-regression machine-learning machine-learning-algorithms numpy opencv pandas python seaborn sklearn tensorflow

Last synced: 05 Jan 2026

https://github.com/nadahamdy217/movies-data-etl-using-python-gcp

Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.

analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/lijesh010/employeeanalysis-

This is a simple data analysis project developed as part of my Entri Elevate Data Science and Machine Learning course. It serves as a practical demonstration of my skills in data analysis.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/constraintautomaton/client_pgu1

The client part of the PgU1 project it. It command the robot and do the video processing

numpy opencv paramiko pyqt5 python3 raspberry-pi-3

Last synced: 12 Apr 2026

https://github.com/22p31a0512/sentimental-analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

feature-engineering machine-learning natural-language-processing numpy pandas python supervised-learning text-classification text-processing tokenization

Last synced: 12 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/ewakamoto/03.-python-projects

This is my collection of Python Projects

jupyter jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 12 Apr 2026

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 12 Apr 2026

https://github.com/Sadegh-Khedry/Pneumonia-Detection-Using-CNN

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 12 Apr 2026

https://github.com/yahiazakaria445/image-classification-using-vgg-16

computer vision model for 10 classes classification

matplotlib numpy opencv pandas sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/alejoduarte23/bs6472

Serviceability assessment as per BS 6472 for human exposure to vibration in buildings Vibration sources other than blasting

matplotlib numpy pydantic scipy

Last synced: 07 Jan 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/reckonsys/datascience-docker

A docker image with numpy, scipy, nltk and pandas pre-installed with pip

docker docker-image dockerfile nltk numpy pandas scipy

Last synced: 10 Mar 2025