An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/dipeshgoyal013/salary-data-analysis

Salary Analysis according department and agency.

analysis matplotlib numpy pandas salary sklearn-library

Last synced: 15 Apr 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/mattia-hulathduwage/wine-quality-analyzer

A machine learning project that analyzes wine quality using clustering, regression, and classification techniques. The model predicts wine quality scores based on chemical properties and determines the most influential features affecting quality.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/suhas-005/jovian-data-analysis-course-assignment

These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai

data-analysis data-analytics numpy pandas python

Last synced: 07 May 2026

https://github.com/amruta33/credit_card_analysis

The loan providing companies find it hard to give loans to the people due to their insufficient or non-existent credit history. Because of that, some consumers use it as their advantage by becoming a defaulter.

numpy pandas python3

Last synced: 15 Apr 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/karishmagupta05/airbnb-data-scraping-and-analysis

This project involves web scraping Airbnb listings using Python and BeautifulSoup, followed by data analysis with Pandas. The goal is to extract meaningful insights from the data, clean and preprocess it, and visualize the results using libraries like Matplotlib and Seaborn.

beautifulsoup4 matplotlib numpy pandas python scraping seaborn visualization

Last synced: 15 Apr 2026

https://github.com/l-gre/data_analytics_for_finance

Comprehensive course materials for the Data Analytics for Finance - Master Programme, covering data manipulation, statistical analysis, visualisation, automation, and real-world case studies using industry-standard tools.

automation data-cleaning data-manipulation data-visualization excel hypothesis-testing industry-applications matplotlib numpy pandas python real-world-case-studies regression-analysis seaborn sql statistical-analysis tableau workflow-automation

Last synced: 15 Apr 2026

https://github.com/sarmad426/ai

AI basic to advanced featuring Machine Learning, Deep Learning and Data Science.

ai data-science deep-learning hugging-face machine-learning numpy pandas python scikit-learn

Last synced: 15 Apr 2026

https://github.com/type0-1/salary-truth-predictor

A supervised machine learning regression model. Includes problem statement, approach to solution, code, images, dataset, and Jupyter Notebook for interactive analysis.

linear-regression machine-learning matplotlib-pyplot ml numpy pandas polynomial-regression projects scikitlearn-machine-learning support-vector-regression

Last synced: 16 Apr 2026

https://github.com/theamnabb/my-ai-brain

A complete, structured AI learning journey from scratch to advanced. Includes daily logs, mini projects, roadmaps, and useful resources for Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI.

matplotlib numpy pandas pyhton3

Last synced: 07 May 2026

https://github.com/lorenzorottigni/ml-advertising

Machine Learning python bootcamp: logistic regression on advertising dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/shwetapardhi/assignment-03-q4-hypothesis-testing

Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing norm null-hypothesis numpy numpy-arrays pandas python scipy significance-testing stats

Last synced: 31 Jan 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/ishinzoo/songrecommendation

This project is a machine learning-based system that recommends songs based on the user's detected emotions. The application uses facial expression recognition to determine the user's current emotional state and suggests songs that align with that emotion. This system can be particularly useful for personalized music streaming services, helping use

machine-learning mediapipe numpy opencv os python tenserflow

Last synced: 25 Feb 2026

https://github.com/hossein-cod/practicing-deep-learning

with keras data set we load a data base and train our machine for test

deep deep-learning jupyter-notebook keras machine-learning matplotlib numpy py python sklearn tensorflow

Last synced: 25 Feb 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 07 May 2026

https://github.com/martincastroalvarez/search-keras-gensim-elasticsearch

Search Engine using Word Embeddings, GloVe, Neural Networks, BART and Elasticsearch

elasticsearch gensim gensim-word2vec keras nlp numpy python scipy spacy word2vec

Last synced: 15 Apr 2026

https://github.com/hercules-peters/learn-ml-with-me

Welcome to **ML-Journey**, my personal progress tracker for mastering machine learning! This repository showcases my projects, notebooks, and explorations as I advance my skills, starting with hands-on projects.

machine-learning numpy pandas python sklearn

Last synced: 15 Apr 2026

https://github.com/abhishekpsonawane07/loan-repayment-prediction-ml

Loan Approval Prediction uses machine learning to analyze factors like income and credit history, automating and improving the loan approval process. The project ensures accurate, efficient, and user-friendly predictions with a Gradio interface.

artificial-neural-networks confusion-matrix decision-trees gradio-interface k-nearest-neighbours loan-dataset loan-prediction-analysis loan-repayment logistic-regression numpy pandas python random-forest sckiit-learn seaborn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 16 Apr 2026

https://github.com/s-hirata0831/graphicsprocessing

Pythonによる画像処理基礎実験(with OpenCV)

numpy opencv python

Last synced: 07 May 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/asherk7/house-price-prediction

House Prices - Advanced Regression Techniques - Predict sales prices and practice feature engineering, RFs, and gradient boosting

data-science numpy pandas regression scikit-learn

Last synced: 15 Apr 2026

https://github.com/gopikrsmscs/mycinepick-recommendation-system

MyCinePick is a user-specific movie recommendation system built using collaborative filtering techniques. This project aims to provide personalized movie suggestions tailored to each user's unique viewing preferences and habits.

collaborative-filtering movie-recommendation-app numpy python3 recommendation-system sklearn streamlit

Last synced: 14 Apr 2026

https://github.com/michaelgermini/python-3d-programming-book

Complete Python & 3D Programming Book: 30 chapters, 120+ examples, real-time ray tracing, ML graphics, OpenGL, NumPy

3d-graphics computer-graphics educational game-development graphics-programming machine-learning numpy opengl python ray-tracing tutorial

Last synced: 16 Apr 2026

https://github.com/subh888999/car-prices--analysis-projects

This repository houses projects focused on data collection, assessment, cleaning, visualization, and analysis. It includes workflows and methodologies for handling data, from initial gathering and evaluation to processing, visualizing insights, and performing in-depth analysis

jupyter-notebook matplotlib numpy panda seaborn statistics

Last synced: 03 May 2026

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/kganitis/bam-engine

High-performance Python implementation of the macroeconomic agent-based BAM model using ECS architecture

agent-based-modeling computational-economics macroeconomics numpy python

Last synced: 04 Mar 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 16 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/krishbharadwaj5678/gestureninja

Play Fruit Ninja using just your index finger with real time gesture tracking. No mouse, no keyboard — just your webcam and smooth slicing.

automation computer-vision cvzone finger-tracking fruit-ninja game-development gamedev gesture-recognition hand-tracking mediapipe mouse mouse-control numpy opencv pyautogui python realtime virtualmouse webcam

Last synced: 07 May 2026

https://github.com/eckelsjd/copier-numpy

Template for a numpy-based scientific computing Python package.

copier-template numpy pdm python

Last synced: 05 Mar 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/supershivam5/python_projects

💻 Python programming with Numpy, Pandas, Matplotlib.🌟 Love exploring new technologies. Check out my projects!

matplotlib-pyplot numpy pandas scikit-learn seaborn

Last synced: 17 Apr 2026

https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting

An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.

forcasting machine-learning numpy pandas scikit-learn time-series-analysis

Last synced: 17 Apr 2026

https://github.com/akashash01/decision_tree

A very specific type of probability tree that enables you to make a decision on certain process(chance event outcomes, resources cost, utility).

data-science decision-tree-classifier machine-learning numpy pandas-dataframe python sklearn

Last synced: 11 May 2026

https://github.com/mnitin-reddy/coffee-shop-sales-analytics

This project analyzes coffee shop sales data to uncover key insights, focusing on customer behavior, product performance, and sales trends. Using Pandas, Matplotlib, and Seaborn, the analysis identifies peak sales hours, popular product categories, and opportunities for upselling or bundling.

datasceince datavisualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/sa001gar/password-manager-on-the-top-of-streamlit

Secure Password Manager is a Python-based application developed by Sagar Kundu, designed to provide users with a secure and convenient way to manage their passwords. With features like user authentication, password generation, and secure storage using bcrypt and MongoDB, it ensures the safety of sensitive information while offering an intuitive use

bcrypt gui-application mongodb numpy numpy-library pandas-dataframe pandas-library pandas-python password-generator password-manager pymongo python python3 streamlit-application streamlit-webapp urllib

Last synced: 15 Apr 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/jaybfn/single-molecule-electrophysiology-data-analysis

Pynanopore serves as a sophisticated toolkit for analyzing single molecules, designed to interpret complex states within nanopore data automatically.

docker numpy pandas plotly pytest python3 streamlit streamlit-webapp

Last synced: 28 Jan 2026

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/yanxue06/housing-price-predictor

Python-based California housing price predictor

jupyter numpy pandas python seaborn

Last synced: 06 Mar 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/slovnicki/matematicki-softver

homeworks and projects for Mathematical Software course

ipython jupyter latex markdown mathplotlib numpy pandas scikit sympy

Last synced: 15 Apr 2026

https://github.com/radinshahdaei/ce40215-nc

Theoretical and practical assignments for "Numerical Computation".

jupyter-notebook numpy python sympy

Last synced: 17 Apr 2026

https://github.com/lavkalsi/creditcardfrauddetector

Credit Card Fraud Detector is a React web app that predicts if a credit card transaction is fraudulent using a Python machine learning model. Users can input transaction data, and Flask facilitates communication between the backend and frontend. Backend files are located in the res folder. This app provides simple UI for user interaction.

fraud-detection machine-learning nodejs numpy pandas python reactjs sklearn-library

Last synced: 15 Apr 2026

https://github.com/esraamhmd/intelligent-skin-cancer-image-based-detection-models

This project combines traditional machine learning approaches with advanced deep learning techniques to assist healthcare professionals in early diagnosis and improve patient outcomes.

ai deep-learning ham10000 isic kaggle machine-learning matplotlib numpy pandas python3 pytorch seaborn skin-cancer skin-detection sklearn time torchvision tqdm

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/akshitvats026/heart_disease_prediction

An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.

accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/satyas567/weatherdataanalysis

Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/amanimagdi/data-flux-fullstack

This repository is for a full stack application provides in-depth analysis and manipulation of various data types (tabular data, RGB images and textual data), Flask on BE and ReactJs on FE.

flask javascript numpy pandas python reactjs tailwindcss

Last synced: 02 Apr 2026

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 02 Apr 2026

https://github.com/lintangwisesa/python_pillow_image_array

Convert 🖼 image to 📜 Numpy array using Pillow [Python]

image numpy pillow python

Last synced: 07 May 2026

https://github.com/loong64/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu loong64 loongarch64 machine-learning neural-network numpy python tensor

Last synced: 02 Apr 2026

https://github.com/mac2bua/autoalgo

Autonomous algorithm optimization research - 19x speedup achieved through self-driven iterations

algorithm-optimization autoresearch numpy performance python

Last synced: 02 Apr 2026

https://github.com/mnagaharshithrao/diamondpriceprediction

Diamond Price Prediction is a machine learning project that predicts the price of diamonds based on features like carat, cut, color, clarity, and more. Using models such as XGBoost, Random Forest, and Gradient Boosting, the project aims to accurately forecast diamond prices, helping buyers and sellers in the diamond industry make informed decisions

machine-learning numpy pandas sckiit-learn xgboost

Last synced: 17 Apr 2026