An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/poziloi/-image_processing-

Методы и алгоритмы цифровой обработки изображений, задания

cv2 matplotlib numpy python

Last synced: 20 Jan 2026

https://github.com/asut00/python-piscine_42ai

Python Bootcamp: A one-week intensive course with 42AI at École 42, covering Python fundamentals, data manipulation, and introductory AI concepts.

matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/chandkund/image-classification-using-the-mnist-dataset

Image Classification using the MNIST dataset. This project leverages a Convolutional Neural Network (CNN) to recognize and classify handwritten digits with high accuracy. Includes data preprocessing, model architecture, and evaluation. Explore the code and results here!

computer-vision data-science machine-learning matplotlib numpy pandas python

Last synced: 08 Apr 2026

https://github.com/raghavendranhp/airbnb-data-analysis

The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.

eda jupyter-notebook mongodb numpy pandas powerbi preprocessing

Last synced: 08 Apr 2026

https://github.com/mikma03/simulation_modeling

Simulation models using Python. Practical use of Python in real-world examples and additional resourses.

matplotlib numpy pandas portfolio python simulation stocks

Last synced: 08 May 2026

https://github.com/pabs-code/face-detection-using-haar-cascade-classifier

This is a Streamlit-based face detection application that uses the Haar Cascade classifier to detect faces in uploaded images.

face-detection haar-cascade-classifier numpy opencv python streamlit

Last synced: 08 Apr 2026

https://github.com/iliailmer/numpy_learn

A small machine learning mock-library written entirely in numpy and shamelessly imitating PyTorch.

deep-learning machine-learning mnist numpy scipy

Last synced: 09 May 2026

https://github.com/gmork2/covid-19

A mathematical analysis of the infection growth

coronavirus covid-19 jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/shahbazshaddy/explainable-multimodal-ai-for-breast-cancer-and-pneumonia-prediction

A deep learning-based framework integrating explainable multimodal AI for accurate prediction and transparent diagnosis of breast cancer and pneumonia.

deep-learning explainable-ai grad-cam groq-api llm machine-learning matplotlib multimodal numpy pandas python pytorch scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/seymencanates/pythonn

Here is my all personal educations , notes , basic apps I did. Here will be my backside for all supportive information about python I am gonna need.

django flask numpy pandas python3 scikitlearn-machine-learning tensorflow tkinter

Last synced: 08 Apr 2026

https://github.com/sarthakm402/rabbi

A Ml library that can be used ot clean,and use models to predict on data

numpy optuna pandas pypi-package python sklearn

Last synced: 20 Apr 2026

https://github.com/barraharrison/seoul-bike-sharing

Performing EDA on a kaggle dataset to look at the distribution of Seoul's bike-sharing system

jupyterlab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 23 Jul 2025

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/campus-placement-prediction

Student Campus-Placement Prediction ML Project using Python

decision-trees machine-learning matplotlib numpy python random-forest scipy seaborn

Last synced: 08 Mar 2026

https://github.com/abyshergill/ultralytics_yolo_object_detection_testing_gui

This is a desktop application for real-time object detection using the YOLO (You Only Look Once) model build with ultralytics libarary. It's built with PyQt5 for the graphical user interface and OpenCV for video and image processing. The application allows users to load a YOLO model, adjust detection settings, and perform object detection on media.

numpy opencv-python pytorch qt5-gui threading ultralytics yolo

Last synced: 08 May 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/labex-labs/numpy-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.

array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations

Last synced: 20 Jun 2026

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/anushkapal28/maatri-care

MaatriCare is an AI-based healthcare application designed to provide personalized nutrition advice and maternal care recommendations for pregnant women, lactating mothers, and women in general.

gemini-api jupyter-notebook labelencoder numpy pandas python3 random-forest streamlit svm-training

Last synced: 18 Sep 2025

https://github.com/v-mayya/resale-revolution

Empower Hacks Hackathon Submission

html matplotlib numpy pandas python sqlite streamlit

Last synced: 09 Apr 2026

https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques

Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.

data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 28 Jul 2025

https://github.com/shambac/shamboflow

Fierce tensorflow competitor

cuda cupy machine-learning numpy pypi-package

Last synced: 19 Feb 2026

https://github.com/mallocode300/colour_palette_generator

Upload an image and immediately obtain a colour palette with the 10 most common colors in HEX codes and RGB

numpy python

Last synced: 29 Jul 2025

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/youssefali11997/free-style-coding

Scripts written as practice, trying new technology or just playing with the code !

bash javascript numpy python3 ruby-on-rails scripts vuejs

Last synced: 12 Apr 2026

https://github.com/abeed04/face-recognition-using-computer-vision-cv2

OpenCV (cv2) can be used for face recognition by detecting faces, extracting facial features, and comparing them to a database of known faces.

cmake cv2-library dlib-face-recognition face-recognition flask numpy pycharm-ide python

Last synced: 08 Feb 2026

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. 🛠️🎥

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project

In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.

data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql

Last synced: 09 Apr 2026

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/harmanveer-2546/house-price-prediction-

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn svm

Last synced: 09 Apr 2026

https://github.com/kasraskari/tumor-predict

Streamlit app for predicting tumor malignancy using logistic regression.

logistic-regression machine-learning numpy pandas python scikit-learn streamlit tumor-detection

Last synced: 09 Apr 2026

https://github.com/mobinabatebi/electricity_consumption_project

The project aimed at estimating electricity consumption in administrative offices, along with analyzing the impacts of such consumption on power outages, is critical for addressing the pressing issue of summer electricity shortages.

jupyter-notebook numpy pandas python

Last synced: 28 Sep 2025

https://github.com/abdulsamie10/pythonbasics

This repository contains few tasks, which I developed just to get a strong grip on Python Programming Langauge.

ai labtasks lambda numpy python pythonlab

Last synced: 20 Apr 2026

https://github.com/christabelsakyi/employee_attrition_prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

machine-learning numpy pandas plotly python sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/thavinduushan/linear-regression

A linear regression model to predict premium price of an insurance. The dataset includes around 1,000 observations. The project, aimed at learning, was implemented in Jupyter notebooks using NumPy, Matplotlib, Pandas, Keras, and TensorFlow.

matplotlib numpy pandas python tensorflow

Last synced: 09 Apr 2026

https://github.com/dulajkavinda/handwritten-recognition

🛢Handwritten Digit Recognition using OpenCV

added machinelearning numpy opencv python

Last synced: 09 Apr 2026

https://github.com/fazal-coder/data-science-business-analytics-tsf

I have completed two tasks in this repository , task 3 and task 4

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/omdoshi13/pricing-of-laptops-using-ml

Data Analysis, training Machine Learning models, and Model Evaluation and Refinement for Pricing of Laptops dataset.

data-analysis data-analysis-project datascience google-colab jupyter-notebook machine-learning matplotlib model-evaluation model-refinement numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 09 Apr 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/jahanostg/healthguard_with-smart-ai-diagnosis-hub

This project is a Flask-based platform for patients, donors, and doctors, integrating hospital management in one solution. It uses Machine Learning models for predictive health analysis and Power BI for interactive dashboards. Through this project, I enhanced my skills in full-stack development, applied ML, and healthcare data solutions.

css flask-api google-colaboratory-notebooks html5 js machine-learning mysql numpy pandas pkl-model powerbi python-3

Last synced: 09 Apr 2026

https://github.com/josafatburmeister/circle_detection

A Python package for circle fitting.

circle numpy objectdetection pointcloud python

Last synced: 09 Apr 2026

https://github.com/srikarveluvali/dataanalysis

The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.

css data-analysis html javascript matplotlib numpy pandas python seaborn tableau

Last synced: 09 Apr 2026

https://github.com/faisal-khann/banking_portfolio_risk_analysis

This analysis explores the banking customer dataset to uncover key patterns in account holdings, loan portfolios, income profiles, and transactional behavior. By integrating both financial and demographic variables, we examine customer engagement, lending trends, and risk exposure across different segments.

data-manipulation eda matplotlib numpy pandas powerbi python seaborn

Last synced: 21 Aug 2025

https://github.com/prath0m/python-numpy

Python basic Numpy Program

machine-learning numpy python python3

Last synced: 09 Apr 2026

https://github.com/singhvishal003/bert-sentiment

Sentiment Analysis Model To Detect the Sentiments.

beautifulsoup numpy pandas requests torch torchvision

Last synced: 09 Apr 2026

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 09 Apr 2026

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 09 Apr 2026

https://github.com/baranwaljii/data-science-projects

This Repository contains projects related to the domain of data science based on python and jupyter notebook

data-science jupyter-notebook machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://gitlab.com/DLR-SY/beos

Beos is a legacy Fortran tool used to calculate the static and dynamic buckling behavior of flat/curved fiber composite structures.

fortran numpy python

Last synced: 16 Aug 2025

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/amithjoseph777/gourmethaven-case-competition

Predictive analytics case competition project from the Master of Science in Business Analytics course at the College of Business, University of Louisville. Our team analyzed customer response trends for Gourmet Haven, developed a predictive model, and ranked in the Top 5 out of 25 teams.

google-colab jupyter-notebook numpy pandas r scikit-learn

Last synced: 10 Apr 2026

https://github.com/terrelbrinkley/python-projects

Python Scripts & Data Analyst Projects

aws numpy pandas python selenium

Last synced: 10 Apr 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/liblaf/array-typing

🧮 Type-safe array operations for NumPy, JAX, and PyTorch

array code-quality documentation jax numpy python static-analysis torch type-hints typing

Last synced: 19 Aug 2025

https://github.com/kaladabrio2020/calculusnumerics

Trabalho da cadeira de Métodos Numérico I

metodos-numericos numpy python tkinter-gui

Last synced: 18 May 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/dhaase-de/dito

Yet another toolbox for the daily work with OpenCV under Python.

numpy opencv python

Last synced: 22 Aug 2025

https://github.com/balakrish001/python-course

The Python course provides a thorough foundation in Python programming, starting with core concepts and progressing to OOP. It covers essential libraries like NumPy and Pandas for data manipulation and analysis. You'll also learn how to send emails using Python's SMTP library and interact with APIs for real-world applications,

api numpy oops-in-python pandas problem-solving python smtp

Last synced: 10 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/prince-pastakiya/human-resources-tableau-project

👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.

chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public

Last synced: 18 Apr 2026

https://github.com/sc0v0ne/blueflix

Simple Application Recommend Movies and Tv Shows

csv k-means k-means-clustering kaggle movies numpy pandas python recommends-movies sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/mays-m/deeplearning

In deep learning, with the help of data and training, neural networks are created and trained, which create better predictions than before with the help of neural networks.

anaconda artificial-intelligence conda-environment matplotlib neural-networks numpy pandas python3 tensorflow tensorflow-gpu transortfow-

Last synced: 11 Apr 2026

https://github.com/quickheaven/scs-3251-statistics-for-data-science

Repository dedicated to Term Project of UofT Statistics for Data Science Course

data-science jupyter-notebook logistic-regression numpy pandas python

Last synced: 10 Apr 2026

https://github.com/thatguychandan/adoptimization

This project implements an ad optimization system using a hybrid approach combining Thompson Sampling and Upper Confidence Bound (UCB) algorithms. The system learns to select the most effective ads based on user context and historical performance.

numpy pandas plotly python pytorch reinforcement-learning scikit-learn streamlit thompson-sampling upper-confidence-bound

Last synced: 10 Apr 2026

https://github.com/hasnainroopawalla/sudoku-generator

This algorithm can generate and solve a random Sudoku Puzzle Board using Backtracking

backtracking numpy python sudoku-generator

Last synced: 16 May 2026