An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/nowon1/insurance-claim-prediction_version

This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.

data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn

Last synced: 31 Dec 2025

https://github.com/sasivakulrithwik/059_iva_assignment-1

Image processing and object detection using OpenCV, including color detection, smoothing, edge detection, and face detection with Haar cascades. 🚀

ipython-notebook numpy opencv python3

Last synced: 31 Dec 2025

https://github.com/chinmayee4/diwali_sales_analysis_using_python

Analyze Diwali Sales Data to improve customer experience and sales.

data-science exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 28 Aug 2025

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 31 Dec 2025

https://github.com/sudarshanc00/brain-tumor-classification

This project uses a deep learning model in PyTorch to classify brain MRI images into four tumor types, aiding early diagnosis and treatment planning. Two ResNet-based models were developed and optimized, achieving high accuracy to support healthcare professionals in identifying tumor categories.

matplotlib numpy pytorch resnet scikit-learn streamlit

Last synced: 31 Dec 2025

https://github.com/sagravela/plotground

Plotting Tool with Matplotlib and Tkinter. Video Demo:

matplotlib numpy python tkinter

Last synced: 21 Mar 2025

https://github.com/hossein-cod/parking-space-detection

Identifying empty spaces in the parking lot with openCV

cv2 jupyter-notebook numpy opencv pickle python

Last synced: 21 Mar 2025

https://github.com/mathensley/asimpleneuralnetworklib

A simple library in Python for training datasets using neural network models

deep-learning neural-networks numpy python

Last synced: 21 Mar 2025

https://github.com/nikhilsree5/netflixcasestudy

Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth

eda numpy pandas python visualization

Last synced: 21 Mar 2025

https://github.com/firewind25/gdp-data-extraction-and-processing

Data Extraction from a web resource using Web-Scraping and request APIs. Processed it using Pandas and NumPy libraries. Finally exporting it to csv file for further analysis

numpy pandas python webscraping

Last synced: 12 Jul 2025

https://github.com/blinksta1ker/spam-mail-classification

Supervised ML model using Logistic Regression model to classify an email as Ham or Spam

jupyter-notebook logistic-regression machine-learning numpy pandas python sklearn supervised-machine-learning

Last synced: 30 Dec 2025

https://github.com/djdurga/google_play_store_apps_analysis

This data analysis project focuses on exploring and understanding the Google Play Store Apps dataset.

numpy pandas python

Last synced: 21 Mar 2025

https://github.com/dhruvil-26/python-projects

This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.

eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization

Last synced: 21 Mar 2025

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 21 Mar 2025

https://github.com/andrealopezp/accommodationanalysis

Accommodation - Exploratory Data Analysis & Visualization - Market Trends

dashboard data-cleaning exploratory-data-analysis matplotlib numpy pandas python seaborn tableau-public

Last synced: 21 Mar 2025

https://github.com/nikhilsree5/walmartcasestudy

Analysis of Customer Spending Habits at Walmart Inc

clt eda numpy pandas python3 statistics visualization

Last synced: 21 Mar 2025

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 21 Mar 2025

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 21 Mar 2025

https://github.com/zombietdv/rfl-13_8

Implementaion for Genetic algorithm and Q-Learning.

genetic-algorithm grid-world machine-learning numpy pygame q-learning

Last synced: 21 Mar 2025

https://github.com/ccolpasm/pageviewanalyzer

PageViewAnalyzer analyzes and visualizes time series data of forum visits (2016-2019). It uses Pandas, Matplotlib, and Seaborn to create line plots, bar charts, and box plots, highlighting trends, monthly averages, and seasonality in page views. Perfect for data exploration with Python.

data-science matplotlib numpy pandas python

Last synced: 21 Mar 2025

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 21 Mar 2025

https://github.com/robinhosz/traffic-data-analyzer

This repository presents a project for urban traffic signal optimization using Reinforcement Learning (RL) to improve traffic flow, increase safety at intersections, and promote energy efficiency in cities. Using the "Traffic Volume Counts" dataset from Kaggle.

data-analysis-python kaggle-dataset numpy pandas python traffic-analysis

Last synced: 21 Mar 2025

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 31 Dec 2025

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 31 Dec 2025

https://github.com/vipinchaudhary31122002/bitcoinforecast

BitcoinForecast is a deep learning-based Bitcoin price prediction project using historical data, time series analysis, and LSTMs to forecast BTC trends. 🚀💰

deep-learning machine-learning matplotlib numpy pandas python3 scikit-learn sklearn streamlit

Last synced: 31 Dec 2025

https://github.com/shibam120302/dog_breed_prediction_app

The Dog Scanner app will identify your dog's breed reliably in just a few seconds! Besides taking a picture, you can also record a video or upload an image from your gallery. Got a mixed breed? No problem, the Dog Scanner app also recognizes mixed breeds!

keras numpy opencv python

Last synced: 31 Dec 2025

https://github.com/shibam120302/indian_sign_language_translator

This repository consists of the code utilized for creation of an Indian Sign Language Translator satisfying the following criteria : Near-Real-Time Application Achieve background independence Attain Illumination independence

imageai keras numpy opencv pillow tensorflow

Last synced: 31 Dec 2025

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 31 Dec 2025

https://github.com/quickheaven/scs-3251-statistics-for-data-science

Repository dedicated to Term Project of UofT Statistics for Data Science Course

data-science jupyter-notebook logistic-regression numpy pandas python

Last synced: 31 Dec 2025

https://github.com/Madhuresh2011/Leetcode-Questions-Analysis-Using-Python

The LeetCode Question Analysis is a Python-based project designed to help users analyze and gain insights into LeetCode problems. It allows users to track, categorize, and evaluate their progress, identify patterns in problem-solving, and optimize the preparation.

analysis csv-files matplotlib-pyplot numpy pandas python python-project seaborn

Last synced: 13 Sep 2025

https://github.com/anumit-web/python-for-data-science-for-dummies-3rd-edition

Python® for Data Science For Dummies®, 3rd Edition John Paul Mueller

numpy pandas pandas-dataframe python

Last synced: 24 Feb 2025

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 21 Mar 2025

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 31 Dec 2025

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 31 Dec 2025

https://github.com/shivam5509/power-bi-project

Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.

advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server

Last synced: 31 Dec 2025

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 31 Dec 2025

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 31 Dec 2025

https://github.com/memgonzales/order-brushing-shopee2020

Solution to the Week 1 Contest (Student Category) of the 2020 Shopee Code League on order brushing detection

data-analytics e-commerce fraud-detection jupyter-notebook numpy order-brushing pandas python rule-based

Last synced: 31 Dec 2025

https://github.com/theanujsinha01/seoul-bike-sharing-demand-prediction

This project predicts the demand for bike-sharing services in Seoul using machine learning algorithms. It utilizes historical data and environmental factors to provide accurate predictions that can aid in resource optimization and planning.

data-cleaning data-science exploratory-data-analysis feature-engineering jupyter-notebook machine-learning matplotlib model numpy pandas seaborn

Last synced: 31 Dec 2025

https://github.com/omarnahhass/crypto-cast

CryptoCast is a machine learning-powered tool that showcases three fundamental prediction models for forecasting cryptocurrency prices

cryptocurrency machine-learning numpy pandas python

Last synced: 29 Mar 2025

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 29 Mar 2025

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 20 Jun 2025

https://github.com/pramodyasahan/custom-image-classification-model

This repository hosts a TensorFlow-based custom image classification model. The model is designed to classify images into binary classes, labeled as 'Happy' and 'Sad'.

computer-vision convolutional-neural-networks cv2 keras matplotlib numpy tensorflow

Last synced: 31 Dec 2025

https://github.com/jhonatanthiago/projeto_analise_de_dados_com_python

A empresa contratou você para analisar dados de mais de 800 mil clientes, a maioria dos quais são inativos, ou seja, já cancelaram o serviço. O objetivo é entender os principais motivos desses cancelamentos e identificar as ações mais eficazes para reduzir esse número.

jupyter-notebook nbformat numpy openxyl pandas plotly python

Last synced: 31 Dec 2025

https://github.com/dpgitaccount/project-customer-segmentation-analysis-with-power-bi

This project segments customers based on their purchasing behavior to identify different target groups. It demonstrates skills in data analysis, clustering, and visualization using Power BI.

kmeans-clustering-algorithm linear-regression numpy pandas powerbi python seaborn segmentation

Last synced: 31 Dec 2025

https://github.com/kahngjoonkoh/randomshapegenerator

A program that will generate images with random shapes and background colours. Can be customized and generated in bulk.

generative-art numpy opencv python threading tkinter

Last synced: 31 Dec 2025

https://github.com/monddavila/online-retail-data-analysis

Online Retail Exploratory Data Analysis with Python

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Jul 2025

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 31 Dec 2025

https://github.com/hanzopgp/tensorflowtest

Learning how to use jupyter notebook, tensorflow, numpy and other librairies for machine learning projects. Following Thibaut Neveu's tutorial.

artificial-intelligence convolutional-neural-networks deep-learning lstm-neural-networks machine-learning matplotlib multilayer-neural-networks numpy personal recurrent-neural-networks sklearn tensorflow

Last synced: 31 Dec 2025

https://github.com/elmezianech/email-spam-ham-classifier-lr

Email Classifier: A machine learning project using Python that categorizes emails into spam and ham (non-spam). Utilizes the Scikit-Learn library, employing logistic regression and TF-IDF (Term Frequency-Inverse Document Frequency) vectorization for text analysis and classification.

ai emails jupyter-notebook logistic-regression machine-learning ml numpy pandas python spam-detection spam-filtering tfidfvectorizer

Last synced: 31 Dec 2025

https://github.com/elam-parithi/singapore_flatprice_predicting

Flat price prediction with Machine learning tools and python.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 31 Dec 2025

https://github.com/elmezianech/email-spam-ham-classifier-nb-simple

Count Vectorizer Naive Bayes Email Classifier: This Python project utilizes a simple Naive Bayes approach with Count Vectorizer to classify emails as spam or ham. The implementation focuses on word frequency for classification.

countvectorizer emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 31 Dec 2025

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 31 Dec 2025

https://github.com/martinkersner/kmeans-meetup

Presentation about k-Means for Seoul AI Meetup on July 22, 2017.

kmeans numpy python scikit-learn

Last synced: 25 Jun 2025

https://github.com/sridharyadav07/code_alpha-_task-4

This project demonstrates the typical steps in a machine learning pipeline from data preprocessing and cleaning to training a model and evaluation its performance. The use of Random Forest is appropriate here given the complexity of the dataset, and with further tuning and improvements, this model could be used to make accurate predictions on heart

confusion-matrix jypyternotebook labelencoder matplotlib numpy pandas python random-forest-classifier scikit-learn

Last synced: 24 Feb 2025

https://github.com/hairymax/python-for-science-lecture-notes

Tutorial material on the scientific Python ecosystem in Russian

matplotlib numpy python scipy sympy

Last synced: 31 Dec 2025

https://github.com/ahmedabdalkreem/naive-bayes

Show what types of naive-bayes and how to use model and deal with data ( Numerical, Categorical and Mixed between Numerical and Categorical )

numpy pandas python3

Last synced: 31 Dec 2025

https://github.com/tsar123/coffee-quality-analysis

machine learning using RandomForestRegressor and dataset with coffee data

coffee ml numpy pandas randomforestregressor regressor-models sklearn

Last synced: 25 Jun 2025

https://github.com/chandkund/recipe-recommendations

The Recipe Recommendations project leverages natural language processing (NLP) techniques to process recipe descriptions and classify them into categories like desserts, main courses, and more. The system provides personalized recommendations by analyzing the text input and suggesting relevant recipes.

embeddings matplotlib nlp numpy pandas softmax tensorflow

Last synced: 31 Dec 2025

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 04 Mar 2025

https://github.com/caterinatasinato/python-projects

Projects I worked on as Trainee in Data Analytics at ProfessionAI

matplotlib numpy pandas python3

Last synced: 31 Dec 2025

https://github.com/ahmedabdalkreem/preprocessing

All you need to be dataset ready to work in model without any Noise.

numpy pandas python3

Last synced: 31 Dec 2025

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 31 Dec 2025

https://github.com/gautamthampy/color-detection

Provides colour names and HTML/RGB mappings in various output formats.

c numpy opencv python

Last synced: 31 Dec 2025

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 24 Dec 2025

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 31 Dec 2025

https://github.com/satyacoder29/crm-analytics-power-bi

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 29 Mar 2025

https://github.com/guptamudit/whatsapp-chat-analyser

Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.

matplotlib numpy pandas python seaborn

Last synced: 31 Dec 2025

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 31 Dec 2025

https://github.com/jass024/bitcoin-historical-analysis

Performed comprehensive time-series analysis on 14+ years of Bitcoin historical data using Python (Pandas, NumPy, Plotly). Calculated key technical indicators (SMA, EMA, RSI, volatility) to identify market trends, potential anomalies, and areas of regulatory concern. Visualized results using interactive Plotly charts.

numpy pandas python-3

Last synced: 28 Mar 2025

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 31 Dec 2025

https://github.com/rotantos/ml-projects

Includes various ML projects created for educational purposes.

classification-model jupyter-notebook machine-learning numpy pandas python sklearn-library

Last synced: 31 Dec 2025

https://github.com/balakrish001/python-course

The Python course provides a thorough foundation in Python programming, starting with core concepts and progressing to OOP. It covers essential libraries like NumPy and Pandas for data manipulation and analysis. You'll also learn how to send emails using Python's SMTP library and interact with APIs for real-world applications,

api numpy oops-in-python pandas problem-solving python smtp

Last synced: 31 Dec 2025

https://github.com/ramyacp14/salesforecasting

Forecasts future sales for a retail company using time series analysis with Facebook Prophet. The project involves data preprocessing, exploratory data analysis (EDA), and forecasting with holiday effects considered.

data-preprocessing data-visualization exploratory-data-analysis fbprophet machine-learning matplotlib model-evaluation numpy pandas python seaborn time-series-forecasting

Last synced: 31 Dec 2025

https://github.com/hariprasath-v/machinehack-odetocode_predicting_weather_using_alien_fruit_properties

Identify the type of climate the exoplanet has based on the properties of the fruit by using machine learning.

catboost machine-learning matplotlib numpy pandas seaborn shap sklearn

Last synced: 31 Dec 2025

https://github.com/abhijeet107/household-transactions-analysis-report

This project analyzes household financial transactions spanning from 2015 to 2018, focusing on income, expenses, net savings, and spending patterns across categories and modes of payment.

matplotlib numpy pandas powerbi python seborn

Last synced: 31 Dec 2025