An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/anmamun0/data-analysis-home-cleaning-services

This repository contains the analysis and visualization of data from a home cleaning services dataset. The project provides valuable insights into revenue generation, customer trends, and regional performance, helping businesses make data-driven decisions.

matplotlib numpy pandas

Last synced: 05 Mar 2025

https://github.com/pabs-code/face-detection-using-haar-cascade-classifier

This is a Streamlit-based face detection application that uses the Haar Cascade classifier to detect faces in uploaded images.

face-detection haar-cascade-classifier numpy opencv python streamlit

Last synced: 08 Apr 2026

https://github.com/vyjayanthipolapragada/image_classifier_model_hotdog

Building an Image classifier model to train and test a dataset and classify the given images into hotdog and not-hotdog.

artificial-intelligence dataset image-classification image-processing machine-learning matplotlib neural-networks numpy pandas python pytorch tensor torchvision transfer-learning

Last synced: 08 Apr 2026

https://github.com/muhkartal/e-forecast

machine learning-powered energy consumption prediction system that analyzes historical data to forecast future energy usage trends, optimizing efficiency and sustainability.

fastapi joblib matplotlib numpy pandas pydantic pytest sckit-learn seaborn tensorflow tqdm uvicorn xgboost yaml

Last synced: 08 Apr 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/tabotcharlesbessong/python-errors

This repository will contain all python errors i will encounter in my life as a python plus their solutions

matplotlib-animation matplotlib-pyplot numpy pandas python-script python3 scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/shdrn2402/budget-analytics

Analyze a family's two-year spending in Israel to uncover trends, optimize budgets, and compare consumption patterns using Python, Pandas, and visualization tools.

jupyter-notebook matplotlib numpy pandas python seaborn visualization

Last synced: 08 Apr 2026

https://github.com/secary/compsci7211

Foundations of Computer Science B

algorithms git numpy pandas shell unix

Last synced: 10 Apr 2026

https://github.com/singhvishal003/stock-price-predict

Stock Price Prediction model using yfinance data .

keras matplotlib numpy pandas yfinance

Last synced: 08 Apr 2026

https://github.com/mondalbidisha/landmark-recognition

A solution for identifying and recognizing landmarks from images, addressing key challenges and leveraging both algorithmic and human expertise to achieve high accuracy and reliability.

jupyter-notebook kaggle kaggle-datasets numpy pandas python3

Last synced: 08 Apr 2026

https://github.com/gmork2/covid-19

A mathematical analysis of the infection growth

coronavirus covid-19 jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/beolawork-art/novabank-churn-analysis

NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.

data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql

Last synced: 08 Apr 2026

https://github.com/lorenzorottigni/ml-lending-club

Machine Learning python bootcamp: random forest classifier on LendingClub dataset

ipynb machine-learning numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/shahbazshaddy/explainable-multimodal-ai-for-breast-cancer-and-pneumonia-prediction

A deep learning-based framework integrating explainable multimodal AI for accurate prediction and transparent diagnosis of breast cancer and pneumonia.

deep-learning explainable-ai grad-cam groq-api llm machine-learning matplotlib multimodal numpy pandas python pytorch scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/colesmcintosh/pycuda-numpy-vector-ops

Accelerating NumPy Vector Operations with PyCUDA

cuda-programming numpy pycuda

Last synced: 28 Apr 2026

https://github.com/dmarks84/coursework_project_boston-data-project

Project for IBM Data Science course on Statistics -- Read in a large data set and performed several statistical analyses and hypothesis testing

communication data-modeling data-reporting dataframes eda hypothesis-testing matplotlib numpy pandas probability python scipy seaborn statistics visualization

Last synced: 08 Apr 2026

https://github.com/seymencanates/pythonn

Here is my all personal educations , notes , basic apps I did. Here will be my backside for all supportive information about python I am gonna need.

django flask numpy pandas python3 scikitlearn-machine-learning tensorflow tkinter

Last synced: 08 Apr 2026

https://github.com/shreyavats23/sign_language_recognition

Sign Language Recognition using Deep Learning : ~ Built a model to recognize American Sign Language (ASL) for letters A–Z and digits 1–9 ~ Achieved ~89% validation accuracy using a Convolutional Neural Network (CNN) ~ Used data augmentation techniques to improve generalization ~ Visualized predictions to test the model on real samples

artificial-intelligence keras machine-learning matplotlib numpy python pytorch seaborn sklearn tensorflow

Last synced: 08 Apr 2026

https://github.com/sarthakm402/rabbi

A Ml library that can be used ot clean,and use models to predict on data

numpy optuna pandas pypi-package python sklearn

Last synced: 20 Apr 2026

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 08 Apr 2026

https://github.com/barraharrison/seoul-bike-sharing

Performing EDA on a kaggle dataset to look at the distribution of Seoul's bike-sharing system

jupyterlab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 23 Jul 2025

https://github.com/alphacrypto246/insurance-charges-prediction

The Predicting Insurance Charges project uses Decision Tree Regression to predict insurance charges based on features like age, sex, BMI, and smoking habits. It involves data preprocessing, feature scaling, and model evaluation with metrics like MAE and R².

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/elijah-1994/modelling-airbnbs-property-listing-dataset

Training and Evaluating classification and regression model use cases on the Airbnb dataset.

data-science machine-learning numpy pandas plotly python pytorch sklearn

Last synced: 10 Apr 2026

https://github.com/ashkarin/rawimage

Read and write RAW images (or data arrays) without a headache.

image-processing imagej ndarray numpy raw-data raw-image readers writers

Last synced: 28 Apr 2026

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/bassemhalim/webrtc-demo

webrtc demo using aiortc library

numpy opencv python3 webrtc

Last synced: 09 Apr 2026

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 09 Apr 2026

https://github.com/armahdavi/analytics_statistics_ML_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 17 Sep 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ML_PM_PSD_residential_QFF

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 17 Sep 2025

https://github.com/abeertechcamus/documentdata

The dataset was cleaned and queried using Python inside Jupyter Notebook and visualizes using PowerBI Document Data Analysis Projects

dax jupyter-notebook numpy pandas powerbi python

Last synced: 04 May 2026

https://github.com/dakshsammi/arkaid

Arkaid is a game performance analytics platform developed for the Information Integration Architecture Course - CSE656 (IIIT Delhi). It uses a data warehouse approach to analyze gaming data from multiple sources and provides insights via an AI-driven interface.

ai airflow aws database-management datawarehouse docker etl flask information-integration numpy openai postgresql python togetherai

Last synced: 09 Apr 2026

https://github.com/roluochke/real-estate-price-prediction-app

This is a data science project for a real estate price prediction website.

flask html-css-javascript jupyter-notebook matplotlib numpy python sklearn

Last synced: 09 Apr 2026

https://github.com/dsowles/miniproject1davidsowles2025

INF601 Mini Project 1

numpy yfinance

Last synced: 16 May 2026

https://github.com/abyshergill/ultralytics_yolo_object_detection_testing_gui

This is a desktop application for real-time object detection using the YOLO (You Only Look Once) model build with ultralytics libarary. It's built with PyQt5 for the graphical user interface and OpenCV for video and image processing. The application allows users to load a YOLO model, adjust detection settings, and perform object detection on media.

numpy opencv-python pytorch qt5-gui threading ultralytics yolo

Last synced: 08 May 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/princeoncada/quant-pca-risk

Applies Principal Component Analysis (PCA) to daily returns of 20 US equities (2015–2025) to uncover hidden risk factors. Explores variance explained, scree, loadings, factor returns, covariance reconstruction, and Varimax rotation. Results show 3–5 PCs capture ~75% of portfolio risk.

correlation-analysis covariance-matrix dimensionality-reduction factor-models matplotlib numpy pandas pca portfolio-risk principal-component-analysis python quantitative-finance time-series-analysis variance varimax

Last synced: 06 May 2026

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/kandil2001/lid-cavity-evolution

Lid Cavity Evolution is an open-source CFD suite for the lid-driven cavity problem, featuring MATLAB, Python, and parallel solvers with benchmark comparisons.

benchmarking cfd cfd-simulation lid-driven-cavity matlab mpi numpy open-source openfoam openmp parallel-computing python scientific-computing simple-algorithm star-ccm

Last synced: 06 May 2026

https://github.com/4ngelojr/ai-ml-cheatsheets

🗂️ Access essential AI and ML concepts with quick-reference cheatsheets for effective learning and project implementation.

bigdata cheatsheet computer-science convolutional-neural-networks data-science deep-learning dl excel hadoop keras linux neural-networks numpy pig plotly probability sql statistics

Last synced: 01 May 2026

https://github.com/labex-labs/numpy-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.

array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations

Last synced: 20 Jun 2026

https://github.com/jaketherealone/machinelearningtoolset

Collection of primary tools used in the field of machine learning - Written for numpy projects

machine-learning numpy python toolset

Last synced: 19 Jun 2026

https://github.com/rohanbanerjee1234567-cell/prediction-of-expected-salary-using-machine-learning

Here is my first Project Repository where I have made a Machine Learning Project using Python. The Problem statement was to train a model based on the given Dataset and from there we need to Predict the Expected Salary of an Employee who will have similar profiles.

exploratory-data-analysis linearregression matplotlib-pyplot numpy pandas randomforest randomforestregressor scikit-learn scikitlearn-machine-learning searborn visualization

Last synced: 27 Apr 2026

https://github.com/radithsandeepa/robot-path-visualizer

Welcome to Robot Path Visualizer! This tiny project uses Python, Pygame, and NumPy to replay a precomputed trajectory (x, y, θ). It rotates a robot sprite frame-by-frame and draws a trail, making it handy for diff-drive path playback, simulation demos, and quick visual debugging of robotics experiments.

animation numpy pygame python robot-simulation robotics trajectory-visualization

Last synced: 18 Sep 2025

https://github.com/shotadft/vecposlib

This is a library that enables vector calculations in Python.

math math-library numpy python python-lib python-library python3

Last synced: 18 Sep 2025

https://github.com/abhishek-patidar066/synthetic-datasets

Synthetic datasets are artificially generated data used for training machine learning models, simulating real-world data while ensuring privacy.

clustering datasets jupyter-notebook libraries matplotlib-pyplot numpy pandas-dataframe python random sklearn

Last synced: 15 Apr 2026

https://github.com/crvargasm/metnumun2021i

Universidad Nacional de Colombia - Métodos Numéricos ~ Grupo 1 2021-I

matplotlib numerical-analysis numpy pandas python quantopian

Last synced: 12 Apr 2026

https://github.com/akash-207/ecommerce-monetization-analysis

“Revenue and monetization analysis of Brazilian e-commerce data (Olist dataset). Includes KPIs, trends, and business insights.”

business-analytics matplotlib mysql numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/noxs1d/cv-emotion-classification

This project is a deep learning-based system for detecting human emotions from facial images using TensorFlow, OpenCV, and NumPy.

cnn cnn-classification dockerfile keras numpy opencv tensorflow transfer-learning

Last synced: 09 Apr 2026

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 09 Apr 2026

https://github.com/vyjayanthipolapragada/image_classifier_cnn_data_augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

adam-optimizer convolutional-neural-networks data-augmentation deep-learning image-classification jupyter-notebook neural-networks numpy optimizer pandas python pytorch relu-layer

Last synced: 09 Apr 2026

https://github.com/mirzaazwad/tymbert

TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy

bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch

Last synced: 09 Apr 2026

https://github.com/jcardonamde/datasets_ml

This project analyzes cab and limousine travel data in New York City. This with the goal of predicting the total duration of trips within the city. Machine learning models were used.

data-science machine-learning machine-learning-algorithms matplotlib numpy pandas pipelines python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/stefagnone/text_adventure_game

A text-based adventure game project using Python fundamentals

matplotlib numpy pandas python r scikit-learn seaborn sql

Last synced: 09 Apr 2026

https://github.com/Khushi130404/CatNet

CatNet is a simple machine learning project that classifies images as either a cat or not a cat using logistic regression. The dataset consists of labeled images of cats and non-cats, preprocessed and used to train a binary classification model.

h5py matplotlib numpy pillow scipy

Last synced: 19 Sep 2025

https://github.com/n-t-raghava/the_sweet_16

This project detects faces in real-time from a webcam feed or an uploaded image and predicts age (in bins of 5 years) and gender (Male/Female). The model is based on OpenCV’s Deep Neural Network (DNN) module and pre-trained models for face, age, and gender detection.

caffemodel deep-learning flask numpy opencv python

Last synced: 09 Apr 2026

https://github.com/shauryashaurya/marty_mcfly

Code, text and notebooks on a tutorial for Introduction to Machine Learning using open sources

anaconda jupyter-notebooks machine-learning machine-learning-tutorials notebook numpy python regression scikit-learn scipy tutorial

Last synced: 09 Apr 2026

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 16 May 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/ghaniketrajputp005/adobe-gensolve-hackathon-2024

This Project aims to identify, regularize, and beautify curves in 2D Euclidean space.

cv2 keras-tensorflow numpy os sklearn

Last synced: 07 Feb 2026

https://github.com/shambac/shamboflow

Fierce tensorflow competitor

cuda cupy machine-learning numpy pypi-package

Last synced: 19 Feb 2026

https://github.com/mallocode300/colour_palette_generator

Upload an image and immediately obtain a colour palette with the 10 most common colors in HEX codes and RGB

numpy python

Last synced: 29 Jul 2025

https://github.com/morsalinislamshapon/diabetes-prediction-systemv3

This repository contains a machine learning model that predicts diabetes using user health data. It features an interactive web interface built with Streamlit and provides insights into model predictions through SHAP and permutation importance. 🐙🌟

jupyter-notebook machine-learning matplotlib numpy pandas plotly randomforestclassifier sckit-learn seaborn streamlit-webapp

Last synced: 29 Jul 2025

https://github.com/arjunravi26/glucose-monitoring

Glucose Monitoring system to monitor patients and notify whenever glucose goes high.

machine-learning numpy pandas plotly python3 streamlit xgbboostregressor

Last synced: 30 Jul 2025

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/zeeshan4002911/data-analysis-hub

Quality control, data processing, data cleaning, data ploting

jupyter-notebook jupyterlab matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/ledsouza/MedPhys-BI

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 22 Sep 2025

https://github.com/abeed04/face-recognition-using-computer-vision-cv2

OpenCV (cv2) can be used for face recognition by detecting faces, extracting facial features, and comparing them to a database of known faces.

cmake cv2-library dlib-face-recognition face-recognition flask numpy pycharm-ide python

Last synced: 08 Feb 2026

https://github.com/werctfourth/python-border-autocrop2

Python script that crops borders from images v2

border crop image-processing libvips numba numpy python python3

Last synced: 17 May 2026

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/muavia1/roman-urdu-poetry-generation-using-lstm

Here’s a short description you can add to your GitHub project: Roman Urdu Poetry Generator A deep learning project using LSTM and TensorFlow to generate Roman Urdu poetry. Trained on a poetry dataset and deployed with a Gradio interface for interactive text generation.

deep-learning gradio gradio-interface lstm model-training numpy pandas poetry-generator python tensorflow

Last synced: 09 Apr 2026

https://github.com/sridharyadav07/machine-learning-project-bankruptcy-prevention-

The project explores multiple machine learning algorithms and evaluates their performance using various metrics, such as accuracy and confusion matrices. The models tested include Logistic Regression, K-Nearest Neighbors (KNN), Naive Bayes, and Support Vector Machine (SVM). In addition, regularization techniques (L1, L2) are used to avoid overfit.

data-preprocessing evaluation machine-learning-models matplotlib-pyplot modelbuilding modeldeployment numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 09 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/jjwizardmp/gallery-art-problem

Resolve the Gallery Art Problem implement BFS and Delaunay Triangulation

jupyter jupyter-notebook jupyterlab mathplotlib numpy pandas python python-env python3 scipy shapely

Last synced: 09 Apr 2026