An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/mayankmittal29/pagesage-ai_urls_content_scraper_webapp

PageSage is a modular web scraping and semantic retrieval system using BeautifulSoup for extraction, SentenceTransformers for dense embeddings, and ChromaDB for scalable vector search—enabling efficient data indexing for advanced RAG pipelines.

beautifulsoup chromadb fastapi gpt-turbo numpy openai-api python3 question-answering rag react requests sentence-transformers tailwindcss web-scraping

Last synced: 10 Apr 2026

https://github.com/kaladabrio2020/calculusnumerics

Trabalho da cadeira de Métodos Numérico I

metodos-numericos numpy python tkinter-gui

Last synced: 18 May 2026

https://github.com/mpbeatriz/santander_coders_2024

Repositório dos exercícios e projetos do curso de Engenharia de Dados da Ada em parceria com o Santander (em andamento).

apache-kafka apache-spark api data-engineering data-extraction fastapi flask numpy oop pandas python webhooks

Last synced: 10 Apr 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/trivediayush/Accident-and-Drowsiness-Detection

Our college minor project is a comprehensive solution aimed at enhancing road safety. The project comprises two main components: an Accident Detection Model and a Real-Time Driver Drowsiness System.

accident-detection accidents-analysis college-project cvzone dlib-face-detection dlib-face-recognition face-detection face-recognition imutils minor-project numpy opencv pandas python python3 yolov8

Last synced: 22 Aug 2025

https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas

This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian

data-analysis numpy pandas python

Last synced: 10 Apr 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/eduardoprofe666/mn-api

🐍📦 Paquete de Python con implementaciones de métodos numéricos

mn-api numerical-methods numpy pandas python scipy simpy tabulate

Last synced: 04 Jan 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/bhawnamehbubani/advanced-zomato-restaurant-clustering-and-sentiment-analysis

This project leverages this data to perform clustering and sentiment analysis, providing actionable insights for customers and business growth.

matplotlib-pyplot nltk numpy pandas seaborn sklearn wordcloud

Last synced: 05 May 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/matiassenia/modelo-prediccion-temperatura

Proyecto de machine learning que utiliza TensorFlow para predecir temperaturas en grados Fahrenheit a partir de temperaturas en grados Celsius. Incluye un script de Python para entrenar el modelo y realizar predicciones, así como datos de entrenamiento y visualizaciones

adam-optimizer machine-learning numpy python python3 tensorflow

Last synced: 11 Apr 2026

https://github.com/djdurga/google_play_store_apps_analysis

This data analysis project focuses on exploring and understanding the Google Play Store Apps dataset.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/kkufieta/carnd-vehicle-detection

Vehicle Detection & Tracking - Self Driving Cars

cv2 hog-features numpy self-driving-car sklearn vehicle-detection

Last synced: 11 Apr 2026

https://github.com/apfirebolt/numpy-and-pandas-examples

Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python

data-analysis jupyter-notebook numpy pandas python

Last synced: 17 Apr 2026

https://github.com/yoavram/numpycharm

Some examples on using PyCharm with Scientific Python

cython numpy pycharm python scientific-computing training-materials

Last synced: 11 Apr 2026

https://github.com/anubagre/predicting-price-of-pre-owned-cars

This project is solution to data science problem based on regression model to predict the price of pre owned cars. Programming language used is Python.

data-science machine-learning numpy pandas price-prediction python seaborn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/biasandrade/rede-neural-do-zero-classificacao-binaria

🧠 Rede Neural Artificial construída do ZERO (só NumPy) para detectar fraudes em transações | Forward/Backward Propagation | Sigmoid | Gradiente Descendente

backpropagation binary-classification deep-learning fraud-detection from-scratch gradient-descent machine-learning neural-network numpy python

Last synced: 18 Nov 2025

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/faizantkhan/numpy

NumPy Practice Repository Dive into the world of numerical computing with our collection of NumPy exercises. Perfect for beginners and seasoned programmers alike, this repository offers a range of challenges to sharpen your array manipulation and data analysis skills.

data-science library machine-learning numpy numpy-arrays numpy-library numpy-python outlier outlier-detection python pythonlibrarires tutorial

Last synced: 21 May 2026

https://github.com/chirindaopensource/non_linear_forecasting_backcasting

Python implementation of Gourieroux-Jasiak's (2025) mixed causal-noncausal VAR models. Features probabilistic forecasting, nonlinear innovation filtering, and state-dependent IRF analysis for financial time series with explosive dynamics. Enables robust risk assessment and structural analysis of speculative behavior.

bootstrap bubble-analysis financial-modeling forecasting jupyter-notebook monte-carlo nonlinear-modeling numpy pandas python quantitative-finance research-implementation risk-management scipy statistical-modeling time-series uncertainty-quantification var-models

Last synced: 11 Apr 2026

https://github.com/xiaohaoo/machine-learning-examples

机器学习使用不同框架的常见模型的例子

keras matplotlib numpy python pytorch sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/mashalk1/computer-vision-attendance-system

A computer vision based attendance system, using cv2, firebase, numpy, cvzone and face_recognition library in python

computer-vision cv2 cvzone face-recognition numpy

Last synced: 04 Feb 2026

https://github.com/4211421036/githubiotpy

GitHubIoT is a comprehensive toolkit designed to simplify the visualization of IoT (Internet of Things) data with seamless GitHub integration. The application provides an intuitive graphical interface for real-time data monitoring, analysis, and configuration

cli esp32 esp8266 github-actions github-iot matplotlib numpy pypi-packages python tkinter

Last synced: 16 Apr 2025

https://github.com/alexquar/whatisthatdog

Mobile App built using react-native that will send api requests for classifying dog breeds to a flask api server containing a custom built CNN made with pytorch trained on the Stanford dog dataset. To take a look at some examples checkout some images from my portfolio linked here.

aws ec2 expo flask gunicorn matplotlib-pyplot numpy pandas pytorch react-native seaborn typescript

Last synced: 11 Apr 2026

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/chaudharypraveen98/lungcancerdetection

To distribute the work of doctors and process the large amount of data to produce accurate results on the go

numpy pandas pillow python scipy tenserflow

Last synced: 16 Apr 2026

https://github.com/dhirajmerit/diabetes-prediction-models

This project implements multiple machine learning algorithms to predict diabetes diagnoses based on medical diagnostic data. The project compares the performance of various models, providing insights into which algorithms are most effective for this task.

ai machine-learning numpy pandas tensorflow

Last synced: 11 Apr 2026

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/marketing_statistical_analysis

Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns

customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/abhi227070/wine-quality-prediction

The Wine Quality Prediction project utilizes machine learning to assess wine quality based on various parameters. It offers a user-friendly interface built with Streamlit for easy interaction. Deployed on AWS, it provides scalable access to quality predictions for wine batches.

classification-algorithm machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas python3 seaborn

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/sohan181204/airline-ticket-price-analysis

The analysis of airline ticket prices explores the dynamic and competitive nature of the aviation industry in India. With rapid economic growth and increasing air travel demand, understanding ticket price trends is crucial for airlines, passengers, and policymakers.

numpy pandas python selenium

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/jminjares4/selected-areas-in-networks

Assignments for Selected Areas in Networks (EE 5390)

matplotlib network-analysis numpy pandas python

Last synced: 08 May 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/chernyakid/bank-clients-churn-research

Анализ оттока клиентов в Метанпромбанке

jupyter numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/shubhamahobia/lstm-hamlet

The LSTM Next Word Prediction project trains a model on Hamlet to predict the next word in a sequence. After preprocessing the text, an LSTM network learns patterns in the language. The trained model can then generate text by predicting the next word in a given sequence, with its accuracy evaluated to ensure it effectively models the language.

deep-learning github hamlet lstm lstm-neural-network lstm-neural-networks machine-learning neural-network nltk numpy pandas pickle python

Last synced: 11 Apr 2026

https://github.com/pranavsuriya-sr/e-hub_energysystems

Python based Web Application for analysis and visualization of concepts in Energy Systems.

css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit

Last synced: 11 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 11 Apr 2026

https://github.com/guptamudit/whatsapp-chat-analyser

Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.

matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/thaisgarcia/convert-censo-file

Este repositório contém scripts Python para processar dados destinados ao censo do MEC, incluindo a leitura de um arquivo TXT, a geração de um arquivo Excel e a conversão desse arquivo Excel de volta para um formato de texto.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/pd2871/streamlit_automl

Auto Exploratory Data Analysis using Streamlit App

matplotlib numpy pandas python streamlit streamlit-webapp

Last synced: 11 Apr 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/costopoulos/ntua-telecommunications

:radio: :telephone_receiver: NTUA ECE Telecommunications Course Code & Report

numpy nyquist python sampling scipy telecommunications

Last synced: 20 Apr 2026

https://github.com/nagipragalathan/zk-connect

In the conventional hiring process, candidate applications are met with limited assessment, leaving the crucial task of evaluating suitability to recruiters. ZKConnect disrupts this norm by introducing a transformative approach.

css django html nextjs numpy opencv pandas python

Last synced: 11 Apr 2026

https://github.com/imdeepmind/knn_from_scratch

KNN classifier in pure Numpy and Python

knn knn-classification machine-learning numpy python

Last synced: 11 Apr 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/shwetajanwekar/hr-data-analytics

Analyze the HR data through exploratory data analysis, we identified key factors influencing employee attrition, satisfaction levels, and performance. Machine learning models enabled us to predict employee churn and classify potential candidates for promotion.

matplotlib numpy pandas seaborn

Last synced: 19 Apr 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/kzon94/torn-market-analyzer

Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.

data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api

Last synced: 11 Apr 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026