An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/ashishsingh789/traffic_accident_analysis

Analyzed traffic accident data to identify patterns related to road conditions, weather, and time of day. Visualized accident hotspots using heatmaps and bar plots.

excel matplotlib numpy pandas pandas-dataframe python seborn

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/marketing_statistical_analysis

Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns

customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/abhi227070/wine-quality-prediction

The Wine Quality Prediction project utilizes machine learning to assess wine quality based on various parameters. It offers a user-friendly interface built with Streamlit for easy interaction. Deployed on AWS, it provides scalable access to quality predictions for wine batches.

classification-algorithm machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas python3 seaborn

Last synced: 11 Apr 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/nasa-asteroid-classification

Classifying whether an asteroid is hazardous or not.

eda matplotlib numpy pandas python seaborn visualization xgboost

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/aciluca/credit-approval-system

A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.

data-science keras machine-learning numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/f-ssemwanga/advanced-programming-york

This repository holds the extensive python programming completed during the advanced programming unit of my master's programme

comprehension csv-parser json lambda-functions list matplotlib mongodb numpy oops-in-python pandas-dataframe postgresql python3 tkinter-gui

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/sohan181204/airline-ticket-price-analysis

The analysis of airline ticket prices explores the dynamic and competitive nature of the aviation industry in India. With rapid economic growth and increasing air travel demand, understanding ticket price trends is crucial for airlines, passengers, and policymakers.

numpy pandas python selenium

Last synced: 11 Apr 2026

https://github.com/yanglr/mlnotes-python

Machine learning Notes for Python 3 (NumPy, TensorFlow etc.), with interactive Jupyter notebook.

jupyter jupyter-notebook numpy numpy-notes

Last synced: 21 May 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/ayushtiwari134/machine_learning_models

A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch

linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/arnauog/ironhack_data_analytics

Weekly labs to practice everything taught in class during the Data Analytics bootcamp of Ironhack in Barcelona of Sept 2024 - March 2025.

machine-learning mongodb mysql numpy pandas python seaborn tableau webscraping

Last synced: 11 Apr 2026

https://github.com/jminjares4/selected-areas-in-networks

Assignments for Selected Areas in Networks (EE 5390)

matplotlib network-analysis numpy pandas python

Last synced: 08 May 2026

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 11 Apr 2026

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/anshpg/vectorwave

The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.

collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/chernyakid/bank-clients-churn-research

Анализ оттока клиентов в Метанпромбанке

jupyter numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/amnydv17/landmark-detection

This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/shubhamahobia/lstm-hamlet

The LSTM Next Word Prediction project trains a model on Hamlet to predict the next word in a sequence. After preprocessing the text, an LSTM network learns patterns in the language. The trained model can then generate text by predicting the next word in a given sequence, with its accuracy evaluated to ensure it effectively models the language.

deep-learning github hamlet lstm lstm-neural-network lstm-neural-networks machine-learning neural-network nltk numpy pandas pickle python

Last synced: 11 Apr 2026

https://github.com/pranavsuriya-sr/e-hub_energysystems

Python based Web Application for analysis and visualization of concepts in Energy Systems.

css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit

Last synced: 11 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 11 Apr 2026

https://github.com/guptamudit/whatsapp-chat-analyser

Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.

matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/ebadshabbir/logistic_regression-binomial-

Logistic Regression on Social Network Ads Dataset This project applies Logistic Regression to predict whether a user will purchase a product based on their age and estimated salary, using the Social Network Ads dataset. The data is split into training and test sets, with feature scaling applied for normalization.

classification jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/nagipragalathan/zk-connect

In the conventional hiring process, candidate applications are met with limited assessment, leaving the crucial task of evaluating suitability to recruiters. ZKConnect disrupts this norm by introducing a transformative approach.

css django html nextjs numpy opencv pandas python

Last synced: 11 Apr 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/shwetajanwekar/hr-data-analytics

Analyze the HR data through exploratory data analysis, we identified key factors influencing employee attrition, satisfaction levels, and performance. Machine learning models enabled us to predict employee churn and classify potential candidates for promotion.

matplotlib numpy pandas seaborn

Last synced: 19 Apr 2026

https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook numpy pandas probability python stats

Last synced: 11 Apr 2026

https://github.com/kzon94/torn-market-analyzer

Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.

data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api

Last synced: 11 Apr 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026

https://github.com/bagusperdanay7/fcc-da-mean-variance-standard-deviation-calculator

One of Data Analysis with Python (freecodecamp) task, created a Mean Variance Standard Deviation Calculator.

data-analysis freecodecamp-project numpy python

Last synced: 06 May 2026

https://github.com/snghrsw/kikagaku-ml-learning

Pythonで単回帰分析と重回帰分析、ディープラーニングで回帰と分類

liner-regestion multiple-regression numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/elmezianech/email-spam-ham-classifier-nb-hard

Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.

ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 11 Apr 2026

https://github.com/pvlbzn/datascience

Learning data science 🤓

data-science numpy pandas

Last synced: 09 May 2026

https://github.com/aaaastark/textclassification-nlp-project

Dataset preparation, Feature Engineering, Model Building, Summary Statistics, Data Exploration by Visualizations, and Save File CSV

matplotlib nltk numpy pandas python seaborn sklearn textblob wordcloud

Last synced: 11 Apr 2026

https://github.com/nikhil-donthusaram/heartdiseaseprediction

Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.

data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode

Last synced: 11 Apr 2026

https://github.com/junaidsumsaal/advanceparkingsystem

Its a web application that integrates AI-powered parking spot predictions, real-time notifications, and role-based dashboards for Drivers, Attendants, and Providers. This system is designed to streamline parking operations, enhance user experience, and leverage machine learning for future availability predictions.

axios-react chakra-ui-react data-science met metrics ml numpy pand python3 react sklearn tailwindcss typescript

Last synced: 11 Apr 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/coderjolly/employee-information-retrieval-system

An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.

etl flask mvc numpy pandas python sqlite3

Last synced: 11 Apr 2026

https://github.com/tanmay-312/pacman-game-python

"Embark on a nostalgic journey with my Python Pac-Man game! 🎮🕹️ Crafted using pygame, tcod, numpy, and enum libraries, it's a modern twist on a classic. Navigate the maze, chomp pellets, and evade ghosts in this retro-inspired adventure! 🚀👻 #PythonGameDev #PacManRevival #ClassicGaming"

enum game game-development gamedev numpy pygame pygame-games python python-3 python3 tcod

Last synced: 22 Apr 2026

https://github.com/alphan26/breastcancerclassification

This project involves a binary classification problem where we applied PyTorch to detect breast cancer.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/hmourad2023/tweets_sentiments_emotions_analysis

Les réseaux sociaux, et en particulier Twitter, sont des plateformes essentielles où les utilisateurs expriment leurs sentiments et émotions au quotidien. Analyser les tweets peut fournir des informations précieuses sur l’humeur collective et les préoccupations des gens. Ce projet utilise un dataset de tweets disponible sur Kaggle

nltk numpy pandas sklearn

Last synced: 01 Aug 2025

https://github.com/muhammad-rebaal/eda-with-python-and-pandas

Explore, Analyze, and Visualize Data with Python and Pandas: Dive into the world of data analysis and visualization using Python and Pandas with this repository. Discover valuable insights, clean and preprocess your data, and create stunning visualizations to make data-driven decisions.

calmap matplotlib numpy pandas python python3 seaborn

Last synced: 07 May 2026

https://github.com/psycho-poodle/simple_rnn_imdb

This repository contains a simple web application for sentiment analysis of movie reviews using a pre-trained RNN model. The application is built with TensorFlow and Streamlit, making it easy to use and deploy.

numpy python streamlit tensorflow

Last synced: 11 Apr 2026

https://github.com/dane-meister/machine-learning-algos

Repository of several ML algorithms implemented for different environments and robot control vectors.

colab-notebook gymnasium jupyter-notebook machine-learning-algorithms numpy opencv pybullet python pytorch

Last synced: 11 Apr 2026

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/tigureis/aula_deployment

This project not only demonstrates a machine learning pipeline but also serves as a template for deploying models into production. By leveraging the project's structure and code, you can streamline the process of deploying your own machine learning models.

numpy pandas pickle sklearn

Last synced: 11 Apr 2026

https://github.com/collins-kimotho/wrangle-and-analyze-project

Data Wrangling and Analysis Project: Analyzing WeRateDogs Twitter Account Data

explanatory-data-analysis exploratory-data-analysis matplotlib numpy pandas python twitter twitter-api

Last synced: 11 May 2026

https://github.com/bhuvilol/objdetect_img

Object Detecttion from a image input

numpy opencv python3 pytorch yolov8

Last synced: 04 Feb 2026

https://github.com/rezapace/number-generator

Proyek ini adalah sebuah program yang digunakan untuk menghasilkan deretan angka dengan langkah tertentu dan menambahkan teks setelah setiap angka. Program ini mendukung berbagai metode untuk menghasilkan angka, termasuk penggunaan CuPy untuk memanfaatkan GPU, serta penggunaan threading untuk meningkatkan kinerja.

colab cupy custom generator jupiter kumal link number numpy pyhon webkumal webkumal-projek

Last synced: 11 Apr 2026

https://github.com/vishal-verma-96/Honey_Production_Analysis_using_Python

In this project, I analyzed U.S. honey production from 1998 to 2021 across different U.S. cities, explored yields, and examined production values using Matplotlib and Seaborn visualizations.

matplotlib numpy pandas python-3 python-visualization seaborn

Last synced: 19 Apr 2025

https://github.com/bilalm04/autonomous-lane-detection

An autonomous lane detection system leveraging OpenCV to process images and videos, accurately identifying lanes and providing key visualizations.

artificial-intelligence computer-vision lane-detection machine-learning matplotlib numpy opencv python

Last synced: 20 Jan 2026

https://github.com/pymivn/vinastats

Khám phá các sự thật ở Việt Nam với thống kê và số liệu từ tổng cục thống kê Việt Nam

numpy pandas statistics vietnam

Last synced: 02 Apr 2026

https://github.com/waynewbishop/bishop-algorithms-quiver-package

Quiver is a Swift package that provides vector operations for Swift applications.

data-science linear-algebra numpy swift vectors

Last synced: 31 Mar 2025

https://github.com/l-gre/tradefx-ai

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modelling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 12 Apr 2026

https://github.com/shwetapardhi/assignment-3-q3-hypothesis-testing

Chi2 contengency independence test image of Buyer Ratio Assume Null Hypothesis as Ho: Independence of categorical variables (male-female buyer rations are similar across regions (does not vary and are not related) Thus Alternate Hypothesis as Ha: Dependence of categorical variables (male-female buyer rations are NOT similar across regions (does v

chi2-contingency contingency-analysis hypothesis-testing norm numpy pandas python scipy stats

Last synced: 12 Apr 2026

https://github.com/saopicc/ddfacet-kernels

Modular implementation of the DDFacet Gridding Kernels

numpy radio-astronomy

Last synced: 26 Jan 2026

https://github.com/shhiivvaam/traffic_prediction

This project explores the application of advanced neural network architectures, including Multi-Layer Perceptron (MLP), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU), to forecast traffic volume. 👍👍✅

conv1d conv2d keras lstm matplotlib numpy pandas seaborn sklearn statsmodels tensorflow

Last synced: 12 Apr 2026

https://github.com/iamkushvanth/marketing-campaign-analysis

Data analysis comparing the performance of Facebook and AdWords ad campaigns in 2019, Using Statical Methods such as A/B test focusing on clicks, conversions, cost-effectiveness, and seasonal trends.

ab-testing hypothesis-testing linear-regression matplotlib-pyplot numpy pandas python regression-analysis scikit-learn scipy stastistics

Last synced: 07 Apr 2026

https://github.com/prakshal0809/machine_learning_algorithms

This project showcases basic machine learning programs I developed during my studies, written in Python, intended for educational use or as a foundation for more advanced projects.

machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/marvin-vw/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 04 Feb 2026

https://github.com/alwaysvivek/next-word-prediction

🔮 Predicts the next word in a text sequence using either an N-gram statistical model or an LSTM-based neural network.

argparse laplace-smoothing machine-learning neural-network ngrams nlp nltk numpy python3 tensorflow

Last synced: 12 Apr 2026