An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 11 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/project54321/stocksniper

"An interactive Stock Dashboard that visualizes historical prices, calculates key financial metrics, displays fundamental statements, and aggregates the latest news for any ticker — all in a sleek, easy-to-use Streamlit interface."

alphavantage finance numpy pandas plotly pypi python stock-market stocknews streamlit

Last synced: 11 Apr 2026

https://github.com/apfirebolt/numpy-and-pandas-examples

Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python

data-analysis jupyter-notebook numpy pandas python

Last synced: 17 Apr 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/ikbalcaus/HandSketch

Drawing on Canvas with Hand Gestures + AI for Letter Recognition

mediapipe numpy ocr-recognition opencv python pytorch tkinter

Last synced: 31 Mar 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/pavanreddy565/infosys_python_for_datascience

Data Science Course Tasks Repository This repository contains tasks and projects completed during the Data Science course from Infosys Springboard. It includes setup instructions for running Jupyter Notebook sessions in a virtual environment.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/chirindaopensource/non_linear_forecasting_backcasting

Python implementation of Gourieroux-Jasiak's (2025) mixed causal-noncausal VAR models. Features probabilistic forecasting, nonlinear innovation filtering, and state-dependent IRF analysis for financial time series with explosive dynamics. Enables robust risk assessment and structural analysis of speculative behavior.

bootstrap bubble-analysis financial-modeling forecasting jupyter-notebook monte-carlo nonlinear-modeling numpy pandas python quantitative-finance research-implementation risk-management scipy statistical-modeling time-series uncertainty-quantification var-models

Last synced: 11 Apr 2026

https://github.com/eljandoubi/trading-with-momentum

A trading strategy based on a momentum indicator.

momentum numpy pandas scipy trading trading-algorithms trading-strategies

Last synced: 05 May 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025

https://github.com/vyjayanthipolapragada/marketing_statistical_analysis

Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns

customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/hitthecodelabs/weatherforecast-lstm

TensorFlow and Python for analyzing and forecasting weather data

cnn keras lstm matplotlib numpy pandas python tensorflow weather weather-forecast

Last synced: 11 Apr 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/arnauog/ironhack_data_analytics

Weekly labs to practice everything taught in class during the Data Analytics bootcamp of Ironhack in Barcelona of Sept 2024 - March 2025.

machine-learning mongodb mysql numpy pandas python seaborn tableau webscraping

Last synced: 11 Apr 2026

https://github.com/chernyakid/bank-clients-churn-research

Анализ оттока клиентов в Метанпромбанке

jupyter numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 11 Apr 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/elmezianech/email-spam-ham-classifier-nb-hard

Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.

ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/junaidsumsaal/advanceparkingsystem

Its a web application that integrates AI-powered parking spot predictions, real-time notifications, and role-based dashboards for Drivers, Attendants, and Providers. This system is designed to streamline parking operations, enhance user experience, and leverage machine learning for future availability predictions.

axios-react chakra-ui-react data-science met metrics ml numpy pand python3 react sklearn tailwindcss typescript

Last synced: 11 Apr 2026

https://github.com/moustafamohamed01/car-price-prediction

Car Price Prediction using Linear Regression A machine learning model that predicts car prices based on features like name, company, year, kilometers driven, and fuel type. Built with Linear Regression and deployed as a Streamlit web app.

data-science jupyter-notebook linear-regression machine-learning numpy pandas python sickit-learn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/tanmay-312/pacman-game-python

"Embark on a nostalgic journey with my Python Pac-Man game! 🎮🕹️ Crafted using pygame, tcod, numpy, and enum libraries, it's a modern twist on a classic. Navigate the maze, chomp pellets, and evade ghosts in this retro-inspired adventure! 🚀👻 #PythonGameDev #PacManRevival #ClassicGaming"

enum game game-development gamedev numpy pygame pygame-games python python-3 python3 tcod

Last synced: 22 Apr 2026

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 11 Apr 2026

https://github.com/npodlozhniy/dl-tinkoff

This is a warehouse of my homeworks for master MIPT deep learning course from Tinkoff FinTech

deep-learning numpy pytorch

Last synced: 08 May 2026

https://github.com/psycho-poodle/simple_rnn_imdb

This repository contains a simple web application for sentiment analysis of movie reviews using a pre-trained RNN model. The application is built with TensorFlow and Streamlit, making it easy to use and deploy.

numpy python streamlit tensorflow

Last synced: 11 Apr 2026

https://github.com/tigureis/aula_deployment

This project not only demonstrates a machine learning pipeline but also serves as a template for deploying models into production. By leveraging the project's structure and code, you can streamline the process of deploying your own machine learning models.

numpy pandas pickle sklearn

Last synced: 11 Apr 2026

https://github.com/rezapace/number-generator

Proyek ini adalah sebuah program yang digunakan untuk menghasilkan deretan angka dengan langkah tertentu dan menambahkan teks setelah setiap angka. Program ini mendukung berbagai metode untuk menghasilkan angka, termasuk penggunaan CuPy untuk memanfaatkan GPU, serta penggunaan threading untuk meningkatkan kinerja.

colab cupy custom generator jupiter kumal link number numpy pyhon webkumal webkumal-projek

Last synced: 11 Apr 2026

https://github.com/beberiche/jeon-woo-chi

지역 축제를 기반으로 하는, 게이미피케이션을 접목시킨 지역 상권 맞춤 추천 서비스

aws django docker elasticsearch grafana jenkins kibana logstash numpy pandas phaserjs prometheus react react-query recoil redis spring-boot tensorflow typescript zipkin

Last synced: 10 Mar 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/itssahilwhat/AI-Fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 02 Jul 2025

https://github.com/bilalm04/autonomous-lane-detection

An autonomous lane detection system leveraging OpenCV to process images and videos, accurately identifying lanes and providing key visualizations.

artificial-intelligence computer-vision lane-detection machine-learning matplotlib numpy opencv python

Last synced: 20 Jan 2026

https://github.com/rnuv/artificial-neural-network

A Simple 2 layered Artificial Neural Network library made from scratch in Python and NumPy. Fitted with a feed forward method and backpropagation.

neural-network numpy python tensorflow

Last synced: 18 Mar 2025

https://github.com/klihe/astar-visualization

I am trying to implement *a path finding algorithm and create a visualization of it with Python and numpy library

algorithm astar-algorithm numpy pathfinding pygame python visualization

Last synced: 06 Apr 2025

https://github.com/shwetapardhi/assignment-3-q3-hypothesis-testing

Chi2 contengency independence test image of Buyer Ratio Assume Null Hypothesis as Ho: Independence of categorical variables (male-female buyer rations are similar across regions (does not vary and are not related) Thus Alternate Hypothesis as Ha: Dependence of categorical variables (male-female buyer rations are NOT similar across regions (does v

chi2-contingency contingency-analysis hypothesis-testing norm numpy pandas python scipy stats

Last synced: 12 Apr 2026

https://github.com/mohith-2002/plotchart

PlotChart is a web application created by the Python library Streamlit.

matplotlib numpy pandas python streamlit

Last synced: 12 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_05

Метод опорних векторів -Support Vector Machine, SVM. Дерева рішень - RandomForestClassifier, XGBClassifier

classification numpy pandas python random-forest-classifier reporting scikit-learn svm-classifier svm-model xgbclassifier

Last synced: 12 Apr 2026

https://github.com/prakshal0809/machine_learning_algorithms

This project showcases basic machine learning programs I developed during my studies, written in Python, intended for educational use or as a foundation for more advanced projects.

machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/webintellectual/coloured-image-to-b-w-converter-web-app

A Streamlit web application to convert a coloured image to black and white image.

numpy opencv pil pillow python3 streamlit streamlit-webapp

Last synced: 12 Apr 2026

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/khushi130404/echobot

EchoBot is a chatbot built using TensorFlow to classify user inputs into predefined intents and respond accordingly. It uses a neural network trained with the data in intents.json.

keras nlp nltk numpy pickle tensorflow

Last synced: 12 Apr 2026

https://github.com/mark1708/neurointerfaces-of-information-systems

Laboratory work on the discipline "Neurointerfaces of information systems"

numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/mariam-zaidi/credit_risk_analytics

Performing Exploratory Data analysis for loan application approvals and understanding Risk analytics

exploratory-data-analysis matplotlib numpy pandas risk-analytics seaborn

Last synced: 06 May 2026

https://github.com/nazwright/daria

Real-time fraud detection architecture powered by AWS Kinesis, KaggleHub, and SMOTE-augmented data — the foundation of DARIA, the Detection And Risk-Intelligence Agent.

aws evm fraud fraud-detection-using-machine-learning kaggle kinesis machine-learning math numpy pandas python random web3

Last synced: 12 Apr 2026

https://github.com/manuelfay/numpydeeplearning

Deep Learning Framework built entirely using numpy

deep-learning deep-learning-framework minigrad numpy pytorch

Last synced: 06 May 2026

https://github.com/jvictor011/big-five-personality-test

Usando machine learning para fazer o agrupamento de um sistema de traço de personalidade

matplotlib numpy os pandas python seaborn

Last synced: 07 May 2025

https://github.com/alphan26/humidityregressor

This project involves developing a model to predict relative humidity using PyTorch. The model was trained on a dataset containing monthly data, with the goal of forecasting the relative humidity for the subsequent month.

matplotlib numpy pandas python pytorch

Last synced: 11 Apr 2026

https://github.com/darkusphantom/newton-method

7 ejercicios practicos del método de Newton en la programación computacional

computational-maths numpy python

Last synced: 25 Mar 2025

https://github.com/kareem74x/data-mining-uni-project

A project focused on applying data mining techniques to extract insights from datasets

data-mining matplotlib numpy pandas python

Last synced: 25 Feb 2026

https://github.com/ds-python-programmers-spectrum/flipkart-sentimental_analysis-app

The Flipkart Sentiment Analysis project aims to analyze customer reviews of Flipkart products using Natural Language Processing (NLP) and visualize insights through a Streamlit-based web application. This system will classify customer feedback as positive, negative, or neutral

machine-learning nltk numpy pandas sentiment-analysis streamlit-webapp textblob vader-sentiment-analysis

Last synced: 22 Jun 2025

https://github.com/byteshiva/getmaplocation

Automate Dummy data from the open data source. Parse International Address, Geopy to automate map locations using lat and lon, Cache geopy map location calls using sqlite3, Use Pandas to parse and convert CSV to JSON

csv-parser geopy json numpy pandas pandas-dataframe parser pickle python3 sqlite3

Last synced: 18 Apr 2026

https://github.com/sayan520/titanic-data-insights

Conducting data analysis on Kaggle's Titanic: Machine Learning from Disaster dataset using essential data wrangling, exploratory data analysis (EDA), and visualization techniques to uncover insights, identify patterns, and explore factors influencing passenger survival.

jupyter-notebook kaggle matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/tnleite/credit-card-customer-clustering

Este repositório apresenta um projeto de segmentação e predição de clientes de cartões de crédito. Utilizando EDA, clusterização (K-Means) e machine learning, o objetivo é prever o grupo de novos clientes, apoiando estratégias de marketing personalizadas.

classification-algorithm clustering-algorithm clustering-analysis data-science exploratory-data-analysis kmeans-clustering logistic-regression machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn seaborn

Last synced: 07 May 2026

https://github.com/arrmansa/leetcode-nice-solutions

Some solutions for leetcode problems that I found fun - either the algorithm or using numpy bruteforce where it shouldn't be

leetcode leetcode-python leetcode-python3 leetcode-solutions numpy python python3

Last synced: 16 Mar 2025

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026