An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 04 May 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/abhishekyadav915/e-commerce-sales-analysis

E-Commerce Sales Analysis is a data analysis project that explores sales data from an e-commerce platform to uncover insights and trends. The analysis includes visualizing sales performance, customer behavior, and product trends to help optimize business strategies and improve customer satisfaction.

matplotlib-pyplot numpy pandas-library plotly python3

Last synced: 04 May 2026

https://github.com/soumyapro/house-price-prediction

Explored different attributes of Boston housing dataset then a part of dataset was used to train the linear regression algorithm after that trained model was used to give predictions on remaining part of dataset.

linear-regression matlplotlib numpy pandas sklearn svm

Last synced: 13 Apr 2026

https://github.com/aniketwithpython/fundamentals-of-scientific-computing

Fundamentals of Scientific Computing (MS1980) Notebook

jupyter matplotlib numpy pandas python

Last synced: 01 May 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/thorbeorn/epsi-m1-datamanagement-etude-de-cas

Ce projet implémente une chaîne complète de traitement de données RH après fusion d’entreprise

dashboard duckdb epsi etl kaleido numpy pandas plotly python pyyaml

Last synced: 11 Apr 2026

https://github.com/elmezianech/email-spam-ham-classifier-nb-hard

Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.

ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering

Last synced: 11 Apr 2026

https://github.com/nambukeerthi/copper_project

Data Analysis Skills – Work with real-world data. Machine Learning Experience – Build and train models. Web App Development – Create interactive dashboards

eda machine-learning numpy pandas python streamlit

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/shwetapardhi/assignment-03-q2--hypothesis-testing

Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth

annova ftest hypothesis-testing numpy pandas python scipy significance-testing stats

Last synced: 11 Apr 2026

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/saraasgari99/videoclassificationusingdeeplearning

ML pipeline for biological video analysis using cutting-edge CNN architectures and industry-standard data processing and ML techniques

computer-vision data-processing keras-tensorflow machine-learning numpy sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/udacity-machinelearning-internship/support-vector-machine

Implementing SVM's using pandas and sklearn in python

numpy pandas sklearn support-vector-machines svm

Last synced: 13 Apr 2026

https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024

This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.

bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas

Last synced: 01 May 2026

https://github.com/zhanziyuan/pixelpuzzle-web

Encode/decode images using Base64 or shuffle/recover the pixels of images.

anime encryption image image-processing numpy python

Last synced: 11 Apr 2026

https://github.com/sadia-khan13/data-preprocessing

Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.

artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn

Last synced: 11 Apr 2026

https://github.com/1adityakadam/uber_data_analytics

End to end Google Bigquery + Looker Studio Data Analytics Project Transforming NYC Taxi Data into Actionable Intelligence

bigquery looker-studio mage-ai-pipeline numpy pandas sql

Last synced: 13 Apr 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/csengupta1101/netflix-rating

The project revolves around Netflix shows and movies around the world. The problem statement that is being tried to address here is that what kind of show to come up with in future times and how well that will fit with the audience

jupyter-notebook matplotlib movies netflix numpy pandas plotly python python3 rating tvseries

Last synced: 13 Apr 2026

https://github.com/sadegh15khedry/housing-prices-prediction-using-randomforest

This repository contains an implementation of random forest model to predict housing prices using the Boston Housing dataset.

csv joblib jupyter-notebook matplotlib numpy pandas pil python random-forest seaborn sklearn

Last synced: 27 Feb 2025

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/5hraddha/eda-instacart-customers-shopping-habits

In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.

data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas

Last synced: 13 Apr 2026

https://github.com/thanhloc81/rfm-model-project

✨ Build a flow to deploy Segmentation evaluation through Python programming.

customer-segmentation numpy pandas python rfm-analysis

Last synced: 03 May 2026

https://github.com/kzon94/torn-market-analyzer

Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.

data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api

Last synced: 11 Apr 2026

https://github.com/shwetajanwekar/hr-data-analytics

Analyze the HR data through exploratory data analysis, we identified key factors influencing employee attrition, satisfaction levels, and performance. Machine learning models enabled us to predict employee churn and classify potential candidates for promotion.

matplotlib numpy pandas seaborn

Last synced: 19 Apr 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/mstrlc/iss-project

FIT VUT – ISS – signal processing - synthesizing and resynthesizing of a MIDI keyboard

fit iss matplotlib numpy python scipy signal-processing vut

Last synced: 11 Mar 2025

https://github.com/cyb3r-jak3/docker-alpine-pandas

Python alpine image with pandas, and numpy installed

alpine-image numpy pandas-python python-alpine

Last synced: 30 May 2026

https://github.com/dharma-acha/resnet18_imageclassification_cnn

In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.

matplotlib numpy python3 pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/thinker84/real-time-stock-price-prediction-and-market-analysis-using-machine-learning

Real-time stock price prediction app using LSTM, Streamlit, and historical data (2010–2023). Forecasts next 10 days & visualizes trends.

data-science django lstm machine-learning numpy pandas pandas-datareader scikit-learn stock-market stock-price-prediction stooq streamlit yahoo-finance yahoo-finance-api

Last synced: 13 Jul 2025

https://github.com/mgitrov/ann-from-scratch

An artificial neural network implemented from scratch using numpy only.

artificial-neural-networks deep-learning guide numpy

Last synced: 02 May 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 07 Apr 2026

https://github.com/voidkeishi/computational-science

This repository contains implementations of algorithms from the course "Computational Science" at HUST

computational-science numpy python

Last synced: 02 May 2026

https://github.com/imdeepmind/knn_from_scratch

KNN classifier in pure Numpy and Python

knn knn-classification machine-learning numpy python

Last synced: 11 Apr 2026

https://github.com/nagipragalathan/zk-connect

In the conventional hiring process, candidate applications are met with limited assessment, leaving the crucial task of evaluating suitability to recruiters. ZKConnect disrupts this norm by introducing a transformative approach.

css django html nextjs numpy opencv pandas python

Last synced: 11 Apr 2026

https://github.com/lipskiyp/trading_dashboard

Pandas framework to analyze financial time-series and calculate various performance metrics (e.g. Sharpe Ratio and Max Drawdown Duration).

financial-analysis numpy pandas

Last synced: 02 May 2026

https://github.com/vyjayanthipolapragada/fraud_detection_creditcard

Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML

classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model

Last synced: 11 Apr 2026

https://github.com/moonshallow5/solar-system-simulation

A solar system simulation using the Beeman scheme and the Euler method

matplotlib-animation matplotlib-pyplot numpy

Last synced: 13 May 2026

https://github.com/sakhileln/multimodal-chatbot-with-memory

A lightweight, CLI-based conversational agent that processes text and image inputs, remembers past interactions.

chatbot machine-learning nlp numpy opencv spacy tensorflow

Last synced: 11 Apr 2026

https://github.com/pd2871/streamlit_automl

Auto Exploratory Data Analysis using Streamlit App

matplotlib numpy pandas python streamlit streamlit-webapp

Last synced: 11 Apr 2026

https://github.com/rishabhmathur06/pytorch

This repository contains codes related to the PyTorch tutorial provided by Patrick Loeber

neural-network numpy pandas python pytorch sklearn

Last synced: 13 Apr 2026

https://github.com/mastermindromii/car-price-prediction-model

Here is My Regression Project based on Predicting Price of Car using Linear Regression.

linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/dharma-acha/imageclassification

This project is an interactive Streamlit web application using the VGG-13 model to classify images from the CIFAR-10 dataset. Users can upload images to receive real-time predictions and visual explanations of the model's decisions. The goal is to accurately classify images into one of the ten CIFAR-10 classes: airplanes, automobiles, birds, cats,

colab-notebook matplotlib numpy pandas python3 pytorch scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/thaisgarcia/convert-censo-file

Este repositório contém scripts Python para processar dados destinados ao censo do MEC, incluindo a leitura de um arquivo TXT, a geração de um arquivo Excel e a conversão desse arquivo Excel de volta para um formato de texto.

numpy pandas python

Last synced: 11 Apr 2026

https://github.com/singhvishal003/brain_stroke_prediction

Brain Stroke Prediction according to age and Health Condition.

matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/shreeparab1890/heart-attack-analysis-prediction-accuracy-85.24

This iPython Notebook implements the Analysis and Predictions for Heart Attack. The Analysis is done on the dataset to identify the key features which has a impact on Heart Attack. After the analysis and data cleaning, the dataset is used to train the model using different ML algorithm.

eda exploratory-data-analysis matplotlib numpy pandas plotly prediction python scikit sklearn

Last synced: 13 Apr 2026

https://github.com/marianoviola/reim

🕸️ A Computational Framework for Inferring System Truth from Distributed Noisy Observations

fastapi numpy pandas python3 scipy

Last synced: 11 Jun 2026

https://github.com/iaroslavbelavskii/statkemapp

Here's the code for my 'StatKemApp' app, that can be used to calculate statistics for the experimental laboratory data (e.g. titration). Feel free to use it and report any bugs!

chemical-engineering chemistry matplotlib-pyplot numpy pandas python scipy

Last synced: 11 Apr 2026

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 06 May 2026

https://github.com/nyx1311/toxicity-detector-using-bilstm

🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web app that flags multiple forms of online abuse in real time and offers tools for emotional recovery. 📊 Under the hood: BiLSTM + Word2Vec embeddings for deep, context-aware detection Trained on 21K+ labeled comments across 7 toxicity categories Built with Python, Tensor

epoch genism gpu keras-tensorflow matplotlib model nlp nlp-machine-learning nlpaug-textual numpy pandas pandas-library python3 streamlit tensorflow word2vec

Last synced: 14 Apr 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/lesiaukr/mathematic_for_cs_colab_hw

Master's degree | HW | Mathematics for Computer Science course | Homeworks

goit google-colab home-works latex linear-algebra mathematics-for-computerscience matrices numpy python vectors

Last synced: 13 Apr 2026

https://github.com/ediloaz/linear-regression-with-lists-numpy-and-pytorch

This was done like the homework #01 of Machine Learning Course, of Computer Science Master.

arrays-raw jupyter-notebook machine-learning numpy python pytorch

Last synced: 13 Apr 2026

https://github.com/abhinavraj009/x_ray-conversion-csv-

To convert an X-ray image into a CSV format for machine learning or data analysis, you'll typically want to represent the image as numerical pixel values.

machine-learning numpy opencv pandas python

Last synced: 13 Apr 2026

https://github.com/tigureis/analise_dataset_campanha_marketing

O projeto visa melhorar a eficácia das campanhas de marketing através da análise de dados e da construção de um modelo preditivo.

numpy pandas plotly seaborn sklearn xgbo

Last synced: 13 Apr 2026

https://github.com/amiriiw/text_classification

Welcome to the Text Classification Project! This project is designed to train a model for classifying texts based on their emotional content and then using it to categorize new texts into corresponding emotional categories.

keras numpy pandas pickle scikit-learn tensorflow text-classification

Last synced: 20 Jan 2026

https://github.com/mmerlyn/analysis-of-tomato-prices

Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.

css flask html matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/vedikasnehil/daily-leetcode-

Daily LeetCode Using Python is a project dedicated to solving coding challenges consistently using Python. It focuses on improving problem-solving skills, mastering Python techniques, and building a collection of clean, well-documented solutions. Perfect for interview preparation, learning algorithms, or daily coding practice!

matplotlib numpy pandas python python3 seaborn

Last synced: 13 Apr 2026

https://github.com/amnydv17/landmark-detection

This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/harmanveer-2546/covid-19-chances-detection

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

covid-19 covid-data logistic-regression matplotlib mlp-classifier numpy pandas python random-forest seaborn sklearn-library xgboost

Last synced: 10 Apr 2026

https://github.com/chernyakid/bank-clients-churn-research

Анализ оттока клиентов в Метанпромбанке

jupyter numpy pandas python scipy seaborn

Last synced: 11 Apr 2026

https://github.com/krook1024-uni-archive/kmeans

kmeans implementation for a university course

kmeans kmeans-clustering machine-learning numpy python

Last synced: 02 May 2026

https://github.com/aiwithqasim/ab-testing-for-shoefly.com

In this project, I had analyzed the data set of SHOEFLY.COM to find out valuable insights to check which platform is more useful for advertisement.

numpy pandas python3 sqlite3

Last synced: 13 Apr 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/danny-1k/nnpy

Neural network library written from scratch in numpy

deep-learning neural-networks-from-scratch numpy python

Last synced: 17 May 2026

https://github.com/legalaspro/rnn_gru_lstm_experiments

RNN, GRU, LSTM implementation using PyTorch and Numpy

gru ltsm numpy pytorch rnn

Last synced: 21 Apr 2026

https://github.com/lupin3000/augmentedreality

Introduction into augmented reality (AR) with Python OpenCV ArUco markers.

aruco aruco-marker camera-calibration marker-detection numpy opencv pose-estimation python

Last synced: 13 Apr 2026

https://github.com/tkc/node-neural-network

Built with only pure typescript, neural network iris sample.

javascript machine-learning neural-network node numpy typescript

Last synced: 09 Apr 2026

https://gitlab.com/DLR-SY/mcodac

MCODAC is a Fortran library for the numerical evaluation of fiber composite damage. The library contains analysis methods specifically tailored to fiber composites, from micromechanical homogenization approaches to macroscopic fatigue models of orthotropic multilayer composites.

fortran numpy python

Last synced: 02 Apr 2025

https://github.com/parthshah02/customer_churn_dashboard

This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python

data-analysis matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 11 Apr 2026

https://github.com/lingesh81051/similar-template-document-matching-and-fraud-detection

An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.

numpy opencv opencv-python pillow pymupdf pytesseract pytesseract-ocr python tkinter

Last synced: 13 Apr 2026