An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/arn4/labtools

Package with useful scripts for Physics Laboratory Courses

numpy python3 scipy uncertainties

Last synced: 05 Jan 2026

https://github.com/saliola/nonnegative_integer_matrices

code to generate and count nonnegative integer matrices with prescribe row and column sums (aka contingency tables)

cython cython-examples numpy numpy-examples python3

Last synced: 18 Apr 2026

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 11 Apr 2026

https://github.com/mani-prakash-n-r/stock_market_prediction_system

This project uses LSTM networks to predict stock prices based on historical data, providing insights for informed investment decisions..LSTM, NumPy, Scikit-Learn, Matplotlib, yfinance and TensorFlow

lstm matplotlib numpy python sckiit-learn tensorflow yfinance

Last synced: 11 Apr 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/sufiyan1803/student-result-analysis

This project explores student exam data to find out what affects their scores. It covers everything from cleaning the data to analyzing score distributions, the effects of parental education and marital status, and comparisons by gender and lunch type. It’s a great way to see how different factors influence student performance.

matpolotlib numpy pandas python

Last synced: 03 Feb 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/emhash/numpy-tutorial

Just simple tasks of practice of NumPy library of python

numpy numpy-library numpy-tutorial python

Last synced: 11 Apr 2026

https://github.com/shreyasmehta05/sortsync

A custom sorting algorithm combining parallel merge and count sort, with detailed performance comparisons against standard sorting techniques.

c matplotlib numpy python3

Last synced: 04 Feb 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 12 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/mnitin-reddy/summer-olympics-data-analysis-web-app

An interactive web app for exploring trends in Olympic Games history, analyzing overall medal tallies, country-wise performance, and athlete demographics. Built with Python and Streamlit, this app offers insights through visualizations and data-driven statistics.

dataanalytics matplotlib numpy pandas python seaborn streamlit

Last synced: 12 Apr 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/andersoncrs/prediccion_precio_vehiculos_statsmodels

Este proyecto utiliza un modelo de regresión lineal para predecir el precio de vehículos basándose en sus características principales. El análisis incluye la definición del problema, exploración y limpieza de datos, conversión de variables categóricas a numéricas, evaluación de correlaciones y entrenamiento del modelo.

analisis-de-datos analisis-exploratorio-de-datos matplotlib numpy seaborn statsmodels visualizacion-de-datos

Last synced: 26 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/jigyasag18/iit-guhawati-final-capstone-project

Smart Dynamic Parking Price Optimization System that adjusts parking fees in real-time based on demand, traffic, and competition. It employs adaptive pricing models and rerouting logic to enhance parking utilization and reduce congestion. The system is visualized via an interactive Streamlit dashboard, enabling users to simulate dynamic pricing.

bokeh bokeh-server bokehplots capstone-project data dataset deployment machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot mlproject normalisation numpy pandas pathway python streamlit

Last synced: 05 Apr 2026

https://github.com/bhawnamehbubani/advanced-zomato-restaurant-clustering-and-sentiment-analysis

This project leverages this data to perform clustering and sentiment analysis, providing actionable insights for customers and business growth.

matplotlib-pyplot nltk numpy pandas seaborn sklearn wordcloud

Last synced: 05 May 2026

https://github.com/erikbrinkman/hilbert-bytes

A python library for converting between d-dimensional points and indices on a hilbert curve

hilbert-curve numba numpy python

Last synced: 08 May 2025

https://github.com/nadahamdy217/movies-data-etl-using-python-gcp

Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.

analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/lijesh010/employeeanalysis-

This is a simple data analysis project developed as part of my Entri Elevate Data Science and Machine Learning course. It serves as a practical demonstration of my skills in data analysis.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 11 Apr 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/rizkyrajitha/ann_101

A really simple neural network for binary classification written from scratch using python

jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/constraintautomaton/client_pgu1

The client part of the PgU1 project it. It command the robot and do the video processing

numpy opencv paramiko pyqt5 python3 raspberry-pi-3

Last synced: 12 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/kahngjoonkoh/randomshapegenerator

A program that will generate images with random shapes and background colours. Can be customized and generated in bulk.

generative-art numpy opencv python threading tkinter

Last synced: 11 Apr 2026

https://github.com/ewakamoto/03.-python-projects

This is my collection of Python Projects

jupyter jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/amanda-ucc/market-tracker

Automated portfolio creator that generates a portoflio which mimics a bench mark index by minimizing the tracking error.

beta covariance numpy pandas scipy sharpe-ratio tracking-error variance yfinance

Last synced: 11 Apr 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/sothulthorn/stock-prediction-portal

The Stock Prediction Portal is a full-stack web application that demonstrates how to use machine learning models to analyze stock market data and forecast future trends.

axios django django-rest-framework javascript keras matplotlib numpy pandas python react tensorflow yfinance

Last synced: 02 Apr 2026

https://github.com/eduardoprofe666/mn-api

🐍📦 Paquete de Python con implementaciones de métodos numéricos

mn-api numerical-methods numpy pandas python scipy simpy tabulate

Last synced: 04 Jan 2026

https://github.com/nachtfeuer/covid19

Python script(s) for visualizing corona data

csv json matplotlib numpy pandas python requests tkinter

Last synced: 05 May 2026

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 12 Apr 2026

https://github.com/winterwind/ecg_signal_classification

Two-part project that involves detecting the R-peaks in an ECG signal to extract the individual ECG beats and making a machine learning model to classify them

csv csv-files data-science decision-trees ecg ecg-classification ecg-signal jupyter jupyter-notebook knearest-neighbors knn machine-learning matplotlib matplotlib-pyplot numpy pandas pyplot python random-forest scipy

Last synced: 11 Apr 2026

https://github.com/chintanboghara/rocket-simulation

A comprehensive web-based orbital mechanics simulator with advanced mission planning, real-time tracking, and educational features.

docker flask html javascript numpy plotly python

Last synced: 11 Apr 2026

https://github.com/alejoduarte23/bs6472

Serviceability assessment as per BS 6472 for human exposure to vibration in buildings Vibration sources other than blasting

matplotlib numpy pydantic scipy

Last synced: 07 Jan 2026

https://github.com/rahatmoktadir03/customer-churn-prediction

A machine learning project for predicting customer churn, enabling businesses to identify at-risk customers and develop retention strategies.

business-analytics customer-churn-prediction data-science google-colab machine-learning numpy pandas python sklearn strreamlit xception-model

Last synced: 11 Apr 2026

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/manikantasanjay/data_analysis_using_python_libraries_series

This Series contains Data Analysis projects performed on different Kaggle datasets and providing valuable insights into the data by making use of Python libraries.

exploratory-data-analysis kaggle-datasets machine-learning matplotlib numpy opendatasets pandas python visualization

Last synced: 12 Apr 2026

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 12 Apr 2026

https://github.com/andrewzhoumq123/sci_graph_api

Generates different types of detailed plots for scientific data

fastapi h5py json matplotlib numpy pandas scipy

Last synced: 12 Apr 2026

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/ahmedabdalkreem/animals_classification

This project is dependent to make classification between faces of cats, dogs and wild_animals and use multiple model such as CNN and pretrained model like ResNet and MobileNet and show the result of all model in the image using heatmap to know what is the part that model take decision when see it.

classification cnn heatmap matplotlib mobilenet numpy opencv pandas pretrained-models python restnet

Last synced: 12 Apr 2026

https://github.com/chandkund/sentiment-analysis-using-nlp

This project focuses on Sentiment Analysis using the textual content from product reviews. The goal is to analyze user sentiments based on their written feedback, particularly focusing on the "reviewText" column in the dataset.

lemmatization matplotlib nlp nltk numpy pandas seaborn tokenization

Last synced: 31 Jan 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/soumyapro/heart-disease-prediction

This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.

logistic-regression numpy pandas simpleimputer sklearn

Last synced: 21 Jan 2026

https://github.com/arthurdsant/dataanalysis-agricultural_raw_material

This Python project performs analysis and visualization of agricultural raw material price data using a Kaggle dataset. Based on Jupiter Notebook and Python.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 26 Jan 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/nishi1612/knight-tour-problem

IT485 Logic of Inference Project on Knight's Tour. A hamiltonian path problem to determine path of knight to travel entire chessboard with every cell visited only once.

backtracking-algorithm bootstrap flask html knights-tour localhost numpy pygame python tkinter warnsdorff

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 12 Apr 2026

https://github.com/baranwaljii/data-science-projects

This Repository contains projects related to the domain of data science based on python and jupyter notebook

data-science jupyter-notebook machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/ieCecchetti/Python_ML_DL_examples

A variety of Machine Learning and Deep Learning scripts in Python. Included some theorical info about that in the Readme

bayes-classifier bayesian-statistics deep-learning kernel machine-learning matplotlib neural-network numpy pandas pca pca-analysis python scikitlearn-machine-learning scipy shi

Last synced: 10 Mar 2025

https://github.com/simonusher/multi-layer-perceptron

My solution to an assignment on multi-layer perceptron. Implemented in python and numpy. Contains the forward and backward pass code, as well as some utility functions.

multi-layer-perceptron neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/abideen-olawuwo/logistic-regression-house-prediction

Using logistic Regression Model to predict house price

logistic-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/nilayhangarge/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python

Last synced: 12 Apr 2026

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/charles-l/rayboi

a raytracer written in futhark/python

futhark numpy pathtracing python3 raytracing

Last synced: 19 Apr 2026

https://github.com/shreeya-methuku/predictive_analysis

This project leverages machine learning to perform time-series forecasting on key financial metrics, including sales (income), expenses, cash flow, and profit.

arima financial-forecasting numpy pandas sarima xgboost

Last synced: 20 Apr 2026

https://github.com/ngangawairimu/data-validation-using-python

Agricultural dataset validated using python code for usage. Building a data pipeline that will ingest and clean data with the press of a button.

jupyter-notebook numpy pandas pytest python

Last synced: 13 Apr 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/konynour/facedectection

This project demonstrates real-time face detection using OpenCV and a pre-trained SSD (Single Shot MultiBox Detector) model. The system captures video from a camera, processes the frames, detects faces, and displays the results with bounding boxes and confidence scores. This application showcases the use of deep learning models for object detection

cv2 jupyter-notebook lambda-functions matplotlib numpy opencv python3

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/aqueeqazam/numpy-for-mathematical-computing

You're at the perfect spot if you're interested in developing your career in data science or machine learning, or if you just enjoy having fun with mathematical operations.

mathematics numpy numpy-arrays

Last synced: 19 Apr 2026

https://github.com/chenshuliu/python-guide-to-numpy

This repository is dedicated to explaining the functionalities of Numpy in Python

data-structures numpy python

Last synced: 18 Jan 2026