An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/ola-kaznowska/summer-programming-camp-gp-travel-2024

Projects implemented at summer programming camp https://github.com/ArsBeneMoriendi

ai artificial-intelligence cv2 json math neural-network numpy python3 tensorflow

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/manorohith-03/attendance-system-using-face-recognition

The system leverages the Python programming language and several powerful libraries for image processing and face recognition. The ultimate goal is to provide a more efficient, accurate, and user-friendly attendance tracking solution.

face-recognition numpy opencv pandas python-3

Last synced: 30 Apr 2026

https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study

This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.

data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn

Last synced: 01 May 2026

https://github.com/bjpcjp/optimize-mktg-spend-with-lagrange

simple demo of marketing (2 channel) spend optimization

lagrange matplotlib matplotlib-pyplot numpy python scipy

Last synced: 01 May 2026

https://github.com/macnianios/fandango

a udemy project for seaborn data visualizations and basic pandas commands

numpy pandas python seaborn seaborn-plots udemy-course-project

Last synced: 01 May 2026

https://github.com/vsrc/project-cardionaut

AI predictor for Heart Attack likelihood

numpy pandas pickle python streamlit

Last synced: 01 May 2026

https://github.com/aimin-nur/dataanalyst-cohortanalysis

Sebuah project Data Analayst untuk mengetahui bagaimana pola pembelian pelanggan dalam periode waktu tertentu (Cohort Analysis)..

cohort-analysis numpy pandas pytz

Last synced: 07 May 2026

https://github.com/splithor1zon/edu-barriermethodlp

Education material for learning/teaching of Linear Progrmming using Barrier Method. Presentation with commentary and python code available.

barrier-method convex-optimization cvxpy educational-project linear-programming numpy presentation python

Last synced: 01 May 2026

https://github.com/sairupeshl/portfolio-risk-calculator

A portfolio risk engine using Python, integrating the Yahoo Finance API to ingest and preprocess historical multi-asset price data

numpy pandas python yfinance

Last synced: 08 Jun 2026

https://github.com/zsailer/np2d

Common 2-D Numpy operations

array numpy

Last synced: 01 May 2026

https://github.com/hghyhghy/notebook

Repo consist numpy , matplotlib and pymongo aggression

jupyter-notebook matplotlib numpy pymongo python-3

Last synced: 01 May 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/ahmedsamir45/demograghic-data-analyzer

answer some questions about the data

demographic-data-analyzer numpy pandas

Last synced: 01 May 2026

https://github.com/4strium/predict-french-tv-audience

🎬 A beginner's machine learning project whose aim is to predict the audience for films on TV in France, based on previous data provided by the CNC and Médiamétrie.

machine-learning numpy pandas python tv xgboost

Last synced: 07 May 2026

https://github.com/mohhasan1/machine-learning

This repository is where I learn and store information about various machine learning libraries, such as Pandas, scikit-learn, and others. It serves as a personal reference for exploring and experimenting with different ML tools and techniques.

machine-learning matplotlib numpy pandas seaborn sklearn

Last synced: 01 May 2026

https://github.com/cbhihe/nn_binary-classifier

NN-based binary classification with logistic regression loss, back-propagation with gradient descent and optional regularization

classifier gradient-descent logistic-regression neural-network numpy python regularization sklearn

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/louisblankemeier/numpytex

Latex script generator that generates an aesthetically appealing table using a numpy array.

latex numpy script-generator table

Last synced: 07 May 2026

https://github.com/emmanueltobi/ai-academy-university

Code samples from the Intel Nervana AI Academy University held in FUTA

ai classification-algorithm dataset intel numpy regression regression-models sklearn

Last synced: 01 May 2026

https://github.com/hansalemaos/a_pandas_ex_image_tools

Uses pandas/numpy/numexpr for operations on pictures - very fast

colors image locate numexpr numpy opencv pictures python

Last synced: 01 May 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/dkm/stlsplit

Tool to split STL in several objects

numpy python stl

Last synced: 17 Jun 2026

https://github.com/deusares/funpil

A collection of PIL, Numpy and OpenCV functions to manipulate images

batch image-processing lazy media numpy opencv pil pillow python script

Last synced: 01 May 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 01 May 2026

https://github.com/offthetab/nn_homework

Практические работы по нейронным сетям МИРЭА 3 курс.

gensim ignite matplotlib neural-network nltk numpy pandas scipy-stats seaborn torch torchvision

Last synced: 01 May 2026

https://github.com/dostuffthatmatters/jax-capability-analysis

demo repository containing the experiments for my master's seminar @ TUM

autograd click deep-learning huggingface jax jit-compilation mnist numpy python pytorch

Last synced: 01 May 2026

https://github.com/mayankanand007/betago

Platform for tracking Go moves using Computer Vision. (StarterHacks 2018 submission)

java matlab matplotlib numpy opencv-python python3

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/azaz9026/python_library_numpy

NumPy is a Python library used for working with arrays. It also has functions for working in domain of linear algebra, fourier transform, and matrices...

array jupyter-notebooks numpy python3

Last synced: 08 May 2026

https://github.com/anubagre/predicting-price-of-pre-owned-cars

This project is solution to data science problem based on regression model to predict the price of pre owned cars. Programming language used is Python.

data-science machine-learning numpy pandas price-prediction python seaborn sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/ducha-aiki/laf2ell

Utils for LAF to ellipse and back conversion. Also function for drawing LAFs.

ellipse laf local-features numpy visualization

Last synced: 02 May 2026

https://github.com/v-goncharenko/freespace

Implementation of signal propagation in free space

numpy python signal-processing

Last synced: 02 May 2026

https://github.com/mgitrov/ann-from-scratch

An artificial neural network implemented from scratch using numpy only.

artificial-neural-networks deep-learning guide numpy

Last synced: 02 May 2026

https://github.com/lipskiyp/trading_dashboard

Pandas framework to analyze financial time-series and calculate various performance metrics (e.g. Sharpe Ratio and Max Drawdown Duration).

financial-analysis numpy pandas

Last synced: 02 May 2026

https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.

analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 02 May 2026

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/amishidesai04/emergency-calls-data-analysis-project

Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.

analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn

Last synced: 02 May 2026

https://github.com/fabriciocovalesci/sistema-vendas-graficos

Sistema simples de vendas e estoque, com gráficos usando Matplotlib

matplotlib matplotlib-pyplot numpy python-script python3 venv virtualenv

Last synced: 02 May 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/momo790-m/startup-profit-prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

assumptions business colab-notebook linear-regression machine-learning matplotlib numpy pandas preprocessing seaborn skit-learn sklearn

Last synced: 08 May 2026

https://github.com/andremenezesds/machine_learning_test

Testing among various Machine Learning models and parameters, in order to further study their behaviour for Classification, Regression and Clustering analysis.

classification-models clustering-models machine-learning numpy pandas regression-models sklearn

Last synced: 02 May 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/baranovserv/h-s-diagram

Парсер данных построения H-S диаграммы для веществ

diagrams json matplotlib numpy parser requests

Last synced: 02 May 2026

https://github.com/project54321/stocksniper

"An interactive Stock Dashboard that visualizes historical prices, calculates key financial metrics, displays fundamental statements, and aggregates the latest news for any ticker — all in a sleek, easy-to-use Streamlit interface."

alphavantage finance numpy pandas plotly pypi python stock-market stocknews streamlit

Last synced: 11 Apr 2026

https://github.com/himanshugoyal77/movie-recommendation

Cosine similarity based Recommendation Engine

nltk numpy pandas requests-library-python tmdb-api vectorization

Last synced: 08 May 2026

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 02 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/akashash01/decision_tree

A very specific type of probability tree that enables you to make a decision on certain process(chance event outcomes, resources cost, utility).

data-science decision-tree-classifier machine-learning numpy pandas-dataframe python sklearn

Last synced: 11 May 2026

https://github.com/sahil210695/numpy-basics

A brief overview of NumPy basics

data-science numpy numpy-tutorial python

Last synced: 08 May 2026

https://github.com/mani-prakash-n-r/stock_market_prediction_system

This project uses LSTM networks to predict stock prices based on historical data, providing insights for informed investment decisions..LSTM, NumPy, Scikit-Learn, Matplotlib, yfinance and TensorFlow

lstm matplotlib numpy python sckiit-learn tensorflow yfinance

Last synced: 11 Apr 2026

https://github.com/bhavna-kale/cars-eda-project

Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.

data-analysis excel matplotlib numpy pandas python3 searborn streamlit

Last synced: 03 May 2026

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/alexbie98/bietorch

neural nets with numpy

autograd neural-nets numpy torch

Last synced: 03 May 2026

https://github.com/sunnyrao07/water-quality-analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost

Last synced: 08 May 2026

https://github.com/fandredev/ml-my-guide

my own annotations about ML/DS using pandas, matplotlib, numpy, scikit learn

anaconda matplotlib numpy pandas plotly scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/emhash/numpy-tutorial

Just simple tasks of practice of NumPy library of python

numpy numpy-library numpy-tutorial python

Last synced: 11 Apr 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/shwetapardhi/assignment-03-q1--hypothesis-testing

Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit

hypothesis-testing numpy p-value pandas python scipy significance-testing stats t-test

Last synced: 11 Apr 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/farhannaeem00/data-visualization-practice

Learning matplotlib fundamentals - seeking feedback and code reviews

jupyter matplotlib numpy pandas python

Last synced: 03 May 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/jsnevt/decisiontreecls

Data Science - Árvores aleatórias - RandomForestClassifier

decision-tree-classifier matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/abrarshahok/electric-vehicle-charging-station-energy-consumption-prediction

With the rapid adoption of electric vehicles, optimizing energy usage at charging stations has become crucial for improving operational efficiency and ensuring customer satisfaction. This tool leverages predictive modeling to forecast energy consumption for charging sessions based on various input features.

matplotlib numpy pandas plotly python3 scikit-learn xgboost

Last synced: 09 Jun 2026