An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/krook1024-uni-archive/kmeans

kmeans implementation for a university course

kmeans kmeans-clustering machine-learning numpy python

Last synced: 02 May 2026

https://github.com/vermenea/masters---ad-signaling-pathway-using-hsp90-inhibitor

Influence of 17-AAG a Hsp90 inhibitor on signaling pathway in Atopic Dermatitis

matplotlib numpy pandas python seaborn statmodels

Last synced: 02 May 2026

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 02 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/helenaden/data-science-fundamentals

This project delves into fundamental data science concepts using Python libraries like NumPy and Pandas

data-analysis datascience datasets datavisualization datawrangling heatmap numpy pandas patterns python

Last synced: 03 May 2026

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 03 May 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/chethannazre/railway-track-obstacle-detection-system

RAILWAY TRACK OBSTACLE DETECTION AND AUTOMATION (RAILTECH AUTOMATION)

automation cv2-library internship numpy ppt project python report

Last synced: 03 May 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/jsnevt/decisiontreecls

Data Science - Árvores aleatórias - RandomForestClassifier

decision-tree-classifier matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/vinicius999/icu-beds-forecast-covid-19

Predição da ocupação de leitos de UTI no Brasil devido a pandemia do COVID-19, usando Support Vector Regression (SVR)

covid-19 matplotlib numpy pandas python sklearn support-vector-regression

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/carol-neto/sprint-9-making-data-driven-business-decisions

In this project I evaluate hypotheses using ICE and RICE and perform A/B testing analysis.

abtesting dataanalysis hypothesis-testing matplotlib numpy pandas python scipy-stats seaborn

Last synced: 03 May 2026

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/ankitgmishra/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/asayda01/airport_operations_analytics

The project analyzes the performance of Hold Baggage Screening X-ray Machines, identifying trends in system load, bottlenecks, and operator efficiency. Using an interactive Streamlit app, users can explore data-driven insights to optimize airport screening processes and improve overall operational performance.

data-science dataanalysis datainsights datavisualization numpy pandas python streamlit

Last synced: 03 May 2026

https://github.com/hansalemaos/np_zoom_in

Zoom in on a picture

image numpy python zoom

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/grachale/convolution_image

The implementation of a function that takes an image (RGB or gray one) and a filter (kernel) and applies a convolution between them.

convolution image-processing numpy python

Last synced: 03 May 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/anas436/ibm-cloud-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

ibmwatson matplotlib numpy pandas python3 seaborn

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 04 May 2026

https://github.com/ortanav2/parabola-visualizer

Simple parabola-visualizer made with matplot+numpy [Formula: a(x+d)²+e]

mathematics matplotlib numpy parabola visualization

Last synced: 04 May 2026

https://github.com/jalijuhola/neurcomputing-course

Some of tasks from neurocomputing course

mypy numpy python

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/abhivur/graduate-income-forecaster

Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/miozilla/fescarefine

fescarefine :skier::mount_fuji::guide_dog: : Refine & Test ML Models # Features Scaling # Normalization # Datasets

gradient-descent linear-regression matplotlib model-testing mse norm numpy ols overfit pandas scatter seaborn sklearn standardization statsmodels underfit

Last synced: 04 May 2026

https://github.com/florianwoelki/salary_linear_regression

This project is for salary prediction using linear regression without any machine learning library.

jupyter linear-regression machine-learning matplotlib numpy pandas python regression

Last synced: 04 May 2026

https://github.com/arjunravi26/deep_learning

This repo contains codes and notes to learn deep learning

ann autoencoders cnn deep-learning deepseek-r1 lstm numpy rnn-tensorflow tensorflow transformer

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/florasteve/ml-foundations-day1

Day-1 ML foundations focused on linear algebra: vectors, dot products, norms, angles, projections, and basic matrix operations—implemented in a Jupyter notebook with NumPy/Matplotlib, clear 2D visuals, a self-quiz, and a brief reflection. Emphasizes how these math primitives map to core ML/AI ideas (similarity, least squares, geometric transforms).

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy python3 terminal-mac tutorial visualization vscode

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 04 May 2026

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/wwwmisla/gerador-mapas-calor

Sistema de visão computacional para gerar mapas de calor com base na movimentação em espaços públicos, auxiliando no planejamento urbano e uso eficiente do espaço.

color computer-vision demo google-colab gradio heatmap marchine-learning matplotlib model numpy opencv people-detection python smart-city ufrn visao-computacional vision-computer yolo yolov8

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/kaungkhantkyaw1997/face_recognition

A Computer Vision tool designed to identify and recognize characters in images.

face-recognition flask numpy opencv

Last synced: 05 May 2026

https://github.com/dmkk01/mlp-python

Implementation of a multilayer perceptron using Pytorch and Numpy libraries

mlp numpy pytorch

Last synced: 05 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/jokoum/sentiment-classification

Multi-layer Perceptron classification of product reviews written in Greek language

flask machine-learning multilayer-perceptron-network neural-network numpy python sentiment-classification

Last synced: 05 May 2026

https://github.com/vladd12/machine-learning-practice

Репозиторий для учебной практики и различных курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/sahu-sakshi/covid-19

A comprehensive data analysis project for COVID-19 pandemic data, featuring automated data collection, feature engineering, visualization tools, and exploratory analysis notebooks.

jupyter numpy pandas python3

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/sowmi0304/predictive-modeling-titanic-survival-prediction

Machine learning project to predict passenger survival on the Titanic using classification algorithms, data preprocessing, and model evaluation techniques.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/lruizap/tomography-with-algebraic-topology

This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.

gudhi jupyter-notebook matplotlib numpy opencv python tomography

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/yenk/human-genomic-meta-analysis

Meta analysis of genomic studies to better understand if telomeres in the human genome are impacted by age from wild animals data.

linear-regression machine-learning matplotlib numpy scipy

Last synced: 05 May 2026

https://github.com/mohitsai/boston-housing-data-analysis

Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city

data-analysis data-science matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/tapas-gope/diwali-sales-analysis

Analyze Diwali Sales Data to improve customer experience and sales.

exploratory-data-analysis matplotlib-pyplot numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/chdl17/marvelvsdc

This GitHub repository contains Python and Tableau for analyzing Marvel Vs DC superheros.

excel jupyter-notebook numpy pandas python tableau-public

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/zuhairzia/customer-segmentation

📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.

joblib matplotlib numpy pandas scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/abdelrahman-amen/neural_style_transfer

This project applies neural style transfer to create new images by merging the content of one image with the artistic style of another.

neuralstyletransfer numpy os python streamlit tensorflow

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026