An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/beyzabasarir/spotify-youtube-eda

Exploratory Data Analysis of global songs' performance metrics on Spotify and YouTube with Python, using Jupyter Notebook.

eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study

This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.

data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn

Last synced: 01 May 2026

https://github.com/ismaelvr1999/bioinspiredalgorithms

This repository contains implementations of various bioinspired algorithms for optimization and machine learning.

machine-learning matplotlib numpy python tensorflow

Last synced: 01 May 2026

https://github.com/hansalemaos/a_pandas_ex_image_tools

Uses pandas/numpy/numexpr for operations on pictures - very fast

colors image locate numexpr numpy opencv pictures python

Last synced: 01 May 2026

https://github.com/paulo-santos-ds/analise_dados_musicais

Dashboard para compreender as preferências musicais dos ouvintes, tendências regionais e o impacto de artistas específicos.

numpy pandas powerbi python

Last synced: 01 May 2026

https://github.com/offthetab/nn_homework

Практические работы по нейронным сетям МИРЭА 3 курс.

gensim ignite matplotlib neural-network nltk numpy pandas scipy-stats seaborn torch torchvision

Last synced: 01 May 2026

https://github.com/aniketwithpython/fundamentals-of-scientific-computing

Fundamentals of Scientific Computing (MS1980) Notebook

jupyter matplotlib numpy pandas python

Last synced: 01 May 2026

https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024

This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.

bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas

Last synced: 01 May 2026

https://github.com/dpb44/exploring-the-intuition-of-neural-networks-on-a-classification-problem-using-only-numpy

Implementing a softmax-based neural network from scratch using NumPy to classify the Iris dataset, leveraging vectorization, gradient descent, and decision boundary visualization.

deep-learning neural-network numpy softmax-classifier

Last synced: 02 May 2026

https://github.com/andremenezesds/machine_learning_test

Testing among various Machine Learning models and parameters, in order to further study their behaviour for Classification, Regression and Clustering analysis.

classification-models clustering-models machine-learning numpy pandas regression-models sklearn

Last synced: 02 May 2026

https://github.com/ksharma67/intel-stock-predication-wiith-eda

We are trying to design a model that can predict the price of stock using different methods and algorithms.

eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn

Last synced: 02 May 2026

https://github.com/baranovserv/h-s-diagram

Парсер данных построения H-S диаграммы для веществ

diagrams json matplotlib numpy parser requests

Last synced: 02 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/bhavna-kale/cars-eda-project

Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.

data-analysis excel matplotlib numpy pandas python3 searborn streamlit

Last synced: 03 May 2026

https://github.com/alexbie98/bietorch

neural nets with numpy

autograd neural-nets numpy torch

Last synced: 03 May 2026

https://github.com/k-ilyas/ai-projects-handwritten-digit-recognition-titanic-survivor-analysis

AI Projects: Handwritten Digit Recognition & Titanic Survivor Analysis using scikit-learn

machine-learning numpy pandas sickit-learn

Last synced: 03 May 2026

https://github.com/sadmanca/imdb-scraper

Scrapes IMDb's movie database and outputs the data to CSV files.

beautifulsoup data-scraping imdb numpy pandas python requests

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/kaustavmodak/business-aided-customer-feedback-assessment-system

A Streamlit-based sentiment analysis app that classifies customer reviews into Positive, Neutral, or Negative using a pre-trained ML mode

framework machine-learning matplotlib nlp nltk numpy pandas pickle regex scikit-learn seaborn sentiment-analysis streamlt tfidf-vectorizer

Last synced: 03 May 2026

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/manikantasanjay/youtube_exploratory_data_analytics

This is a Mini Project made using Python Libraries like Numpy, Pandas, Matplotlib etc to perform Data Analytics on US Video and Comments dataset.

jupyter-notebook matplotlib numpy pandas plotly python seaborn wordcloud

Last synced: 03 May 2026

https://github.com/syed-m-nofel/python-data-science-fundamentals

Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.

api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial

Last synced: 03 May 2026

https://github.com/ankitgmishra/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/asayda01/airport_operations_analytics

The project analyzes the performance of Hold Baggage Screening X-ray Machines, identifying trends in system load, bottlenecks, and operator efficiency. Using an interactive Streamlit app, users can explore data-driven insights to optimize airport screening processes and improve overall operational performance.

data-science dataanalysis datainsights datavisualization numpy pandas python streamlit

Last synced: 03 May 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/naritanara/tf-test

Mostres de ML i IA amb TensorFlow basades en els tutorials oficials per al PR

numpy python research-project tensorflow tensorflow-tutorials

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/joao-inacio/numpy-essencial-para-desenvolvedores-python

O curso foi elaborado para desenvolvedores Python que desejam explorar os recursos poderosos da biblioteca NumPy. Através de aulas práticas, eu vou adquirir as habilidades necessárias para trabalhar com arrays multidimensionais, realizar cálculos científicos complexos e manipular dados com eficiência.

array array-manipulations array-methods jupyter-notebook numpy python python3

Last synced: 03 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/anas436/ibm-cloud-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

ibmwatson matplotlib numpy pandas python3 seaborn

Last synced: 04 May 2026

https://github.com/dr-rompecabezas/medical-data-visualizer

[freeCodeCamp] Medical Data Viz Jupyter Notebook built with Matplotlib and Seaborn

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/pomxark/smartdictor_0.1_nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

cv2 fuzzywuzzy kivy matplotlib-pyplot nuitka numpy pil pyautogui pyqt5 pytesseract pyttsx3 scipy silero simpleaudio torch

Last synced: 04 May 2026

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 04 May 2026

https://github.com/timothyjan/intro-machine-learning-polynomial-regression

We use NumPy to implement vectorized linear and polynomial regression models and compare their performance using separate training and test sets.

numpy

Last synced: 04 May 2026

https://github.com/jakeoeding/rippy

A Python seam carving implementation

numpy python3 seam-carving

Last synced: 04 May 2026

https://github.com/dmdlgg/calculadora_de_indicadores_estatisticos

Uma calculadora que retorna a média, mediana, desvio padrão, quartis e variância dos dados inseridos pelo usuário. Tecnologias utilizadas: Flet, Python e Numpy. (Ainda em andamento)

flet numpy pyhton

Last synced: 04 May 2026

https://github.com/ola-kaznowska/median-statistic

A program that calculates the median statistic. Program written in Python 3.13.1 with NumPy library during Machine Learning course.

mathematics numpy python python3

Last synced: 04 May 2026

https://github.com/arsalan692/python-telusko-playlist

A collection of beginner Python scripts and practice exercises from my early learning days, inspired by Telusko and other resources.

basic-programming bs4 matplotlib numpy oop pandas python requests selenium tkinter

Last synced: 04 May 2026

https://github.com/florianwoelki/salary_linear_regression

This project is for salary prediction using linear regression without any machine learning library.

jupyter linear-regression machine-learning matplotlib numpy pandas python regression

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/charlenry/machine_learning_estimer_prix_voiture

Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application

joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression

Last synced: 04 May 2026

https://github.com/florasteve/ml-foundations-day1

Day-1 ML foundations focused on linear algebra: vectors, dot products, norms, angles, projections, and basic matrix operations—implemented in a Jupyter notebook with NumPy/Matplotlib, clear 2D visuals, a self-quiz, and a brief reflection. Emphasizes how these math primitives map to core ML/AI ideas (similarity, least squares, geometric transforms).

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy python3 terminal-mac tutorial visualization vscode

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/nordic96/lanedetection

Lane Detection Module using OpenCv and Python for educational purpose.

computer-vision lane-detection matplotlib-pyplot numpy opencv python3

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/abhijeet-bhambere/exploratory_data_analysis

A collection of several projects for conducting EDA on datasets from various sources

exploratory-data-analysis matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/rtgrt5645/numpy-lab

🧮 Explore, manipulate, and visualize data with NumPy to enhance your Python skills in scientific computing and data analysis.

array-operations data-analysis data-science jupyter-notebook machine-learning numerical-computing numpy numpy-arrays numpy-library numpy-python python python3 scientific-computing

Last synced: 04 May 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/chathumiamarasinghe/nn-training-model

A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.

jupyter-notebook matplotlib numpy phyton scikit-learn

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 05 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/yashpaneliya/bank-loan-default-analysis

Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.

data-analysis loan-default-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/5hraddha/optimize-oil-well-locations

In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.

linear-regression numpy pandas scikit-learn supervised-learning

Last synced: 06 May 2026

https://github.com/chopikus/raytracing-bench

Benchmarking ray tracers in Python, Java, Numpy, Cupy, Pypy

cupy java numpy pypy python raytracing

Last synced: 06 May 2026

https://github.com/coder5omkar/lendingclubcasestudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/fazal-coder/capturing_img

python code for Creating blue color image

loop numpy opencv processing python scheduler threading

Last synced: 06 May 2026

https://github.com/cycle-sync-ai/student-score-analysis

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student

Last synced: 06 May 2026

https://github.com/naveen88112/genetic_algos_maze_solver

Genetic Algorithm Maze Solver This project utilizes a Genetic Algorithm (GA) to solve the optimization of a maze problem. It evolves the population of promising solutions to the best solution finding shortest path between start and end point using selection, crossover, and mutation.

deap-library genetic-algorithms matplotlib maze-solver numpy python

Last synced: 06 May 2026

https://github.com/albert-2002/robust-penalized-empirical-likelihood-estimation-method-for-linear-regression

Implementation for the paper titled: Robust Penalized Empirical Likelihood Estimation Method for Linear Regression

linear-regression machine-learning numpy pandas python regression regression-analysis statistics

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/timeless-residents/handson-tensorflow-numpy

A practical guide to learning and using TensorFlow and NumPy for data manipulation and machine learning

data-science machine-learning numpy tensorflow tutorial

Last synced: 06 May 2026

https://github.com/hansalemaos/cv2_collage

Create a collage from images with OpenCV

collage numpy opencv python

Last synced: 06 May 2026

https://github.com/lunanvarg/bid-data-cac22039

En este repositorio se vé mi TP Integrador del programa Codo a Codo: Big Data

colab data-studio-google deepnote excel matplotlib mysql numpy pandas-library python seaborn

Last synced: 06 May 2026