An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024

This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.

bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas

Last synced: 01 May 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/krook1024-uni-archive/kmeans

kmeans implementation for a university course

kmeans kmeans-clustering machine-learning numpy python

Last synced: 02 May 2026

https://github.com/camsai/jupyterlite

CAMSAI JupyterLite is a lightweight, browser-based environment tailored for AI-driven materials science research. It integrates CAMSAI tools, schemas, and workflows, enabling users to validate data, run simulations, and explore materials science applications without requiring local installation or setup.

chemistry jupyterlite materials materials-design materials-informatics materials-science matplotlib numpy pyodide python scipy

Last synced: 02 May 2026

https://github.com/andremenezesds/machine_learning_test

Testing among various Machine Learning models and parameters, in order to further study their behaviour for Classification, Regression and Clustering analysis.

classification-models clustering-models machine-learning numpy pandas regression-models sklearn

Last synced: 02 May 2026

https://github.com/ksharma67/intel-stock-predication-wiith-eda

We are trying to design a model that can predict the price of stock using different methods and algorithms.

eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn

Last synced: 02 May 2026

https://github.com/baranovserv/h-s-diagram

Парсер данных построения H-S диаграммы для веществ

diagrams json matplotlib numpy parser requests

Last synced: 02 May 2026

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 02 May 2026

https://github.com/hadjuse/spotify-recommendation-knn

Here is a personal project where i use KNN project to classify songs

data-science graphics interpretation knn-classification machine-learning matplotlib music numpy plotly spotify

Last synced: 03 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/bhavna-kale/cars-eda-project

Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.

data-analysis excel matplotlib numpy pandas python3 searborn streamlit

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/alexbie98/bietorch

neural nets with numpy

autograd neural-nets numpy torch

Last synced: 03 May 2026

https://github.com/fandredev/ml-my-guide

my own annotations about ML/DS using pandas, matplotlib, numpy, scikit learn

anaconda matplotlib numpy pandas plotly scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/farhannaeem00/data-visualization-practice

Learning matplotlib fundamentals - seeking feedback and code reviews

jupyter matplotlib numpy pandas python

Last synced: 03 May 2026

https://github.com/mzayles/data-preparation.py

Curso Programação em Python para Data Science | Preparando e tratando dados de uma base de vendas com Pandas e NumPy.

numpy pandas python

Last synced: 03 May 2026

https://github.com/shreenidhi7700/auto_price_prediction

We are required to use Machine Learning algorithms to predict the price of cars with the available independent variables such as Horsepower, no-of-cylinders, engine-type, city_mpg, highway_mpg etc.

decision-tree-regression ensemble knn-regression linear-regression machine-learning-algorithms matplotlib-pyplot numpy pandas python3 random-forest-regression regression seaborn xgboost-regression

Last synced: 04 May 2026

https://github.com/vinicius999/icu-beds-forecast-covid-19

Predição da ocupação de leitos de UTI no Brasil devido a pandemia do COVID-19, usando Support Vector Regression (SVR)

covid-19 matplotlib numpy pandas python sklearn support-vector-regression

Last synced: 03 May 2026

https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi

Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/carol-neto/sprint-9-making-data-driven-business-decisions

In this project I evaluate hypotheses using ICE and RICE and perform A/B testing analysis.

abtesting dataanalysis hypothesis-testing matplotlib numpy pandas python scipy-stats seaborn

Last synced: 03 May 2026

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 03 May 2026

https://github.com/lmizner/codecademy_product_defects

Practicing rules of probability, set theory, and distributions

cdf jupyter-notebook numpy pmf ppf python rvs scipy-stats

Last synced: 03 May 2026

https://github.com/coderooz/handwritten_digit_recognition

This project uses a Convolutional Neural Network (CNN) to recognize handwritten digits from the MNIST dataset. The model is built using TensorFlow and Keras.

colab handwritten-digit-recognition image-classification matplotlib neural-network numpy python tensorflow

Last synced: 03 May 2026

https://github.com/hansalemaos/np_zoom_in

Zoom in on a picture

image numpy python zoom

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/joao-inacio/numpy-essencial-para-desenvolvedores-python

O curso foi elaborado para desenvolvedores Python que desejam explorar os recursos poderosos da biblioteca NumPy. Através de aulas práticas, eu vou adquirir as habilidades necessárias para trabalhar com arrays multidimensionais, realizar cálculos científicos complexos e manipular dados com eficiência.

array array-manipulations array-methods jupyter-notebook numpy python python3

Last synced: 03 May 2026

https://github.com/grachale/convolution_image

The implementation of a function that takes an image (RGB or gray one) and a filter (kernel) and applies a convolution between them.

convolution image-processing numpy python

Last synced: 03 May 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/egorumaev/2024-ab-tests-metrics

Анализ соответствия размера выборки и плановых значений метрик A/B-теста

latex monte-carlo-method numpy pandas statmodels

Last synced: 04 May 2026

https://github.com/anas436/ibm-cloud-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

ibmwatson matplotlib numpy pandas python3 seaborn

Last synced: 04 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/pomxark/smartdictor_0.1_nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

cv2 fuzzywuzzy kivy matplotlib-pyplot nuitka numpy pil pyautogui pyqt5 pytesseract pyttsx3 scipy silero simpleaudio torch

Last synced: 04 May 2026

https://github.com/timothyjan/intro-machine-learning-polynomial-regression

We use NumPy to implement vectorized linear and polynomial regression models and compare their performance using separate training and test sets.

numpy

Last synced: 04 May 2026

https://github.com/dmdlgg/calculadora_de_indicadores_estatisticos

Uma calculadora que retorna a média, mediana, desvio padrão, quartis e variância dos dados inseridos pelo usuário. Tecnologias utilizadas: Flet, Python e Numpy. (Ainda em andamento)

flet numpy pyhton

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/mchenryspagg/investigate_a_dataset

This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.

data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization

Last synced: 04 May 2026

https://github.com/abhivur/graduate-income-forecaster

Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/badranalyst/udemy-course-data-analysis-with-pandas-and-visualization-libraries

Explores Udemy course data using NumPy and Pandas for data manipulation. Provides insights into course topics, ratings, and pricing with visualizations created using Matplotlib, Seaborn, and Plotly.

data-science dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn

Last synced: 16 Apr 2026

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/linsanity03/particle-in-potential-well

Visualizing the Wavefunction and Probability density of a Particle in a 1D Infinite Potential Well

jupyter-notebook matplotlib-pyplot numpy probability-density schrodinger-equation wavefunction-visualisation

Last synced: 28 Apr 2026

https://github.com/nermeenkamal/image-processing-system

This Python application provides a graphical user interface (GUI) for performing various image processing operations. Users can upload, manipulate, and save images using a range of techniques, including grayscale conversion, binary conversion, histogram equalization, and more.

matplotlib numpy opencv python tkinter

Last synced: 01 May 2026

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/separatrixxx/tmandcm_labs_3_sem

👓 Laboratory works for 3 semester of MAI on theoretical mechanics

matplotlib numpy python theoretical-mechanics

Last synced: 20 May 2026

https://github.com/NishuMehta/Netflix-Data-Analytics

Netflix Data: Cleaning, Analysis and Visualization

jupyter-notebook matplotlib-pyplot numpy pandas python python3 seaborn vscode

Last synced: 14 Mar 2025

https://github.com/HarshilChampaneri/Titanic_Survival_Prediction

I have Created a Machine Learning Model based on the Logistic Regression algorithm. This Model is capable of predicting whether the person will survive or not survive based on the given data of that particular person.This Model is highly accurate in predicting the result.

artificial-intelligence jupyter kaggle logistic-regression machine-learning matplotlib numpy pandas python seaborn titanic-kaggle titanic-survival-prediction

Last synced: 30 Dec 2025

https://github.com/jigyasag18/fake-news-prediction-project

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization

Last synced: 08 Jun 2026

https://github.com/saikiran-erukonda/drug-labels-side-effects-analysis--a-ml-approach

An end-to-end real-time project on pharmaceutical data, focused on exploring statistical modeling, hypothesis testing, machine learning, and business intelligence techniques.

datascience-machinelearning matplotlib-figures numpy pandas python seaborn

Last synced: 17 Feb 2026

https://github.com/pyfig/s21_data-science-bootcamp

School21 Bootcamp Data Science

data data-science numpy pandas python school21

Last synced: 26 Jun 2025

https://github.com/MuhammedMorad/Egypt-Premier-Fashion-Sales-Analysis

Egypt’s Premier Fashion Sales Analysis is a data-driven project that provides interactive sales insights using Streamlit, Power BI, and Pandas. It analyzes sales trends, customer behaviors, and product performance while offering real-time dashboards and downloadable reports. helping businesses make informed, data-driven decisions. 🚀

matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 30 Dec 2025

https://github.com/anodicpassion/printing-accuracy-analyzer

This project provides a web-based application for analyzing the print quality of scanned documents by comparing them against a template. It identifies issues such as misalignment and blur, providing visual feedback and quantitative metrics.

numpy opencv-python print

Last synced: 20 May 2026

https://github.com/iankitnegi/datapy

Welcome to DATAPy! This repo contains my Python coding projects for data science, featuring data manipulation, analysis, and visualization. Stay tuned for updates and new content!

basic-programming matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/ola-kaznowska/first-steps-in-numpy

My first steps in the NumPy library

mathematics numpy python python3

Last synced: 20 May 2026

https://github.com/elifftosunn/textdataclean

Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.

corpus deasciifier morphological-analysis ngram nltk numpy pandas sentence-embedding sentence-tokenizer stemmer stopwords string turkish turkish-sentence-tokenizer word-tokenizer

Last synced: 20 May 2026

https://github.com/kr1shnasomani/genagenet

Gender and Age recognition from face images

computer-vision deep-learning neural-network numpy opencv

Last synced: 21 Jul 2025

https://github.com/vishnu-vamshii/heart-disease-prediction-using-ml

This project presents an end-to-end data analysis and machine learning pipeline for predicting heart disease using a publicly available dataset. The project includes data exploration, visualization, and implementation of various machine learning models to predict the likelihood of heart disease based on a set of clinical attributes.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/kishankrishna1/spam-classifier

Developed a Machine Learning-based Spam Classifier using Multinomial Naive Bayes to identify and filter spam messages with high precision

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/jt4v4res/neural-networks-and-deep-learning

Some neural networks implementation with purpose of understanding the use of this learning algorithm and deep learning.

deep-learning machine-learning machine-learning-algorithms neural-networks numpy perceptron-learning-algorithm

Last synced: 16 Jan 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/makoczoro/credit-default-risk-analysis-eda

This repository contains the detailed EDA Analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 20 May 2026

https://github.com/pzuehlke/ml-with-numpy

Basic ML algorithms using only NumPy

deep-learning machinelearning numpy

Last synced: 20 May 2026

https://github.com/genndy007/ad_python_labs

Analysis of Data Laboratory Works of Sikorsky KPI, 3 year 2 semester

jupyter-notebook matplotlib numpy pandas python scipy seaborn sklearn

Last synced: 14 Apr 2026

https://github.com/rahulraimau/credit-card-fraud-detection

This project develops a real-time credit card fraud detection system using the Kaggle Credit Card Fraud Detection Dataset, containing 284,807 transactions with 31 features (Time, Amount, V1-V28, Class).

imbalanced-learn- matplotlib numpy pandas python pytorch scikit-learn- tensorflow

Last synced: 08 Apr 2026

https://github.com/udayraj04/facialemotion_musicrecommendations

A web application that detects facial emotions via webcam and recommends music tracks that match the user’s mood in real-time. Built with Flask, TensorFlow/Keras, OpenCV, and a React frontend.

cnn flask machine-learning numpy opencv pil reactjs tensorflow

Last synced: 08 Apr 2026

https://github.com/9keystrokes/photorealistic-person-scene-composition

This computer vision project implements an advanced image processing pipeline to seamlessly integrate a person into any background scene with photorealistic results.

numpy opencv python rembg skimage

Last synced: 04 May 2026

https://github.com/sridharyadav07/ai--powered-task-management-system

An intelligent Task Management System that integrates Sentiment Analysis, Task Optimization, and Forecasting to streamline project and task handling. This AI-powered tool is designed to assist teams and project managers in making data-driven decisions by understanding emotional context, forecasting productivity, and optimizing workload distribution

arima flask joblib jupyter-notebook naive-bayes-classifier nltk numpy pandas pickle-file python randomforestregressor scikit-learn stopwords-removal streamlit tfidf-vectorizer

Last synced: 08 Apr 2026

https://github.com/s-h-i-v-i-s/bangalore-housing-price-prediction

A Machine Learning Project to predict Bangalore House Prices.

jupyter-notebook machine-learning matplotlib numpy pandas python pythonflask sklearn

Last synced: 08 Apr 2026

https://github.com/chea-young/financial-statistics-practice-using-python

파이썬을 활용한 금융통계실습 정리 repository (matplotlib, numpy, pandas, seaborn) 😀

matplotlib numpy pandas seaborn yfinance

Last synced: 11 May 2026

https://github.com/rahulchouhan1/car-theft-data-in-india

This project analyzes car theft incidents in India using Python and data visualization techniques. It provides insights into theft trends, frequently stolen car models, recovery statistics, and the impact of security measures like GPS and CCTV.

matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/annygutierrez/ai-chatbot-python

Deep learning chatbot: Chatbot model to make predictions and chat back and forth with our user.

nltk numpy tensorflow tflearn

Last synced: 06 May 2026

https://github.com/shubhs27/snap-n-solve

A real-time Sudoku puzzle solver that uses computer vision and machine learning to detect, solve, and overlay solutions on Sudoku puzzles captured through your webcam.

keras numpy opencv python sudoku-scanner sudoku-solver tensorflow

Last synced: 07 Apr 2026

https://github.com/harrisonwelch/pythondatascience

Repo of code from the linked-in lesson "Python: Data Analysis"

data data-science matplotlib notes numpy python tutorial

Last synced: 12 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/okoge-kaz/deep_learning

O'REILY ゼロから作るDeep Learning

deep-learning neural-network numpy pandas perce python3

Last synced: 20 Apr 2026

https://github.com/hannah-aji/predictive-modeling-to-identify-high-risk-areas

Predictive Modeling to Identify High-Risk Areas for Targeted Interventions in Chicago, Illinois

machine-learning matplotlib numpy supervised-learning visualization

Last synced: 17 Jul 2025

https://github.com/sohamkul16/excel-sheet-comparator-by-python

The Excel Sheet Comparator project is a Python-based application designed to compare two Excel spreadsheets to identify differences between them. These differences are then highlighted in other downloadable .csv format.

html-css numpy pandas python

Last synced: 17 Jul 2025

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/dmarks84/coursework_capstone_spacex_predictions

Final Project for IBM Data Science Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification to predict the success of SpaceX landings; issued full report to stakeholders

api classification dash eda folium linear-algebra matplotlib mysql numpy pandas plotly probability python seaborn sql statistics supervised-ml technical-writing web-scraping

Last synced: 08 Apr 2026