An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/omarsaad21/credit-train-data-science-project

This a full web application to predict the credit score of clients plus I did many visulizations to express many insights in chart

eda matplotlib ml numpy pandas python sklearn streamlit-webapp

Last synced: 09 Apr 2026

https://github.com/stephnna/my_image_classifier

A deep learning-based image classifier built with PyTorch, designed to identify various flower species using architectures like VGG16, ResNet101, and AlexNet. This project leverages transfer learning for improved performance, checkpointing for seamless training, and supports both CPU and GPU devices. Perfect for exploring deep learning models and i

matplotlib numpy python pytorch

Last synced: 11 May 2026

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 19 Apr 2026

https://github.com/crispengari/effects-of-covid19-on-trade

This repository visulises the effects of covid19 on trade within these years:

100daysofcode datascience datascience-machinelearning matplotlib matplotlib-pyplot numpy pandas python python3

Last synced: 02 May 2026

https://github.com/sutterseba/des-python

A simple DES implementation in Python

cryptography numpy python

Last synced: 15 Jun 2026

https://github.com/caioandrian/data-analyst-procon

Análise dos dados do Procon, período de 2013 à 2016.

dataset numpy pandas python

Last synced: 11 Apr 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/om-kanabar/sciencefair2025

This is my project for Chicago Public School's student science fair 2025.

chicago-public-schools matplotlib-pyplot neural-networks numpy python science-fair tensorflow

Last synced: 03 Nov 2025

https://github.com/emredemirbas/google-playstore-eda

Exploratory Data Analysis (EDA) of the Google Play Store dataset — examining trends in app ratings, categories, pricing, and user engagement using Python and LaTeX.

exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/nskamaleshmani/exoseeker

🌌 Discover and analyze exoplanets with ExoSeeker, a tool designed for efficient world-hunting using Python and data visualization techniques.

exoplanet-transits exoplanets gradient-boosting machine-learning matplotlib nasa nasa-data nasa-spaceapps-challenge numpy pandas perceptron-neural-networks random-forest streamlit

Last synced: 03 Nov 2025

https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023

Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.

cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill

Last synced: 16 Aug 2025

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/rahul-404/full_stack_data_science_masters

Welcome to the repository for the course "Full Stack Data Science Masters". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science techniques.

computer-vision data-science database deep-learning exploratory-data-analysis flask machine-learning natural-language-processing numpy pandas python statistics time-series visualization

Last synced: 10 Apr 2026

https://github.com/utkarsh251106/tracking-with-yolo

This project uses YOLOv8 and DeepSORT to detect and track children and adults in video streams. It assigns unique IDs, handles re-tracking after occlusions, and outputs an annotated video with labeled bounding boxes.

computer-vision deep-learning deepsort machine-learning numpy opencv python torch ultralytics yolov8

Last synced: 30 Jan 2026

https://github.com/ramyacp14/salesforecasting

Forecasts future sales for a retail company using time series analysis with Facebook Prophet. The project involves data preprocessing, exploratory data analysis (EDA), and forecasting with holiday effects considered.

data-preprocessing data-visualization exploratory-data-analysis fbprophet machine-learning matplotlib model-evaluation numpy pandas python seaborn time-series-forecasting

Last synced: 06 Apr 2026

https://github.com/RedInfinityPro/RedditBot

Rating: (7/10) This script collects, preprocesses, trains models, processes images, and handles files, handling data from Reddit, image processing, and file handling.

autocorrect bytesio concurrent nltk numpy openpyxl pandas pil praw random re requests secrets sklearn string tensorflow time urllib

Last synced: 30 Sep 2025

https://github.com/isk-daniar/-contrast-improvements-on-pil

Contrast improvements on PIL

numpy pillow python

Last synced: 10 Jun 2026

https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 05 Apr 2025

https://github.com/nishthasharma-22/binary-black-hole-merger-gravitational-waves-simulation

Repository of all astronomy related projects, including: Gravitational Waves graph from binary black hole merger

astrophysics blackhole-merger gravity-simulator matplotlib numpy python scipy

Last synced: 05 May 2026

https://github.com/BiocPy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 03 Oct 2025

https://github.com/thavinduushan/object-detection

Application developed for real-time object detection and counting utilizing COCO dataset

numpy opencv python

Last synced: 15 Apr 2026

https://github.com/debjyotisaha/data-analytics-projects-phase-2

Developed and showcased various data analytics projects, including data preprocessing, exploratory data analysis, and visualization. Utilized tools such as Python, Pandas, NumPy, and Matplotlib to derive actionable insights and demonstrate problem-solving capabilities.

data-analysis data-preprocessing eda matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/apuravdivekar2032/real-estate-price-prediction

A real estate price prediction website developed using Python with Numpy and Pandas for data cleaning Matplotlib for data visualization and scikit-learn for model building, featuring a Flask server to handle HTTP requests and integrates a user-friendly UI built with HTML, CSS, and JavaScript

flask html-css-javascript jupyter-notebook matplotlib numpy pandas pycharm python3 sklearn vscode

Last synced: 20 Jan 2026

https://github.com/alejandrolara11/desafio_latam_introduccion_analisis_de_datos

Repositorio del curso "Introducción al Análisis de Datos" de Desafío Latam. Ejercicios prácticos realizados durante el curso, enfocados en análisis de datos con Python, Pandas, y visualización básica.

data-analysis data-science data-visualization matplotlib numpy pandas python seaborn statsmodels

Last synced: 29 Apr 2026

https://github.com/vedanty3/bulldozer-price-prediction

A machine learning project aiming to build a machine learning model which could predict the sales price of bulldozer.

andrew-ng-machine-learning ensemble-machine-learning gridsearchcv jupyter-notebook machine-learning matplotlib numpy pandas python randomforestregressor randomizedsearchcv scikit-learn ztm

Last synced: 05 Apr 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/jai0212/gpt-rnn-poetry-generator

A generative pre-trained transformer (GPT) using a recurrent neural network (RNN) to generate poetry with customizable length and creativity index.

ai csv-files gpt machine-learning neural-networks nlp-machine-learning numpy pandas-library poetry-generator rnn-tensorflow training-dataset

Last synced: 12 Aug 2025

https://github.com/mchenryspagg/wrangle-and-analyze-data

This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017

api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization

Last synced: 09 Apr 2026

https://github.com/ijproject/calculate-absorption-rate

大気を構成する分子ごとの赤外線吸収割合を計算するプログラム。

numpy python

Last synced: 11 May 2026

https://github.com/shibam120302/dog_breed_prediction_app

The Dog Scanner app will identify your dog's breed reliably in just a few seconds! Besides taking a picture, you can also record a video or upload an image from your gallery. Got a mixed breed? No problem, the Dog Scanner app also recognizes mixed breeds!

keras numpy opencv python

Last synced: 06 Apr 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/cyberlument/opencv-colormasking-red-

This mini project opencv is for educational purpose.

numpy opencv opencv-python pycharm vscode

Last synced: 20 Jan 2026

https://github.com/priyasingh26/financial_document-data_extraction

This project extracts key information from financial documents like invoices and receipts using text recognition. It processes images, classifies documents, and extracts data, which is then stored in a CSV file. The aim is to automate data collection from scanned documents, reducing manual work and increasing accuracy.

data-extraction numpy ocr pandas pillow preprocessing pytesseract-ocr python sklearn torch transformers

Last synced: 08 Apr 2026

https://github.com/caefleury/cis-ieee-difusion-model

Repositório teste para códigos do projeto de Modelo de Difusão do branch CIS-IEEE

docker docker-compose numpy python3 tensorflow

Last synced: 08 Apr 2026

https://github.com/prashhhant213/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

math matplotlib numpy pandas python scipy-stats seaborn stats

Last synced: 09 Apr 2026

https://github.com/manuethomas/traffic-accident-analysis-us

The project provides a comprehensive analysis of traffic accidents in the US from 2016-2023 aiming to identify key factors contributing to accidents. The analysis also focussed on finding features that could be used to develop a predictive model

exploratory-data-analysis feature-engineering feature-selection matpllotlib numpy pandas seaborn

Last synced: 20 Mar 2025

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/anand-sony/machine_learning

Machine Learning codes and concepts, including algorithms like K-means, PCA, and Linear Regression, with Python libraries (NumPy, Pandas, Matplotlib).

artificial-neural-networks kmeans-clustering knn linear-regression machine-learning matplotlib normalization numpy pandas pca python

Last synced: 30 Apr 2025

https://github.com/poziloi/-image_processing-

Методы и алгоритмы цифровой обработки изображений, задания

cv2 matplotlib numpy python

Last synced: 20 Jan 2026

https://github.com/lmizner/codecademy_life_expectancy

Calculate quartiles, quantiles, and the inter-quartile range (IQR) for a variable

histogram jupyter-notebook matplotlib-pyplot numpy pandas python quantiles quartiles

Last synced: 09 Apr 2026

https://github.com/babagata/praktikum-4-data-analysis

Data analysis for course "Physics laboratory IV"

matplotlib numpy pandas scipy

Last synced: 28 Apr 2026

https://github.com/michelenana/projet-7

PRODUISEZ UNE ETUDE DE MARCHE AVEC R OU PYTHON

acp boxplots cah kmeans matplotlib numpy pandas python scipy seaborn sklearn

Last synced: 08 Apr 2026

https://github.com/mukhtarmid/data-science

This repository is for the knowledge of data science.

datascience eda numpy pandas

Last synced: 10 May 2026

https://github.com/chaitanyac22/cross_platform_product_mapping_algorithm_for_products

This repository contains a product ID mapping solution using TF-IDF vectorizer for weighted text vectors, Facebook AI Similarity Search (FAISS) for coarse filtering with cosine similarity, and Levenshtein distance for refined matching against the Blinkit catalog. Achieved 11.45% match for Zepto and 11.48% for Instamart.

exploratory-data-analysis faiss levenshtein-distance nlp numpy pandas similarity-search tf-idf-vectorizer

Last synced: 20 Mar 2025

https://github.com/ansh2709/customer-segmentation-ml-project

Project segregates the customers on the basis of their spending score and annual income using K-Means Clustering that is a part of unsupervised learning

clustering-algorithm k-means-clustering machine-learning matplotlib-pyplot numpy pandas python unsupervised-machine-learning wcss

Last synced: 01 May 2026

https://github.com/asut00/python-piscine_42ai

Python Bootcamp: A one-week intensive course with 42AI at École 42, covering Python fundamentals, data manipulation, and introductory AI concepts.

matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/rafali25/perceptron-algorithm

A simple implementation of the Perceptron algorithm using numpy. This project demonstrates how to classify data points by iteratively updating weights and biases based on misclassified samples. Perfect for understanding the fundamentals of linear classification!

machine-learning numpy perceptron-neural-networks pycharm-ide python

Last synced: 09 Apr 2026

https://github.com/chandkund/image-classification-using-the-mnist-dataset

Image Classification using the MNIST dataset. This project leverages a Convolutional Neural Network (CNN) to recognize and classify handwritten digits with high accuracy. Includes data preprocessing, model architecture, and evaluation. Explore the code and results here!

computer-vision data-science machine-learning matplotlib numpy pandas python

Last synced: 08 Apr 2026

https://github.com/bearddan2000/python-web-3d-matplotlib-stem-graph

A demo of creating a 3d scatter parametric curve and line graph.

3d graph matplotlib numpy pandas python stem web

Last synced: 09 Apr 2026

https://github.com/allenvox/neural

Workspace for Neural Networks class

jupyter-notebook neural-networks numpy python pytorch tensorflow

Last synced: 02 Jan 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/raghavendranhp/airbnb-data-analysis

The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.

eda jupyter-notebook mongodb numpy pandas powerbi preprocessing

Last synced: 08 Apr 2026

https://github.com/5hraddha/zyfra-gold-recovery-prediction

Zyfra is a pioneering developer of efficiency solutions for heavy industries & is aiming to take help of machine learning to optimize the efficiency in Gold Ore processing

decisiontreeregressor dummyregressor linearregression numpy pandas randomforestregressor scipy seaborn smape supervised-learning

Last synced: 08 Apr 2026

https://github.com/abdelrahman-amen/attendance_system

An AI-driven system leveraging real-time face recognition to automate attendance, ensuring accuracy, scalability, and seamless CSV logging of names and timestamps.

cmake cv2 dlib face-recognition numpy python

Last synced: 20 May 2026

https://github.com/rakibhhridoy/covid19analysisindashboard-tableau

Covid19 dashboard analysis of world,north america,south east Asia and their characteristics upon pandemic. Some interesting statistics is shown by the data. The increase rate make effect on death and recover rate quite periodic. Simulating those changes make more interactive.

covid-19 dashboard data-processing dataviz numpy pandas python statistics tableau tableau-dashboards

Last synced: 06 May 2026

https://github.com/sultanazhari/factors-that-affect-vehicle-prices.

Crankshaft List Company want to know what the factors that affect to vehicles prices. As a data Analyst we're giving list of factor that affect with vehicle prices.

matplotlib-pyplot numpy python3 seaborn vehicles

Last synced: 11 May 2026

https://github.com/abhay-kanwasi/ml-learning

Discover a repository brimming with machine learning insights and projects. Dive into comprehensive documentation of ML concepts, algorithms, and techniques. Explore hands-on projects spanning sentiment analysis, image classification, recommendation systems, and more.

ml numpy pandas python recommender-system sklearn streamlit-webapp themoviedb-api

Last synced: 02 Apr 2026

https://github.com/hotequil/fish_classification

Identify the fish specie with Machine Learning.

classification fish keras numpy python tensorflow

Last synced: 27 Mar 2025

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 07 Jan 2026

https://github.com/armandomtz05/optikit

Solution of the paraxial wave equations using different coordinates systems

fourier-transform holography numerical-methods numpy optics-code optics-simulation pillow

Last synced: 02 Feb 2026

https://github.com/aysh2603/twitter-sentiment-analysis

The Twitter Sentiment Analysis project employs Natural Language Processing (NLP) techniques to classify tweets into positive or negative sentiments. By analyzing the tone of tweets, this project provides insights into public sentiment on various topics.

hyperparameter-tuning nlp-machine-learning numpy pandas python3 scikit-learn

Last synced: 13 Jul 2025

https://github.com/danwild/bike-share-prediction

Predict bike sharing time-series with numpy for Deep Learning

artificial-intelligence artificial-neural-networks deep-learning neural-network numpy

Last synced: 16 May 2026

https://github.com/adnaen/water-quality-analysis

End-to-end Ground Water Quality Prediction app using Streamlit.

classification-model machine-learing matplotlib notebook numpy pandas plotly python sklearn streamlit

Last synced: 30 Dec 2025

https://github.com/vinit-source/csl7382-medical-image-clustering-assignment.py

The IPython notebook contains the questions as well as the related code. Only numpy has been used.

bioimage-analysis kmeans-clustering numpy slic spectral-clustering

Last synced: 22 May 2026

https://github.com/gitw1n/japandemographicsinsights

JapanDemographicsInsights is an ongoing project aimed at providing comprehensive insights into the demographic trends of Japan. The project is focused on analyzing and visualizing population data, age distribution, migration patterns, birth and death rates, and other related demographic factors that shape the current and future landscape of Japan.

indevelopment jypyternotebook numpy python3 scientific-visualization

Last synced: 11 May 2026

https://github.com/davityak03/sentence-paraphraser-checker-using-transformers

This Jupyter Notebook implements a tool to check whether two sentences are paraphrases by analyzing their semantic similarity using NLP techniques. It provides a similarity score and a binary decision to indicate if the sentences are paraphrases.

keras nlp nltk numpy python tensorflow tokenizer transformers

Last synced: 02 Jan 2026

https://github.com/rdvdev2/tf-test

Mostres de ML i IA amb TensorFlow basades en els tutorials oficials per al PR

numpy python research-project tensorflow tensorflow-tutorials

Last synced: 22 Mar 2025

https://github.com/mikma03/simulation_modeling

Simulation models using Python. Practical use of Python in real-world examples and additional resourses.

matplotlib numpy pandas portfolio python simulation stocks

Last synced: 08 May 2026

https://github.com/omraj0/covid19-data-analysis

Analysis of COVID-19 infection rates in various countries, correlating them with factors such as GDP per capita and social support.

covid-19 google-colab matplotlib numpy pandas python

Last synced: 19 Apr 2026

https://github.com/raghulrajn/machine-learning-d-r-y

This repository contains quick python scripts that are repeatedly used in EDA on dataset

data-science numpy pandas python

Last synced: 09 Apr 2026

https://github.com/yash-3-bit/human-activity-recognition-using-smartphone-data

Human Activity Recognition (HAR) Using Smartphone Data This project leverages smartphone sensor data to recognize human activities such as walking, running, sitting, and standing.

numpy pandas python scikitlearn-machine-learning seaborn

Last synced: 09 Apr 2026

https://github.com/pranjalshivhare06/medical-ensurance-charge-predictor

The Insurance Price Predictor is a machine learning project designed to predict insurance costs based on various input features. The project leverages four different algorithms, with XGBoost emerging as the most accurate and efficient model.

fastapi machine-learning numpy pandas xgboost-classifier

Last synced: 19 Apr 2026

https://github.com/anmamun0/data-analysis-home-cleaning-services

This repository contains the analysis and visualization of data from a home cleaning services dataset. The project provides valuable insights into revenue generation, customer trends, and regional performance, helping businesses make data-driven decisions.

matplotlib numpy pandas

Last synced: 05 Mar 2025

https://github.com/pabs-code/face-detection-using-haar-cascade-classifier

This is a Streamlit-based face detection application that uses the Haar Cascade classifier to detect faces in uploaded images.

face-detection haar-cascade-classifier numpy opencv python streamlit

Last synced: 08 Apr 2026

https://github.com/vyjayanthipolapragada/image_classifier_model_hotdog

Building an Image classifier model to train and test a dataset and classify the given images into hotdog and not-hotdog.

artificial-intelligence dataset image-classification image-processing machine-learning matplotlib neural-networks numpy pandas python pytorch tensor torchvision transfer-learning

Last synced: 08 Apr 2026