An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026

https://github.com/omdoshi13/pricing-of-laptops-using-ml

Data Analysis, training Machine Learning models, and Model Evaluation and Refinement for Pricing of Laptops dataset.

data-analysis data-analysis-project datascience google-colab jupyter-notebook machine-learning matplotlib model-evaluation model-refinement numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/mandeep04sharma/ev-cars-analysis---python

This project dives deep into electric vehicle specifications and performance using Python. It involves data cleaning, filtering, statistical analysis, outlier detection, and hypothesis testing to uncover meaningful insights.

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/alphacrypto246/air-quality-prediction

The Air Pollution Prediction project forecasts pollution levels using the Air Quality UCI dataset, including variables like CO, NOx, temperature, and humidity. It analyzes temporal and environmental factors to predict air quality trends.

machine-learning numpy pandas scikitlearn-machine-learning sckiit-learn

Last synced: 10 May 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 13 Apr 2026

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/RedInfinityPro/CircuitPythonProjects

Rating: (4/10) This Python script simulates circuits using PySpice, importing libraries like NumPy and Matplotlib. It defines voltage divider, diode circuits, subcircuit definitions, DC sweep analysis, and simulates each circuit using the simulator.

matplotlib numpy os pyspice sys

Last synced: 30 Sep 2025

https://github.com/sudip-13/plant-disease-detection

Plant disease detection from leaf uses machine learning to identify diseases for early intervention

cloudinary css docker jwt-token keras matplotlib-pyplot mongodb nodejs numpy reactjs tensorflow tensorflow-serving

Last synced: 09 Apr 2026

https://github.com/vedanty3/supermarket-sales-data-analysis

This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.

data-analysis data-science jupyter-notebook matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/fazal-coder/data-science-business-analytics-tsf

I have completed two tasks in this repository , task 3 and task 4

matplotlib numpy pandas python scipy

Last synced: 09 Apr 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/alex1iv/asr_ru_numbers

Automatic Speech Recognition (ASR) system for Russian digits

audio-processing librosa numpy speech-recognition tensorflow

Last synced: 13 Apr 2026

https://github.com/dulajkavinda/handwritten-recognition

🛢Handwritten Digit Recognition using OpenCV

added machinelearning numpy opencv python

Last synced: 09 Apr 2026

https://github.com/akku-1206/spotify_data_analysis

Spotify Tracks Analysis Overview This project performs an exploratory data analysis (EDA) on Spotify music data to uncover insights and correlations related to song features, popularity, and genre.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/zvdy/movie_recommendation

Movie Recommendation Search Engine using Jupyter Notebooks, Pandas, Nnmpy, SciKit Learn, IPyWidgets

data-science jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/machinelearninglogisticregression

A fraude de cartão de crédito é uma forma comum de fraude que afeta milhões de pessoas em todo o mundo. Essa fraude pode ter consequências graves, tanto para as vítimas quanto para as instituições financeiras que emitem os cartões de crédito.

math matplot numpy pandas python3 sklearn

Last synced: 13 Apr 2026

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 09 Apr 2026

https://github.com/yuu-eguci/cognitive-services-trial

Try to play with Cognitive Services!! [Cognitive Services] [OpenCV] [Numpy]

cognitive-services dotenv numpy opencv-python pipenv python python3

Last synced: 05 Jul 2025

https://github.com/jessicahora/series_temporais

Análise de Series Temporais do Consumo de Energia da Alemanha.

numpy pandas python time-series-analysis

Last synced: 07 Apr 2026

https://github.com/alphan26/optimal-logistics-locator

This is a project in which we estimate the biomass avaibility of places due to their index and determine the optimal preprocessing depot and biorafinery in Gujarat, India

numpy pandas python sckit-learn

Last synced: 04 May 2026

https://github.com/ondiekelijah/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 09 Apr 2025

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/bhanuprakashnani/scit-tasks

Tasks for the SCIT Work Shop.

numpy python3

Last synced: 25 Apr 2026

https://github.com/vishal815/python-based-flight-data-scraping-automating-data-collection-for-analysis

This project is a Python-based web scraping script designed to extract flight details from the Yatra travel website. The extracted data includes flight pricing, origin, destination, departure and arrival times, and duration. The data is then saved into a CSV file for further analysis.

beautifulsoup flight-data-analysis flight-data-visualization numpy pandas pyhton pyplot scraping scraping-python seaborn selenium vishal-lazrus vishallazrus webscraping webscraping-projects

Last synced: 18 Apr 2026

https://github.com/max134/deepplate

A custom-trained YOLOv11-based license plate detection with PaddleOCR for accurate ANPR.

computer-vision cv2 deep-learning image-recognition licesnse-plate-recognition numpy object-detection ocr paddleocr real-time-processing sqlite3 ultralytics yolov11

Last synced: 13 Jul 2025

https://github.com/animesh-chourey/loan-classifier

Trained machine learning algorithms (Logistic Regression, KNN, SVM, Decision Tree) specifically, after performing visualization and pre-preocessing tasks on a loan dataset. Executed the evaluation metrics such as F1-score, Log loss and jaccard-similarity score to assess the algorithms performance.

decision-tree f1-score jaccard-similarity knn logistic-regression logloss matplotlib numpy pandas preprocessing-data svm

Last synced: 06 May 2026

https://github.com/itshyphen/mass-mailing-script

A simple mass mailing script that sends personalized email to multiple emails importing from csv

numpy pandas python smtplib

Last synced: 13 Apr 2026

https://github.com/a-parida12/numericalmethods

Teaching Material for Workshop for Engineering Students

numpy ode pde python3

Last synced: 13 Apr 2026

https://github.com/jose-zothner-meyer/finance-individual-loans

A comprehensive analysis of a financial institution's loan portfolio using Python, focusing on data preprocessing, exploratory data analysis (EDA), and feature engineering.

credit data-science data-visualization eda exploratory-data-analysis finance jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn

Last synced: 09 Apr 2026

https://github.com/jim-by/scrape_analysis_books

Scraping book data from the website books.toscrape.com and performing analysis on the collected data.

beautifulsoup json matplotlib numpy pandas python scipy scraping seaborn

Last synced: 09 Apr 2026

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 13 Apr 2026

https://github.com/amir0234-afk/housing-price-prediction-dnn

Keras + NumPy baselines for housing price regression

keras numpy regression

Last synced: 13 Apr 2026

https://github.com/nischalkshaj/image-identification

This is a repository for AI image training for beginners.

express mongodb node numpy pillow python3 pytorch reactjs

Last synced: 09 Apr 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/sultanazhari/the-effect-of-weather-on-travel-frequency

Zuber Company need to know passenger preferences and external effect on the trip. As an Data Analyst we're provide actual data who has affect on travel frequency.

matplotlib-pyplot numpy python3 scipy seaborn

Last synced: 03 Aug 2025

https://github.com/harmanveer-2546/house-price-prediction-

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn svm

Last synced: 09 Apr 2026

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/raeinlayeghpour/test-bench-for-signal-validation-simulated-3-phase-system

Automated Project - Test Bench for Signal Validation (Simulated 3-Phase System)

numpy pandas pycharm pytest python

Last synced: 13 Apr 2026

https://github.com/chandkund/wine-quality-prediction

This project predicts wine quality based on physicochemical properties using machine learning models. By leveraging Random Forest Classifier, Logistic Regression, and SVM, the goal is to classify wines into quality categories and uncover the key factors that influence wine quality.

logistic-regression matplotlib numpy pandas-python random-forest-classifier svm-classifier

Last synced: 01 May 2026

https://github.com/harmanveer-2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost-classifier eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xbg

Last synced: 13 Apr 2026

https://github.com/chmohit91/introduction-to-python

A comprehensive collection of Python programming resources, from basics to advanced concepts. Built for learners, by learners

matplotlib numpy pandas plotly python python-library python3 seaborn

Last synced: 13 Apr 2026

https://github.com/avs123a/image-edge-detection-demo

Demo of image edges detection using different operators, norms and custom masks

demo edge-computing edge-detection image-analysis image-manipulation image-processing jupyter-notebook numpy python python27 python3

Last synced: 04 Feb 2026

https://github.com/n1k1f0rm/ml-predicts

Place where you can find in transparent way how ML algos works

machine-learning ml numpy python

Last synced: 21 Apr 2026

https://github.com/kumartusha/Data_Scientist-AI-ML-_Training_Anudip_Foundation

Anudip Foundation Data Analytics Training 📊 A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing 📉 Data Visualization & EDA 📊 Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 27 Sep 2025

https://github.com/falakjatin/face-recognition

This is my final year project of engineering. It might not work because its not maintained.

haarcascade-frontalface numpy opencv pandas python3

Last synced: 13 Apr 2026

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/ifte-13/early-stage-brain-stroke-detection

Predictive Analysis & Early Detection of Brain stroke using Machine Learning Algorithm

decision-tree imbalanced-learn knn matplotlib numpy pandas random-forest scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/harmanveer2546/recognizing-digits

I used the MNIST dataset for the implementation of a handwritten digit recognition app. To implement this, will be using a special type of deep neural network called Convolutional Neural Networks. In the end, I also build a Graphical user interface(GUI) where you can directly draw the digit and recognize it straight away.

dense flatten gui keras maxpooling mnist numpy pandas pillow python sequential

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de réseaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/eny2930/ai-ml-projects

A project demonstrating various libraries and tools in using algorithms in machine learning. This repository includes code samples, datasets, and detailed explanations.

iris-dataset keras linear-regression logistic-regression matplotlib numpy pandas pytorch sckiit-learn seaborn sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/lmtreser/intro-data-science

Trayecto de formación en Ciencias de Datos con Python y Pandas

data-science matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/shwetapardhi/assignment-03-q2--hypothesis-testing

Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth

annova ftest hypothesis-testing numpy pandas python scipy significance-testing stats

Last synced: 11 Apr 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/bropenguin847/python-learning

Learning Python for Scientific Programming. Intended for beginners with some knowledge of Python.

array-manipulations convolution jupyter matlab matplotlib-pyplot numpy pandas-python pep8 python python3 scientific-programming scipy spyder-ide vscode

Last synced: 13 Apr 2026

https://github.com/sowmi0304/predictive-modeling-titanic-survival-prediction

Machine learning project to predict passenger survival on the Titanic using classification algorithms, data preprocessing, and model evaluation techniques.

machine-learning matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/saksham-21/data_analysis-on-hindustan-times-

Performed a proper NLP based analysis from the headlines, date published and the category of the news. Moreover created a interactive Dashboard on Tableau

beautifulsoup matplotlib numpy pandas tableau

Last synced: 05 May 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/sadia-khan13/data-preprocessing

Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.

artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn

Last synced: 11 Apr 2026

https://github.com/trivediayush/accident-and-drowsiness-detection

Our college minor project is a comprehensive solution aimed at enhancing road safety. The project comprises two main components: an Accident Detection Model and a Real-Time Driver Drowsiness System.

accident-detection accidents-analysis college-project cvzone dlib-face-detection dlib-face-recognition face-detection face-recognition imutils minor-project numpy opencv pandas python python3 yolov8

Last synced: 09 Apr 2026

https://github.com/ot-code/coca-cola-stock-prediction

This repo compares four predictive models—Linear Regression, ARIMA, XGBoost, and LSTM—to forecast Coca‑Cola FEMSA stock closing prices using Python and five years of historical data.

arima csv linear-regression lstm-neural-networks mae matplotlib mse numpy pandas python r2 scikit-learn seaborn tensorflow-keras xgboost

Last synced: 13 Apr 2026

https://github.com/aritrakar/ai-projects

🤖A centralized collection of links to my AI projects roughly divided by topic.

numpy pandas python pytorch tensorflow

Last synced: 13 Apr 2026

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/yenk/human-genomic-meta-analysis

Meta analysis of genomic studies to better understand if telomeres in the human genome are impacted by age from wild animals data.

linear-regression machine-learning matplotlib numpy scipy

Last synced: 05 May 2026

https://github.com/valmir-unicap/deteccao-de-mascaras-faciais

Experimento baseado no artigo: A Face Mask Detection Algorithm Based on YOLO

kaggle-dataset mobilenetv2 numpy opencv python torch yolo

Last synced: 12 Apr 2026

https://github.com/youssefali11997/free-style-coding

Scripts written as practice, trying new technology or just playing with the code !

bash javascript numpy python3 ruby-on-rails scripts vuejs

Last synced: 12 Apr 2026