An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lmizner/codecademy_life_expectancy_gdp

Utilize understanding of data visualization to analyze and plot data about GDP and life expectancy

barplot jupyter-notebook lineplot matplotlib-pyplot numpy pandas python scatterplot seaborn

Last synced: 12 Apr 2026

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 12 Apr 2026

https://github.com/mnitin-reddy/summer-olympics-data-analysis-web-app

An interactive web app for exploring trends in Olympic Games history, analyzing overall medal tallies, country-wise performance, and athlete demographics. Built with Python and Streamlit, this app offers insights through visualizations and data-driven statistics.

dataanalytics matplotlib numpy pandas python seaborn streamlit

Last synced: 12 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 12 Apr 2026

https://github.com/armahdavi/mlops

Productionizing ML Models using a variety of tools including FastAPI, Flask, Doocker, AWS, GCP, TensorFlow Extended (TFX), and TF.js.

apache-beam docker fastapi logistic-regression mlops mlops-workflow numpy random-forest sklearn tensorflow-extended unicorn xgboost

Last synced: 12 Apr 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/nadahamdy217/movies-data-etl-using-python-gcp

Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.

analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/hansalemaos/group_by_continuous_sequence

Groups lists/NumPy arrays by continuous sequence

group list numpy python

Last synced: 19 Feb 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/22p31a0512/sentimental-analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

feature-engineering machine-learning natural-language-processing numpy pandas python supervised-learning text-classification text-processing tokenization

Last synced: 12 Apr 2026

https://github.com/ewakamoto/03.-python-projects

This is my collection of Python Projects

jupyter jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/vasulab/knightshock

Shock tube experiment planning and data analysis package.

cantera data-analysis matplotlib numpy shock-tube

Last synced: 18 Jul 2025

https://github.com/hariprasath-v/av-job-a-thon-november-2022

Build a machine learning/deep learning approach to forecast the total energy demand on an hourly basis for the next 3 years based on past trends.

exploratory-data-analysis kaggle lightgbm-regressor matplotlib numpy pandas python rmse-score seaborn sklearn statsmodels timeseries-forecasting

Last synced: 12 Apr 2026

https://github.com/akashkg03/feynn-labs-internship

This repository contains the projects I did as a Data Science intern with Feynn Labs.

data-science machine-learning matplotlib numpy pandas python scipy seaborn

Last synced: 07 Jan 2026

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 12 Apr 2026

https://github.com/Sadegh-Khedry/Pneumonia-Detection-Using-CNN

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 12 Apr 2026

https://github.com/yahiazakaria445/image-classification-using-vgg-16

computer vision model for 10 classes classification

matplotlib numpy opencv pandas sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/reckonsys/datascience-docker

A docker image with numpy, scipy, nltk and pandas pre-installed with pip

docker docker-image dockerfile nltk numpy pandas scipy

Last synced: 10 Mar 2025

https://github.com/manikantasanjay/data_analysis_using_python_libraries_series

This Series contains Data Analysis projects performed on different Kaggle datasets and providing valuable insights into the data by making use of Python libraries.

exploratory-data-analysis kaggle-datasets machine-learning matplotlib numpy opendatasets pandas python visualization

Last synced: 12 Apr 2026

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 12 Apr 2026

https://github.com/andrewzhoumq123/sci_graph_api

Generates different types of detailed plots for scientific data

fastapi h5py json matplotlib numpy pandas scipy

Last synced: 12 Apr 2026

https://github.com/ahmedabdalkreem/animals_classification

This project is dependent to make classification between faces of cats, dogs and wild_animals and use multiple model such as CNN and pretrained model like ResNet and MobileNet and show the result of all model in the image using heatmap to know what is the part that model take decision when see it.

classification cnn heatmap matplotlib mobilenet numpy opencv pandas pretrained-models python restnet

Last synced: 12 Apr 2026

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 12 Apr 2026

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 12 Apr 2026

https://github.com/abideen-olawuwo/logistic-regression-house-prediction

Using logistic Regression Model to predict house price

logistic-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/raghavendra317/brain-tumor-detection

A deep learning-based project for classifying brain MRI images into Glioma, Meningioma, No Tumor, and Pituitary Tumor using Convolutional Neural Networks (CNNs). This project aims to assist in early tumor detection, improving diagnostic accuracy and healthcare outcomes.

cnn-classification keras numpy opencv scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/sandipkushwaha20/brain_tumour_detector

This project utilizes deep learning to detect and classify brain tumors from MRI images. The primary objective is to determine whether a tumor is present and, if so, classify its type with high accuracy.

deeplearning keras numpy pandas python tensorflow

Last synced: 12 Apr 2026

https://github.com/d1ogocs/afinador-de-instrumentos

Desenvolvimento de um afinador que se ajusta automaticamente ao instrumento musical escolhido pelo utilizador

butterworth-filter instrument-tuner matplotlib numpy pyaudio python scipy threading tkinter

Last synced: 07 Jan 2026

https://github.com/konynour/facedectection

This project demonstrates real-time face detection using OpenCV and a pre-trained SSD (Single Shot MultiBox Detector) model. The system captures video from a camera, processes the frames, detects faces, and displays the results with bounding boxes and confidence scores. This application showcases the use of deep learning models for object detection

cv2 jupyter-notebook lambda-functions matplotlib numpy opencv python3

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/aqueeqazam/numpy-for-mathematical-computing

You're at the perfect spot if you're interested in developing your career in data science or machine learning, or if you just enjoy having fun with mathematical operations.

mathematics numpy numpy-arrays

Last synced: 19 Apr 2026

https://github.com/supriya811106/whatsapp-chat-analyzer-app

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

analyzer-web-app chat-analytics chat-analyzer data-preprocessing data-visualization emojis machine-learning matplotlib natural-language-processing nltk numpy pandas plotly python3 seaborn sentiment-analysis streamlit-webapp text-analysis user-engagement

Last synced: 30 Dec 2025

https://github.com/supriya811106/twitter-sentiment-analysis

Analyzing the mood of tweets! We sort tweets on popular topics into positive, negative, or neutral categories to gauge public opinion. See what Twitter really thinks!

bernoulli-naive-bayes jupyter-notebook matplotlib nlp-machine-learning nltk numpy pandas python scikit-learn seaborn sentiment-analysis text-classification tfidf-vectorizer wordcloud

Last synced: 05 Apr 2026

https://github.com/shreyazh/respire

To deal with Respiratory Challenges

machine-learning matplotlib numpy pandas pillow python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/marcelovilla/fire-congress-2019-tdf-talk

This repository contains the code used for the project presented in the "Fire activity on the Colombian Tropical Dry Forest: an environmental and social perspective" talk during the 8th International Fire Ecology and Management Congress.

earth-observation ecology fire-activity gdal latex matplotlib modis numpy pandas python remote-sensing requests seaborn tikz trmm xarray

Last synced: 12 Apr 2026

https://github.com/riya2624/heart-disease-diagnostic-analysis

Analyzed heart disease diagnostic data through an ETL process, ensuring data accuracy and readiness for analysis using Python (Pandas, NumPy, Matplotlib, Seaborn). Conducted comprehensive exploratory data analysis (EDA) to uncover trends related to heart disease rates by gender and age. Developed interactive dashboards with Power BI and Tableau

dashboard dax-query eda machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/azaz9026/machine_learning_algorithms

Machine learning algorithms are computational models that allow computers to understand patterns and forecast or make judgments based on data without the need for explicit programming. These algorithms form the foundation of modern artificial intelligence and are used in a wide range of applications, including image and speech recognition.

machine-learning machine-learning-algorithms numpy pandas python3 seaborn sklearn-library

Last synced: 12 Apr 2026

https://github.com/ankitmishralive/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 22 Mar 2025

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 12 Apr 2026

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 12 Apr 2026

https://github.com/anupreet02/deep-learning-challenge

The objective of this analysis is to develop a deep learning model capable of predicting whether a charity funded by Alphabet Soup is likely to be successful. The model is built using the charity dataset, which contains various features related to each charity, and is used to classify charities as successful or not based on these features.

numpy pandas sckit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/renukadhule/crop_production_python_analysis

This project analyzes crop production data in India from 1997 to 2015, identifying trends, seasonality, and top-performing states and crops. The insights aim to guide improvements in agricultural productivity and policy decisions.

matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/dan-niles/neural-networks-from-scratch

A neural network implemented using only NumPy and Pandas for CS3631: Deep Neural Networks

neural-networks numpy pandas

Last synced: 05 May 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/devbigboy/php-crud-project

PHP-CRUD-Project

jupyter numpy pandas

Last synced: 20 Apr 2026

https://github.com/nilusche/feed_forward_network_py

Implementation of a neural network in python which can predict handwritten numbers from the MNIST dataset.

colab-notebook feed-forward-neural-networks neural-network numpy python

Last synced: 01 May 2026

https://github.com/asuquoaa/cdc_2017_immunization_data_analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 12 Apr 2026

https://github.com/testetson22/abelian_sandpiles

Simple App that implements, builds and tests an approximation of the Abelian sandpile model

github-actions numpy pytest

Last synced: 12 Apr 2026

https://github.com/manoj-kumar-c/data_analyst_projects

Live Data Analyst Projects.

excel numpy pandas powerbi

Last synced: 07 Feb 2026

https://github.com/chandkund/iris-classification

Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.

machine-learning matplotlib-pyplot numpy pandas python sklearn

Last synced: 21 Jan 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1

Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?

confidence-interval normalization numpy pandas python scipy stats

Last synced: 13 Apr 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/gustavo-venceslau/housing-price

A Machine Learning project to create a model to predict housing prices. The project is based on the example in chapter 2 of the book Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélion Géron

conda machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/barkerbg001/framefusion

Python script that creates a video from a set of images and adds a random audio clip to each video

glob moviepy numpy opencv pygt5 python python-3 random video video-editor

Last synced: 16 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ellennewerkla/apriori-analise_cesta_de_mercado

Análise de Cesta de Mercado (Market Basket Analysis) utilizando o algoritmo Apriori para identificar regras de associação entre produtos em um conjunto de transações de supermercado.

apriori-algorithm jupyternotebook marketbasketanalysis mathplotlib mba numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/karanyeole/movie-rating-analysis

analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.

matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/sunilsm7/numpy-starter

IPython Notebook(s) demonstrating NumPy functionality.

numpy python

Last synced: 13 Apr 2026