An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/AsuquoAA/CDC_2017_Immunization_Data_Analysis

This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.

numpy

Last synced: 21 Jul 2025

https://github.com/2kabhishek/ml-primer

Basic Machine Learning & Data Science in Python/ πŸ“ˆπŸ

jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 12 Apr 2026

https://github.com/rubada/machine-learning-with-ruba-dabbas

Advance your skills and start your career here, by taking the online courses on Intuidemy.

course learning machine machine-learning matplotlib matplotlib-pyplot models numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/shreyazh/respire

To deal with Respiratory Challenges

machine-learning matplotlib numpy pandas pillow python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/sudarshanc00/mnist-digit-classification

This project uses a machine learning model to classify handwritten digits (0-9) from the MNIST dataset, a popular collection of grayscale images for image classification benchmarking. It includes a Jupyter Notebook to train a neural network, enabling accurate recognition and classification of digits.

jupyter-notebook matplotlib numpy python tensorflow

Last synced: 12 Apr 2026

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/azaz9026/machine_learning_algorithms

Machine learning algorithms are computational models that allow computers to understand patterns and forecast or make judgments based on data without the need for explicit programming. These algorithms form the foundation of modern artificial intelligence and are used in a wide range of applications, including image and speech recognition.

machine-learning machine-learning-algorithms numpy pandas python3 seaborn sklearn-library

Last synced: 12 Apr 2026

https://github.com/ankitmishralive/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 22 Mar 2025

https://github.com/liviuxyz-ctrl/financialportofoliosimulator

FinancialPortfolio is your financial planning ally, helping you manage portfolios, calculate future values, and visualize performance, all with user-friendly easeπŸš€

matplotlib numpy python simulation

Last synced: 12 Apr 2026

https://github.com/ishtufinquant/portfolio-optimizer

Optimizes stock portfolios using simulation and real-world constraints (SLSQP).

backtesting finance numpy pandas portfolio-optimization python quant scipy yfinance

Last synced: 12 Apr 2026

https://github.com/ben854719/wasde-revenue-in-2024

The project will use AI to analyze the revenue from the dataset of the World Agricultural Supply and Demands Estimates.

gemini langchain large-language-models matplotlib numpy pandas-python

Last synced: 12 Apr 2026

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 12 Apr 2026

https://github.com/anupreet02/deep-learning-challenge

The objective of this analysis is to develop a deep learning model capable of predicting whether a charity funded by Alphabet Soup is likely to be successful. The model is built using the charity dataset, which contains various features related to each charity, and is used to classify charities as successful or not based on these features.

numpy pandas sckit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/nilusche/feed_forward_network_py

Implementation of a neural network in python which can predict handwritten numbers from the MNIST dataset.

colab-notebook feed-forward-neural-networks neural-network numpy python

Last synced: 01 May 2026

https://github.com/manoj-kumar-c/data_analyst_projects

Live Data Analyst Projects.

excel numpy pandas powerbi

Last synced: 07 Feb 2026

https://github.com/nambukeerthi/youtube_dataproject

Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data

api numpy pandas python sql streamlit

Last synced: 09 Apr 2026

https://github.com/yosrak5/predictive_maintenance

End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )

data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost

Last synced: 25 Feb 2026

https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1

Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?

confidence-interval normalization numpy pandas python scipy stats

Last synced: 13 Apr 2026

https://github.com/murugavl/loan_approval_prediction

This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.

jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit

Last synced: 13 Apr 2026

https://github.com/pedasoft-consult/house-price-prediction

This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.

flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/gustavo-venceslau/housing-price

A Machine Learning project to create a model to predict housing prices. The project is based on the example in chapter 2 of the book Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by AurΓ©lion GΓ©ron

conda machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/ellennewerkla/apriori-analise_cesta_de_mercado

AnΓ‘lise de Cesta de Mercado (Market Basket Analysis) utilizando o algoritmo Apriori para identificar regras de associaΓ§Γ£o entre produtos em um conjunto de transaΓ§Γ΅es de supermercado.

apriori-algorithm jupyternotebook marketbasketanalysis mathplotlib mba numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker

Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.

cnn-classification computer-vision keras-tensorflow numpy python tensorflow

Last synced: 07 Jan 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/preethiphilip/prosper-loan-data-analysis

This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.

jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/miss-mhv/udacityproject_dataviz

Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day

jupyter-notebook matplotlib numpy pandas python3 seaborn-plots

Last synced: 13 Apr 2026

https://github.com/rajsinha7/hand-gesture-recognition

This project serves as a foundation for integrating hand gesture controls into interactive applications like games, presentations, or robotic controls. It features: Real-time hand tracking with MediaPipe Hands and Live visualization of hand landmarks and gesture labels

mediapipe numpy opencv python

Last synced: 13 Apr 2026

https://github.com/alinenog/desenvolve_gb_2022

FormaΓ§Γ£o Desenvolve 2022 do Grupo BoticΓ‘rio na Γ‘rea de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/allanotieno254/pinecone-index-creation-vector-store-database-

This project demonstrates how to create an index in Pinecone, a vector database optimized for similarity search and machine learning applications. The notebook covers setting up a Pinecone client, creating an index, and managing vector data efficiently.

index jupyter-notebook numpy pinecone python

Last synced: 13 Apr 2026

https://github.com/mike014/soundbot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 13 Apr 2026

https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions

A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders

machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn

Last synced: 22 Feb 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/prateekrajsrivastav/financial-transition-classification

This project focuses on classifying financial transactions using machine learning techniques. By leveraging labeled data, the model aims to categorize transactions into predefined categories (e.g., "Food," "Transport," "Shopping," etc.).

matplotlib numpy pandas-python scikitlearn-machine-learning seaborn

Last synced: 07 Sep 2025

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de prediΓ§Γ£o de preΓ§os de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 13 Apr 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/bbhargavpanchal/brain-tumor-segmentation-unet

U-Net based Convolutional Neural Network for brain tumor segmentation using MRI images. Implemented in Python with TensorFlow, OpenCV, and Scikit-learn.

keras-tensorflow numpy opencv pandas python scikit-learn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/ksprugevics/magic-square

Some simple algorithm implementations for solving (ordinary) magic squares using NumPy

algorithms mathematics numpy python

Last synced: 21 Apr 2026

https://github.com/danhenriquex/data_science_and_machine_learning

A.I, Data Science & Machine Learning and Deep Learning with Tensorflow

deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow

Last synced: 13 Apr 2026

https://github.com/chandkund/predicting-diabetes-onset

The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.

deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/pratyush905/farecast-nyc-taxifare-predictor

Machine learning models to predict nyc taxi fare based on given dataset

jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn

Last synced: 13 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/caterinatasinato/python-projects

Projects I worked on as Trainee in Data Analytics at ProfessionAI

matplotlib numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/joiceo/python

Projetos e exercΓ­cios em Python

eda machine-learning numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/kizman-23/supervised_models

Classical prediction of future data using models trained by labeled data

numpy pandas scikit-learn supervised-machine-learning

Last synced: 13 Apr 2026

https://github.com/venkat-0706/accenture-hackathon

Developing an e-commerce recommendation system involves utilizing technologies such as Python for programming, Pandas for data manipulation, SQL for database management, FastAPI for building APIs, PostgreSQL for data storage, and Docker for containerization.

alembic api docker fastapi machinelearningalgorithms matplotlib numpy postgresql pydantic python3 scipy seaborn sqlmodel

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/somyaaaaaa2004/travelling-sales-man-problem

This project is a soft computing-based solution to the Traveling Salesman Problem (TSP) using a Genetic Algorithm (GA), implemented entirely from scratch in Python on Google Colab. The goal of the TSP is to find the shortest possible route that visits each city exactly once and returns to the starting point. It's a classic NP-hard combinatorial

genetic-algorithm jupyter-notebook matplotlib-pyplot numpy pandas python travelling-salesman-problem

Last synced: 06 May 2026

https://github.com/harmanveer-2546/eda-on-indian-railways

Indian Railways is a statutory body under the ownership of the Ministry of Railways of the Government of India that operates India's national railway system. As of 2023, it manages the fourth largest national railway system by size with a track length of 132,310 km, running track length of 106,493 km and route length of 68,584 km.

clean-data eda exploratory-data-analysis geometry geopandas indian-railways json linestring matplotlib numpy os pandas plotly python railway seaborn shapely train visualization

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/prediction-of-ticket-cancellation

The objective is to develop a model that accurately predicts whether users will cancel their tickets. Each cancellation incurs a fine for the ticket registration site from the passenger company.

datetime evaluation gridsearchcv labelencoder numpy pandas standardscaler stratified-k-fold train-test-split xgboost-model

Last synced: 04 May 2026

https://github.com/csengupta1101/rock-paper-scissor-game

Rock Paper Scissor game built with Python 3. Jupyter notebook used as IDE. Code File , Read Me attached herewith.

game if-else-statements numpy python python3 random

Last synced: 11 May 2026

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 13 Apr 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/harmanveer-2546/predicting-schizophrenia-disorder

The positive symptoms typical of schizophrenia – such as delusions, hallucinations or formal thought disorders – often first appear in an attenuated or transient form during the initial prodromal phase

boxplot decisiontreeregressor disorder linearregression matplotlib mean-squared-error numpy pairplot pandas prediction randomforestregressor schizophrenia seaborn train-test-split visualization

Last synced: 13 Apr 2026

https://github.com/shlok-nahar/mnist-cnn-classifier

This repository trains and evaluates three CNN models on MNIST, providing performance comparisons and 5 unique visualizations.

confusion-matrix graph heatmap-visualization json machine matplotlib mnist numpy precision-recall python receiver-operating-characteristic seaborn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/zvdy/movie_recommendation

Movie Recommendation Search Engine using Jupyter Notebooks, Pandas, Nnmpy, SciKit Learn, IPyWidgets

data-science jupyter-notebook machine-learning numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/nur-alam39/course-lab-experiments

The lab works: Code of academic courses-Digital Image Processing, Microprocessor, OOP, Numerical Analysis

cpp lab lex matplotlib numpy oop opencv python

Last synced: 13 Apr 2026

https://github.com/jessicahora/series_temporais

AnΓ‘lise de Series Temporais do Consumo de Energia da Alemanha.

numpy pandas python time-series-analysis

Last synced: 07 Apr 2026

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/mayankmittal29/algovision-statistical_methods_in_ai

Implementation of various machine learning algorithms from scratch, including Linear Regression, K-Nearest Neighbors, Decision Trees, and K-Means clustering. Also done EDA on data, Implemented LSH, IVF, SLIC algorithms also with evaluation metrics

decision-tree-classifier eda gradient-descent image-segmentation ivf knn-classification linear-regression lsh-implementation matplotlib-pyplot numpy pandas python3 seaborn sgd-optimizer sklearn slic-superpixel-algorithm

Last synced: 11 Apr 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/bhanuprakashnani/scit-tasks

Tasks for the SCIT Work Shop.

numpy python3

Last synced: 25 Apr 2026

https://github.com/vishal815/python-based-flight-data-scraping-automating-data-collection-for-analysis

This project is a Python-based web scraping script designed to extract flight details from the Yatra travel website. The extracted data includes flight pricing, origin, destination, departure and arrival times, and duration. The data is then saved into a CSV file for further analysis.

beautifulsoup flight-data-analysis flight-data-visualization numpy pandas pyhton pyplot scraping scraping-python seaborn selenium vishal-lazrus vishallazrus webscraping webscraping-projects

Last synced: 18 Apr 2026

https://github.com/animesh-chourey/loan-classifier

Trained machine learning algorithms (Logistic Regression, KNN, SVM, Decision Tree) specifically, after performing visualization and pre-preocessing tasks on a loan dataset. Executed the evaluation metrics such as F1-score, Log loss and jaccard-similarity score to assess the algorithms performance.

decision-tree f1-score jaccard-similarity knn logistic-regression logloss matplotlib numpy pandas preprocessing-data svm

Last synced: 06 May 2026

https://github.com/tatmorenno/python_para_data_science

Python para Data Science IntroduΓ§Γ£o Γ  linguagem e Numpy πŸŒŽπŸ’»

jupyter-notebook numpy python

Last synced: 13 Apr 2026