An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/lorenzorottigni/ml-lending-club

Machine Learning python bootcamp: random forest classifier on LendingClub dataset

ipynb machine-learning numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 27 Oct 2025

https://github.com/thaisgarcia/analise-exploratoria

Respondendo 10 perguntas de negócio a partir de um dataset sobre uma rede de varejo que comercializa diversos produtos em diversas cidades dos EUA.

matplotlib numpy pandas

Last synced: 28 Oct 2025

https://github.com/romerorodriguezd/homography-with-opencv

Homography operations applied to insert images inside others regardless of the perspective.

computer-vision homography matrices-transformation numpy opencv opencv-python

Last synced: 28 Oct 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 28 Oct 2025

https://github.com/lintangwisesa/python_pillow_image_array

Convert 🖼 image to 📜 Numpy array using Pillow [Python]

image numpy pillow python

Last synced: 28 Oct 2025

https://github.com/sahraiidle/email-spam-detector

Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.

data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm

Last synced: 24 Jan 2026

https://github.com/alejandro945/insurance-risk

This project aims to predict the risk of insurance claims using a dataset from Kaggle. The dataset consists of 26 columns and 205 rows, providing various features related to insurance risk. By analyzing this data, we seek to build predictive models that can help insurers assess the risk of claims.

data-analytics ipython-notebook numpy pandas python

Last synced: 06 Feb 2026

https://github.com/jaybfn/single-molecule-electrophysiology-data-analysis

Pynanopore serves as a sophisticated toolkit for analyzing single molecules, designed to interpret complex states within nanopore data automatically.

docker numpy pandas plotly pytest python3 streamlit streamlit-webapp

Last synced: 28 Jan 2026

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/tamk-kol/project_orbital_data_analysis

The goal of this project is to develop an automatic method to detect orbital maneuvers using machine learning.

matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2026

https://github.com/stavrosarm/mathematics_algorithms

Fundamental algorithms of Linear Algebra , Numerical Analysis and Calculus in Python and Julia.

calculus julia-language linear-algebra numerical-analysis numpy python

Last synced: 30 Jan 2026

https://github.com/keyurparalkar/knn-for-cifar-10

K-nearest neighbour algorithm for CIFAR-10 dataset

cifar10 knn numpy

Last synced: 07 Feb 2026

https://github.com/thkox/uni_2023-compression-and-object-removal

This project is a team assignment for the "Multimedia Systems" course, offered in the 6th semester of the 2023 academic year at the University of Piraeus, Department of Informatics.

2023 6th-semester colorama compression-algorithm department-of-informatics heapq lossless-compression-algorithm motion-compensation multimedia-systems numpy object-removal opencv-python pickle python scipy-stats university-of-piraeus

Last synced: 07 Feb 2026

https://github.com/nmerka/practicum_rep

Проекты курса Аналитик данных (Яндекс.Практикум)

jupyter-notebook matplotlib numpy pandas plotly python scipy seaborn sql tableau

Last synced: 31 Jan 2026

https://github.com/chandkund/sentiment-analysis-on-movie-reviews

Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.

data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn

Last synced: 01 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/aimin-nur/dataanalyst-cohortanalysis

Sebuah project Data Analayst untuk mengetahui bagaimana pola pembelian pelanggan dalam periode waktu tertentu (Cohort Analysis)..

cohort-analysis numpy pandas pytz

Last synced: 25 Jun 2025

https://github.com/jonathanp-code/stock-prediction-with-lstm-using-yfinance

This repository contains two machine learning models that predict future prices for Gold and Apple (AAPL) stocks using LSTM neural networks. The models fetch historical price data from Yahoo Finance, preprocess the data, and also generate predictions with buy/sell recommendations.

keras keras-neural-networks keras-tensorflow lstm lstm-neural-networks mashine-learning numpy python stock-market stock-price-prediction stocks tensorflow yfinance

Last synced: 25 Jun 2025

https://github.com/lmizner/codecademy_auto_evaluation

Summarized categorical variables in Python using numerical summary statistics.

categorical-data frequency jupyter-notebook median numpy pandas proportion python

Last synced: 25 Jun 2025

https://github.com/05afreen/indian-job-market-analysis

This repository focuses on 🔍 scraping, 🧹 cleaning, and 📊 visualizing job market data in India. It automates job listing collection from platforms like 🌐 Naukri, Indeed, and LinkedIn using 🐍 BeautifulSoup/Selenium. The scraped data is then 🛠️ cleaned and standardized, handling ❌ missing values, 🏷️ formatting inconsistencies, and 🔎 extracting

numpy pandas powerbi selenium

Last synced: 30 Dec 2025

https://github.com/tigureis/aula_deployment

This project not only demonstrates a machine learning pipeline but also serves as a template for deploying models into production. By leveraging the project's structure and code, you can streamline the process of deploying your own machine learning models.

numpy pandas pickle sklearn

Last synced: 30 Dec 2025

https://github.com/alexsolov28/ml_nir

Научно-исследовательская работа на тему «Прогнозирование цен на жилье»

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Jan 2026

https://github.com/nagipragalathan/python_tutorial_for_data-science

This repository is a comprehensive guide for learning data science using Python. It covers various essential libraries and tools commonly used in the field of data science, including Jupyter Notebook, Matplotlib, NumPy, Pandas, Scikit-learn, and PyTorch.

datascience datavisualization deeplearning jupyter jupyter-notebook learning-by-doing learningresources machinelearning matplotlib numpy opensource pandas python python-script python3 pytorch pytorch-implementation scikitlearn tutorial

Last synced: 30 Dec 2025

https://github.com/singhvishal003/email-spam-detector-model

Email Spam Detector model to detect spam messages.

numpy pandas sklearn

Last synced: 30 Dec 2025

https://github.com/shandilya1998/neurorobotics

Python tools for robotics, deep reinforcement learning and neuroscience research.

computational-neuroscience mujoco mujoco-py numpy openai-gym pytorch reinforcement-learning robotics

Last synced: 30 Dec 2025

https://github.com/tinaland101/credit-risk-classification

The purpose of this project is to build a credit risk classification model using machine learning techniques. This model helps identify the creditworthiness of borrowers based on historical lending data. Specifically, it uses a logistic regression model to predict whether a loan is healthy (0) or high-risk (1).

numpy pandas pathlib scikit-learn

Last synced: 24 Feb 2025

https://github.com/alirezasaharkhiz9/python-for-dataanalysis

The educational repository for data manipulation and analysis with the libraries numpy, pandas, and matplotlib.

matplotlib numpy pandas

Last synced: 24 Feb 2025

https://github.com/amiridk/statics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 24 Feb 2025

https://github.com/kingsley-ezenwaka/medical-data-visualizer

A data analysis project that investigates a dataset of anonymous patients' medical information, and explores the relationship between cardiac disease, body measurements, blood markers, and lifestyle choices.

analysis data matplotlib numpy pandas seaborn

Last synced: 24 Feb 2025

https://github.com/amiridk/machinelearning

machine-learning-notes

matplotlib-pyplot numpy pandas pyhton

Last synced: 24 Feb 2025

https://github.com/kingsley-ezenwaka/mean-var-std

This is a (mini) Python project, completed as part of the requirements for the Data Analysis with Python certification from freeCodeCamp.com.

numpy python

Last synced: 24 Feb 2025

https://github.com/fatimatuzzahromutmainnah/predictive-maintenance-for-industrial-equipment

This project focuses on Predictive Maintenance for industrial equipment using machine learning. The goal is to predict potential machine failures before they occur, enabling proactive maintenance and reducing downtime.

decision-trees digitalocean docker feature-selection flask logistic-regression matplotlib modelevaluation numpy pandas predictive-maintenance predictive-modeling seaborn sklearn-library

Last synced: 24 Feb 2025

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 26 Oct 2025

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 24 Feb 2025

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 24 Feb 2025

https://github.com/chenna7893/enhancing-customer-retention-a-machine-learning-approach-to-churn-prediction

Churn prediction using machine learning models like Random Forest, Decision Tree, and XGBoost involves identifying customers who are likely to stop using a company's services.

decision-tree-classifier matplotlib numpy pandas random-forest-classifier xgboost

Last synced: 24 Feb 2025

https://github.com/sadiyabhokare/window_switch

🖐️ Hand Gesture-Based Task View Navigator is a project that enables users to switch between desktop windows using simple hand gestures. This tool offers a touch-free way to manage open windows.

mediapipe numpy opencv-python

Last synced: 24 Feb 2025

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 24 Feb 2025

https://github.com/manalisbhavsar/stock-price-prediction

Stock Price Prediction model using Machine Learning and LSTM to forecast future stock prices based on historical data. Achieved a low error rate of 3.2% by leveraging moving averages and deep learning techniques, ensuring accurate predictions.

data-analysis deep-learning lstm machine-learning matplotlib numpy pandas python

Last synced: 24 Feb 2025

https://github.com/albert-2002/robust-penalized-empirical-likelihood-estimation-method-for-linear-regression

Implementation for the paper titled: Robust Penalized Empirical Likelihood Estimation Method for Linear Regression

linear-regression machine-learning numpy pandas python regression regression-analysis statistics

Last synced: 24 Feb 2025

https://github.com/karthik9273/electric-vehicle-market-segmentation

Electric Vehicle market in India using Segmentation analysis and come up with a feasible strategy to enter the market, targeting the segments most likely to use Electric vehicles.

data-science machine-learning-algorithms matplotlib numpy numpy-arrays pandas seaborn statistics

Last synced: 19 Jul 2025

https://github.com/rahulchouhan1/air-quality-data-in-india

India Air Quality Analysis (2015-2024)

matplotlib numpy pandas python

Last synced: 24 Feb 2025

https://github.com/petridhsg/firewall-data-classification

A single implementation of a machine learning algorithm for a firewall data classification task

machine-learning matplotlib numpy python seaborn

Last synced: 16 Nov 2025

https://github.com/wisskirchenj/matrix-processor

Python app to process matrices using numpy

matrix-calculations numpy python3

Last synced: 12 Sep 2025

https://github.com/chandkund/personalized-meal-recommendation

This project focuses on building a personalized meal recommendation system by predicting cuisines based on the ingredients used. Leveraging NLP and a deep learning model, this system helps users discover meal options tailored to their preferences.

matplotlib nlp numpy pandas sklearn

Last synced: 29 Mar 2025

https://github.com/rohansoni45/movie-recommendation-system

This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.

content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn

Last synced: 05 Apr 2025

https://github.com/ugurcan222/a-different-approach--image-enhancement-with-imputation-and-regression-methods

This experimental work presents a different approach to increase the size and quality of an image by adding a blank pixel around each pixel in an image, enlarging the image, breaking it into parts, and generating these blank pixels by predicting them with models.

ai-image-upscaling computer-vision digital-image-processing gradient-boosting image-analysis image-enhancement image-enlargement image-interpolation image-processing imputation knn machine-learning numpy opencv pixel-prediction python randomforest regression-models super-resolution xgboost

Last synced: 17 Jan 2026

https://github.com/strawndri/python-ds-numpy

Projeto de introdução à ciência de dados e Python, da plataforma Alura.

alura data-science numpy python

Last synced: 12 Sep 2025

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 24 Feb 2025

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 03 Aug 2025

https://github.com/chdl17/imdb-assignment

This GitHub repository contains a project that performs an analysis of the IMDb movie rating dataset. The data is used to explore the relationships between various features and the movie rating. Visualizations are used to illustrate the results of the analysis. The code is organized into a Jupyter Notebook file and data files.

numpy pandas-dataframe python

Last synced: 24 Feb 2025

https://github.com/jonolav95/advent_of_code_2022

Using Python with Pandas and Numpy (www.adventofcode.com).

numpy pandas

Last synced: 05 Apr 2025

https://github.com/aqueeqazam/numpy-for-mathematical-computing

You're at the perfect spot if you're interested in developing your career in data science or machine learning, or if you just enjoy having fun with mathematical operations.

mathematics numpy numpy-arrays

Last synced: 04 Mar 2025

https://github.com/bishopce16/world_weather_analysis

The project’s objective is to improve a travel app to give customers a way to decide their travel destination and ideal hotel based on weather preferences.

apis citipy datetime gmaps json-traversal jupyter-notebook matplotlib numpy pandas python requests scipy

Last synced: 24 Feb 2025

https://github.com/vbhvsingh0/cdc_immunization

This project explores the relationships in between different vaccines and the sex, age and other basic features in the data.

data-cleansing data-manipulation-with-pandas data-science numpy pandas-python python3

Last synced: 04 Mar 2025

https://github.com/brenderb/python-projects

General interest and school python projects I completed. Included are lab analysis files (with uncertainty analysis, fitting, and graphing), simulation files (visualization of electric+magnetic fields, etc), specific research project files (data extraction + processing for a building science lab, graphs for a medical physics paper) and misc files

csv fitting matlab matplotlib numpy os pandas pyplot python scipy uncertainty

Last synced: 05 Apr 2025

https://github.com/vbhvsingh0/deforestation_rainfall_correlation

The aim of this project is to check if there is any correlation of rainfall with deforestation in Pennsylvania state of USA.

correlation-analysis data-science matplotlib-pyplot numpy pandas python3

Last synced: 04 Mar 2025

https://github.com/vbhvsingh0/fraudulent_transactions

A few models were developed based on Decision trees and Logistic Regression to categorize fraudulent transactions

credit-card-fraud data-science logistic-regression machine-learning numpy numpy-python pandas pandas-python python3 random-forest-classifier

Last synced: 04 Mar 2025

https://github.com/salvatoreamaddio/powerbi

This project combines both Python and PowerBI technologies. It cleans a dataset and imports it into a PowerBI file.

numpy pandas pandas-python powerbi python

Last synced: 21 Mar 2025

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 30 Dec 2025

https://github.com/prarthana-singh/potato-disease-classification-using-deep-learning

🌱 Potato Disease Classification using Deep Learning – A deep learning-based model to detect and classify potato plant diseases using image processing and CNNs. Built with TensorFlow/Keras & Python.

agriculture cnn deep-learning image-classification keras machine-learning neural-network numpy pandas plant-disease-detection python3 streaming tensorflow

Last synced: 04 Mar 2025

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 30 Dec 2025

https://github.com/supreme-gg-gg/beam-analysis

Fast, customisable, intuitive web application to simplify beam analysis. Certified by Bob the Builder.

civil-engineering matplotlib numpy python streamlit structural-analysis

Last synced: 30 Dec 2025

https://github.com/technologicat/orderfix

Reorder solutions of parametric studies to make continuous curves

cython eigenvalues numpy postprocessing python python2 python27 python3 python34

Last synced: 25 Jan 2026

https://github.com/alexsolov28/vkrb

Выпускная квалификационная работа бакалавра «Система анализа алгоритмов машинного обучения для решения задач классификации с использованием Pandas»

jupyter-notebook matplotlib numpy pandas python scikitlearn seaborn streamlit

Last synced: 11 Sep 2025

https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india

As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.

accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization

Last synced: 19 Jan 2026

https://github.com/mituskillologies/aiml-dypiemr-sep24

Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.

artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3

Last synced: 05 Apr 2025

https://github.com/harmanveer-2546/guide-to-regularization

Regularization is a crucial technique in machine learning that helps to prevent overfitting. Overfitting occurs when a model becomes too complex and learns the training data so well that it fails to generalize to new, unseen data.

generalization inline l1 l2 matplotlib numpy overfitting overfitting-prevention overfitting-reduced pandas regularization regularization-methods regularization-techniques regularization-to-avoid-overfitting seaborn

Last synced: 05 Apr 2025

https://github.com/chandkund/predicting-heart-disease

Welcome to the Heart Disease Prediction project! 🩺 This project focuses on developing a predictive model to assess heart disease risk based on health indicators like age, cholesterol levels, and blood pressure. By analyzing these features, we aim to create an effective tool for early diagnosis and heart disease prevention

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Jan 2026

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 24 Aug 2025

https://github.com/mchenryspagg/economic-variable-correlation-and-productivity-model

This report provides an in-depth analysis of Nigeria's economic performance over the past decade, spanning from 2014 to 2024.

correlation exchangerates inflation moneysupply numpy productivity python regression regression-analysis regression-models visualization

Last synced: 10 Sep 2025

https://github.com/Mike014/SoundBot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 12 Sep 2025