An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/anvesham/machinelearning_models_using_matlab

Stepwise Multiple Linear Regression (With Interactions) and Random Forest Regression on predicting the Productivity of the Garment Factory Workers

bagging-ensemble bayesian-optimization decision-trees jupyter-notebook machine-learning matlab multiple-linear-regression-model numpy python random-forest-regression

Last synced: 11 Apr 2026

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/yash22222/sync-intern-s-ml-tasks

SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.

bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns

Last synced: 07 May 2026

https://github.com/mahnoorsheikh16/Credit-Card-Default-Prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

chi-square-test encoding hiplot imblearn json knn-imputer matplotlib numpy pandas pca-analysis pillow plotly robust-scalar scipy seaborn sklearn smote streamlit ttest visualization

Last synced: 01 Mar 2025

https://github.com/dpgitaccount/file-handling-data-analytics-toolkits

In this repo File handling, Inventory management system and data toolkits resources are provided these are essential for managing, storing, and manipulating data in various formats, such as JSON files. JSON (JavaScript Object Notation), is widely used for data exchange due to its simplicity and readability. also File Handling included in this repo.

filehandling inventory-management json jupyter-notebook numpy os

Last synced: 13 Apr 2026

https://github.com/toby-sanders-101/mydigits

Everything needed to create a AI bot that can read images

deep-neural-networks image-classification matplotlib numpy opencv pyautogui python

Last synced: 08 May 2026

https://github.com/mardavsj/numpy-in-python

The fundamentals of Python NumPy Library.

analysis-tool data-manipulation numpy numpy-arrays numpy-matrix python

Last synced: 14 Apr 2026

https://github.com/rkarahul/machinelearning

This is for my all machine learning code

analysis computer-vision matplotlib mysql nlp numpy pandas python

Last synced: 11 Apr 2026

https://github.com/datarohit/date-fruit-classification

This is Date Fruit Data taken from Kaggle. This data severs a classification problem to solved. Using various features of the fruit classify the fruit to its type.

classification logisticregression numpy pandas python3 standard-scaler

Last synced: 13 Apr 2026

https://github.com/akashash01/logistic-regresion

This is an contrast of linear regression model, used to examine the association between the independent variable(category or contineous) with dependent variable(binary), which is an discrete outcome.

auc-roc-curve classification confusion-matrix data-science kolmogorov-smirnov machine-learning nonlinear-regression numpy pandas python supervised-learning vif woe-and-iv

Last synced: 08 May 2026

https://github.com/framebuffers/mindhunter

Wrappers for Pandas DataFrames to add quicker access for common statistical values, utilities and functionality.

data-analysis data-science numpy pandas python utilities-python

Last synced: 08 May 2026

https://github.com/idow09/ud810-problem-sets

Solutions to the problem sets of Introduction to Computer Vision course at Udacity

computer-vision numpy opencv opencv-python problem-sets python solutions udacity

Last synced: 09 May 2026

https://github.com/avs123a/image-color-spaces-morphology-demo

Simple demo that shows extracting RGB and HSV channels, converting from RGB to HSV and reverse converting. There are 4 main morphological operations used: erosion, diletion, opening and closing image

demo image-processing jupyter-notebook numpy opencv python3

Last synced: 05 May 2026

https://github.com/haloapping/sharing-tipis-tipis

#sharingtipistipis 📗 catatan ketika belajar machine learning, deep learning dan masih banyak lagi.

caring deep-learning machine-learning numpy pandas scikit sharing

Last synced: 09 May 2026

https://github.com/paulo-santos-ds/analise_corrida_taxi

Este projeto tem como objetivo analisar os dados de um aplicativo fictício de caronas. Encontrar padrões nas informações disponíveis, entender as preferências dos passageiros e o impacto de fatores externos nas corridas.

math matplotlib numpy pandas python3 seaborn sql

Last synced: 09 May 2026

https://github.com/m-rishab/housing-price-prediction

The Housing Price Prediction Accuracy Improvement project is a data-driven initiative focused on enhancing the precision and reliability of housing price predictions. This project encompasses a multidisciplinary approach, combining data science, machine learning, and real estate insights to optimize the accuracy of forecasts in the housing market.

gridsearchcv hyperparameter-tuning matplotlib matplotlib-pyplot numpy pandas pickle python3 randomforestregressor scipy sklearn

Last synced: 11 Apr 2026

https://github.com/quentinwidlocher/cours-python-pandas

Jupyter Lab sur le traitement des données en python

jupyter-lab matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/mohd-tahzeeb-khan/reverse_dcf-calculator-and-vistualization

This Dashboard is Developed by me. I had Applied for Internship in which i had to develop this Dashboard to showcase my skill and talent.

beautifulsoup bs4 dash dashboard dataframes flask numpy pandas plotly plotly-dash plotly-express python requests shares stock-market

Last synced: 14 Apr 2026

https://github.com/takkii/pylean

Data analysis ( 🐍 💎 📈 )

analayze matplotlib numpy pandas python scikit-learn

Last synced: 09 Sep 2025

https://github.com/mchenryspagg/prosper-loan-project

A data analysis project that entails using the data from a fictional loan company known as Prosper to perform exploratory data analysis using univariate, bivariate and multivariate visualizations to produce insights that answers questions asked from the data

datastorytelling exploratory-data-analysis exploratory-data-visualizations jupyter-notebook matplotlib-pyplot numpy pandas python seaborn-plots

Last synced: 13 Apr 2026

https://github.com/Krisseck/hki-traffic-predict

Keras neural network to predict traffic in Helsinki

avoindata helsinki keras numpy open-data python traffic

Last synced: 13 Jul 2025

https://github.com/k-ibadullaev/py-ssa-lib

Py-SSA-Lib: Python implementation of the multichannel singular spectrum analysis (MSSA) and singular spectrum analysis (SSA)

mssa numpy python singular-spectrum-analysis ssa time-series time-series-analysis time-series-forecasting

Last synced: 10 May 2026

https://github.com/codewitheshayoutube/corvit_python_data_analytics_course

I teach Python & Data Analytics at Corvit Systems. This repository contains all course materials, including code, assignments, projects, and resources. ⭐ Stay updated!

matplotlib numpy pandas python python3 scikitlearn-machine-learning seaborn

Last synced: 13 Apr 2026

https://github.com/Shriram-Vibhute/RealEstateForecast-Price-Prediction-System

This project aims to predict Prices of House. It involves several key stages, including data preprocessing, feature engineering, model selection, and evaluation. The goal is to develop a model that provides accurate and reliable price predictions based on the given features.

exploratory-data-analysis feature-engineering feature-transformation hyperparameter-tuning machine-learning matplotlib numpy pandas python random-forest sklearn

Last synced: 02 Apr 2025

https://github.com/huifer/python_doc

python技术相关

numpy pandas python requests

Last synced: 11 May 2026

https://github.com/ashraf-khabar/funcplot-plottingfunctionsmadeeasy

FuncPlot is a user-friendly app built with Kivy and KivyMD that allows you to easily plot and visualize mathematical functions on your desktop or mobile device. With its intuitive interface and support for multiple functions, FuncPlot makes it easy to explore and understand complex mathematical concepts.

kivy kivymd matplotlib numpy python

Last synced: 11 May 2026

https://github.com/martincastroalvarez/python-deep-learning-algorithms

Combinatorics implemented with numpy, matplotlib and scipy.

algebra combinatorics matplotlib numpy python scipy

Last synced: 13 Apr 2026

https://github.com/rhazra-003/sketcher

A simple python code to make sketch out of any given photo using opencv and numpy

numpy opencv python3

Last synced: 11 May 2026

https://github.com/fayzi-dev/machin_learning

Machin Learning Full Algorithm (Linear Regression, Decision tree, Random forest, Neural network ,Logistic regression ,Support vector machine ,Naive Bayes ,Clustering, XGBoost,DBscan,KMeans)

algorithms artificial-neural-networks logistic-regression machine-learning matplotlib matplotlib-pyplot naive-bayes-classifier numpy pandas python python3 seaborn seaborn-plots sklearn sklearn-knn sklearn-library sklearn-linear-model sklearn-linear-regression sklearn-metrics sklearn-svm

Last synced: 09 Apr 2025

https://github.com/michaelfeil/lstm-dlnn2020

Character-LSTM implementation in Numpy

lstm nlp-machine-learning numpy

Last synced: 14 May 2026

https://github.com/anishshinde01/machine-learning-exercises

Python implementations of machine learning, statistics, and mathematical foundations.

linear-algebra machine-learning machine-learning-algorithms matplotlib numerical-analysis numpy python scikit-learn scipy statistics

Last synced: 11 Jun 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/alicankaya192/deep_learning_path

Sıfırdan ileri seviyeye Deep Learning yol haritası. 11 modül: PDF + Python + Jupyter Notebook. NumPy · TensorFlow · PyTorch karşılaştırmalı implementasyonlar.

artificial-intelligence cnn data-science deep-learning gan jupyter-notebook lstm machine-learning neural-networks numpy python pytorch rnn tensorflow transformer

Last synced: 21 Jun 2026

https://github.com/rara-ch/data-analysis-portfolio

This repository to store my data analytics projects, showcasing my skills in SQL and Python.

data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics

Last synced: 12 Mar 2025

https://github.com/ola-kaznowska/median-statistic

A program that calculates the median statistic. Program written in Python 3.13.1 with NumPy library during Machine Learning course.

mathematics numpy python python3

Last synced: 04 May 2026

https://github.com/arsalan692/python-telusko-playlist

A collection of beginner Python scripts and practice exercises from my early learning days, inspired by Telusko and other resources.

basic-programming bs4 matplotlib numpy oop pandas python requests selenium tkinter

Last synced: 04 May 2026

https://github.com/florianwoelki/salary_linear_regression

This project is for salary prediction using linear regression without any machine learning library.

jupyter linear-regression machine-learning matplotlib numpy pandas python regression

Last synced: 04 May 2026

https://github.com/cheese23456/ai-based_stock_analysis_and_portfolio_optimisation

📈 Analyze stocks and optimize portfolios with AI-driven insights, technical indicators, and clear investment signals for informed decision-making.

ai asset-allocation financial-analysis huggingface-spaces numpy pandas portfolio-optimisation python scipy-optimize streamlit

Last synced: 04 May 2026

https://github.com/sweta-kaundilya/python_for_data_analysis

Learning Python and all the relevant libraries in python for Data field.

cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn

Last synced: 04 May 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/silbena/gibbssampler

Gibbs Sampler for a simple mortgage model.

gibbs-sampler gibbs-sampling numpy python

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/ronitjariwala/-prodigy_ds_01

Prodigy InfoTech Data Science Internship Task-1

matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/sahu-sakshi/covid-19

A comprehensive data analysis project for COVID-19 pandemic data, featuring automated data collection, feature engineering, visualization tools, and exploratory analysis notebooks.

jupyter numpy pandas python3

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/kbunwong/stock-price-prediction-using-lstm

This project is about predicting stock prices with more accuracy using LSTM algorithm. For this project we have fetched real-time data from yfinance library.

google-colab jupyter-notebook keras-tensorflow lstm-model machine-learning matplotlib numpy pandas python visualization yfinance

Last synced: 05 May 2026

https://github.com/elijahondiek/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/tapas-gope/diwali-sales-analysis

Analyze Diwali Sales Data to improve customer experience and sales.

exploratory-data-analysis matplotlib-pyplot numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/jeslipriya/expense-tracker

This is a smart expense tracking web app built with Flask that helps users manage and visualize their spending. It features Pandas and Numpy analysis to suggest ways to cut down on unnecessary expenses, with clean UI, graph visualizations, and secure login.

css flask-application full-stack html matplotlib numpy pandas

Last synced: 05 May 2026

https://github.com/chdl17/marvelvsdc

This GitHub repository contains Python and Tableau for analyzing Marvel Vs DC superheros.

excel jupyter-notebook numpy pandas python tableau-public

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/zuhairzia/customer-segmentation

📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.

joblib matplotlib numpy pandas scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/pyladiesams/python-oss-sprints-mar2024

Sprints to contribute to the most popular data and AI Python OSS libraries

conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn

Last synced: 05 May 2026

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/nandinimarepalli/ai_ml_internship_projects

Projects completed during my AI/ML and Data Expert internship, including EDA, machine learning models, and dashboard development using Python, pandas, scikit-learn, and visualization libraries.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/ayushdutta-ai/face-recognition-model

A custom face recognition model built from scratch using Python, OpenCV, Pillow, and Face libraries, utilizing LBPH for efficient facial feature extraction and accurate identification.

face haar-cascade lbph-face-recognizer numpy opencv pillow

Last synced: 05 May 2026

https://github.com/tsungtsetu122/patternrecognition-gnn-node-classification

This project implements a Mixed Graph Neural Network (GNN) for semi-supervised multi-class node classification using the Facebook Large Page-Page Network Dataset. The goal of the project is to predict the class of each node (page) based on 128-dimensional feature vectors and the graph structure connecting the nodes.

evaluation-metrics gnn matplotlib networkx numpy optimization pandas python pytorch-geometric

Last synced: 05 May 2026

https://github.com/nkamilla/titanic-eda

Exploratory Data Analysis of the Titanic dataset using Python (Pandas, NumPy, Matplotlib). Includes data cleaning, visualizations, correlations, and key business insights.

data-analysis eda jupyter-notebook matplotlib numpy pandas python titanic-dataset

Last synced: 05 May 2026

https://github.com/nxhawk/mln

Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

decision-tree-classifier genetic-algorithm jupyter-notebook machine-learning-algorithms numpy pandas sklearn-library

Last synced: 06 May 2026

https://github.com/romanectatyana/fuel_norm

Practical data analytics case: calculating and optimizing fuel consumption norms for vehicles and heavy equipment. (Аналітичний проєкт із визначення норм витрат палива для автотранспорту та спецтехніки.)

matplotlib numpy pandas plotly power-bi

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/frolov-andrey2405/mandelbrot-s-fractal

Mandelbrot's Fractal Constructions

numba numpy pygame python

Last synced: 06 May 2026

https://github.com/yashpaneliya/bank-loan-default-analysis

Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.

data-analysis loan-default-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026