An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abideen-olawuwo/gender-detection

Gender Detection app

cv cv2 numpy

Last synced: 14 May 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/dino-kupinic/pocolocosolver

algorithmic solution with 3D visualization

algorithms numpy panda3d puzzle pytest python solver

Last synced: 19 Jun 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/takana671/fluidcube

Particle Movement Using Curl Noise

curl-noise cython numpy panda3d python

Last synced: 22 Jun 2026

https://github.com/gerardomacedo/ml-from-scratch

Logistic regression from scratch (NumPy) with tests and CLI; compared vs scikit-learn

ci-cd from-scratch logistic-regresion machine-learning numpy python unittest

Last synced: 23 Jun 2026

https://github.com/no0bitah/pdf-highlight-extractor

A Python tool for extracting highlighted text from PDF files while preserving formatting attributes (headers, bold, italic) and removing unwanted line breaks and page breaks. Perfect for integrating with content management systems.

automation crm documentation-tool numpy opencv pdf pdf-document-processor pillow pymupdf pypdfium2 python3 scrapping

Last synced: 23 Jun 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/abhinav330/911-emergency-calls-analysis

This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.

data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python

Last synced: 09 Jun 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/abhishekyadav915/e-commerce-sales-analysis

E-Commerce Sales Analysis is a data analysis project that explores sales data from an e-commerce platform to uncover insights and trends. The analysis includes visualizing sales performance, customer behavior, and product trends to help optimize business strategies and improve customer satisfaction.

matplotlib-pyplot numpy pandas-library plotly python3

Last synced: 04 May 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/asuquoaa/applied_data_science_with_python_study

Notes, codes and practicals on the Applied Data Science with Python specialization by UM

matplotlib numpy pandas seaborn

Last synced: 04 May 2026

https://github.com/dmkk01/mlp-python

Implementation of a multilayer perceptron using Pytorch and Numpy libraries

mlp numpy pytorch

Last synced: 05 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/lorenzorottigni/ml-missing-values

Machine Learning course of Piero Savastano 5: ColumnTransformer, SimpleImputer, numpy

column-transformer machine-learning missing-values numpy simple-imputer

Last synced: 05 May 2026

https://github.com/lin826/nanogpt-demo

Training and finetuning local GPTs.

gpt nanogpt numpy pytorch tqdm transformers

Last synced: 05 May 2026

https://github.com/vladd12/machine-learning-practice

Репозиторий для учебной практики и различных курсов, связанных с ИИ

json keras-tensorflow machine-learning numpy pandas pickle tensorflow2 torch xml

Last synced: 05 May 2026

https://github.com/bhaskarkulshrestha/rock-vs-mine-prediction

A machine learning project to predict whether an object detected underwater is a rock or a mine using Logistic Regression.

colab-notebook logistic-regression machine-learning numpy pandas python sckit-learn supervised-learning

Last synced: 05 May 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/thomasa/python-intro

This is a Jupyter notebook slide show for a 2-hour crash intro to Python, NumPy, Matplotlib, and SciPy

matplotlib numpy python scipy tutorial

Last synced: 05 May 2026

https://github.com/13anush/python-libraries-

A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.

data-analysis matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/hitthecodelabs/petalanalyticsstreamlit

Web application developed with Streamlit that predicts the Iris flower type based on its physical features

matplotlib model numpy pickle python scikit-learn sklearn streamlit

Last synced: 05 May 2026

https://github.com/cicku/en.650.672

HW of EN.650.672

analytics data-analysis numpy pandas

Last synced: 05 May 2026

https://github.com/rayan-alam-uoit/space-x-launch-prediction-tool

A full-stack data science project analyzing SpaceX launch data using web scraping, REST APIs, SQL, interactive visualizations, and machine learning in Python to explore trends and predict launch success.

dash data-science folium jupyter-notebook machine-learning matplotlib numpy pandas plotly python seaborn sql sqlite

Last synced: 05 May 2026

https://github.com/jeslipriya/expense-tracker

This is a smart expense tracking web app built with Flask that helps users manage and visualize their spending. It features Pandas and Numpy analysis to suggest ways to cut down on unnecessary expenses, with clean UI, graph visualizations, and secure login.

css flask-application full-stack html matplotlib numpy pandas

Last synced: 05 May 2026

https://github.com/himanshigautam-rgb/python_libraries

This repository contains operations in Matplotlib for data visualization, NumPy for numerical computations, and Pandas for data manipulation and analysis. These libraries work together to efficiently process, analyze, and visualize data.

matplotlib numpy pandas python3

Last synced: 05 May 2026

https://github.com/batrick-swaistan/nalaiya_thiran

Nalaiya Thiran IBM Course Training

matplotlib numpy pandas-dataframe python

Last synced: 05 May 2026

https://github.com/theadeelahmed/mall-customers-segmention

This project segments mall customers based on income, age, and shopping score. Using clustering techniques, it identifies key customer groups for targeted marketing campaigns. Tools used: Pandas, Matplotlib, Seaborn, and Scikit-learn

matplotlib-pyplot numpy pandas python3 seaborn sklearn-clustering

Last synced: 05 May 2026

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 05 May 2026

https://github.com/aathira2001/python-basics

Basic Python concepts with examples

functions numpy pandas python variables

Last synced: 05 May 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/abdelrahman-amen/neural_style_transfer

This project applies neural style transfer to create new images by merging the content of one image with the artistic style of another.

neuralstyletransfer numpy os python streamlit tensorflow

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/waghraj1699/fraud-detection-project

A fraud detection system with machine learning will be able to detect risk based on your historical data

data-processing decision-tree-classifier machine-learning numpy pandas

Last synced: 05 May 2026

https://github.com/kalwalt/pybind-test

A simple example with pybind and numpy with C++ code

cplusplus numpy pybind11 python

Last synced: 05 May 2026

https://github.com/nxhawk/mln

Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

decision-tree-classifier genetic-algorithm jupyter-notebook machine-learning-algorithms numpy pandas sklearn-library

Last synced: 06 May 2026

https://github.com/romanectatyana/fuel_norm

Practical data analytics case: calculating and optimizing fuel consumption norms for vehicles and heavy equipment. (Аналітичний проєкт із визначення норм витрат палива для автотранспорту та спецтехніки.)

matplotlib numpy pandas plotly power-bi

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/frolov-andrey2405/mandelbrot-s-fractal

Mandelbrot's Fractal Constructions

numba numpy pygame python

Last synced: 06 May 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/mikma03/datascience_python_datacamp

DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.

data-analysis data-science datacamp datascience numpy pandas python

Last synced: 06 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/coder5omkar/lendingclubcasestudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 06 May 2026

https://github.com/deanlogan/data-analysis-course

Code created when completing the Data Analysis with Python Course on freecodecamp.org

course data-analysis numpy pandas python python3

Last synced: 06 May 2026

https://github.com/paulo-santos-ds/llm

Um aplicativo web simples de tradução que converte texto do português para o inglês, construído com Streamlit e Google Translator

deeptranslate googletranslate numpy pyngrok python streamlit

Last synced: 06 May 2026

https://github.com/cycle-sync-ai/student-score-analysis

A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.

clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student

Last synced: 06 May 2026

https://github.com/elmezianech/classifyreviews_nlp

Revolutionize customer feedback analysis with our NLP Insights Analyzer. Utilize cutting-edge text preprocessing to transform raw reviews into a machine-friendly format. Explore sentiment models, such as Logistic Regression and Naive Bayes, employing cross-validation for model robustness.

accuracy-score ai countvectorizer cross-validation joblib logistic-regression machine-learning ml multinomialnb naive-bayes-classifier nltk-library numpy pandas randomforestclassifier svc

Last synced: 06 May 2026

https://github.com/fbi555/fashion-mnist-cnn-tf

🧵 Classify fashion images using a compact CNN model in TensorFlow/Keras, designed for the Fashion-MNIST dataset with easy execution in Google Colab.

adam classification cnn colab-notebook computer-vision convolutional-neural-networks custom-training-loop deep-learning fashion-mnist gans image-classification interactive jupyter-notebook keras lowlevelapi mnist-dataset numpy regularization

Last synced: 06 May 2026

https://github.com/christiansassi/signal-image-and-video-project

Project developed by Pietro Bologna (@bolognapietro) and Christian Sassi for the Signal, Image and Video course.

image-processing mri-segmentation numpy opencv python

Last synced: 06 May 2026

https://github.com/williyam-m/company-registration-trends

Utilized Linear Regression from scikit-learn to predict future company registration trends.

flask matplotlib numpy pandas-python scikit-learn

Last synced: 06 May 2026

https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model

This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.

datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm

Last synced: 06 May 2026

https://github.com/hansalemaos/cv2_collage

Create a collage from images with OpenCV

collage numpy opencv python

Last synced: 06 May 2026

https://github.com/sahilmate/gesturepainter

Virtual Painting Application

gesturepainter mediapi numpy opencv virtualpaint

Last synced: 06 May 2026

https://github.com/fatihilhan42/lane-line-detection-with-python

In this project you will detect lane lines in images using Python and OpenCV. OpenCV means "Open-Source Computer Vision", which is a package that has many useful tools for analyzing images.

image-processing line-detection numpy opencv python

Last synced: 06 May 2026

https://github.com/jayanthpotluri5513/heart-disease-prediction-using-ml

This project aims to develop a machine learning model for heart disease prediction, leveraging the potential of advanced algorithms and a comprehensive dataset.

machine-learning matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/jai0212/course-selector-uoft

A program that suggests university courses based on your program and provides a custom GUI to customize your course plan. Created to help UofT students

beautifulsoup csv graphs graphs-algorithms networkx-graph numpy os parser pickle pillow plotly python tkinter-gui tkmacosx

Last synced: 06 May 2026

https://github.com/harmanveer-2546/student-performance-in-exam

Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.

classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization

Last synced: 06 May 2026

https://github.com/jcbritobr/iris-python

Simple data analysis using iris dataset, pandas, numpy, matplotlib and seaborn.

data-science matplotlib numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/jonolav95/advent_of_code_2022

Using Python with Pandas and Numpy (www.adventofcode.com).

numpy pandas

Last synced: 06 May 2026

https://github.com/aleksibovellan/ai-2048-game

AI Heuristics in Python for solving the 2048 Game

2048 2048-game 2048-solver ai expectimax game heuristics numba numpy python python3

Last synced: 06 May 2026

https://github.com/kianaabrisham/svm-from-scratch

Linear SVM from scratch with hinge loss + decision boundaries

classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm

Last synced: 07 May 2026

https://github.com/voorhs/ml-practice

ML algorithms implementation

aimasters lomonosov-msu machine-learning numpy python

Last synced: 07 May 2026