An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/hariprasath-v/hackerearth_transunion-data-science-analytics-hiring-challenge_2022

Machine learning model to classify the credit score based on people bank details and credit related information.

catboost exploratory-data-analysis klib machine-learning matplotlib numpy optuna pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/renan-siqueira/reinforcement-learning-cart-pole

This repository provides implementations of a Q-learning agent to balance a pole on a cart.

artificial-intelligence gym-environment machine-learning numpy openai-gym pygame python qlearning-algorithm reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/chandkund/customer-segmentation-using-k-means-clustering

Implemented K-Means Clustering to segment customers based on purchasing behavior, enabling targeted marketing strategies. Analyzed data, optimized clusters using the Elbow Method, and derived insights to enhance customer engagement and retention.

data-science kmeans-clustering machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/thekartikeyamishra/data-preprocessor

A Google Colab module for interactive data preprocessing. Handles missing values, categorical encoding (One-Hot, Label), and numerical scaling (Standard, MinMax). Outputs a cleaned dataset

ipywidgets numpy pandas python scikit-learn

Last synced: 08 May 2026

https://github.com/lmizner/codecademy_nba_trends

Investigate whether there is an association between two variables

chi2-contingency jupyter-notebook matplotlib numpy pandas pearson-correlation pyplot python seaborn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/mugambi645/classify-song-genre-from-audio-data

Classify song genres using supervised learning models & PCA for dimensiality reduction

decision-tree-classifier logistic-regression numpy pandas pca-analysis

Last synced: 17 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/sunnyrao07/water-quality-analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost

Last synced: 08 May 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 04 Apr 2026

https://github.com/gsaini/streamlit-ui-getting-started

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps with only a few lines of code. Build and deploy powerful data apps in minutes. Let's get started!

codecov github-actions numpy pandas python streamlit unittest

Last synced: 14 Jun 2026

https://github.com/lexxai/goit_python_ds_hw_01

GoIT, Python Data Science, Homework number 1.

data-science google-colab jupyter-nootebook numpy numpy-arrays python

Last synced: 18 Apr 2026

https://github.com/charlenry/python_math_machine_learning

Mes notebooks de travaux pratiques sur Python, NumPy, SimPy, SciPy, Matplotlib, Plotly, Seaborn et les Mathématiques pour le Machine Learning

algebra derivatives functions jupyter linear matplotlib matrix numpy plotly probabilities pyplot python pytorch scipy seaborn sklearn statistics sympy tensorflow time

Last synced: 25 Jun 2026

https://github.com/arnoldchrisoduor1/potato_disease

Trained a CNN to detect the presence and the current stage of Blight disease in potatoes, WebApp in FastAPI and React

fastapi keras numpy react serving-tensors tensorflow

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/jsnevt/regressao_linear

Regressão linear simples - Qual a velocidade se levou x distância para parar ?

linear-regression matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/stffns/snapvec

Fast compressed ANN search via randomized Hadamard transform + Lloyd-Max quantization. Pure NumPy.

ann embeddings hadamard numpy quantization rag vector-search

Last synced: 07 Apr 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/ben854719/nvidia---gen-ai-

The goal of the project is to analyze and predict with the use of AI for the revenue of Nvidia

matplotlib numpy pandas python pytorch

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/rahuls-1106/dataspark

DataSpark is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe powerbi python seaborn sql

Last synced: 18 Apr 2026

https://github.com/md-emranhossen/deep-learning-hub

Welcome to Deep Learning Playground, a personal repository for exploring and experimenting with deep learning concepts, architectures, and techniques. This repo serves as a hands-on space for implementing and understanding deep learning models, from fundamental neural networks to advanced architectures.

cnn-model deep-learning deeplearning gan generative-adversarial-network neaural-network numpy pandas pthon tensorflow

Last synced: 18 Apr 2026

https://github.com/sanchariii/health-assist-1

Health Assist web app uses the Streamlit framework to help users identify potential diseases. Simply input your symptoms, and the system will predict whether you have diabetes, heart disease, or Parkinson's disease. The system is designed to be quick and convenient, promoting early detection and timely medical intervention.

numpy pandas pickle streamlit-webapp svm-model

Last synced: 13 Aug 2025

https://github.com/autokmg/ai-foundations

Practice exercises related to the Foundations of Artificial Intelligence university course. The code examples are designed to help students gain a deeper understanding of these topics and to prepare for exams and assignments. The repository is open to contributions from other students.

numpy python

Last synced: 18 Apr 2026

https://github.com/shaulamarquez/data-job-market

This is a Data Job Market analysis

analysis matplotlib numpy python seaborn

Last synced: 09 May 2026

https://github.com/codetun/wie-act-challenge-ai

Our AI model, integrated with the Defend Her extension, uses advanced natural language processing (NLP) to predict and identify offensive or harmful language in real-time 🧠💬🚫. By continuously analyzing content, it ensures accurate detection of bad words, enhancing the extension's ability to protect users and promote a safer online environment 🌐

colab csv jupiter-notebook kaggle nltk numpy pandas

Last synced: 19 Apr 2026

https://github.com/costopoulos/ntua-dsp

:signal_strength: NTUA ECE Digital Signal Processing Course Source Codes and Reports

dsp filters fourier-transform librosa numpy pywt scipy short-time-signal-analysis stft

Last synced: 19 Apr 2026

https://github.com/mariacarolinass/programacao-linear

Minimizando problemas matemáticos com Python e AMPL.

ampl numpy programacao-linear ufrn

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/micmurawski/cloud-array

cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.

aws azure big-data bigarray blob-storage cloud data-structures digitalocean-spaces gcp gcp-cloud-storage ibm-cloud-object-storage numpy s3 stream-processing streaming zadara

Last synced: 12 Aug 2025

https://github.com/paulo-santos-ds/previsao_preco_carros

O objetivo é realizar predições em um aplicativo de vendas de carros. A qualidade e velocidade de predições é crucial para o negócio. O projeto utiliza dados históricos, especificações técnicas, versões de acabamento e preços.

lightgbm math matplotlib numpy pandas python time

Last synced: 20 Apr 2026

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/muhammadmoeezkhan/euclidean-distances

Python Program That Calculates Euclidean Distances Via 5 Algorithms

numpy pandas-python scipy timeit

Last synced: 20 Apr 2026

https://github.com/who-else-but-arjun/pyspyce-cs

Circuit Simulator for R, L, C components and AC/DC voltage and current sources built using python as a part of EE204 circuit theory course project.

circuit-simulator numpy python streamlit sympy

Last synced: 20 Apr 2026

https://github.com/aymen016/leetcode-problems

This repo is designed to enhance your problem-solving skills through daily coding challenges, complete with clear explanations and well-structured solutions. 🎯

coding decision-making leetcode-python leetcode-solutions numpy problem-solving python

Last synced: 09 May 2026

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 09 May 2026

https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data datavisualization matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/rphadtare/python_by_example

This repository consist sample programs to learn and understand core concepts

duckdb numpy pandas python-os python3

Last synced: 27 Jun 2026

https://github.com/dino-kupinic/pocolocosolver

algorithmic solution with 3D visualization

algorithms numpy panda3d puzzle pytest python solver

Last synced: 19 Jun 2026

https://github.com/carmoreno/aviones-aldeas

:airplane: Final project o Complexity and Optimisation course :house_with_garden:

algorithms numpy pyqt4 scipy

Last synced: 21 Apr 2026

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/jayemscript/lab-to-code

A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.

bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn

Last synced: 19 Jun 2026

https://github.com/tonoy30/emotion-mask

A pytorch model for add face mask based on detected emotion.

emotion-detection numpy opencv-python python pytorch

Last synced: 09 May 2026

https://github.com/jbayardo/aa-tp2

Q-Learning for Four in a row

matplotlib numpy python q-learning

Last synced: 24 Apr 2026

https://github.com/pawpaw2022/proj03-math-painting

An app that lets the user provide the start coordinates of geometrical shapes such as squares and rectangles, their dimensions, and their colors, and the program produces an image file canvas with all the geometrical shapes drawn in it. (Click the link below to see the demo)

numpy pillow python

Last synced: 24 Apr 2026

https://github.com/nightmachinery/mlp_from_scratch

A simple MLP using only numpy. The computational graph is not automatically tracked, but the network supports arbitrarily many batch axes.

deep-learning deep-neural-networks deeplearning educational mlp mlp-classifier mlp-networks numpy

Last synced: 25 Apr 2026

https://github.com/jagkagd/np-xarr

Perform a numpy array transformation intuitively by giving simple patterns.

numpy python

Last synced: 25 Apr 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/abhisingam/brickbreaker

A Terminal-Based game made in Python that emphasises upon the use of OOPs concepts. Python | NumPy | Game Development

game-development numpy python3

Last synced: 26 Apr 2026

https://github.com/hinanohart/dostosim

Dostoevskian agent-based simulation: seven literary archetypes as a unified reward decomposition. Anti-optimisation, dignity-as-utility, ideological cascade, gambler-ruin — pre-registered H1-H4 with bootstrap CI / BH-FDR / Cliff's delta.

abm agent-based-modeling behavioral-economics bootstrap-ci computational-social-science dostoevsky literature networkx numpy python reproducible-research

Last synced: 27 Apr 2026

https://github.com/huggon1/ml-algorithm-implementations

Educational implementations for ML, DL, LLM blocks, ViT, and CUDA.

cuda machine-learning numpy pytorch vision-transformer

Last synced: 28 Jun 2026

https://github.com/ydon96/versions-unveiled-the-evolution-of-nsa

Detection of Alzheimer's disease through the implementation of improved NSA algorithms

nsa numpy pandas python3 sklearn

Last synced: 27 Apr 2026

https://github.com/saptarshi11/diamlp---diabetes-classification-with-multilayer-perceptron

Diabetes Classification using Multilayer Perceptron (MLP) is a binary classification problem where the goal is to predict whether a patient has diabetes (positive class) or not (negative class) based on a set of biomedical features or risk factors.

artificial-neural-networks binaryclassification machine-learning mlp-classifier numpy seaborn

Last synced: 27 Apr 2026

https://github.com/lesterevse/numvision

Artificial Intelligence for recognizing numbers

artificial-intelligence numpy

Last synced: 27 Apr 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/harmanveer-2546/covid-19-chances-detection

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

covid-19 covid-data logistic-regression matplotlib mlp-classifier numpy pandas python random-forest seaborn sklearn-library xgboost

Last synced: 10 Apr 2026

https://github.com/vasukalariya/neural-network-from-scratch

It's a simple Feedforward Neural Network implemented just using Numpy

artificial-neural-networks classification mnist neural-network numpy

Last synced: 28 Apr 2026

https://github.com/ronverse17/automate-data-cleaning

This project automates messy data cleaning tasks - like fixing column names, filling missing values, and spotting outliers, so analysts and data scientists can spend more time on insights, not preprocessing.

numpy pandas pipeline python

Last synced: 28 Apr 2026

https://github.com/dwade-eng/amazon-product-recommender-prototype-

This project is a content-based product recommendation engine inspired by Amazon's "Customers who viewed this item also viewed" feature. It uses a dataset of product metadata and user interactions to suggest similar items based on product titles, brands, and categories using TF-IDF vectorization and cosine similarity.

html numpy pandas python3 scikit-learn

Last synced: 28 Apr 2026