An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/abhiram-epuru/ai-powered-movie-recommender

A Movie Recommendation System using the MovieLens dataset, leveraging collaborative filtering techniques such as KNN (k-Nearest Neighbors) and Singular Value Decomposition (SVD). The project demonstrates a personalized recommendation model capable of suggesting movies based on user preferences and interaction data.

matplotlib numpy pandas python sckiit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/beyzabasarir/spotify-youtube-eda

Exploratory Data Analysis of global songs' performance metrics on Spotify and YouTube with Python, using Jupyter Notebook.

eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/maximlevchenko/python-course-convolution-orb-detector-data-processing-2d-game

This repository contains a collection of projects developed during a university Python course. Each project explores different aspects of data processing, analysis, and computer vision.

2dgame bomberman-game computer-vision convolution harris-corner-detector image-processing keypoint-detection numpy pandas python statistical-analysis

Last synced: 30 Apr 2026

https://github.com/mogankumar/bird-flock-simulation-with-pyspark

A PySpark-based bird flock simulation modeling alignment, cohesion, and separation forces with a leader guiding the flock. Efficiently handles large-scale simulations and generates a visualized GIF of the flock’s movement.

argparse matplotlib numpy pyspark python3

Last synced: 01 May 2026

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/niteshchawla/netflix-case-study

Analyze the data and generate insights that could help Netflix ijn deciding which type of shows/movies to produce and how they can grow the business in different countries

matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/hghyhghy/notebook

Repo consist numpy , matplotlib and pymongo aggression

jupyter-notebook matplotlib numpy pymongo python-3

Last synced: 01 May 2026

https://github.com/ahmedsamir45/demograghic-data-analyzer

answer some questions about the data

demographic-data-analyzer numpy pandas

Last synced: 01 May 2026

https://github.com/mohhasan1/machine-learning

This repository is where I learn and store information about various machine learning libraries, such as Pandas, scikit-learn, and others. It serves as a personal reference for exploring and experimenting with different ML tools and techniques.

machine-learning matplotlib numpy pandas seaborn sklearn

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/emmanueltobi/ai-academy-university

Code samples from the Intel Nervana AI Academy University held in FUTA

ai classification-algorithm dataset intel numpy regression regression-models sklearn

Last synced: 01 May 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/dpb44/exploring-the-intuition-of-neural-networks-on-a-classification-problem-using-only-numpy

Implementing a softmax-based neural network from scratch using NumPy to classify the Iris dataset, leveraging vectorization, gradient descent, and decision boundary visualization.

deep-learning neural-network numpy softmax-classifier

Last synced: 02 May 2026

https://github.com/anker661/autograd-playground

A simple Numpy & Python based auto differentiation system that supports visualization before and after back-propagation.

auto-differentiation autograd computaion-graph jupyter-notebook numpy python3

Last synced: 09 May 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/alexgustafsson/word-frequencies

Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.

language nlp nltk numpy python python3 sklearn spelling-correction trigram

Last synced: 10 May 2026

https://github.com/harsha-yuvaraj/data-compression-and-decompression-tool

A specialized lossless file compression and decompression tool designed mainly for plain text, including programming files.

data-structures huffman-compression-algorithm lossless-compression-algorithm numpy python tkinter

Last synced: 15 Apr 2026

https://github.com/subhrajit77/ai-ml

My personal AI/ML projects repository

jupyter numpy pandas python

Last synced: 09 Mar 2026

https://github.com/vuanhtuan1012/data-scientist-with-python

Notes on career track "Data Scientist with Python" at DataCamp

importing-data matlab numpy pandas python3 sqlalchemy

Last synced: 09 Apr 2026

https://github.com/hadjuse/spotify-recommendation-knn

Here is a personal project where i use KNN project to classify songs

data-science graphics interpretation knn-classification machine-learning matplotlib music numpy plotly spotify

Last synced: 03 May 2026

https://github.com/helenaden/data-science-fundamentals

This project delves into fundamental data science concepts using Python libraries like NumPy and Pandas

data-analysis datascience datasets datavisualization datawrangling heatmap numpy pandas patterns python

Last synced: 03 May 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/alphacrypto246/air-quality-prediction

The Air Pollution Prediction project forecasts pollution levels using the Air Quality UCI dataset, including variables like CO, NOx, temperature, and humidity. It analyzes temporal and environmental factors to predict air quality trends.

machine-learning numpy pandas scikitlearn-machine-learning sckiit-learn

Last synced: 10 May 2026

https://github.com/layonj3000/libras-recognition

This project is a Brazilian Sign Language (LIBRAS) recognition system that utilizes computer vision and machine learning techniques.

computer-vision libras mediapipe numpy opencv python random-forest-classifier

Last synced: 03 May 2026

https://github.com/sukanyadutta52/topic_modeling

What are the most pressing concerns regarding ‘Climate Change’ among tweeters according to Topic Modeling?

climate-change gensim matplotlib nltk numpy pandas pyldavis regular-expression spacy

Last synced: 09 Apr 2026

https://github.com/shailikadakia/yoga-pose-accuracy-detector

Real-time yoga pose accuracy detector using MediaPipe and OpenCV. Classifies poses from webcam input with KNN and provides feedback on duration and accuracy, built with Python and scikit-learn.

computer-vision fitness health-tech knn machine-learning mediapipe numpy opencv pandas pose-estimation python real-time-detection yoga

Last synced: 09 Apr 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/prathicashettym/machinelearningalgorithms

Contains mini projects and implementation using ML algorithms

jupyter-notebook linear-regression matplotlib numpy pandas panfas python seaborn

Last synced: 03 May 2026

https://github.com/shreenidhi7700/auto_price_prediction

We are required to use Machine Learning algorithms to predict the price of cars with the available independent variables such as Horsepower, no-of-cylinders, engine-type, city_mpg, highway_mpg etc.

decision-tree-regression ensemble knn-regression linear-regression machine-learning-algorithms matplotlib-pyplot numpy pandas python3 random-forest-regression regression seaborn xgboost-regression

Last synced: 04 May 2026

https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi

Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/manikantasanjay/youtube_exploratory_data_analytics

This is a Mini Project made using Python Libraries like Numpy, Pandas, Matplotlib etc to perform Data Analytics on US Video and Comments dataset.

jupyter-notebook matplotlib numpy pandas plotly python seaborn wordcloud

Last synced: 03 May 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/alvii147/discretetimelib

Python library for analysis of discrete time signals and systems

numpy pandas scipy signals-and-systems sympy

Last synced: 03 May 2026

https://github.com/hansalemaos/np_zoom_in

Zoom in on a picture

image numpy python zoom

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/dawoudtormos/rain_predicition_ml

This repository contains a machine learning project for rain prediction using historical meteorological data. The goal is to predict whether it will rain the next day based on weather measurements like atmospheric water vapor, cloud formation, and past rainfall records.

ai machine-learning machine-learning-algorithms matplotlib numpy seaborn sklearn

Last synced: 10 May 2026

https://github.com/sanchariii/health-assist-1

Health Assist web app uses the Streamlit framework to help users identify potential diseases. Simply input your symptoms, and the system will predict whether you have diabetes, heart disease, or Parkinson's disease. The system is designed to be quick and convenient, promoting early detection and timely medical intervention.

numpy pandas pickle streamlit-webapp svm-model

Last synced: 13 Aug 2025

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/aymanmomin/facebook-ego-network-analysis-for-political-outreach-optimization

Analyzing Facebook ego networks to identify influential users and communities for optimizing political advertising. Leverages network science techniques (Louvain communities, centrality metrics, threshold models) to simulate influence spread.

community-detection course-project facebook-network-analysis louvain-algorithm louvain-community-detection matplotlib network-analysis network-graphs networkx-library numpy pandas python3 pyvisgraph research-paper scipy-stats sklearn-metrics tqdm

Last synced: 04 May 2026

https://github.com/pomxark/smartdictor_0.1_nuitka_cleer

Распознание и озвучивание голосовым движком текста с экрана.

cv2 fuzzywuzzy kivy matplotlib-pyplot nuitka numpy pil pyautogui pyqt5 pytesseract pyttsx3 scipy silero simpleaudio torch

Last synced: 04 May 2026

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/RedInfinityPro/AI_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 30 Sep 2025

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/jvictor011/big-five-personality-test

Usando machine learning para fazer o agrupamento de um sistema de traço de personalidade

matplotlib numpy os pandas python seaborn

Last synced: 07 May 2025

https://github.com/mzayles/iris-seaborn_visualizacao-dados

💻📊✅ Curso de Programação em Python para Data Science | Gráficos utilizando a base de vendas da Seaborn e a biblioteca Matplotlib.

matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/jeslipriya/emotion-detection

This project is an enhanced real-time facial emotion detection system that uses deep learning (OpenCV + mini-XCEPTION) to accurately recognize human emotions through a webcam or image input. It features advanced preprocessing, temporal smoothing, and a clean visual interface for smooth and stable emotion analysis.

7-emotions deep-learning emotion-detection fer2013 numpy opencv opencv-python python tensorflow

Last synced: 03 Jul 2025

https://github.com/shibam120302/indian_sign_language_translator

This repository consists of the code utilized for creation of an Indian Sign Language Translator satisfying the following criteria : Near-Real-Time Application Achieve background independence Attain Illumination independence

imageai keras numpy opencv pillow tensorflow

Last synced: 12 Apr 2026

https://github.com/allenvox/statistics

Workspace for applied problems of probability theory & mathematical statistics & modelling class

acf autocorrelation distribution k-means k-means-clustering maths matplotlib numpy probability-theory python sklearn statistics

Last synced: 17 Apr 2026

https://github.com/byteshiva/getmaplocation

Automate Dummy data from the open data source. Parse International Address, Geopy to automate map locations using lat and lon, Cache geopy map location calls using sqlite3, Use Pandas to parse and convert CSV to JSON

csv-parser geopy json numpy pandas pandas-dataframe parser pickle python3 sqlite3

Last synced: 18 Apr 2026

https://github.com/nazwright/daria

Real-time fraud detection architecture powered by AWS Kinesis, KaggleHub, and SMOTE-augmented data — the foundation of DARIA, the Detection And Risk-Intelligence Agent.

aws evm fraud fraud-detection-using-machine-learning kaggle kinesis machine-learning math numpy pandas python random web3

Last synced: 12 Apr 2026

https://github.com/sayan520/titanic-data-insights

Conducting data analysis on Kaggle's Titanic: Machine Learning from Disaster dataset using essential data wrangling, exploratory data analysis (EDA), and visualization techniques to uncover insights, identify patterns, and explore factors influencing passenger survival.

jupyter-notebook kaggle matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rohitinu6/image_cartoonify

To Cartoonify an Image using opencv-python

image-processing matplotlib numpy opencv opencv-python python

Last synced: 12 Apr 2026

https://github.com/micmurawski/cloud-array

cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.

aws azure big-data bigarray blob-storage cloud data-structures digitalocean-spaces gcp gcp-cloud-storage ibm-cloud-object-storage numpy s3 stream-processing streaming zadara

Last synced: 12 Aug 2025

https://github.com/techcsispit/algorithmic-trading

This project focuses on algorithmic training using Python in Jupyter Notebooks. It covers various data structures and algorithms, providing a hands-on approach to solving algorithmic challenges. The notebooks are designed to be interactive and educational for learners of all levels.

matplotlib numpy pandas python sklearn

Last synced: 04 Sep 2025

https://github.com/hrolive/fundamentals-of-accelerated-data-science

How to use GPU-accelerated tools to conduct data science faster, leading to more scalable, reliable, and cost-effective results.

accelerated-computing bokeh cudf cugraph cuml cupy dask data-science jupyter numpy pandas python rapids xgboost

Last synced: 12 Apr 2026

https://github.com/nicolas-giacomelli/modelo-polinomial-api-streamlit

Codigo focado em treinamento de modelo de previsão de salario por tempo de empresa e nivel dentro da empresa disponibilizado por api e usando front-end streamlit para calcular basta inserir tempo de empresa e nivel e calcular o resultado de salario compativel

api fastapi joblib machine-learning matplotlib numpy pandas pingouin pydantic scikit-learn seaborn streamlit uvicorn

Last synced: 12 Apr 2026

https://github.com/shahsuvarli/election-voters-data-analysis-pandas

Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.

cleaning data grouping matplotlib numpy pandas python visualization

Last synced: 12 Apr 2026

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/alwaysvivek/next-word-prediction

🔮 Predicts the next word in a text sequence using either an N-gram statistical model or an LSTM-based neural network.

argparse laplace-smoothing machine-learning neural-network ngrams nlp nltk numpy python3 tensorflow

Last synced: 12 Apr 2026

https://github.com/offthetab/matcha_code_generation

Fine-tunning MatCha model for better python matplot lib code generation by given chart.

fine-tuning huggingface matcha matplotlib numpy pandas pil sklearn torch

Last synced: 20 Jan 2026

https://github.com/emhash/machine-learning-and-ai

Personal tasks or codes of Machine Learning and Artificial Intelligence. Practice codes and projects of Ml/AI will be share here.

ai datasets ipynb jupyter-notebook keras linear-regression logistic-regression machine-learning machine-learning-algorithms numpy opencv pandas python seaborn sklearn tensorflow

Last synced: 05 Jan 2026

https://github.com/armahdavi/mlops

Productionizing ML Models using a variety of tools including FastAPI, Flask, Doocker, AWS, GCP, TensorFlow Extended (TFX), and TF.js.

apache-beam docker fastapi logistic-regression mlops mlops-workflow numpy random-forest sklearn tensorflow-extended unicorn xgboost

Last synced: 12 Apr 2026

https://github.com/abdullahashfaqvirk/SMS-Spam-Detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 16 Aug 2025

https://github.com/iadtya/hate-text-classification

Training RNN(LSTM) on Hate speech with tensorflow and keras

fastapi keras lstm nltk numpy pandas tenserflow

Last synced: 12 Apr 2026

https://github.com/jillmpla/pythonxmldatabase

A Python & XML database application for a book collection.

database matplotlib numpy pandas pandas-dataframe python xml

Last synced: 12 Apr 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/saopicc/ddfacet-kernels

Modular implementation of the DDFacet Gridding Kernels

numpy radio-astronomy

Last synced: 26 Jan 2026

https://github.com/sothulthorn/stock-prediction-portal

The Stock Prediction Portal is a full-stack web application that demonstrates how to use machine learning models to analyze stock market data and forecast future trends.

axios django django-rest-framework javascript keras matplotlib numpy pandas python react tensorflow yfinance

Last synced: 02 Apr 2026

https://github.com/pthmhatre/smart-attendance-system-using-face-recognition

This project aims to create a smart attendance system using face recognition technology. The system captures and recognizes faces in real-time and marks attendance automatically, providing a seamless and efficient way to manage attendance records.

adafruit computervision css face-recognition iot iot-application javascript mysql numpy opencv php pickle python raspeberry-pi rfid

Last synced: 12 Apr 2026

https://github.com/Sadegh-Khedry/Pneumonia-Detection-Using-CNN

Classifying chest X-ray images into two categories: Pneumonia or Normal using a Convolutional Neural Network (CNN).

cnn convolutional-neural-networks cv2 deep-learning harris-corners joblib jupyter-notebook keras kmeans-clustering matplotlib numpy orb pandas python seaborn shutil sifit tensorflow

Last synced: 12 Apr 2026