An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/akashkg03/facial-expression-image-classification

This notebook involves to build a facial expression image classifier which categorizes facial expressions into one of seven emotions: anger, disgust, fear, happiness, sadness, surprise, and neutral.

jupiter-notebook numpy pandas python

Last synced: 28 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/constraintautomaton/client_pgu1

The client part of the PgU1 project it. It command the robot and do the video processing

numpy opencv paramiko pyqt5 python3 raspberry-pi-3

Last synced: 12 Apr 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/daksh777/expressando

Expressando is a rock, paper, scissors game made using Python (TensorFlow, Keras and OpenCV)

keras keras-tensorflow machine-learning numpy opencv opencv-python tensorflow

Last synced: 13 Apr 2026

https://github.com/alisonmitchell/countries-of-the-world

Exploratory Data Analysis of US Government data from the World Factbook to investigate GDP ($ per capita), Literacy (%) and Phones (per 1000).

data-science machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/bachboy0/montecarlo

Python-based Monte Carlo simulations for statistical analysis and modeling.

monte-carlo-methods numpy python

Last synced: 21 Apr 2026

https://github.com/barkerbg001/framefusion

Python script that creates a video from a set of images and adds a random audio clip to each video

glob moviepy numpy opencv pygt5 python python-3 random video video-editor

Last synced: 16 Apr 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/abhishekpsonawane07/loan-repayment-prediction-ml

Loan Approval Prediction uses machine learning to analyze factors like income and credit history, automating and improving the loan approval process. The project ensures accurate, efficient, and user-friendly predictions with a Gradio interface.

artificial-neural-networks confusion-matrix decision-trees gradio-interface k-nearest-neighbours loan-dataset loan-prediction-analysis loan-repayment logistic-regression numpy pandas python random-forest sckiit-learn seaborn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/dina-hosny/investigate-a-dataset-project

Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.

csv data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 16 Apr 2026

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 16 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/omogbolahan94/google-play-store-app-analysis

Data exploration, data cleaning, interactive data visualisation and insight presentation

matplotlib numpy pandas plotly

Last synced: 11 May 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/nilusche/feed_forward_network_py

Implementation of a neural network in python which can predict handwritten numbers from the MNIST dataset.

colab-notebook feed-forward-neural-networks neural-network numpy python

Last synced: 01 May 2026

https://github.com/rahulchouhan1/spotify-most-popular-songs-data-analysis

🎵 Spotify Songs Analysis using Pandas

matplotlib numpy pandas

Last synced: 16 Apr 2026

https://github.com/lorenzorottigni/ml-random

Machine Learning course of Piero Savastano 7: matplotlib, accuracy_score, precision_score, recall_score, f1_score, classification_report

machine-learning matplotlib metrics numpy

Last synced: 04 May 2026

https://github.com/marcow2812/zuse-projekt

Python-basierte Software zur Projektion von 3D-Objekten auf einen Würfel

augmented-reality numpy opencv-contrib python

Last synced: 16 Apr 2026

https://github.com/littie28/hazy-frames

Hierarchical coordinate frames - crystal clear transforms

3d coordinate-systems geometry numpy python reference-frames scipy transformations

Last synced: 18 Apr 2026

https://github.com/ayhem18/tinyad

A simple Python library to better understand automatic Differentiation

automatic-differentiation mathematical-programming numpy python3

Last synced: 29 Apr 2026

https://github.com/zuhairzia/customer-segmentation

📖 About Customer Segmentation using KMeans clustering to analyze demographics, income, and spending. Helps businesses with targeted marketing and customer insights.

joblib matplotlib numpy pandas scikit-learn seaborn

Last synced: 05 May 2026

https://github.com/voidwalker7/mnist

MNIST Digit Detector Model using only Numpy

from-scratch-in-python mnist numpy python

Last synced: 05 May 2026

https://github.com/pramodyasahan/cifar-10-object-recognition

This repository contains a TensorFlow-based image classification model designed to classify images from the CIFAR-10 dataset. The CIFAR-10 dataset comprises 60,000 32x32 color images in 10 classes, with 6,000 images per class.

convolutional-neural-networks numpy python tensorflow

Last synced: 05 May 2026

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 05 May 2026

https://github.com/dineshdhamodharan24/industrial-copper-modeling

It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.

bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit

Last synced: 13 Apr 2026

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 05 May 2026

https://github.com/abdelrahman-amen/neural_style_transfer

This project applies neural style transfer to create new images by merging the content of one image with the artistic style of another.

neuralstyletransfer numpy os python streamlit tensorflow

Last synced: 05 May 2026

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/amruta33/lead_scoring_case_study

An education company named X Education sells online courses to industry professionals. On any given day, many professionals who are interested in the courses land on their website and browse for courses.

bagging linear-regression numpy pandas python3

Last synced: 05 May 2026

https://github.com/hansalemaos/cinpy

C/C++ in Python for Dummies

c cpp ctypes numpy python

Last synced: 13 Apr 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/tfiers/unitlib

📐 Physical units for NumPy arrays \ ⏱ Fast • Simple • High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026

https://github.com/companyakis/numpy

Python Numpy

data-science numpy python

Last synced: 04 Sep 2025

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/andydevs/optipy

Optimization algorithms written in python

numpy optimization optimization-algorithms

Last synced: 13 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 06 May 2026

https://github.com/kashinathbiradar/bangalore-housing-price-prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql

Last synced: 06 May 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/alphacrypto246/stock-price-movement-prediction

A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.

machine-learning numpy pandas polynomial-regression scikit-learn yfinance

Last synced: 13 Apr 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/iama02/vet-medicine-analysis

Analysis of 50,000+ veterinary medicine records with Pandas and NumPy, highlighting dosage forms, manufacturers, indications, and shelf life trends.

numpy pandas python

Last synced: 06 May 2026

https://github.com/skorupcia/winequality-white

winequality-white.csv visualization

matplotlib numpy pandas seaborn sklearn

Last synced: 06 May 2026

https://github.com/yashpaneliya/bank-loan-default-analysis

Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.

data-analysis loan-default-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.

data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 06 May 2026

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/jim-by/tweet-sentiment-analysis

Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.

nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/vasu7052/recognizing-handwritten-digits

This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.

machine-learning machine-learning-algorithms numpy python scikit-learn

Last synced: 13 Apr 2026

https://github.com/dan-niles/neural-networks-from-scratch

A neural network implemented using only NumPy and Pandas for CS3631: Deep Neural Networks

neural-networks numpy pandas

Last synced: 05 May 2026

https://github.com/frolov-andrey2405/prime-numbers-and-polar-coordinate-system

Prime numbers and polar coordinate system

matplotlib numba numpy python

Last synced: 01 May 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/mikma03/datascience_python_datacamp

DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.

data-analysis data-science datacamp datascience numpy pandas python

Last synced: 06 May 2026

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 13 Apr 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/prateekrajsrivastav/financial-transition-classification

This project focuses on classifying financial transactions using machine learning techniques. By leveraging labeled data, the model aims to categorize transactions into predefined categories (e.g., "Food," "Transport," "Shopping," etc.).

matplotlib numpy pandas-python scikitlearn-machine-learning seaborn

Last synced: 07 Sep 2025

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/kunalpisolkar24/gradientdescent

A Python implementation of gradient descent for linear regression, visualized with cost function history, parameter trajectory, and a 3D cost surface.

cost-function gradient-descent numpy visualization

Last synced: 06 May 2026

https://github.com/alinenog/desenvolve_gb_2022

Formação Desenvolve 2022 do Grupo Boticário na área de dados

data-analysis data-science googlesheet machine-learning numpy pandas python

Last synced: 13 Apr 2026

https://github.com/kartikeya443/handgesture_vol_control

This OpenCV project enables remote control of sound playback using wireless communication. It lets users adjust the master volume of the central device using finger gestures, enhancing convenience and user experience.

mediapipe numpy opencv pycaw wireless-communication

Last synced: 13 Apr 2026

https://github.com/paulo-santos-ds/guia_de_precos_de_veiculos_com_machine_learn

Sistema de predição de preços de carros usados desenvolvido para a Empresa Rusty Bargain

catboost lgbm numpy pandas pyplot python seaborn sklearn time

Last synced: 13 Apr 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/lexiortiz/ibm-data-engineering-fundamentals

Notes, exercises, and projects from the IBM Data Engineering Fundamentals path via Verizon Skill Forward.

data-engineering numpy pandas postegresql python sql

Last synced: 13 Apr 2026

https://github.com/renukadhule/crop_production_python_analysis

This project analyzes crop production data in India from 1997 to 2015, identifying trends, seasonality, and top-performing states and crops. The insights aim to guide improvements in agricultural productivity and policy decisions.

matplotlib numpy pandas python

Last synced: 12 Apr 2026