An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis

The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,

jupyter matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026

https://github.com/sasanka14/water_quality_predictions

Water Quality Prediction - College Project 🌊💧 Predicts water potability (safe/unsafe) using ML models like XGBoost & Random Forest. Features data preprocessing, feature importance, model evaluation, and visualizations. Built with Python, Pandas, Scikit-learn & Seaborn for analysis. 🚀

anaconda jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/manuelfay/numpydeeplearning

Deep Learning Framework built entirely using numpy

deep-learning deep-learning-framework minigrad numpy pytorch

Last synced: 06 May 2026

https://github.com/anujdutt9/reinforcement_learning

Reinforcement Learning using Numpy and PyTorch.

numpy python3 pytorch reinforcement-learning

Last synced: 16 Apr 2026

https://github.com/3rd-son/movie-streaming-service-analysis

Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn

Last synced: 18 Apr 2026

https://github.com/lorenzorottigni/ml-advertising

Machine Learning python bootcamp: logistic regression on advertising dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/williyam-m/movie_booking_app

A Django-based movie booking app featuring an admin dashboard with CRUD functionality for movies, theatres, screens, shows, and bookings. Includes an ML-powered movie recommendation system.

django machine-learning numpy orm pandas python sckiit-learn sqlite3

Last synced: 16 Apr 2026

https://github.com/pramodyasahan/health-insurance-cost-prediction

This project focuses on predicting health insurance costs using a polynomial regression model. By employing machine learning techniques in Python, the project aims to accurately estimate insurance costs based on various personal attributes. The model takes into account several features including age, sex, BMI, number of children, smoking status etc

machine-learning matplotlib numpy pandas python3 scikit-learn

Last synced: 16 Apr 2026

https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization

A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.

dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python

Last synced: 05 May 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/thiti-dev/attendance-management-system

A quick casual project just only for a demonstration in the discrete mathematics course

numpy opencv pandas pillow tkinter

Last synced: 16 Apr 2026

https://github.com/aadityasikder/neural-network-from-scratch-

This notebook demonstrates a neural network implementation using NumPy, without TensorFlow or PyTorch. Trained on the MNIST dataset, it features an architecture with input layer (784 neurons), two hidden layers (132 and 40 neurons), and an output layer (10 neurons) with sigmoid activation.

backward-propagation evaluation forward-propagation mnist-dataset neural-network neural-networks-from-scratch numpy training

Last synced: 16 Apr 2026

https://github.com/hairymax/python-for-science-lecture-notes

Tutorial material on the scientific Python ecosystem in Russian

matplotlib numpy python scipy sympy

Last synced: 12 Apr 2026

https://github.com/tebogoyungmercykay/mathematical_modelling_in_python

Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid Application Development, as well as for use as a scripting or glue language to connect existing components together. Python's simple, easy to learn syntax emphasizes readability and therefore reduces the cost of program maintenance. Python supports modules and packages, which encourages program modularity and code reuse. The Python interpreter and the extensive standard library are available in source or binary form without charge for all major platforms, and can be freely distributed.

clock counter differential-equations equations-and-plotting growth-factor is-sorted lamda-expressions linearization logistic-difference-equations mathplotlib numpy pell-recursive polyfit python recursion rieman-sums sympy

Last synced: 14 Mar 2025

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/rahul-shrivastav/digit-recognition-nn

Web application that recognizes handwritten digits using a deep learning model

deep-learning flask numpy python reactjs tensorflow

Last synced: 11 Apr 2026

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/elhanarinc/deeplearning

Ceng 783 Deep Learning Assignments

ipython-notebook jupyter-notebook numpy python2

Last synced: 06 May 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/ejw-data/tableau-songs

Brief analysis of songs from 1890 - 2015 using pandas and Tableau

numpy pandas python songs tableau

Last synced: 16 Apr 2026

https://github.com/ashwin331133/liver_disease_detection

This dataset consists of 416 liver patient records and 167 non-liver patient records collected from North East of Andhra Pradesh, India. And The main objective of this project is to use classification algorithms to detect liver patients from healthy individuals.

data-visualization machine-learning numpy pandas python

Last synced: 16 Apr 2026

https://github.com/nxhawk/mln

Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

decision-tree-classifier genetic-algorithm jupyter-notebook machine-learning-algorithms numpy pandas sklearn-library

Last synced: 06 May 2026

https://github.com/jvictor011/analise_de_dados_pokemon

Este projeto se concentra na análise dos dados da Pokedex, que incluem informações sobre vários Pokémon. O código em Python realiza várias etapas de análise exploratória de dados (AED) e gera visualizações para entender melhor as características dos Pokémon.

matplotlib numpy pandas python scipy

Last synced: 07 May 2025

https://github.com/vikkiezdev/ai-global-index-analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

cleaning-data correlation-analysis eda matplotlib numpy pandas python3 seaborn statistical-analysis

Last synced: 06 May 2026

https://github.com/leavers/npblob

Transfer numpy array between Python and JavaScript losslessly

array javascript matrix numpy python streaming typescript

Last synced: 06 May 2026

https://github.com/bzubs/mlzero

Implementation of widely used ML algorithms in vanilla python

machine-learning machine-learning-algorithms machinelearning numpy python statistics

Last synced: 03 Mar 2026

https://github.com/iamkushvanth/marketing-campaign-analysis

Data analysis comparing the performance of Facebook and AdWords ad campaigns in 2019, Using Statical Methods such as A/B test focusing on clicks, conversions, cost-effectiveness, and seasonal trends.

ab-testing hypothesis-testing linear-regression matplotlib-pyplot numpy pandas python regression-analysis scikit-learn scipy stastistics

Last synced: 07 Apr 2026

https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies

This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.

activelearning cuda entropy kldivergence margin numpy python pyto uncertainty

Last synced: 06 May 2026

https://github.com/macktb/tetris

Tetris game

list-comprehension numpy

Last synced: 16 Apr 2026

https://github.com/kartmaan/plotly-intervals

Group values from a Pandas Series according to given intervals and represent them graphically with Plotly

bar-chart grouping grouping-plots intervals intervals-chart numpy numpy-arrays orca pandas pandas-series percentile pie-chart plot plotly plotting values

Last synced: 10 May 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/anas436/student-performance-analysis

In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.

data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 16 Apr 2026

https://github.com/abhishekpsonawane07/loan-repayment-prediction-ml

Loan Approval Prediction uses machine learning to analyze factors like income and credit history, automating and improving the loan approval process. The project ensures accurate, efficient, and user-friendly predictions with a Gradio interface.

artificial-neural-networks confusion-matrix decision-trees gradio-interface k-nearest-neighbours loan-dataset loan-prediction-analysis loan-repayment logistic-regression numpy pandas python random-forest sckiit-learn seaborn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 16 Apr 2026

https://github.com/dina-hosny/investigate-a-dataset-project

Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.

csv data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 16 Apr 2026

https://github.com/shiqilu77/machinlearning-lol_gaming

a supervised model that predicts the match winning rate in LOL game based on the lineups of both sides. Implemented data processing by multiple python API including sklearn, NumPy, pandas, and by feature encoding, kNN, Synthetic Minority Oversampling Technique, etc

league-of-legends machine-learning numpy supervised-learning

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 16 Apr 2026

https://github.com/archish27/pythontutorial

Python Programming Tutorial for new geeks who want to learn python from scratch to deal with various applications

matplotlib numpy pandas pygame python python-2 python-3 scikit-learn soup

Last synced: 01 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/ianjure/simple-neural-network

Implementing a simple neural network using only Numpy.

deep-learning neural-network notebook numpy scratch-implementation

Last synced: 06 May 2026

https://github.com/amirkh777/data-science-yandex-practicum

My projects from the Yandex Practicum Data Science course.

data-science matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 12 Apr 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 06 May 2026

https://github.com/mat-kubiak/manual-ml

DIY ML framework (using only NumPy!) + various demos

artificial-intelligence examples machine-learning ml ml-framework neural-networks numpy

Last synced: 06 May 2026

https://github.com/leftcoastnerdgirl/supervised_learning

This project demonstrates supervised machine learning using scikit-learn.

classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn

Last synced: 16 Apr 2026

https://github.com/michaelgermini/python-3d-programming-book

Complete Python & 3D Programming Book: 30 chapters, 120+ examples, real-time ray tracing, ML graphics, OpenGL, NumPy

3d-graphics computer-graphics educational game-development graphics-programming machine-learning numpy opengl python ray-tracing tutorial

Last synced: 16 Apr 2026

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/lorenzorottigni/ml-titanic

Machine Learning python bootcamp: logistic regression on titanic dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/littie28/hazy-frames

Hierarchical coordinate frames - crystal clear transforms

3d coordinate-systems geometry numpy python reference-frames scipy transformations

Last synced: 18 Apr 2026

https://github.com/shriansh8619/eda_customer_behavior

This project analyzes Nielsen transaction data using Python to uncover sales trends, customer preferences, and purchasing patterns. It provides insights to optimize inventory, create targeted promotions, and improve store performance. The goal is to help supermarkets enhance strategies and boost profitability based on data-driven insights

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/dr-rompecabezas/page-view-time-series-visualizer

A data science project built as part of the freeCodeCamp curriculum.

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/mikma03/datascience_python_datacamp

DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.

data-analysis data-science datacamp datascience numpy pandas python

Last synced: 06 May 2026

https://github.com/shubhansu-kr/int213-python-programming

Python Programming | Semester 3 | Python | OOPS | Tkinter | Pandas | Numpy

lpu numpy oops-in-python pandas python sem3 tkinter-gui

Last synced: 10 May 2026

https://github.com/klihe/astar-visualization

I am trying to implement *a path finding algorithm and create a visualization of it with Python and numpy library

algorithm astar-algorithm numpy pathfinding pygame python visualization

Last synced: 06 Apr 2025

https://github.com/pthmhatre/smart-attendance-system-using-face-recognition

This project aims to create a smart attendance system using face recognition technology. The system captures and recognizes faces in real-time and marks attendance automatically, providing a seamless and efficient way to manage attendance records.

adafruit computervision css face-recognition iot iot-application javascript mysql numpy opencv php pickle python raspeberry-pi rfid

Last synced: 12 Apr 2026

https://github.com/rnuv/artificial-neural-network

A Simple 2 layered Artificial Neural Network library made from scratch in Python and NumPy. Fitted with a feed forward method and backpropagation.

neural-network numpy python tensorflow

Last synced: 18 Mar 2025

https://github.com/sundanc/besthousefinder

A Streamlit web application that helps users find the best house based on their preferences.

numpy pandas prediction python streamlit

Last synced: 17 Apr 2026

https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting

An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.

forcasting machine-learning numpy pandas scikit-learn time-series-analysis

Last synced: 17 Apr 2026

https://github.com/pymivn/vinastats

Khám phá các sự thật ở Việt Nam với thống kê và số liệu từ tổng cục thống kê Việt Nam

numpy pandas statistics vietnam

Last synced: 02 Apr 2026

https://github.com/bilalm04/autonomous-lane-detection

An autonomous lane detection system leveraging OpenCV to process images and videos, accurately identifying lanes and providing key visualizations.

artificial-intelligence computer-vision lane-detection machine-learning matplotlib numpy opencv python

Last synced: 20 Jan 2026