An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/manik2000/ml-algorithms

Numpy based own implementations of some of ML algorithms.

machine-learning-algorithms numpy own-implementation

Last synced: 25 Mar 2025

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 25 Mar 2025

https://github.com/05afreen/indian-job-market-analysis

This repository focuses on 🔍 scraping, 🧹 cleaning, and 📊 visualizing job market data in India. It automates job listing collection from platforms like 🌐 Naukri, Indeed, and LinkedIn using 🐍 BeautifulSoup/Selenium. The scraped data is then 🛠️ cleaned and standardized, handling ❌ missing values, 🏷️ formatting inconsistencies, and 🔎 extracting

numpy pandas powerbi selenium

Last synced: 30 Dec 2025

https://github.com/mayankmittal29/tensortinker_statistical_methods_in_ai

This repository contains implementations of various machine learning algorithms from scratch, including Multi-Layer Perceptron (MLP), Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), Autoencoders, and Variational Autoencoders.

autoencoder-mnist cupy gmm-clustering image-segmentation matplotlib-pyplot mlp-classifier mlp-regressor mnist-dataset numpy pandas pca python3 pytorch roc-auc seaborn torch variational-autoencoder

Last synced: 30 Dec 2025

https://github.com/batthulavinay/football-match-data-analysis

This project focuses on analyzing football match data to uncover insights, trends, and performance metrics. The analysis is conducted using Python in a Jupyter Notebook environment.

datacleaning datavisualization exploratory-data-analysis jupyter-notebook matplolib numpy pandas seaborn

Last synced: 13 Apr 2025

https://github.com/batthulavinay/ev-population

This repository contains a Jupyter Notebook focused on analyzing Electric Vehicle (EV) population data. The notebook includes data visualizations, exploratory analysis, and key insights.

data-science datacleaning datapreprocessing datavisualization jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 13 Apr 2025

https://github.com/batthulavinay/genz_datingapp-eda-and-ml

This project focuses on analyzing data from a GenZ Dating App to uncover insights, trends, and predictive models. The analysis is conducted using Python in a Jupyter Notebook environment.

classification datacleaning datapreprocessing datavisualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas regression-models scikit-learn seaborn

Last synced: 13 Apr 2025

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 27 Nov 2025

https://github.com/rafali25/perceptron-algorithm

A simple implementation of the Perceptron algorithm using numpy. This project demonstrates how to classify data points by iteratively updating weights and biases based on misclassified samples. Perfect for understanding the fundamentals of linear classification!

machine-learning numpy perceptron-neural-networks pycharm-ide python

Last synced: 01 Apr 2025

https://github.com/moonmoonsamal/data-driven-google-ads-for-listing-sites-analysis

Analyzed Google Ads performance to identify top channels, keywords, and geographical impact

cleaning-dataset eda manipulation modelling numpy visualization

Last synced: 25 Mar 2025

https://github.com/jkosla/neural_network_from_scratch_numpy

Neural Network From Scratch in Python | Build a simple neural network from scratch using pure Python and NumPy. Learn about forward propagation, backpropagation, and training with gradient descent. Accompanies my Medium article.

ai aritificial-intelligence medium nerual-networks numpy python3 tutorial

Last synced: 02 Apr 2025

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 24 Mar 2025

https://github.com/bunu23/image-classification

This repository contains a notebook implementing a Convolutional Neural Network for multi-class image classification using transfer learning with a pre-trained ResNet-50 model. Covers dataset handling, model architecture customization, training, evaluation, fine-tuning, and external image prediction.

keras matplotlib numpy pil python tensorflow

Last synced: 31 Dec 2025

https://github.com/pinedah/sleep-data-analysis-exercise

Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).

data-analysis data-science escom numpy pandas python school-project scipy

Last synced: 31 Dec 2025

https://github.com/nirmaldeepponnada/codeclauseinternshipproject1

This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.

kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer

Last synced: 31 Dec 2025

https://github.com/arjunan-k/netflix_analysis

Exploratory Data Analysis on Netflix Movies & TV Shows.

matplotlib numpy pandas plotly seaborn

Last synced: 26 Feb 2025

https://github.com/rakesh3596/tutedude_projects

It consist of 3 major project 1:Covid19 Analysis 2:Rainfall Prediction 3:Tumor Detection

csv-files matplotlib numpy pandas plotly python sklearn-library

Last synced: 20 Feb 2025

https://github.com/waghraj1699/codeclauseinternship_crop_disease_identification

Crop Disease detection app,it predicts wheather the given image of crop has disease or not.

cnn deep-learning machine-learning numpy pandas transfer-learning

Last synced: 28 Mar 2025

https://github.com/vasulab/knightshock

Shock tube experiment planning and data analysis package.

cantera data-analysis matplotlib numpy shock-tube

Last synced: 18 Jul 2025

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 31 Dec 2025

https://github.com/yevhenbk/arc-length-calculator

This app calculates the arc length of a given function over the interval [0, 4] using numerical integration

numpy python scipy

Last synced: 29 Mar 2025

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 01 Apr 2025

https://github.com/dineshdhamodharan24/singapore_flat_resale_

This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.

data-analysis flat json numpy pandas pickle project python streamlit

Last synced: 23 Apr 2025

https://github.com/andres29ac/website_heatmaprisk

Mapa hecho con Streamlit y Folium para mostrar la distribución delictiva de cada distrito de Trujillo mediante Mapas de Calor.

folium-maps heatmap-visualization numpy pandas ploty python stadistics streamlit

Last synced: 24 Feb 2025

https://github.com/alejoduarte23/bs6472

Serviceability assessment as per BS 6472 for human exposure to vibration in buildings Vibration sources other than blasting

matplotlib numpy pydantic scipy

Last synced: 07 Jan 2026

https://github.com/aka-sh11/chat-bot

Build your own A.I. chatbot.

ai chatbot numpy python pytorch

Last synced: 28 Mar 2025

https://github.com/reckonsys/datascience-docker

A docker image with numpy, scipy, nltk and pandas pre-installed with pip

docker docker-image dockerfile nltk numpy pandas scipy

Last synced: 10 Mar 2025

https://github.com/tim-w-james/python-data-reference-code

Python reference code for data visualization, machine learning and SQL

data-science jupyter-lab numpy python

Last synced: 09 Apr 2025

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/yesenvidnath/facerecognito-pythonfacialrecognitionapplication

this is a web application created to detect faces using a pre-given image links through Excel format, this project uses Pytorch & Flask

facent flask numpy python

Last synced: 31 Dec 2025

https://github.com/yash22222/analysis-of-amcat-aspiring-minds

Analyzing employment outcomes for engineering graduates based on a provided dataset. Specifically, it aims to verify a claim regarding the salary range for fresh graduates in certain engineering roles and investigate potential relationships between gender and specialization preferences.

amcat aspiring-minds charts computer-engineering data-analytics data-vizualisation dataset graph machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Oct 2025

https://github.com/shekharkram/project

A collection of data analytics projects showcasing skills in data cleaning, exploration, visualization, and basic SQL queries. Designed to demonstrate entry-level data analyst competencies using real-world datasets and tools.

datacleaning excel jupyter-notebook mysql numpy pandas postgresql python sql

Last synced: 24 Dec 2025

https://github.com/celalaygar/python-experiences

How to use numpy, map, response, opencv, pandas on python 3+. There is a lot of simple project for everybody to understant how to use python.

cv2 numpy opencv pandas pandas-dataframe python response

Last synced: 24 Feb 2025

https://github.com/soumyapro/heart-disease-prediction

This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.

logistic-regression numpy pandas simpleimputer sklearn

Last synced: 21 Jan 2026

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 05 Apr 2025

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/manorohith-03/attendance-system-using-face-recognition

The system leverages the Python programming language and several powerful libraries for image processing and face recognition. The ultimate goal is to provide a more efficient, accurate, and user-friendly attendance tracking solution.

face-recognition numpy opencv pandas python-3

Last synced: 12 May 2025

https://github.com/ieCecchetti/Python_ML_DL_examples

A variety of Machine Learning and Deep Learning scripts in Python. Included some theorical info about that in the Readme

bayes-classifier bayesian-statistics deep-learning kernel machine-learning matplotlib neural-network numpy pandas pca pca-analysis python scikitlearn-machine-learning scipy shi

Last synced: 10 Mar 2025

https://github.com/satyamvyas04/masteringpython

MasteringPython features programs that reflect my growing proficiency and ongoing journey with the Python programming language.

jupyter-notebook machine-learning ml numpy openpyxl python3 tkinter

Last synced: 01 Apr 2025

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 31 Dec 2025

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 31 Dec 2025

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 31 Dec 2025

https://github.com/sa001gar/password-manager-on-the-top-of-streamlit

Secure Password Manager is a Python-based application developed by Sagar Kundu, designed to provide users with a secure and convenient way to manage their passwords. With features like user authentication, password generation, and secure storage using bcrypt and MongoDB, it ensures the safety of sensitive information while offering an intuitive use

bcrypt gui-application mongodb numpy numpy-library pandas-dataframe pandas-library pandas-python password-generator password-manager pymongo python python3 streamlit-application streamlit-webapp urllib

Last synced: 01 Apr 2025

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 18 Aug 2025

https://github.com/rosacarla/projetodio_covid19

Projeto desenvolve modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil

etl machine-learning numpy pandas python

Last synced: 24 Feb 2025

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 31 Dec 2025

https://github.com/omogbolahan94/lego-analysis

Analysis of LEGO data from 1949 to 2021.

matplotlib merge numpy pandas

Last synced: 09 Apr 2025

https://github.com/biocpy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 23 Nov 2025

https://github.com/hansalemaos/ffmpeg_stream_to_numpy

ffmpeg stream to numpy arrays

cv2 ffmpeg numpy python stream video

Last synced: 28 Nov 2025

https://github.com/bbergerud/tenops

Creating a common interface for numpy, pytorch, tensorflow, etc.

numpy pytorch tensorflow

Last synced: 20 Jan 2026

https://github.com/samaalharbi2/100-days-of-code

This repository is a collection of 100 Python projects built as part of the "100 Days of Code: Python Developer Challenge" course.

data-science game-development machine-learning numpy pandas python udemy

Last synced: 05 Apr 2025

https://github.com/eesunmoon/genai_cor-recom

[Project] Outfit Coordination Recommender System using KoAlpaca

data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium

Last synced: 30 Dec 2025

https://github.com/ankit21111/carpredict

This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!

data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn

Last synced: 13 Mar 2025

https://github.com/theoldvalyria/analysis_of_orders

Промежуточный проект курса "Аналитик данных" (KARPOV.COURSES)

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 10 Apr 2025

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 31 Dec 2025

https://github.com/mairagalvao/steak_risk_data

A data analysis based on correlations and comparisons among the samples.

database matplotlib numpy pandas python3 sql

Last synced: 26 Jan 2026

https://github.com/chenshuliu/python-guide-to-numpy

This repository is dedicated to explaining the functionalities of Numpy in Python

data-structures numpy python

Last synced: 18 Jan 2026

https://github.com/arv-anshul/campusx-graphs

Contains all the graphs provided by CampusX to learn Statistics and Algebra with some added feature.

algebra campusx-dsmp data-science matrix numpy statistics streamlit vizualisation

Last synced: 14 May 2025

https://github.com/supriya811106/twitter-sentiment-analysis

Analyzing the mood of tweets! We sort tweets on popular topics into positive, negative, or neutral categories to gauge public opinion. See what Twitter really thinks!

bernoulli-naive-bayes jupyter-notebook matplotlib nlp-machine-learning nltk numpy pandas python scikit-learn seaborn sentiment-analysis text-classification tfidf-vectorizer wordcloud

Last synced: 30 Dec 2025

https://github.com/22p31a0512/sentimental-analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

feature-engineering machine-learning natural-language-processing numpy pandas python supervised-learning text-classification text-processing tokenization

Last synced: 23 Apr 2025

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 12 May 2025

https://github.com/sharif-minhaz/rag-system

Ingest and vectorize content upon publication, store embedding, then retrieve and augment user queries with context to generate high-quality responses.

faiss flask mysql2 nodejs numpy rag react transformers

Last synced: 14 Jul 2025

https://github.com/ianisdev/lab_ms_ai_programing_for_ai

Lab work for Programming for AI

kafka numpy pandas python

Last synced: 21 Mar 2025

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 30 Dec 2025

https://github.com/hajaarh/health_state_classifier

Ce projet vise à analyser un dataset contenant divers indicateurs de santé, bien-être et vie quotidienne afin de classifier les individus en trois catégories selon leur état de santé.

numpy pandas python seaborn sklearn

Last synced: 03 Mar 2025

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 30 Dec 2025

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 17 Jul 2025

https://github.com/Mike014/SoundBot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 12 Sep 2025

https://github.com/mchenryspagg/economic-variable-correlation-and-productivity-model

This report provides an in-depth analysis of Nigeria's economic performance over the past decade, spanning from 2014 to 2024.

correlation exchangerates inflation moneysupply numpy productivity python regression regression-analysis regression-models visualization

Last synced: 10 Sep 2025

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 Aug 2025

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 24 Aug 2025

https://github.com/chandkund/predicting-heart-disease

Welcome to the Heart Disease Prediction project! 🩺 This project focuses on developing a predictive model to assess heart disease risk based on health indicators like age, cholesterol levels, and blood pressure. By analyzing these features, we aim to create an effective tool for early diagnosis and heart disease prevention

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Jan 2026