An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 19 Oct 2025

https://github.com/bishopce16/movies-etl

The goal of this project was to create an automated pipeline for a company Amazing Prime that is hosting a hackathon.

json numpy pandas pgadmin4 psycopg2 python regular-expressions sql sqlalchemy time

Last synced: 30 Dec 2025

https://github.com/mgitrov/lifespan-x-population-status

A machine learning project aiming to predict animals' lifespan and population status.

bs4 data-science machine-learning matplotlib numpy pandas python regular-expressions requests seaborn sklearn

Last synced: 30 Dec 2025

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 27 Mar 2025

https://github.com/raduldev/ml-projects

End To End Machine Learning Project guided by Krish Naik from Ineuron.

catboost dill flask-application numpy pandas python scikit-learn xgboost

Last synced: 30 Dec 2025

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 30 Dec 2025

https://github.com/asuquoaa/energy-_gdp-_and_scimago_rankings_analysis-2006-2015

Analysis of top 15 countries in energy, renewables, and GDP using three global datasets.

data-cleaning numpy pandas

Last synced: 12 Sep 2025

https://github.com/hellenilda/ml-housing-price-prediction

Projeto simples para fazer previsão de preços de casas usando Regressão Linear.

artificial-intelligence machine-learning matplotlib numpy python sklearn streamlit

Last synced: 30 Dec 2025

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 23 Jun 2025

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 28 Oct 2025

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/gnikit/fileio

A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()

cpp file-io library numpy

Last synced: 27 Mar 2025

https://github.com/chubi-x/sort-it-image-classifier

Sort-it is an image classification application that classifies images of cats, dogs, bicycles, cars, and motorcycles.

image-classification machine-learning numpy pandas python streamlit tensorflow

Last synced: 30 Dec 2025

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 23 Feb 2025

https://github.com/synthstellar/data-preprocessing-with-python

A data preprocessing repository focused on cleaning, transforming, and preparing datasets for machine learning tasks. It includes functions for handling missing values, scaling, encoding, and feature engineering for improved model performance.

data data-cleaning feature-engineering machine-learning numpy pandas preprocessing python scikit-learn

Last synced: 27 Mar 2025

https://github.com/tharushi11/titanic-data-analysis

This repository contains a simple prediction model and survival predictions based on the Titanic dataset from Kaggle, along with detailed visualizations using Power BI.

dax machine-learning numpy pandas powerbi sklearn titanic-survival-prediction visualization

Last synced: 27 Mar 2025

https://github.com/kevin-rsj/real-estate-investments

Sistema de scoring que clasifica ciudades francesas para inversión en segundas viviendas según perfil de riesgo(alto, moderado y bajo). Evalúa ratios clave en áreas como demanda, disponibilidad, infraestructura, demografía y precios.

bigquery data-analytics looker-studio numpy pandas python sklearn-library sql visualization

Last synced: 30 Dec 2025

https://github.com/danish-jamil-01/image-dehazer

Remove the haze from the image using dark channel prior and guided filter

dark-channel-prior guided-filter guided-filtering ndimage numpy open-cv opencv opencv-python pillow python scipy skimage tkinter

Last synced: 30 Dec 2025

https://github.com/andreazoccatelli/light_permanova

A lightweight implementation of PERMANOVA based on Euclidean distance from centroid

computervision numpy permanova pytorch statistics tabular-data

Last synced: 24 Dec 2025

https://github.com/kalwalt/pybind-test

A simple example with pybind and numpy with C++ code

cplusplus numpy pybind11 python

Last synced: 29 Dec 2025

https://github.com/sabbir45ali/cam-board

This is Python based project which is a virtual board and we write there using our finger and clear it and color changing.

numpy pandas python3

Last synced: 21 Feb 2025

https://github.com/tahir1072a/titanic-veri-analizi

Kaggle Titanic veri seti kullanılarak NumPy ve Pandas ile yapılan veri analizi projesi.

data-science kaggle numpy pandas python titanic-survival-prediction

Last synced: 30 Dec 2025

https://github.com/redinfinitypro/ai_projects

Rating (7/10) The code is a chatbot that interacts with users based on article content. However, issues include incorrect sorting, typos, incorrect indentation, and lack of explanations for each part, potentially affecting the bot's robustness and understanding of context.

newspaper nltk numpy python random sklearn string warnings

Last synced: 20 Jul 2025

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 28 Mar 2025

https://github.com/shefreenkaur/web-scraping-and-word-frequencies

This project analyzes word frequencies in BC Legislative documents using Stanford CoreNLP and Python. The program extracts text from PDF documents, processes it using natural language processing techniques, and generates a comprehensive word frequency analysis.

analytics chromedriver easyocr nlp numpy pandas pymupdf python selenium stanfordcorenlp webscraping wordfrequency

Last synced: 28 Mar 2025

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/paulinhok14/csgo-datascience-project

📊 Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.

matplotlib mlflow numpy python scikit-learn scipy seaborn

Last synced: 30 Dec 2025

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 30 Dec 2025

https://github.com/jolivas51565/streamlit-hotkeys

Streamlit Hotkeys adds app-wide keyboard shortcuts to Streamlit apps, with edge-triggered events and a simple Python API for Ctrl/Cmd/Alt/Shift bindings. 🐙

base64 browser-game cmdk game-development hotkeys keybindings keyboard-input numpy pillow pixel-art png-encoding python realtime shortcuts streamlit streamlit-component streamlit-hotkeys ui

Last synced: 11 Sep 2025

https://github.com/volf52/deep-neural-net

A simple deep neural net class written to work with Numpy and Cupy

binarized-neural-networks binary-neural-networks bnn cupy deep-learning deep-neural-networks mnist numpy python python3

Last synced: 26 Mar 2025

https://github.com/jsnevt/regressao_logistica

Cálculos de probabilidades sobre uma eleição

matplotlib numpy pandas-dataframe python

Last synced: 26 Mar 2025

https://github.com/vigneshvaranasi/breast_cancer_detection

This project employs machine learning, focusing on Logistic Regression, to detect breast cancer using tumor-related features. The dataset is preprocessed, and the model achieves 100% accuracy on the test set. The goal is to gain insights into breast cancer factors and provide an effective detection solution.

jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 30 Dec 2025

https://github.com/varakanthamsaihruthikreddy975/champo_carpets_case_analysis_python

the champo carpets case study Analysis in python, this is still under development this will be a more comprehensive analysis than previous one in R. I have the complete version of code in R refer to this link - ( https://github.com/VarakanthamSaiHruthikReddy975/Champo_Carpets_Case_Study_Analysis)

case-analysis jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python3 seaborn

Last synced: 26 Mar 2025

https://github.com/zalkklop/lvsm

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias" (ICLR 2025 Oral)

attention-mechanisms deep-learning github-config large-view-synthesis matplotlib multi-label-classification nltk numpy one-vs-rest python seaborn sql tensorflow transformers

Last synced: 09 Apr 2025

https://github.com/iv4n-ga6l/titanic-survival-prediction

a predictive model in Python that answers the following question: “using data relating to passengers (name, age, gender, socio-economic class, etc.), who has the greatest chance to survive?"

matplotlib numpy pandas python seaborn sklearn

Last synced: 26 Mar 2025

https://github.com/whiteshadow1234/guessthenumber

This is the python code for teaching high school students python

math numpy python3 random teaching-materials turtle

Last synced: 22 Feb 2025

https://github.com/shlok-nahar/mnist-cnn-classifier

This repository trains and evaluates three CNN models on MNIST, providing performance comparisons and 5 unique visualizations.

confusion-matrix graph heatmap-visualization json machine matplotlib mnist numpy precision-recall python receiver-operating-characteristic seaborn sklearn tensorflow

Last synced: 30 Dec 2025

https://github.com/tshhaider/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 15 Oct 2025

https://github.com/erikbrinkman/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 10 Apr 2025

https://github.com/13anush/python-libraries-

A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.

data-analysis matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2025

https://github.com/professornova/ann-scratch

This repository implements a simple Artificial Neural Network (ANN) from scratch using only NumPy.

artificial-intelligence neural-network numpy scratch

Last synced: 03 Apr 2025

https://github.com/kavayk29/speech-recognition-using-tdnn-and-data-augmentation

Developed a speech recognition system using TDNN, preprocessing audio, extracting MFCC features, and training the model. Fine-tuning with augmented data (19,000 rows) improved accuracy from 9% to 80% training and 40% validation. Data augmentation proved crucial for enhancing model performance and generalization. Still working to increase the acc.

deep-learning keras-tensorflow numpy os pandas tdnn tensorflow

Last synced: 01 Sep 2025

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Mar 2025

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the Lévy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 12 Dec 2025

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 21 Mar 2025

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 24 Feb 2025

https://github.com/djpetti/pyunits

Unit-aware Python with dimensional analysis support.

dimensional-analysis numpy python36 python37 python38 scientific-computing unit-conversions

Last synced: 17 Jan 2026

https://github.com/david20002/pycamp

![pycamp logo](assets/pycamp.png)pycamp is a command-line tool that fetches random Bandcamp albums from your chosen genre. 🎸 With two modes and support for any genre tag, it offers a quick way to discover new music. 💻

audio automation bandcamp beginner-friendly browser-automation command-line-tool machine-learning music-discovery numpy numpy-arrays numpy-library numpy-neural-network pandas pandas-library pandas-python playwright random tutorial

Last synced: 15 Jun 2025

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 24 Feb 2025

https://github.com/kahleryasla/signal-convolution

implementation of signal convolution in Python using both a custom function and NumPy's built-in convolve function. Also includes a function for recording and playing audio using the sounddevice library.

audio-recorder convolution matplotlib numpy python signal-processing sounddevice

Last synced: 02 Mar 2025

https://github.com/aqueeqazam/student-placements-using-machine-learning-and-flask

Through the use of Flask and machine learning, I show how to implement the student placement prediction model.

flask machine-learning numpy pandas

Last synced: 14 Jul 2025

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 30 Dec 2025

https://github.com/ledsouza/python-powerbi

Desenvolver uma dashboard para análise do mercado financeiro utilizando Python e Power BI

etl matplotlib numpy pandas powerbi python seaborn

Last synced: 30 Dec 2025

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 30 Dec 2025

https://github.com/lmizner/grokking_data_science

Coding practice for basic data science interview questions in Python

data-science numpy pandas python scikit-learn

Last synced: 30 Dec 2025

https://github.com/praju-1/data_science_projects

It contains the necessary code, datasets, and documentation to understand, replicate, and build upon the project's findings and methodologies.

data-science datasets deep-learning exploratory-data-analysis machine-learning matplotlib numpy pandas predictive-analytics python seaborn sklearn statistics visualization

Last synced: 30 Dec 2025

https://github.com/lyubov0406/data_analyst_portfolio

В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных

data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization

Last synced: 30 Dec 2025

https://github.com/nikhilfuke1/movies-recommendation-system-python-project-

This project is a simple movie recommendation system built using Python. It suggests movies based on the similarity of titles and a predefined similarity score matrix.

difflib numpy pandas python sklearn

Last synced: 30 Dec 2025

https://github.com/chuvalniy/tulia

Contains self-implemented Machine Learning algorithms using only numpy.

boosting from-scratch knn learning logistic-regression machine ml numpy python random-forest regression sklearn testing xgboost

Last synced: 03 Apr 2025

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 30 Dec 2025

https://github.com/hoyirul/flask-starter-fs

Flask-Starter-FS is a concise full-stack starter kit for Flask web development, offering quick setup and pre-configured settings for seamless integration of front-end and back-end components. Accelerate your full-stack web application development with Flask-Starter-FS.

flask machine-learning numpy pandas python sklearn

Last synced: 24 Jun 2025

https://github.com/mohamed51152/algorithmic-trading-bot

The project aims to evaluate the predictive performance of different machine learning (ML) algorithms for Bitcoin trading. The proposed trading strategy integrates key technical indicators, including the Relative Strength Index (RSI), Simple and Exponential Moving Averages, and the Moving Average Convergence Divergence (MACD).

lightgbm machine-learning matplotlib mlflow numpy optuna pandas pickle random-forest scikit-learn scipy seaborn statsmodels xgboost

Last synced: 10 Apr 2025

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 30 Dec 2025

https://github.com/aminkhavari78/-maternal-health-risk-data-set

In this part I'm working on Maternal Health Risk Prediction

data-science linearsvc machine-learning matplotlib numpy pandas scatter-plot sklearn

Last synced: 24 Jun 2025

https://github.com/wondermongering/nonlineardatagen

A flexible Python framework for generating, fitting, and visualizing noisy nonlinear data. Perfect for educational purposes, algorithm testing, and demonstrating statistical concepts. Includes tools for various noise models, custom function fitting, robust error metrics, and publication-quality visualizations

curve-fitting data-simulation data-visualization education educational-tool interpolation mathematical-modelling matplotlib noise-modeling numpy regression-analysis scientific-computing scipy signal-processing statistics teaching-resource time-series-analysis

Last synced: 22 Aug 2025

https://github.com/harishkumaarhk/algorithmic-trading-bot

The-project-aims-to-evaluate-the-predictive-performance-of-different-machine-learning-(ML)-algorithms-for-Bitcoin-trading.-The-proposed-trading-strategy-integrates-key-technical-indicators,-including-the-Relative-Strength-Index-(RSI),-Simple-and-Exponential-Moving-Averages,-and-the-Moving-Average-Convergence-Divergence-(MACD).

lightgbm machine-learning matplotlib mlflow numpy optuna pandas pickle random-forest scikit-learn scipy seaborn statsmodels xgboost

Last synced: 10 Apr 2025