An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/glbter/data-analysis-py

data-analysis labs. 50/50 made with Oleh Tonkoshkur

numpy pandas python3 scipy-stats

Last synced: 12 Jun 2026

https://github.com/vuanhtuan1012/data-scientist-with-python

Notes on career track "Data Scientist with Python" at DataCamp

importing-data matlab numpy pandas python3 sqlalchemy

Last synced: 09 Apr 2026

https://github.com/nas-research/knowledge-model

Our knowledge system systematically ingests, processes, and indexes open-access life science publications. It supports internal research by providing precise question-answering and efficient retrieval from a continuously updated repository of scientific literature

accelerate aws boto3 dataingestion keras lifesciences llama llama3 llm numpy pymupdf pytorch researchsupport sqlalchemy tensorflow textextraction

Last synced: 10 Apr 2026

https://github.com/ledsouza/medphys-bi

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 10 Apr 2026

https://github.com/elam-parithi/singapore_flatprice_predicting

Flat price prediction with Machine learning tools and python.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/erikaax08/numerical_methods

This repository hosts the project developed for the Numerical Methods class at the Benemérita Universidad Autónoma de Puebla (BUAP)

django maths matplotlib numerical-methods numpy plotly

Last synced: 19 Aug 2025

https://github.com/mjshubham21/ny_yellow_taxi_python_da_project

A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.

data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/cyberoctane29/epa-air-quality-aqi-analysis

This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.

data-analysis numpy pandas python statistics

Last synced: 10 Apr 2026

https://github.com/liblaf/array-typing

🧮 Type-safe array operations for NumPy, JAX, and PyTorch

array code-quality documentation jax numpy python static-analysis torch type-hints typing

Last synced: 19 Aug 2025

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/csengupta1101/career-survey---data-scientist

This repository holds an in depth analysis of Scrapped data from naukri.com. which showcases in in demand skills and opportunities in Data science across the subcontinent.

analysis career data-science matplotlib naukri numpy pandas python3

Last synced: 07 May 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/alimiheb/morphotype-web-app

A modern web application that analyzes body morphotypes (Ectomorph, Mesomorph, Endomorph) using MediaPipe AI and provides personalized fitness and nutrition plans.

body-tracking flask mediapipe numpy opencv pandas pillow python3

Last synced: 10 Apr 2026

https://github.com/yalai92/cava_imp_exp_analysis

Analysis, visualization, preprocessing and clustering of global sparkling wine trade (2017–2024) using Python in Colab and ML to reveal trends and country profiles.

clustering colab-notebook data-science data-visualization machine-learning matplotlib numpy pandas preprocessing python sklearn tableau-public

Last synced: 25 Jan 2026

https://github.com/hadson0/dip-candy-classification

Candy classification project using digital image processing. Includes dataset creation, model development, and evaluation tools with TensorFlow, OpenCV, and Albumentations. Complete setup and instructions provided for building and training the model.

albumentations bounding-boxes cnn computer-vision cvat image-classification image-processing keras machine-learning matplotlib numpy opencv python seaborn sklearn tensorflow

Last synced: 29 Sep 2025

https://github.com/Marvin-VW/3dengine_python

3D engine written in Python using OpenCV and NumPy

3dengine numpy opencv python

Last synced: 29 Sep 2025

https://github.com/sejalmankar1012/customer_churn

The goal of this project was to develop a machine learning model that predicts customer churn based on historical customer data.

dataset deep-learning deployment machine-learning model-building modeling numpy optimization pandas seaborn streamlit tensorflow

Last synced: 05 Apr 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 10 Apr 2026

https://github.com/dhaase-de/dito

Yet another toolbox for the daily work with OpenCV under Python.

numpy opencv python

Last synced: 22 Aug 2025

https://github.com/gaurav-0211/seaborn-for-data-visualization

This Project aims to different plotting methods using seaborn for data Visualization.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe seaborn

Last synced: 20 Apr 2026

https://github.com/ahmedabdalkreem/covid-19

In this project we build complete project conatin tree part first part build deep leanring model using CNN second part build simple mobile app using Flutter and the last part is to connect mobile app with model using Flask API.

cnn dart deep-learning flask-api flutter matplotlib mobile-app numpy pandas python tensorflow

Last synced: 10 Apr 2026

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 10 Apr 2026

https://github.com/balakrish001/python-course

The Python course provides a thorough foundation in Python programming, starting with core concepts and progressing to OOP. It covers essential libraries like NumPy and Pandas for data manipulation and analysis. You'll also learn how to send emails using Python's SMTP library and interact with APIs for real-world applications,

api numpy oops-in-python pandas problem-solving python smtp

Last synced: 10 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/prince-pastakiya/human-resources-tableau-project

👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.

chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public

Last synced: 18 Apr 2026

https://github.com/kwasi-dankwa/game-sales

This project investigates why Nintendo’s global market share fluctuated between 2010–2016, identifies top-performing genres and regions, and builds predictive models to forecast sales and regional performance. It culminates in specific, actionable recommendations to grow market share.

business-intelligence gaming juptyernotebook-language matplotlib numpy pandas postgresql python3

Last synced: 09 Apr 2026

https://github.com/yousefmostafa125/image-processing

An interactive image processing web app built with Streamlit and OpenCV. The app allows users to upload an image, apply various processing steps, and download the results individually or as a ZIP file.

numpy opencv-python python streamlit

Last synced: 03 May 2026

https://github.com/sc0v0ne/blueflix

Simple Application Recommend Movies and Tv Shows

csv k-means k-means-clustering kaggle movies numpy pandas python recommends-movies sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/iv4n-ga6l/simple-image-segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 13 May 2026

https://github.com/sawaira-iqbal/data-visualization-project-on-car-sales-data

Explore Sales Data Visualization with Interactive Charts & Insights! 📊 Uncover trends and patterns to drive smarter automotive decisions.

bivariate-analysis data-science data-visualization interactive-visualizations matplotlib multivariate-analysis numpy plotly python seaborn univariate-analysis

Last synced: 12 Apr 2026

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/mays-m/deeplearning

In deep learning, with the help of data and training, neural networks are created and trained, which create better predictions than before with the help of neural networks.

anaconda artificial-intelligence conda-environment matplotlib neural-networks numpy pandas python3 tensorflow tensorflow-gpu transortfow-

Last synced: 11 Apr 2026

https://github.com/zenithclown/agine

Library for finding Nearest Neighbor or to find if two points on Earth have a Direct Line of Sight.

bearing bearing-calculations distance-calculation euclidean-distance geometry haversine manhattan-distance nearest-neighbor-search numpy pandas rasterio shapely

Last synced: 28 Jan 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/sasivakulrithwik/059_iva_assignment-1

Image processing and object detection using OpenCV, including color detection, smoothing, edge detection, and face detection with Haar cascades. 🚀

ipython-notebook numpy opencv python3

Last synced: 10 Apr 2026

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/yessasvini23/pytorch-bootcamp-on-ai-by-opencv

This OpenCV course is highly recommended for individuals seeking an introductory experience in Computer Vision and AI, and who wish to establish a strong foundation for their career development.

cnn-classification computer-vision cv2-library deeplearning-ai numpy opencv pytorch rnn-tensorflow

Last synced: 13 Apr 2026

https://github.com/jhonatanthiago/projeto_analise_de_dados_com_python

A empresa contratou você para analisar dados de mais de 800 mil clientes, a maioria dos quais são inativos, ou seja, já cancelaram o serviço. O objetivo é entender os principais motivos desses cancelamentos e identificar as ações mais eficazes para reduzir esse número.

jupyter-notebook nbformat numpy openxyl pandas plotly python

Last synced: 10 Apr 2026

https://github.com/thiagohrcosta/machinelearning-temperature

A Small Machine Learning application leveraging Scikit-Learn and statistical learning to extract knowledge from data without explicit programming.

machine-learning numpy pandas python3 scikit-learn

Last synced: 08 Apr 2026

https://github.com/nirmit27/credit-risk-modeling

Classification and regression models for predicting the level of risk associated with extending credit to a borrower and the basic EPS amount respectively.

credit-risk credit-risk-analysis machine-learning ml ml-model ml-models modelling numpy pandas python python3 sckiit-learn

Last synced: 29 Apr 2026

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the Lévy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 02 May 2026

https://github.com/teslaproduuction/data_analiz_al

Прогнозирование курса алюминия с использованием модели ARIMAX

arimax csv django html-css-javascript numpy pandas python rss scipy sklearn sqlite yfinance-library

Last synced: 10 Apr 2026

https://github.com/tralahm/datascience

Getting started with Data science Numerical Analysis and Scientific Computing

hmmlearn ipynb jupyterlab keras numpy pandas python3 scipy tensorflow

Last synced: 10 Apr 2026

https://github.com/sujal2004/stock-analysis-project

Stock Market Analysis and Prediction is the project on technical analysis, visualization and prediction using data provided by Google Finance.

jupyter-notebook matplotlib numpy pandas python stock-market

Last synced: 10 Apr 2026

https://github.com/saro0307/unemployment-analysis

This data science project delves into unemployment trends, using data analysis and machine learning to identify key factors and predict future joblessness rates, aiding policymakers and businesses in informed decision-making.

data-science datavisualization machine-learning matplotlib numpy pandas python seaborn skit-learn

Last synced: 10 Apr 2026

https://github.com/screamman-ssh/video-camera-to-ascii

Convert a video file or camera captured to display as text.

ascii camera-to-text computer numpy opencv pillow python python-3 video-to-text

Last synced: 10 Apr 2026

https://github.com/kostadinlambov/algorithmic-trading-bot

The project aims to evaluate the predictive performance of different machine learning (ML) algorithms for Bitcoin trading. The proposed trading strategy integrates key technical indicators, including the Relative Strength Index (RSI), Simple and Exponential Moving Averages, and the Moving Average Convergence Divergence (MACD).

lightgbm machine-learning matplotlib mlflow numpy optuna pandas pickle random-forest scikit-learn scipy seaborn statsmodels xgboost

Last synced: 05 Apr 2026

https://github.com/fatimaafzaal/car_price_prediction

Trains Random Forest and Gradient Boosting models to predict car prices based on user inputs for various car attributes, evaluating models and making predictions using the best-performing model.

car-price-prediction ensemble-learning gradient-boosting matplotlib numpy pandas random-forest regression regression-models scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/ahmedabdalkreem/machine_learning_with_cv

In this project work to extraction feature from shapes to be input of machine learning model.

chaincode computer-vision contour-detection feature-extraction machine-learning matplotlib numpy open-cv pandas python sklearn svc-model

Last synced: 10 Apr 2026

https://github.com/offthetab/statistics_homework

Практические работы по статистике МИРЭА 3 курс.

numpy pandas python statsmodels

Last synced: 05 Apr 2026

https://github.com/roaajadaa/text-classification

Classify documents using custom naïve bayes model

matplotlib nlp nltk numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/lorenzorottigni/ml-iris-svm

Machine Learning python bootcamp: Support Vector Machines on iris flower dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 10 Apr 2026

https://github.com/prajakta1321/authencheck

Amdocs Gen AI Graduate Hackathon 2024-25- A comprehensive fact-checking and misinformation detection system that leverages cutting-edge AI models and multiple news sources to verify information circulating on social media

api bert-fine-tuning flask-application matplotlib ngrok-server nlp nlp-machine-learning numpy pandas python3 scikit-learn seaborn wandb

Last synced: 05 Apr 2026

https://github.com/gaball1/alzheimer-prediction-ai

Alzheimer Prediction is a machine learning-based project designed to predict the likelihood of Alzheimer’s disease using clinical and imaging data. The project features an interactive web application built with Streamlit, enabling users to input key data, view predictions, and explore visualizations .

ai data-mining gradient-boosting jupyter-notebook logistic-regression logistic-regression-algorithm machine-learning matplotlib-python models numpy pandas python random-forest-classifier streamlit xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/maximlevchenko/pandas-numpy-hippocampus-analysis

This project aims to analyze the differences in hippocampus size between two groups: "Unaffect" and "Affected" using pandas, numpy and matplotlib

jypyternotebook matplotlib numpy pandas pyreadr seaborn statsmodels

Last synced: 16 Apr 2026

https://github.com/bharath-tars/deeptumor

DeepTumor is a project aimed at developing a deep learning model with over 1 million parameters using TensorFlow. with automatic reporting feature

keras machine-learning numpy pandas pyhton tensorflow webdevelopment

Last synced: 10 Apr 2026

https://github.com/theanujsinha01/seoul-bike-sharing-demand-prediction

This project predicts the demand for bike-sharing services in Seoul using machine learning algorithms. It utilizes historical data and environmental factors to provide accurate predictions that can aid in resource optimization and planning.

data-cleaning data-science exploratory-data-analysis feature-engineering jupyter-notebook machine-learning matplotlib model numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/adi3042/credit-card-fault-detection

🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech

classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv

Last synced: 03 Apr 2026

https://github.com/paulo-santos-ds/previsao_total_corridas_taxi

Foi desenvolvido um modelo preditivo específico para atrair mais motoristas durante o horário de pico. Prever a quantidade de pedidos de táxi para a próxima hora.

matplotlib-pyplot numpy pandas python seaborn sklearn statsmodels

Last synced: 10 Apr 2026

https://github.com/pramodyasahan/car-safe-predictor

This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.

classification k-nearest-neighbours machine-learning numpy pandas scikit-learn

Last synced: 10 Apr 2026

https://github.com/psavarmattas/covid19-dashboard

The goal of this project is not to build just another dashboard for Covid 19. But, to focus on collaborative plot ideas, and a mobile friendly UI/UX.

appseed bootstrap covid covid-19 css dashboard django docker html javascript json numpy plotly python travis-ci

Last synced: 10 Apr 2026

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-ecommerce

Machine Learning python bootcamp: linear regression on ecommerce dataset

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 10 Apr 2026

https://github.com/ngangawairimu/automating_farming_analysis

This project automates farming in Maji Ndogo, analyzing soil fertility, climate, and geography to recommend optimal crop locations. Python-based data analysis I uncovers patterns guiding farming practices.

matplotlib-pyplot numpy pandas python seaborn sqlite

Last synced: 10 Apr 2026

https://github.com/rokmark0/auto-feature-engineering

🐙 Automated feature engineering toolkit that creates, selects, and transforms features for machine learning on tabular data, boosting model accuracy and speeding workflows.

auto-feature-engineering automl aws-ec2 dask data-science explainable-ml feature-engineering featuretools machine-learning mahinelearning numpy pandas pandas-library python r sagemaker supervised-learning unsupervised-learning

Last synced: 10 Apr 2026

https://github.com/amardeepsingh-06/smart_attendance_system

The Smart Attendance System is designed to automate the attendance process using facial recognition. Traditional methods of taking attendance can be time-consuming and prone to errors. This project aims to streamline the process by leveraging computer vision technology, allowing for efficient, contactless, and accurate attendance tracking.

asgiref django mysql numpy opencv pillow-library python sqlparse

Last synced: 05 Apr 2026