Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-02 00:07:38 UTC
- JSON Representation
https://github.com/bcgov/bcgroundwater
An R package to facilitate analysis and visualization of groundwater data from the British Columbia groundwater observation well network
Last synced: 20 Jul 2025
https://github.com/dimzachar/datatalksclub-projects
Streamlit-Powered DataTalksClub Project Analyzer: Interactive Insights at Your Fingertips
data-science gpt machine-learning openai python streamlit vizualisation
Last synced: 18 Jul 2025
https://github.com/alastairrushworth/tdf
๐ด๐ ๐Tour de France winners and stages data
data-science dataframe exploratory-data-analysis rstats tdf tour-de-france
Last synced: 13 Apr 2025
https://github.com/ancatmara/data-science-nlp
NLP Section of the Data Science course, NRU HSE
classification clustering data-analysis data-science dimensionality-reduction embeddings fnn language-models morphological-analysis natural-language-processing nlp python regex russian-nlp syntactic-parsing topic-modelling tutorials
Last synced: 11 Jul 2025
https://github.com/brianruizy/2019-microsoft-iot-hackathon
๐ฅ 1st place winner | Bump.IT - Pothole detection and mapping. Using data science methods of analysis, mobile phone's telemetry, computer vision, and, deployed through Azure.
computer-vision data-science geocoding internet-of-things pothole-detection
Last synced: 19 Mar 2025
https://github.com/blazingdb/welcome_to_blazingsql_notebooks
RAPIDS data science. No setup required.
blazingsql blazingsql-notebooks dask data-science data-visualization demos gpu jupyterlab machine-learning notebooks rapids sql
Last synced: 12 Apr 2025
https://github.com/benedekrozemberczki/FSCNMF
An implementation of "Fusing Structure and Content via Non-negative Matrix Factorization for Embedding Information Networks".
analytics artifical-intelligence data-mining data-science deepwalk embedding factorization-machine fscnmf graph-embedding graph2vec information-network machine-learning network-embedding nmf node-embedding node2vec pca regularization word2vec word2vec-model
Last synced: 17 Apr 2025
https://github.com/alessandrocorradini/university-of-michigan-applied-data-science-with-python-specialization
Repository for the Applied Data Science with Python Specialization from University of Michigan on Coursera
coursera coursera-specialization data-science machine-learning mooc moocs
Last synced: 07 Sep 2025
https://github.com/tushar2704/sql-portfolio
Collection of personal SQL projects and queries I've worked on, showcasing my skills and expertise in database management, data analysis, and data manipulation using SQL.
data data-analytics data-science dataanalysis datamanipulation machine-learning mysql postgresql sql streamlit-tushar2704 tushar2704
Last synced: 07 May 2025
https://github.com/OGFris/GoStats
GoStats is a go library for math statistics mostly used in ML domains, it covers most of the statistical measures functions.
data-science go golang gostats machine-learning math mathematics mit-license statistical-measures statistics stats
Last synced: 14 Mar 2025
https://github.com/amine-smahi/r-learning-journey
Some of the projects i made when starting to learn R for Data Science at the university
afc cpa data-cleaning data-integration data-science datascience r r-language
Last synced: 18 Mar 2025
https://github.com/aengl/cocoon-demo
Cocoon โ a flow-based workflow automation, data mining and visual analytics tool.
brushing cocoon data-mining data-science data-visualization dataflow flow-based-modeling flow-based-programming interactive-visualisations node-js reactjs visual-analytics workflow-automation
Last synced: 03 Apr 2025
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 19 Apr 2025
https://github.com/sinanuozdemir/oreilly-transformers-nlp-mlops
Session on MLOps
data-science deep-learning machine-learning mlops natural-language-processing nlp python transformers
Last synced: 24 Feb 2025
https://github.com/systamental/cryptodatapy
CryptoDataPy is a python library that makes it easy to build high quality data pipelines for the analysis of cryptoassets
alternative-data cryptoassets data-science etl-pipeline market-data on-chain-data pandas python
Last synced: 06 Aug 2025
https://github.com/stefen-taime/car-price-predictor
Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration
data data-science dataanalysis-projects engineering machine-learning mlops predictive-modeling
Last synced: 04 Aug 2025
https://github.com/gyrdym/ml_preprocessing
Implementation of popular data preprocessing algorithms for Machine learning
data-preprocessing data-science machine-learning machine-learning-algorithms onehot-encoder ordinal-encoder
Last synced: 21 Mar 2025
https://github.com/alro10/roadmap-data-scientist
The basic roadmap to become a data scientist
analytics cognitive-courses data-science data-scientist docker ibm ibm-cloud kubernetes machine-learning python python3 roadmap roadmap-ds sql
Last synced: 07 Mar 2026
https://github.com/tjpalanca/facebook-news-analysis
Analysis of Facebook News in the Philippines
analysis data data-science facebook news philippines
Last synced: 07 Mar 2026
https://github.com/PySloth/pysloth
A Python Package for Probabilistic Prediction
data-analysis data-science machine-learning python statistics
Last synced: 11 May 2025
https://github.com/Akai01/caretForecast
Conformal Time Series Forecasting Using State of Art Machine Learning Algorithms
caret conformal-prediction data-science econometrics forecast forecasting forecasting-models machine-learning macroeconometrics microeconometrics r time-series time-series-forcasting time-series-prediction
Last synced: 11 May 2025
https://github.com/fbruzzesi/sklearn-smithy
Toolkit to forge scikit-learn compatible estimators
cli data-science machine-learning python scikit-learn webui
Last synced: 16 Sep 2025
https://github.com/mdh266/nycbuildingenergyuse
Creating Regression Models Of Building Emissions On Google Cloud
bokeh data-science energy-efficiency exploratory-data-analysis google-app-engine missing-data missing-values outlier-detection outlier-removal regression regression-models scikit-learn xgboost
Last synced: 30 Jul 2025
https://github.com/eshikashah/skillship-internship-data-science-projects
Utilized this lockdown to do something productive. SkillShip foundation provided and internship opportunity and here's the outcome. The projects made by me in these 2 months.
classification data-science internship machine-learning regression
Last synced: 28 Jul 2025
https://github.com/icaropires/pdf2dataset
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features
data-science distributed-computing distributed-systems ocr pandas-dataframe parallel parquet pdf pdf2image pdftotext pyarrow pytesseract pytesseract-ocr python python3 ray tesseract tesseract-ocr
Last synced: 13 Apr 2025
https://github.com/idanpa/jupad
Python Notepad
calculator data-science ipython jupyter
Last synced: 26 Jul 2025
https://github.com/UtrechtUniversity/iBridges
A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.
data-analysis data-engineering data-science datascience irods-client
Last synced: 29 Jun 2025
https://github.com/ahmetfurkandemir/online-istanbul-applied-data-science-102-bootcamp
Online Istanbul Applied Data Science 102 Bootcamp (Start : 15 August, Finish : 7 November)
bootcamp data-science deep-learning kodluyoruz machine-learning
Last synced: 15 Apr 2025
https://github.com/polyaxon/cli
Polyaxon Core Client & CLI to streamline MLOps
data-science dataops deep-learning hyperparameter-optimization kubernetes machine-learning ml mlops pytorch scikit-learn tensorflow workflows
Last synced: 25 Aug 2025
https://github.com/alvertogit/bigdata_docker
Big Data Docker Data Science Spark Spark4 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook
big-data data-science docker jupyter-lab jupyter-notebook machine-learning python scala spark spark4
Last synced: 10 Mar 2026
https://github.com/psyplot/psy-view
An ncview-like GUI with psyplot
data-science gui netcdf psyplot visualization
Last synced: 20 Aug 2025
https://github.com/pachyderm/neon-workshop
A Pachyderm deep learning tutorial for conference workshops
containers data-engineering data-pipelines data-science deep-learning docker kubernetes machine-learning python
Last synced: 02 Mar 2026
https://github.com/squey/squey
Squey is a visualization software designed to interactively explore and understand large amounts of tabular data (this is the read-only mirror of https://gitlab.com/squey/squey)
cybersecurity data-analysis data-science data-visualization exploratory-data-visualizations parallel-coordinates parquet parquet-files parquet-viewer pcap timeseries timeseries-analysis visualization
Last synced: 08 Mar 2025
https://github.com/mohammed-majid/ml_roadmap
Comprehensive Machine Learning Roadmap
algorithms data-science deep-learning machine-learning roadmap
Last synced: 06 Mar 2025
https://github.com/maastrichtlawtech/case-law-explorer
โ๏ธ A network analysis software platform for analyzing Dutch and European court decisions.
case-law data-science network-analysis
Last synced: 23 Jan 2026
https://github.com/milos-agathon/crisp-topographical-map-with-r
In this repo, I'll show you how to programatically access satellite imagery from several APIs to create such a map of Italy. We will use a single interface to query the data without even downloading raster data to your local drive ๐ฒ. For a tutorial please visit https://milospopovic.net/crisp-topography-map-with-r/
data-science data-visualization gis maps r satellite-imagery topography
Last synced: 04 Apr 2026
https://github.com/sominw/kaggle
Data Analysis using datasets from Kaggle
data-analysis data-science data-science-portfolio exploratory-data-analysis ipython-notebook jupyter-notebook kaggle-competition machine-learning machine-learning-algorithms
Last synced: 28 Oct 2025
https://github.com/cusyio/python4datascience
Teaching materials for the cusy training courses on Python-based data science workflows: https://cusy.io/en/seminars
data-science datascience dvc git ipython numpy pandas python
Last synced: 05 Sep 2025
https://github.com/zakroum-hicham/football-analysis-cv
This repository contains a computer vision/machine learning football project that uses YOLO for object detection, Kmeans for pixel segmentation, and perspective transformation to analyze player movements in football videos
ai computer-vision data-science football-analytics kmeans-clustering machine-learning opencv yolov8
Last synced: 26 Mar 2025
https://github.com/osl-pocs/skdata
Python tools for data analysis
data data-analysis data-science open-data python
Last synced: 23 Feb 2026
https://github.com/vsimkus/torch-reparametrised-mixture-distribution
PyTorch implementation of the mixture distribution family with implicit reparametrisation gradients.
data-science gradients machine-learning mixture-distributions mixture-model mixture-of-gaussians pytorch variational-inference
Last synced: 10 Oct 2025
https://github.com/mdh266/NYCBuildingEnergyUse
Creating Regression Models Of Building Emissions On Google Cloud
bokeh data-science energy-efficiency exploratory-data-analysis google-app-engine missing-data missing-values outlier-detection outlier-removal regression regression-models scikit-learn xgboost
Last synced: 07 May 2025
https://github.com/neemiasbsilva/regression-in-cnns-applied-to-plant-leaf-count
Regression in Convolutional Neural Network applied to Plant Leaf Count
cnns computer-vision convolutional-neural-networks count cvppp data-engineering data-science dataset deep-learning deep-learning-api deep-neural-networks inception-resnet-v2 nasnet-models plant-leaf-counting plant-phenotypes plant-phenotyping regression resnet-50 tensorflow xception-model
Last synced: 11 Apr 2025
https://github.com/tjmahr/polypoly
Helper functions for orthogonal polynomials in R
Last synced: 30 Apr 2025
https://github.com/iBridges-for-iRODS/iBridges
A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.
data-analysis data-engineering data-science datascience irods-client
Last synced: 14 Jul 2025
https://github.com/mad-lab-fau/tpcp
Pipeline and Dataset helpers for complex algorithm evaluation.
algorithms biosignals data-management data-science machine-learning python
Last synced: 04 Feb 2026
https://github.com/csinva/data-viz-utils
Functions for easily making publication-quality figures with matplotlib.
big-data data-analysis data-science data-visualization eda legend matplotlib python python3 scatterplot time-series
Last synced: 05 May 2025
https://github.com/Grasia/WikiChron
Data visualization tool for wikis evolution
analyzer data-analysis data-science data-visualization datascience dump evolution graphs history history-dump mediawiki-wikis plot research-tool time-series visualization web-service wiki wikia wikimedia wikis
Last synced: 03 Apr 2025
https://github.com/mindsetlib/insolver
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
auto-ml automated-machine-learning automl bayesian-optimization data-science elyra elyra-community feature-engineering hyperparameter-optimization insurance insurance-claims insurance-company insurance-scoring insurance-team low-code machine-learning
Last synced: 14 Dec 2025
https://github.com/danlessa/coursera-xarray
Repository for the "Climate Geospatial Analysis with Python and Xarray" project on Coursera
climate-science course-project coursera data-science geospatial-analysis xarray
Last synced: 22 Jun 2025
https://github.com/viveckh/new-ml-data-science-framework-tutorials-by-ej
Internet's Most Popular Tutorials on Fresh-off-the-shelf ML & Data Science Technologies, Authored by Yours Truly.
crash-course data-science directed-acyclic-graph facebookai hiplot hiplot-tutorial ibm-qiskit machine-learning metaflow metaflow-tutorial netflix python python-library qiskit-tutorial qiskit-workshop-materials quantum-computing quantum-programming tutorials
Last synced: 19 Mar 2025
https://github.com/khiopsml/khiops-python
The Python library of the Khiops AutoML suite
auto-feature-engineering automatic-machine-learning automl data-science machine-learning python supervised-learning unsupervised-learning
Last synced: 01 Jul 2026
https://github.com/raamana/missingdata
missing data handing: visualize and impute
biostatistics data-science dirty-data epidemiology imputation machine-learning missing-data missing-values neuroscience visualization
Last synced: 13 Apr 2025
https://github.com/trflorian/sentiment-analysis-viz
Real-time visualization of sentiment analysis on text input
customtkinter data-science huggingface opencv-python python sentiment-analysis tokenizer transformers
Last synced: 08 Jul 2025
https://github.com/ozguraslank/flexml
Easy-to-use and flexible AutoML library for Python
automl data-science machine-learning python scikit-learn
Last synced: 18 Jul 2025
https://github.com/umuthopeyildirim/flatironopensource
Flatiron School lessons for graduated students.
computer-science data-science javascript python ruby web-development
Last synced: 07 Mar 2026
https://github.com/yufree/datadown
ๆฐๆฎๅๆๆฎๅท
bookdown chinese-simplified data-science r statistics
Last synced: 18 Mar 2025
https://github.com/mtpatter/mlflow-tutorial
Fully reproducible, Dockerized, step-by-step, tutorial on training and serving a simple sklearn classifier model using mlflow. Detailed blog post published on Towards Data Science.
data-science machine-learning mlflow mlflow-docker mlops tutorial
Last synced: 04 May 2025
https://github.com/deysuman/machinelearningstocks
Using python and scikit-learn to make stock predictions
data-science deep-learning deysuman finance historical-stock-fundamentals india machine-learning machine-learning-algorithms made-with-love math-with-python python python3 science scikit-learn sklearn stock-analysis stock-prediction yahoo-finance
Last synced: 01 May 2025
https://github.com/robinlovelace/opengeohub2023
Content for lecture at OpenGeoHub 2023 on spatial data and the tidyverse
course data-science opengeohub osgeo practical r reproducible summer-school tidy-data
Last synced: 20 Mar 2025
https://github.com/duo-labs/datasci-ctf
A capture-the-flag exercise based on data analysis challenges
Last synced: 30 Apr 2025
https://github.com/riveryio/rivery_cli
Rivery CLI
data-pipeline data-pipelines data-science database database-management dataops dataops-platform dwh dwh-team elt etl rivery
Last synced: 11 Jul 2025
https://github.com/rezapace/komputasi-big-data
This repository contains materials and practical exercises for learning Python in the context of Big Data Computation. The focus is on analyzing and processing large datasets using various tools and techniques.
ai big data data-science git-reza gunadarma gundar komputasi-big-data
Last synced: 28 Sep 2025
https://github.com/MindSetLib/Insolver
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
auto-ml automated-machine-learning automl bayesian-optimization data-science elyra elyra-community feature-engineering hyperparameter-optimization insurance insurance-claims insurance-company insurance-scoring insurance-team low-code machine-learning
Last synced: 20 Jul 2025
https://github.com/csinva/cookiecutter-ml-research
A logical, reasonably standardized, but flexible project structure for conducting ml research ๐ช
ai artificial-intelligence classification data-science machine-learning ml ml-tooling modeling natural-language-processing nlp python regression research statistics tabular-data template
Last synced: 01 Mar 2026
https://github.com/redhat-na-ssa/demo-ai-gitops-catalog
A catalog for demos in GitOps on OpenShift
ai data-science gitops kustomization kustomize machine-learning openshift
Last synced: 29 Jan 2026
https://github.com/sunnynguyen-ai/fraud-detection-system
Real-time fraud detection system using ensemble ML models, featuring streaming data processing, explainable AI with SHAP, and production-ready deployment with FastAPI and Docker.
data-science docker ensemble-models fastapi feature-engineering fraud-detection machine-learning mlops production-ml python random-forest real-time-ml shap streamlit xgboost
Last synced: 04 May 2026
https://github.com/pforemski/gouda
Golang Utilities for Data Analysis
clustering data-analysis data-science dbscan golang interpolate kdtree kmeans machine-learning
Last synced: 28 Jan 2026
https://github.com/ndleah/people-data-analysis
๐ฅ Employee analysis #DWD
analytics data-analysis data-science historical-data materialized-view slow-changing-dimension sql
Last synced: 05 Mar 2026
https://github.com/idlab-discover/rustiflow
Flow feature extraction tool built in Rust using eBPF
data-science dataset-generation ebpf-programs feature-extraction machine-learning network-analysis network-monitoring network-security packet-analyser packet-capture pcap rust throughput-performance traffic-analysis
Last synced: 02 Feb 2026
https://github.com/dataship/frame
A DataFrame for Javascript
data-frame data-science javascript statistics
Last synced: 08 Jul 2025
https://github.com/mdh266/textclassificationapp
Building and Deploying A Serverless Text Classification Web App
data-science docker document-classification fastapi imbalanced-data imbalanced-learning machine-learning naive-bayes natural-language-processing nlp nltk scikit-learn support-vector-machine text-classification
Last synced: 30 Jul 2025
https://github.com/gyrdym/ml_dataframe
A way to store and manipulate data
data-science dataframe datascience dataset toy-dataset toy-datasets
Last synced: 21 Mar 2025
https://github.com/dagshub/3d-model-datasets
Open-source 3D Model datasets
codepeak codepeak2022 data-science dataset dvc hacktoberfest hacktoberfest-2022 hacktoberfest-2023 hacktoberfest2022 hacktoberfest22 machine-learning
Last synced: 02 Jan 2026
https://github.com/scicloj/tablecloth.time
Tools for the processing and manipulation of time-series data in Clojure.
clojure data-processing data-science dataset scicloj tablecloth time-series
Last synced: 14 Apr 2025
https://github.com/madsjulia/biguq.jl
Bayesian Information Gap Decision Theory
bayesian bayesian-data-analysis data-driven data-science decision-analysis decision-making decision-model decision-support decision-theory experimental-design high-performance-computing information-gap information-theory julia mads model-analysis model-driven predictive-analysis uncertainty-quantification
Last synced: 22 Apr 2025
https://github.com/lettier/interactive-simple-linear-regression
A PureScript, browser-based implementation of simple linear regression.
ai artificial-intelligence data-science frontend functional functional-programming gradient-descent halogen linear-regression machine-learning machine-learning-algorithms nueral-networks press-statistic purescript purescript-halogen regression statistics web-development
Last synced: 03 Feb 2026
https://github.com/kklemon/keras-loves-torchtext
Make Torchtext work with Keras.
data-science deep-learning keras natural-language-processing pytorch tensorflow torchtext
Last synced: 24 Sep 2025
https://github.com/991o2o9/smart-cardiologist
Intelligent Python service with FastAPI for real-time heart disease predictions using machine learning. Features AI-assisted consultations, user authentication, analysis history, RESTful API, and comprehensive error handling. Secure and scalable solution for healthcare applications.
api artificial-intelligence data-science fastapi healthcare healthcare-technology heart-disease machine-learning medical-ai medical-diagnosis prediction predictive-analytics pydantic python rest-api scikit-learn swagger uvicorn
Last synced: 30 Aug 2025
https://github.com/amitkaps/multidim
Visualising Multi Dimensional Data
data-science data-visualization grammar python r visualization
Last synced: 01 Mar 2026
https://github.com/astarte-platform/astarte_flow
Build data processing pipelines with Astarte Flow.
ai container containers data-science docker elixir iot kubernetes lua pipelines realtime
Last synced: 07 May 2025
https://github.com/balavenkatesh3322/deep-learning-notebook
A collection of deep learning notebooks for learning and practicing.
data-science deep-learning juypter-notebook juypterhub machine-learning neural-network notebook python3 pytorch tensorflow
Last synced: 22 Apr 2025
https://github.com/nirala96/bangalore-house-prediction-app
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.
data-science datacleaning exploratory-data-analysis flask-api flutter jupyter-notebook linear-regression python
Last synced: 10 Mar 2026
https://github.com/jdvelasq/courses
Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
analytics big-data big-data-analytics data-science training-materials
Last synced: 23 Aug 2025
https://github.com/facultyai/faculty
A Python library for interacting with the Faculty platform
data-science faculty-platform python
Last synced: 14 Apr 2025
https://github.com/mkearney/tfse
๐ Useful R functions for various things
data-science functions mkearney-r-package r-language rstats utility
Last synced: 12 Apr 2025
https://github.com/simbafl/interview-notes
Python้็ฌ
data-science hadoop hive machine-learning python spark
Last synced: 08 Apr 2025
https://github.com/getindata/quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
Last synced: 09 Apr 2025
https://github.com/devinterview-io/recommendation-systems-interview-questions
๐ฃ Recommendation Systems interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions recommendation-systems recommendation-systems-interview-questions recommendation-systems-questions recommendation-systems-tech-interview software-engineer-interview technical-interview-questions
Last synced: 08 Jan 2026
https://github.com/wibeasley/ranalysisskeleton
Files and settings commonly used in analysis projects with R
Last synced: 06 Oct 2025
https://github.com/ajayarunachalam/gui-pandas-ai
GUIPandasAI - Integrating Generative AI capabilities into Pandas as Web Interface along with key-words based data analysis services
ai chatgpt data data-analysis data-analytics data-science generative-ai gpt-3 gpt-4 llm pandas python streamlit web-app
Last synced: 06 Jul 2025
https://github.com/kennethleungty/data-centric-ai-competition
Codes for a Top 5% finish in the Data-Centric AI Competition organized by Andrew Ng and DeepLearning.AI
ai andrew-ng data-centric data-centric-ai data-science deep-learning machine-learning
Last synced: 09 Jul 2025
https://github.com/brakmic/data-science-for-losers
:chart_with_upwards_trend: Articles on Data Science, Jupyter, and Pandas
data-science jupyter machine-learning python
Last synced: 23 Apr 2025
https://github.com/mauroluzzatto/explainy
explainy is a Python library for generating machine learning model explanations for humans
data-science explanation machine-learning machine-learning-explainability python scikit-learn
Last synced: 07 Oct 2025
https://github.com/shimantorahman/empulse
Value-driven and cost-sensitive analysis for scikit-learn
cost-sensitive cost-sensitive-learning data-science machine-learning profit-driven profit-driven-analytics python scikit-learn sklearn value-driven value-driven-analytics
Last synced: 12 Oct 2025
https://github.com/picterra/picterra-python
Picterra Python API Client
data-science earth-observation geospatial-analysis geospatial-intelligence machine-learning
Last synced: 14 Jan 2026
https://github.com/devinterview-io/probability-interview-questions
๐ฃ Probability interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions probability probability-interview-questions probability-questions probability-tech-interview software-engineer-interview technical-interview-questions
Last synced: 12 Feb 2026