Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-02 00:07:38 UTC
- JSON Representation
https://github.com/staircase-dev/piso
Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and joins on pandas' Interval, IntervalArray and IntervalIndex
data-analysis data-science data-structures interval interval-arithmetic interval-set pandas set set-operations set-theory
Last synced: 20 Aug 2025
https://github.com/williambdean/conjugate
Bayesian Conjugate Models in Python
bayesian-inference data-science probability-distribution python statistical-analysis statistics
Last synced: 09 Apr 2026
https://github.com/bastianolea/prensa_chile
Web scraping y análisis de texto sobre un corpus de texto de noticias de la prensa chilena
chile data-science datascience r textanalysis textmining
Last synced: 18 Jun 2025
https://github.com/codingforentrepreneurs/try-pandas
In this series, we're going to learn the fundamentals of the popular Python data science tool called Pandas.
data-analysis data-science deepnote jupyter nba-api nba-stats notebook pandas python python-pandas
Last synced: 18 Jan 2026
https://github.com/gabrieldim/house-price-prediction-data-science
Data Analysis & Visualization - Predict the future price of houses
analysis data-science house-price-prediction prediction visualization
Last synced: 10 Jul 2025
https://github.com/jongheepark/bayesiansocialscience
사회과학자를 위한 데이터과학 방법론 (코드 저장소)
bayesian change-point data-science network social-science textbook
Last synced: 18 Apr 2025
https://github.com/bukson/nancorrmp
Parallel correlation calculation of big numpy arrays or pandas dataframes with NaNs and infs.
correlation correlation-matrices data-science machine-learning multiprocessing numpy pandas python
Last synced: 16 Aug 2025
https://github.com/cataseven/statistics-graph-chart-card
A highly customizable, smooth, and advanced graph card. Shows historical sensor data with dynamic trend colors, statistics (min, max, avg), and more. A great alternative to the default history graph and sensor cards.
analysis analytics bar-chart chart data data-analysis data-science data-visualization graph graphics histogram historical-data history home-assistant statistical-analysis statistics
Last synced: 12 Apr 2026
https://github.com/sanjinkurelic/casebasedreasoning
Find missing values in data set using Euclid distance, normalization and calculating information value, weight of evidence
case-based-reasoning csv data-science influence information-value machine-learning numpy pandas python3 weight-of-evidence
Last synced: 20 Jun 2025
https://github.com/nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets
Last synced: 08 Jul 2025
https://github.com/soodoku/data-science
Lecture Slides for Introduction to Data Science
data-science statistical-learning
Last synced: 11 Jan 2026
https://github.com/blmoore/blackspot
Shiny app exploring Edinburgh traffic collision data
Last synced: 05 Jul 2025
https://github.com/ritvik19/data-science-from-scratch
Implementation of various data science techniques and research papers
artificial-neural-networks classification computer-vision convolutional-neural-network data-science deep-learning generative-adversarial-network machine-learning natural-language-processing natural-language-understanding recurrent-neural-networks regression transfer-learning transformer
Last synced: 10 Apr 2025
https://github.com/intuit/metriks
Python package of commonly used metrics for evaluating information retrieval models.
data-science information-retrieval metrics python36
Last synced: 21 Sep 2025
https://github.com/sam-92/telegram-energy-api
The CleanEnergyBot is a Telegram bot providing real-time electricity usage, CO2 forecasts, and energy-saving tips in Ireland, using data from EirGrid and GPT-3 analysis. It helps users make eco-friendly energy choices by comparing emissions data with EU standards.
data-science data-visualization digitaltwins energy-data iot iot-application llm openai smart-grids smart-home telegram-bot telegram-bot-api
Last synced: 17 Jun 2025
https://github.com/deepraj1729/data-science-end-to-end
A Respository to get you job ready as a Data Scientist
apis aws big-data data-science data-visualization dbms deep-learning deployment django ec2-instance exploratory-data-analysis feature-engineering flask machine-learning neural-networks python3 statistics
Last synced: 01 May 2025
https://github.com/phuijse/pythonbook
Computación Científica con Python
data-science jupyter machine-learning numerical-computation python scientific-computing
Last synced: 12 Apr 2025
https://github.com/data-centric-ai-community/nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!
ctgan data-analysis data-science deeplearning deidentification gans generative-adversarial-network machine-learning privacy-enhancing-technologies python synthetic-data synthetic-dataset-generation
Last synced: 23 Apr 2026
https://github.com/nuhmanpk/webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets
Last synced: 21 Mar 2025
https://github.com/erfaniaa/financial-indexes-correlation
Analyze financial data correlations
algo-trading crypto cryptocurrency data-science finance financial-analysis statistics
Last synced: 22 Mar 2025
https://github.com/edrewitz/wxdata
A Python package of end-to-end weather data clients & raw data clients with VPN/PROXY support, data processors that decode variable keys from GRIB format into a plain-language format & various tools for assisting Python automated workflows, querying meteorological datasets and filling gaps in meteorological data.
automation data data-clients data-engineering data-engineering-pipeline data-processing data-processing-pipelines data-science meteorology meteorology-library python weather-data
Last synced: 23 May 2026
https://github.com/fusedio/fused-mcp
Fused MCP Agents: Setting up MCP Servers for Data Scientists
data-science fused mcp python udf
Last synced: 10 Aug 2025
https://github.com/chalk-ai/examples
Curated examples and patterns for using Chalk. Use these to build your feature pipelines.
chalk data data-science ml ml-ops pipeline python
Last synced: 17 Jan 2026
https://github.com/city-of-helsinki/mlops-template
Generic repository template for small scale MLOps
data-science datascience machine-learning machinelearning mlops python
Last synced: 13 May 2025
https://github.com/tlverse/causalglm
Interpretable and model-robust causal inference for heterogeneous treatment effects using generalized linear working models with targeted machine-learning
causal-inference causal-learning data-science generalized-linear-models heterogeneous-treatment-effects high-dimensional-inference interpretable-machine-learning machine-learning marginal-structural-models nonparametric-statistics projection r r-package relative-risk-regression robust-statistics semiparametric-estimation statistics targeted-learning treatment-effects working-model
Last synced: 19 Feb 2026
https://github.com/bgroenks96/normalizing-flows
Implementations of normalizing flows using python and tensorflow
data-science machine-learning machine-learning-algorithms normalizing-flows
Last synced: 09 Mar 2026
https://github.com/medoidai/skrobot
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
artificial-intelligence data-science feature-engineering feature-selection hyperparameter-tuning machine-learning model-evaluation model-selection model-training model-tuning open-source predictive-modelling python scikit-learn
Last synced: 02 Aug 2025
https://github.com/flrs/predicting_the_wind
Data Science in Wind Resource Assessment Tutorial at PyData San Diego, March 2020
binder binder-ready data-science gis jupyter-notebook presentation renewables tutorial wind wind-energy wind-energy-analytics wind-estimate wind-resource-assessment
Last synced: 02 Sep 2025
https://github.com/dorukkarinca/keras-buoy
Keras wrapper that autosaves what ModelCheckpoint cannot.
autosave checkpointing colab colab-automation colab-notebook colaboratory data-science keras machine-learning
Last synced: 04 Oct 2025
https://github.com/RConsortium/r-collaboration
Open Collaboration, Data Registry, and Use Cases Developed by the R Community
data-analysis-in-r data-analytics data-science r
Last synced: 20 Jul 2025
https://github.com/pgebert/bike-sharing-dataset
Analysis and model development for the Kaggle Bike Sharing Dataset.
bike-sharing-dataset bikesharing data-science jupyter kaggle python
Last synced: 09 Oct 2025
https://github.com/lucko515/ml_tutor
Machine Learning Tutor Python library
classification clustering data-science education jupyter-notebook machine-learning python regression tutorials
Last synced: 08 Aug 2025
https://github.com/incubated-geek-cc/Text-To-Speech-App
A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.
data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp
Last synced: 16 Apr 2025
https://github.com/sashakolpakov/dire-jax
DImensionality REduction in JAX
cpu data-science data-visualization dimensionality-reduction embeddings gpu jax machine-learning pca random-projection tpu tsne umap vector-embeddings
Last synced: 04 Oct 2025
https://github.com/tomasonjo/bitcoin-to-neo4jdash
Project that listens to bitcoin websocket API for new transactions and stores them to Neo4j to be analyzed
bitcoin dashboard data data-science graph graphdatabase neo4j python websocket
Last synced: 03 Jul 2025
https://github.com/koonimaru/omniplot
Statistical analysis, clustering and visualinzing scientific data with hassle free
data-science matplotlib numpy pandas python
Last synced: 15 Apr 2025
https://github.com/gagandeepb/frames-beam
Accessing Postgres in a data frame in Haskell
data-science database postgres
Last synced: 20 Aug 2025
https://github.com/pbrdng/learningalgebraicvarieties.jl
Learning Algebraic Varieties from Samples
algebraic-geometry data-science
Last synced: 11 Nov 2025
https://github.com/jaimevalero/push-kaggle-dataset
Github action to upload datasets to kaggle
automation data-science github-actions kaggle kaggle-datasets
Last synced: 18 Jan 2026
https://github.com/nfmcclure/datascience350
Notes for Data Science 350 Class
bayesian-methods data-science genetic-algorithm hypothesis-testing linear-regression neural-network python r teaching-materials
Last synced: 05 May 2025
https://github.com/thechymera/behaviopy
Behavioral data analysis and plotting in Python.
animal-behavior biomedical data-science foss multimodality plotting
Last synced: 19 Apr 2025
https://github.com/devinterview-io/python-ml-interview-questions
🟣 Python ML interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions python-ml python-ml-interview-questions python-ml-questions python-ml-tech-interview software-engineer-interview technical-interview-questions
Last synced: 05 May 2025
https://github.com/github/mlops
Use GitHub to facilitate automation, collaboration and reproducibility in your machine learning workflows.
actions cicd data-science devops-tools machine-learning mlops pages primer primer-design
Last synced: 04 Oct 2025
https://github.com/isala404/speculo
Realtime face detection and recognition using deep learning
data-science face-recognition faces footages opencv python3 reactjs speculo surveillance tensorflow typescript
Last synced: 31 Aug 2025
https://github.com/curso-r/zen-do-r
Um livro sobre programação para não-programadores.
Last synced: 26 Oct 2025
https://github.com/jameslamb/talks
Conference talks, meetup talks, and misc. writing
conference-talk data-science machine-learning open-source presentations python r
Last synced: 06 Sep 2025
https://github.com/laactechnology/foxcross
AsyncIO serving for data science models
async data-science dataframe http machine-learning pandas python pytorch rest-api scikit-learn serving
Last synced: 14 Mar 2026
https://github.com/notesjor/corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
big-data cleaning-data cooccurrence corpus-linguistics corpus-processing data-minig data-mining data-science datajournalism journalism linguistics natural-language-processing natural-language-understanding nlp sdk tagger text-analysis text-mining text-processing visualization
Last synced: 17 Jan 2026
https://github.com/primaryobjects/fashion
The Fashion-MNIST dataset and machine learning models.
artificial-intelligence artificial-neural-networks classification data-science dataset fashion fashion-mnist image-classification image-recognition machine-learning mnist r supervised-learning support-vector-machines svm xgboost
Last synced: 26 Mar 2025
https://github.com/dhaitz/data-science-links
A curated list of links to great data science articles, videos, ...
agile ai artificial-intelligence career-advice data-science data-scientists machine-learning
Last synced: 12 Jun 2025
https://github.com/mrankitgupta/kaggle-pandas-solved-exercises
I'm sharing my Kaggle Pandas Course - Exercise complete solution notebook which I have solved while undertaking this course.
66daysofdata ankitgupta ankittalks data-analysis data-science data-structures data-visualization datascience kaggle kaggle-notebook kaggle-notebooks mrankitgupta pandas pandas-dataframe pandas-library pandas-python pandas-tutorial python python-library python3
Last synced: 22 Apr 2025
https://github.com/microsoft/autobrewml
With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.
anomaly-detection azure-automl cleansing-data data-science datavisualization machine-learning microsoft nlp-machine-learning responsible-ml sampling-strategies text-analysis text-classification text-summarization
Last synced: 12 Mar 2026
https://github.com/incubated-geek-cc/text-to-speech-app
A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.
data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp
Last synced: 03 Mar 2026
https://github.com/aromanro/machinelearning
From linear regression towards neural networks...
adagrad adam-optimizer backpropagation data-analysis data-science generalized-linear-models gradient-descent linear-regression logistic-regression machine-learning machine-learning-algorithms multilayer-perceptron-network nadam nesterov-accelerated-sgd nesterov-momentum neural-network rmsprop
Last synced: 16 Mar 2025
https://github.com/hoangsonww/north-carolina-household-analysis
🏠 This repository contains data analysis scripts for the 2022 American Community Survey (ACS) focusing on individuals aged 25 and over in North Carolina, based on 75,340 observations. This repository offers valuable insights into demographic and economic patterns across North Carolina's urban areas.
confidence-interval confidence-score data data-analysis data-analytics data-science data-visualization ggplot2 hypothesis-testing hypothesis-tests north-carolina r r-language r-programming stata
Last synced: 11 Apr 2025
https://github.com/yandexdataschool/idao-2019-muon-id
Problem for IDAO 2019 on LHCb Muon Identification
competitive-programming data-science high-energy-physics lhcb machine-learning
Last synced: 10 Apr 2025
https://github.com/pysiakk/genetictree
Constructing decision trees with genetic algorithm with a scikit-learn inspired API
classification data-science evolutionary-algorithm genetic genetic-algorithm genetic-programming genetictree machine-learning python python-library scikit-learn tree
Last synced: 13 Apr 2025
https://github.com/chalmerlowe/machine_learning
A gentle introduction to machine learning: data handling, linear regression, naive bayes, clustering
data data-science linear-regression machine-learning nearest-neighbors python scikit-learn
Last synced: 10 Apr 2025
https://github.com/ahmedosamamath/statistics-basics
A comprehensive guide to applying statistical techniques in machine learning, including data preprocessing, model development, evaluation metrics, and real-world applications. This repository provides beginner-to-advanced insights into the statistical foundations of machine learning.
artificial-intelligence data-analysis data-science machine-learning statistics
Last synced: 12 Apr 2025
https://github.com/code2k13/feed-visualizer
Feed Visualizer creates interactive visualizations by clustering RSS/Atom feed items based on semantic similarity. Feed Visualizer also attempts to automatically predict the labels for each cluster. This application will create a "semantic summary" of a website's contents by scanning its RSS/Atom feed, allowing for easy discovery and navigation to topics of interest. Feed Visualizer creates interactive visualizations in the form of static HTML and JS files, which may be edited and sent to a server.
artificial-intelligence atom data-science data-visualization machine-learning no-code python rss semantic-similarity visualization
Last synced: 06 May 2025
https://github.com/gabrieldim/a1on-webscraping-pandas-data-science
Learning WebScraping using Pandas in python. - Data Science
data data-science pandas sciecne web-scraping
Last synced: 10 Jul 2025
https://github.com/timetoai/timediffusion
Unified Framework for Multiple Time Series Tasks
data-science deep-learning deep-neural-networks framework machine-learning machine-learning-algorithms multi-task multi-task-architecture multiple-tasks open-source pypi-package python python3 pytorch pytorch-implementation time-series time-series-forecasting time-series-imputation time-series-prediction time-series-simulation
Last synced: 10 Apr 2025
https://github.com/florents-tselai/pandas-sets
Set-oriented Operations in Pandas
data-science pandas set-operations sets
Last synced: 11 Apr 2025
https://github.com/brpy/ml-books
A list of freely available Machine Learning related books.
books data-science free freely machine-learning statistics
Last synced: 20 Jan 2026
https://github.com/humburg/reportmd
Create multi-page HTML reports in R
data-science r rmarkdown rstudio
Last synced: 20 Mar 2025
https://github.com/smathot/eeg_eyetracking_parser
Python routines for parsing of combined EEG and eye-tracking data
data data-science eeg eye eye-tracking mne pupillometry python
Last synced: 10 Apr 2025
https://github.com/goplus/pandas
Flexible and powerful data analysis / manipulation library for Go+, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
data-analysis data-science data-tech go golang gop goplus pandas scientific-computing
Last synced: 30 Apr 2025
https://github.com/jmshea/foundations-of-data-science-with-python
Interactive flashcards and quizzes, as well as additional tutorials, animations, and code, for "Foundations of Data Science with Python" by John M. Shea
data-science data-visualization probability statistics statistics-course
Last synced: 13 Apr 2025
https://github.com/dayyass/latent-semantic-analysis
Pipeline for training LSA models using Scikit-Learn.
data-science hacktoberfest latent-semantic-analysis lsa machine-learning natural-language-processing nlp pipeline python topic-modeling
Last synced: 13 Apr 2025
https://github.com/bbva/data-refinery
Data transformation
data data-science datascience etl etl-pipeline machine-learning
Last synced: 21 Jun 2025
https://github.com/jphall663/diabetes_use_case
Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/
data-mining data-science explainable-ml healthcare iml interpretability interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python transparency xai xgboost
Last synced: 19 Jul 2025
https://github.com/dachosen1/data-science-guides-
Collections of data science guides
cheatsheet data-science machine-learning programming-language
Last synced: 24 Jan 2026
https://github.com/frictionlessdata/datapackage-java
A Java library for working with Frictionless Data Data Packages.
data-science datapackage datapackage-java frictionlessdata java java-8 java8 json-schema
Last synced: 16 Mar 2026
https://github.com/redhat-na-ssa/flyingthings
Computer vision demo
ai artificial-intelligence computer-vision data-science openshift tekton yolo yolov5
Last synced: 30 Jul 2025
https://github.com/Ritvik19/Data-Science-From-Scratch
Implementation of various data science techniques and research papers
artificial-neural-networks classification computer-vision convolutional-neural-network data-science deep-learning generative-adversarial-network machine-learning natural-language-processing natural-language-understanding recurrent-neural-networks regression transfer-learning transformer
Last synced: 02 Nov 2025
https://github.com/mainakrepositor/diabetes-prediction-system
Predict Diabetes and its possibility of occurrence from the pathological lab reports on your own.
data-science dataset-generation decision-tree-classifier diabetes-prediction multipage-application randomforestclassifier slider-component streamlit-webapp visualization webapplication
Last synced: 02 Aug 2025
https://github.com/facultyai/boltzmannclean
Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines
data-cleaning data-science dataframe pandas restricted-boltzmann-machine
Last synced: 27 Jun 2025
https://github.com/mrtkp9993/deeplearningexamples
Deep learning examples with Python and Tensorflow & Keras.
artificial-intelligence classification colab convolutional-neural-networks data-science deep-learning deep-learning-example deep-neural-networks deeplearningexamples dense-neural-network keras machine-learning python python3 regression tensorflow
Last synced: 07 May 2025
https://github.com/deeptiman/go-batch
A Simple Batch Processing library in Go
batch-processing batch-reader concurrency concurrent-programming data-science go-library go-modules golang golang-channel golang-concurrency golang-library golang-tools parallel-computing parallel-processing parallel-programming supply-chain-data-science workers
Last synced: 03 Jul 2025
https://github.com/capnion/ghostpii_client
This repository contains the Python library for interacting with Capnion's private computation API. Together this library and the API make up Ghost PII.
analytics data-science encryption privacy-enhancing-technologies
Last synced: 21 Feb 2026
https://github.com/kennethleungty/Anomaly-Detection-Pipeline-Kedro
Anomaly Detection Pipeline with Isolation Forest model and Kedro framework
anomaly anomaly-detection credit-card credit-card-fraud data-science data-science-pipeline financial financial-data fraud fraud-detection kedro machine-learning machine-learning-pipeline ml mlops pipelines quantumblack
Last synced: 24 Mar 2025
https://github.com/mainakrepositor/covidview
A complete COVID-19 tracker cum dashboard website made by me.
bootstrap covid-19 covid19-data covid19-tracker css3 data-science javascript-framework netlify reactjs redux tailwindcss template webdevelopment
Last synced: 02 May 2025
https://github.com/mainakrepositor/brs
Recommend books using Machine Learning Techniques
Last synced: 19 Jun 2025
https://github.com/asavinov/lambdo
Feature engineering and machine learning: together at last!
data-analysis data-mining data-science feature-engineering forecasting forecasting-models machine-learning time-series
Last synced: 27 Mar 2025
https://github.com/wearepal/ethicml
Package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency
algorithmic-fairness computer-vision data-science ethical-artificial-intelligence ethical-data-science fairness fairness-ai fairness-assessment fairness-awareness-model fairness-comparison fairness-ml machine-bias machine-learning pytorch responsible-ai toolkit
Last synced: 21 Aug 2025
https://github.com/codingforentrepreneurs/serverless-python-workflow-with-aws-lambda
A tutorial to setup and deploy a simple Serverless Python workflow with REST API endpoints in AWS Lambda.
aws aws-lambda data-science etl etl-pipeline python serverless webscraping
Last synced: 18 Jan 2026
https://github.com/anuraganalog/datacamp
My Solutions to Datacamp projects and courses(datacamp-exercises)
analysis business courses data-science datacamp datacamp-exercises datacamp-projects datacamp-python datacamp-slides finance jupyter-notebook learning machine r solutions sql statistics tableau theory
Last synced: 29 Aug 2025
https://github.com/sap-samples/btp-data-to-value-workshop
This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.
advanced-analytics analytics data-management data-orchestration data-science data-to-value machine-learning predictive-planning sample sample-code sap-analytics-cloud sap-btp sap-data-intelligence-cloud sap-data-warehouse-cloud sap-hana-cloud workshop
Last synced: 13 Apr 2025
https://github.com/amkrajewski/nimcso
nim Composition Space Optimization is a high-performance tool leveraging metaprogramming to implement several methods for selecting components (data dimensions) in compositional datasets, as to optimize the data availability and density for applications such as machine learning.
data-analysis data-optimization data-science materials-informatics metaprogramming nim nim-lang
Last synced: 09 Apr 2025
https://github.com/devinterview-io/cnn-interview-questions
🟣 CNN interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions cnn cnn-interview-questions cnn-questions cnn-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Jan 2026
https://github.com/kf5i/k3ai-core
K3ai-core is the core library for the GO installer. Go installer will replace the current bash installer
argo artificial-intelligence continuous-integration data-science golang k3s kubeflow kubernetes-deployment machine-learning machinelearning ml pipeline
Last synced: 14 Jan 2026
https://github.com/somdeep/Statball
Statball - Football soccer stats analyser from top 5 european leagues with data obtained by web scraping from Fbref and Statsbomb
csharp data-science data-scraping data-viz dotnet dotnet-core fbref football football-analytics football-data scouting-data scraping soccer soccer-analytics soccer-data statsbomb tableau visualizations
Last synced: 02 Apr 2025
https://github.com/a2i2/surround
Surround is a framework for building AI driven microservices in Python, https://surround.readthedocs.io/en/latest/
data-science machine-learning model-serving pipeline-framework python
Last synced: 14 Jan 2026
https://github.com/catdevnull/preciazo
analisis de precios en supermercados minoristas. en constante evolución https://preciazo.nulo.lol
data data-science price-tracker scraper supermarket
Last synced: 17 Mar 2025
https://github.com/mlops-ai/mlops
Open-source tool for tracking & monitoring machine learning models.
ai data-science machine-learning mlflow mlops neptune python
Last synced: 14 Jan 2026
https://github.com/asampat3090/open-datasets
Running list of Open Datasets
artificial-intelligence data data-science neural-network open-datasets open-source
Last synced: 26 Jan 2026
https://github.com/bessouat40/raglight
RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieval augmented thinking)..
agent agentic-ai agentic-rag agentic-workflow artificial-intelligence automation data-science embeddings framework huggingface inference llm lmstudio mistral-api mistralai ollama rag retrieval-augmented retrieval-augmented-generation vector-database
Last synced: 04 Mar 2026
https://github.com/kennethleungty/anomaly-detection-pipeline-kedro
Anomaly Detection Pipeline with Isolation Forest model and Kedro framework
anomaly anomaly-detection credit-card credit-card-fraud data-science data-science-pipeline financial financial-data fraud fraud-detection kedro machine-learning machine-learning-pipeline ml mlops pipelines quantumblack
Last synced: 12 Jul 2025
https://github.com/alenrajsp/tcxreader
tcxreader is a reader / parser for Garmin’s TCX file format. It also works well with missing data!
data-mining data-science python sports-analytics tcx tcx-parser
Last synced: 09 Apr 2025
https://github.com/jphall663/hc_ml
Slides, videos and other potentially useful artifacts from various presentations on responsible machine learning.
accountability data-mining data-science explainable-ai explainable-ml fairness fairness-ai fairness-ml fatml iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability transparency xai
Last synced: 27 Jan 2026