Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/devinterview-io/supervised-learning-interview-questions
๐ฃ Supervised Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview supervised-learning supervised-learning-interview-questions supervised-learning-questions supervised-learning-tech-interview technical-interview-questions
Last synced: 09 Feb 2026
https://github.com/eliasdabbas/dash-aggrid-scales
Color scales (continuous and categorical) and bar charts for Dash-Ag-Grid
aggrid color-scales color-scheme data-science data-visualization html plotly-dash table
Last synced: 16 Mar 2026
https://github.com/elliotwutingfeng/twitter200m
Simple analysis of the Twitter 200M Data Dump of January 2023.
200m data-science haveibeenpwned leak osint twitter
Last synced: 16 Mar 2026
https://github.com/bodo-ai/pydough
Analytics DSL for Python
analytics artificial-intelligence big-data data-science defog defog-ai machine-learning pandas python sql text-to-analytics text-to-sql tpch
Last synced: 22 May 2026
https://github.com/devopscorner/nifi
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
ansible data-science data-structures docker docker-compose dockerhub ecr eks eks-cluster etl kubernetes machine-learning ml mlops nifi nifi-registry terraform vpn vpn-client
Last synced: 08 Sep 2025
https://github.com/canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
artificial-intelligence classification data-mining data-science machine-learning machine-learning-algorithms
Last synced: 28 Apr 2025
https://github.com/bsomps/OpenGeoPlotter
A PyQt5 app catered to the exploration industry for visualizing geologic drill hole data with features like cross-sections, simple 3D views, strip logs, scatter plots, and downhole line plots. Includes data transformation techniques like factor analysis, desurveying, and alpha-beta conversion.
cross-sections data-science drilling exploration geology geoscience pyqt5 python strip-logs
Last synced: 05 Mar 2025
https://github.com/zenml-io/template-starter
A template for a starter project for ZenML
cookiecutter copier-template data-science machine-learning mlops zenml
Last synced: 14 Apr 2025
https://github.com/ashishbamania/tutorials-on-artificial-intelligence
A collection of AI tutorials from Dr. Ashish Bamania
agentic-ai ai ai-agents artificial-intelligence crewai data-science langchain llama machine-learning ml rag retreival-augmented-generation software-engineering
Last synced: 13 May 2025
https://github.com/doarakko/kagoole
Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.
artificial-intelligence data-science heroku kaggle kaggle-competition kaggle-solution machine-learning webapp
Last synced: 17 Oct 2025
https://github.com/anshchoudhary/xgmodel
This repository contains code to predict the Expected Goals (xG) from shots in football using various machine learning models.
data-science football-analytics football-data machine-learning machine-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/shwetajoshi601/world-bank-data-analysis
An Exploratory Data Analysis on the World Bank Dataset.
analysis data-science eda python3 world-bank-api worldbank
Last synced: 02 Aug 2025
https://github.com/chongyasong/youml
YouML: A Machine Learning Toolkit
ai artificial-intelligence big-data data-mining data-science machine-learning matplotlib numpy pandas python scikit-learn scipy
Last synced: 11 Apr 2025
https://github.com/mathewroy/ynabr
Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.
api data-analysis data-science data-visualization r ynab ynab-api
Last synced: 30 Jul 2025
https://github.com/takuti/anompy
A Python library for anomaly detection
anomaly-detection data-science forecasting machine-learning python
Last synced: 15 Apr 2025
https://github.com/hoangsonww/standard-deviation-calculator
๐ This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/tomaztk/List_of_R_packages_for_Data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 30 Jul 2025
https://github.com/h2oai/article-information-2019
Article for Special Edition of Information: Machine Learning with Python
data-science explainable-ai explainable-ml fairness-ai fairness-ml fairness-testing fatml iml interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python xai
Last synced: 07 Apr 2025
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/bdist/bdist-workspace
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Tรฉcnico
data-engineering data-science docker jupyter jupyterlab notebook postgres postgresql python sql sqlite
Last synced: 09 Apr 2026
https://github.com/dmedri/roaster
R - Fetch, build and deploy.
build-tool data-science rstats statistical-analysis statistics virtual-environments
Last synced: 30 Jul 2025
https://github.com/sdpython/mlstatpy
Mathematics, Algorithmic, Data-Science, Teaching Materials
algorithms data-science mathematics python3 teaching-materials
Last synced: 23 Jun 2025
https://github.com/jhrcook/tidy-tuesday
#TidyTuesday to practice data analysis in R
data-analysis data-science r regression-models rlang tidytuesday tidytuesday-challenge tidyverse
Last synced: 28 Oct 2025
https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
data-science docker docker-image google-cloud machine-learning
Last synced: 30 Apr 2025
https://github.com/openbridge/ob_pysh-db
pysh-db - The Data Science Toolkit (DSK)
bash data-science mysql postgres python redshift sql
Last synced: 10 Apr 2025
https://github.com/sjcobb/ai-duet-3d
3D music animation + machine learning (in development)
3d-animation 3d-audio 3d-game artificial-intelligence browser-game data-science data-visualization game-development generative-music javascript machine-learning music music-bot music-composition music-theory music-visualizer neural-network web-development youtube-channel
Last synced: 28 Oct 2025
https://github.com/devinterview-io/reinforcement-learning-interview-questions
๐ฃ Reinforcement Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions reinforcement-learning reinforcement-learning-interview-questions reinforcement-learning-questions reinforcement-learning-tech-interview software-engineer-interview technical-interview-questions
Last synced: 15 Jun 2025
https://github.com/faridrashidi/cnsplots
๐จ Toolkit for generating publication-quality plots for Cell, Nature and Science journals
data-science data-visualization plotting publication-quality python scientific-publications
Last synced: 06 Apr 2026
https://github.com/jimbrig/lossrx
An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.
actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow
Last synced: 01 Jul 2025
https://github.com/alan-turing-institute/hds-discussiongroup
Repo of the Turing's Humanities & Data Science Discussion Group
data-science digital-humanities discussion-group
Last synced: 03 Mar 2026
https://github.com/martincastroalvarez/html2vec
Algorithm that converts an HTML to a vectorized object suitable for neural networks.
data-science html2vec natural-language-processing python web-scraping word2vec
Last synced: 11 Apr 2025
https://github.com/giswqs/notebook-share
A repo for sharing notebooks
data-science dataviz geospatial jupyter-notebook mapping notebook
Last synced: 07 May 2025
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 12 Apr 2025
https://github.com/alugowski/matrepr
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
data-science data-visualization data-viz graphblas jupyter numpy numpy-matrix pytorch scipy sparse sparse-data sparse-matrices sparse-matrix sparse-representations tensor tensorflow torch
Last synced: 12 Apr 2025
https://github.com/xuri/excelize-py
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter
Last synced: 07 May 2025
https://github.com/zen-reportz/zen_dash
Simple, Fast, Scalable , production grade dashboard application . Right solution for team
dashboard data-analytics data-science fastapi flask python3 shiny streamlit
Last synced: 13 Apr 2025
https://github.com/fabsta/interesting_notebooks
A collection of Data Science Jupyter notebook (reference material)
data-science eda jupyter-notebook kaggle machine-learning python
Last synced: 03 Jul 2025
https://github.com/krypty/trefle
Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.
data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn
Last synced: 29 Oct 2025
https://github.com/sjcobb/webxr-threejs-midi-visualizer
WebXR, augmented reality MIDI data visualization, built with Three.js and Tone.js. See video: https://youtu.be/lIecCGtbqSM
3d aframe cannonjs data-science data-visualization depth-estimation game-development hit-detection javascript midi music-theory physics three threejs tone tonejs webvr webxr
Last synced: 12 Jul 2025
https://github.com/devinterview-io/bias-and-variance-interview-questions
๐ฃ Bias And Variance interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions bias-and-variance bias-and-variance-interview-questions bias-and-variance-questions bias-and-variance-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Feb 2026
https://github.com/clojurecivitas/clojurecivitas.github.io
An open effort to structure learning resources with meaningful connections.
blog clay clojure data-science literate markdown notebooks
Last synced: 24 Jun 2025
https://github.com/anshumansinha3301/safetitude-project-finance
My works as an SDE @Safetitude
data-science data-structures dbms
Last synced: 13 Jul 2025
https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab
A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.
abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting
Last synced: 10 Jun 2026
https://github.com/opengeos/qgis-leafmap-plugin
A QGIS plugin for leafmap
data-science geospatial leafmap python qgis qgis-plugin
Last synced: 30 Jan 2026
https://github.com/trafficgcn/osmnx_adjacency_matrix_for_graph_convolutional_networks
Creating an Adjacency Matrix Using the Dijkstra Algorithm for Graph Convolutional Networksย GCNs
adjacency-matrix data-science dijkstra dijkstra-algorithm gcn graph graph-algorithms graph-convolutional-networks matrix metrla open-street-map optimal-route osm osmnx python traffic traffic-analysis traffic-congestion
Last synced: 27 Oct 2025
https://github.com/correia-jpv/fucking-awesome-datascience
๐ An awesome Data Science repository to learn and apply for real world problems. With repository starsโญ and forks๐ด
analytics awesome awesome-list data-mining data-science data-scientists data-visualization deep-learning hacktoberfest machine-learning science
Last synced: 27 Apr 2025
https://github.com/tristanbilot/airflow-rbac-roles-cli
A tool to create Airflow RBAC roles with dag-level permissions from cli.
airflow cloud-composer data-engineering data-science gcp permissions pipeline rbac-roles
Last synced: 25 Oct 2025
https://github.com/flbulgarelli/recursos-python
Spanish resources for learning Python
data-science education http imperative-programming object-oriented-programming python testing
Last synced: 30 Oct 2025
https://github.com/tomaztk/list_of_r_packages_for_data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 16 May 2025
https://github.com/devinterview-io/tensorflow-interview-questions
๐ฃ Tensorflow interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions tensorflow tensorflow-interview-questions tensorflow-questions tensorflow-tech-interview
Last synced: 04 Jul 2025
https://github.com/seandavi/machinelearningintro
Machine learning use cases for teaching
data-science machine-learning r rstats teaching-materials tutorial
Last synced: 05 Apr 2025
https://github.com/dovolopor-research/data-science-research-toolbox
๐งฐ ๆฐๆฎ็งๅญฆ็ง็ ๅทฅๅ ท็ฎฑ
data-science data-science-research data-science-resourses research-resources research-tool visualization
Last synced: 05 Jan 2026
https://github.com/srohit0/datasciencegraphalgorithms
Selected Graph Algorithms
astar astar-algorithm astar-pathfinding astar-search cpp data-science datascience depth-first-search dfs-algorithm dijkstra-algorithm dijkstra-shortest-path graph graph-algorithms graph-theory kosaraju kruskal-algorithm prim-algorithm strongly-connected-components topological-sort transpose
Last synced: 15 Apr 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/techshot25/healthcare
Insurance cost predictor
bayesian-regression data-analysis data-science linear-regression machine-learning polynomial-regression random-forest-regression
Last synced: 24 Apr 2025
https://github.com/oceannetworkscanada/api-python-client
Provides easy access to ONC data in Python
api data-science ocean-sciences onc python
Last synced: 20 Jul 2025
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026
https://github.com/matteocargnelutti/maguire-lab-seizure-detection-webapp
๐ง Maguire Lab's Deep Learning Seizure Detection WebApp.
data-science eeg-signals-processing neuroscience
Last synced: 21 Apr 2025
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 30 Apr 2025
https://github.com/subugoe/scholcomm_analytics
Scholarly Communication Analytics with R Blog
bibliometrics data-science distill library rstats scholarly-communication-analytics
Last synced: 24 Feb 2026
https://github.com/mindful-ai-assistants/hackapucsp-2024
๐ HackaPUCSP 2024 - - Data Science and AI Hackathon - Pontifical Catholic University of Sรฃo Paulo
automation data-science design github-actions hackathon-project oneness-consciousness package-manager programming pucsp pytest python3 unittest
Last synced: 11 Jul 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/khadkarajesh/internship-preparation-kit
Repository consist the technical and behavioural questions asked by french tech companies for internship
algorithm algorithms coding-interviews codinggame data-science data-structures data-structures-and-algorithms french hacktoberfest hacktoberfest-accepted hacktoberfest2022 internship interview interview-practice interview-preparation interview-questions interview-test leetcode python software-engineering
Last synced: 15 Jul 2025
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 14 Mar 2025
https://github.com/jose-jaen/airbnb
Airbnb price prediction using Machine Learning and Deep Learning
ai algorithms bayes bayesian-optimization bayesian-statistics data-science deep-learning deployment econometrics machine-learning python streamlit xai
Last synced: 15 Apr 2025
https://github.com/csfelix/csfelix.github.io
๐ฑ My Personal Portfolio ๐ฑ
css data-science data-science-competition data-science-portfolio data-science-projects html javascript js portfolio
Last synced: 05 Aug 2025
https://github.com/adivarma27/pyab
Python package for Bayesian & Frequentist A/B Testing
ab-testing bayesian-statistics data-science frequentist-statistics hypothesis-testing marketing statistical-methods statistical-tests
Last synced: 14 Jan 2026
https://github.com/lambdaclass/data_etudes
LambdaClass statistics, machine learning and data science etudes
data-science notebook probability statistics
Last synced: 09 Apr 2025
https://github.com/aruizeac/alexandria
The Alexandria Project is an open-source platform where people can share their knowledge through books, podcasts, docs and videos.
alexandria data-science donation ebooks go golang grpc http kafka knowledge knowledge-sharing library microservice podcasts python societies streaming videos webservice
Last synced: 11 Mar 2026
https://github.com/arv-anshul/yt-watch-history
Analyse your YouTube watch history using Data Science, ML and NLP.
data-science docker docker-compose fastapi ml mlflow mlops mongodb nlp pydantic python3 streamlit youtube-api
Last synced: 22 Apr 2025
https://github.com/mrsaeeddev/ai-interview-questions
๐ค Real-World AI Interview Questions for You!
ai algorithms artificial-intelligence career data-science hacktoberfest hacktoberfest2020 interview interview-questions machine-learning resume
Last synced: 09 Aug 2025
https://github.com/rbhatia46/python-for-data-science
This repository contains iPython notebooks to get you started with sufficient amount of Python you need to learn to get started with your Data Science Journey.
data-science python-basics python3
Last synced: 03 Sep 2025
https://github.com/numeract/rflow
Flexible R Pipelines with Caching
cache data-science pipeline r rflow
Last synced: 28 May 2026
https://github.com/networks-learning/discussion-complexity
Code for "On the Complexity of Opinions and Online Discussions", WSDM 2019
complexity data-science discussion online-discussions opinion-mining paper wsdm
Last synced: 10 Aug 2025
https://github.com/amey-thakur/depression_detection_using_tweets
Twitter Depression Detection
amey ameythakur computer-engineering data-science depression-detection depression-detector machine-learning megasatish nlp project python supervised-machine-learning
Last synced: 29 Aug 2025
https://github.com/lucadibello/it-salary-analysis
๐ฐ Analysis of Salaries in IT Roles: DevOps, Cyber Security, and AI
ai cybersecurity data-science devops jupyter-notebook salary-analysis
Last synced: 03 Jul 2025
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 10 Sep 2025
https://github.com/dalageo/ml-titanicshipwreck
Exploring the World's Most Renowned Shipwreck ๐ข
data-science decision-tree-classifier logistic-regression machine-learning python random-forest-classifier scikit-learn stacking-ensemble titanic-dataset xgboost-classifier
Last synced: 04 Sep 2025
https://github.com/kennethleungty/english-premier-league-var-analysis
Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)
data-analysis data-analytics data-science english-premier-league football soccer var
Last synced: 27 Aug 2025
https://github.com/VaibhavAbhimanyooHiwase/Risk_Calculation_using_Backward_Elimination_Algorithm_in_Life_Insurance
Implementation of backward elimination algorithm used for dimensionality reduction for improving the performance of risk calculation in life insurance industry.
alpha-value backward-elimination data-mining-algorithms data-science insurance kaggle-life-insurance life-insurance multiple-linear-regression p-value random-forest risk-analysis risk-assessment risk-calculations risk-modelling risk-models statistical-analysis statistical-data statistical-learning statistical-models statistics
Last synced: 29 Jul 2025
https://github.com/dwhitena/ai-classroom
Code examples for the live, online AI Classroom training:
ai artificial-intelligence data-science machine-learning python pytorch tensorflow
Last synced: 07 Mar 2026
https://github.com/alvarobartt/ea-associate-ds
Electronic Arts (EA) NLP Assignment for: Associate Data Scientist
data-science electronic-arts nlp recruitment-task
Last synced: 12 Apr 2025
https://github.com/devinterview-io/llmops-interview-questions
๐ฃ LLMOps interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation llmops llmops-interview-questions llmops-questions llmops-tech-interview machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 16 Feb 2026
https://github.com/thomasnield/oreilly_kotlin_for_data_science
Notes, slides, and contents for the O'Reilly videos using Kotlin for Data Science
data-engineering data-science etl kotlin oreilly statistics
Last synced: 27 Mar 2025
https://github.com/l480/rewe-price-data
๐ช Daily updated prices of all items from the German supermarket chain REWE as CSV (including EAN, grammage, product image etc.)
csv data-science ean inflation prices rewe shrinkflation supermarket
Last synced: 11 Jan 2026
https://github.com/yangfa-zhang/lunax
Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.
data-analysis data-science lunax machine-learning tabular-data
Last synced: 14 Dec 2025
https://github.com/florents-tselai/sqlite-for-data-scientists
Notebooks and supporting files for SQLite for Data Scientists Online Live Training, on OReilly Learning Platform
data-science learning sql sqlite3 training-materials
Last synced: 11 Apr 2025
https://github.com/dhhruv/stock-price-prediction
A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.
algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal
Last synced: 03 May 2025
https://github.com/klarna-incubator/mleko
Simplify and accelerate your machine learning development with mleko. Designed with modularity and customization in mind, it seamlessly integrates into your existing workflows. Its robust caching system optimizes performance, taking you from data ingestion to finalized models with unparalleled efficiency.
artificial-intelligence data-science machine-learning pipeline python vaex
Last synced: 11 Apr 2025
https://github.com/ttitcombe/constituencymap
Python code to generate political maps
brexit choropleth choropleth-map data-science election-data map political-science politics united-kingdom visualization
Last synced: 11 Apr 2025
https://github.com/edaaydinea/op1-prediction-of-the-different-progressive-levels-of-alzheimer-s-disease
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimerโs disease (AD).
alzheimer-disease-prediction anova-test catboost-classifier chi-square-test data-science deep-neural-networks keras-neural-networks lightgbm-classifier logistic-regression machine-learning multi-layer-perceptron-classifier neural-networks random-forest-classifier tensorflow xgboost-classifier
Last synced: 11 Apr 2025
https://github.com/jeonghunyoon/machine-learning-lecture-notes
Lecture notes and codes for machine learning
data-science decision-tree deep-learning lecture-notes linear-algebra linear-regression lsa machine-learning naive-bayes-classifier statistics
Last synced: 10 Apr 2025
https://github.com/hourout/linora
Simple and efficient tools for data science.
data-analysis data-mining data-science hyperparameter-optimization lightgbm machine-learning python xgboost
Last synced: 04 Apr 2025
https://github.com/alexioannides/notes-and-demos
Study notes and demos.
data-engineering data-science ml-engineering mlops python
Last synced: 29 Oct 2025
https://github.com/alro10/twitter-sentiment-live
Sentiment analysis for tweets written in Portuguese-Brazil
dash dash-app dash-plotly dashboards data-science plotly portuguese-brazilian python3 sentiment-analysis tweepy tweets vader-sentiment-analysis
Last synced: 17 Jun 2025
https://github.com/rbhatia46/data-preprocessing-template
This repository includes all the Data Preprocessing required before using a dataset on a Machine Learning Model. Please refer README on how to use.
data-preprocessing data-science machine-learning python
Last synced: 11 Apr 2025
https://github.com/hsins/mpl-tc-fonts
๐น๐ผ A package to solve the problem of "Tofu" in your matplotlib plots whenever you're trying to use Traditional Chinese characters in labels or texts.
cjk-characters data-science matplotlib
Last synced: 29 Oct 2025