Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-01 00:07:28 UTC
- JSON Representation
https://github.com/andrea-ballatore/open-geo-data-education
Open Geospatial Datasets for GIS Education: This is a repository of open geospatial datasets to be used in an educational context. I created these files over years of teaching Geographic Data Science and GIS. All original datasets are freely available online with open data licenses (see the dataset attribution for details). All the datasets in this repository have been selected, cleaned, harmonised, and repackaged for GIS exercises in a higher-education context. This is a pretty time-intensive process that other educators can hopefully avoid by using these versions.
data-science geojson geospatial-data geospatial-datasets gis gis-data gis-education tsv
Last synced: 15 Mar 2025
https://github.com/XpressAI/xircuits
Simple visual programming environment for jupyterlab
data-science jupyterlab python
Last synced: 25 Oct 2025
https://github.com/ncfrey/resources
A Highly Opinionated List of Open Source Materials Informatics Resources
data-science getting-started materials-informatics materials-science resources tutorials
Last synced: 17 Mar 2026
https://github.com/ahmedfgad/arithmeticencodingpython
Data Compression using Arithmetic Encoding in Python
arithmetic-coding data-compression data-science entropy-coding lossless-compression-algorithm python
Last synced: 29 Jun 2025
https://github.com/Invictify/Jupter-Notebook-REST-API
Run your jupyter notebooks as a REST API endpoint. This isn't a jupyter server but rather just a way to run your notebooks as a REST API Endpoint.
data-science data-science-pipelines docker dockerfile fastapi jupyter python rest-api
Last synced: 15 Mar 2025
https://github.com/ekramasif/basic-machine-learning
This is a repo of basic Machine Learning what I learn. More to go...
ann artficial-neural-network artificial-intelligence bert-embeddings bert-model blstm collaborate data-science deep-learning embeddings keras lstm machine-learning natural-language-processing neural-network nlp pandas python seaborn tensorflow
Last synced: 15 Mar 2025
https://github.com/woz-u/DS-Student-Resources
Data Science Student Companion Notebooks and Data Lake
data-analysis data-science data-visualization machine-learning nosql python r sql statistics
Last synced: 20 Jul 2025
https://github.com/great-expectations/great_expectations_action
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
actions continuous-integration data-integrity data-quality data-science mlops
Last synced: 07 Apr 2025
https://github.com/bramvanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
conll conll-u data-science machine-learning natural-language-processing nlp pandas parser python spacy spacy-extension spacy-pipeline stanford-machine-learning stanford-nlp stanza udpipe
Last synced: 13 Apr 2025
https://github.com/visgl/deck.gl-data
Data for the data visualization library deck.gl examples (https://uber.github.io/deck.gl/#/)
data data-science data-visualization uber
Last synced: 12 Jun 2025
https://github.com/manumerous/vpselector
Visual Pandas Selector: Visualize and interactively select time-series data
data-science data-visualization pandas python selector
Last synced: 26 Mar 2025
https://github.com/gagolews/datawranglingpy
Minimalist Data Wrangling with Python (Open-Access Textbook)
data-analysis data-science data-visualisation data-wrangling jupyter machine-learning matplotlib modelling numpy pandas python python3 scikit-learn scipy scipy-stats seaborn statistics
Last synced: 09 Apr 2025
https://github.com/y-bar/ml-based-anomaly-detection
Spectral Residual
anomaly-detection data-science python
Last synced: 04 Apr 2026
https://github.com/idling-mind/flowfunc
A web-based node editor component for plotly dash
dash data-science data-visualization dataviz flowchart nodeeditor plotly plotly-dash plotting python
Last synced: 01 Feb 2026
https://github.com/produvia/ai-platform
An open-source platform for automating tasks using machine learning models
artificial-intelligence automation data-science deep-learning java keras-models machine-learning model-zoo neural-networks python pytorch-models r task tasks tensorflow-models
Last synced: 30 Sep 2025
https://github.com/imdeepmind/neuralpy
NeuralPy: A Keras like deep learning library works on top of PyTorch
data-science deep-learning keras library machine-learning neural-network neuralpy neuralpy-torch python pytorch
Last synced: 13 Aug 2025
https://github.com/tirthajyoti/synthetic-data-gen
Various methods for generating synthetic data for data science and ML
classification data data-science machine-learning python regression symbolic-computation time-series
Last synced: 30 Apr 2025
https://github.com/mainakrepositor/datasets
A bunch of some 200 datasets. You can call it mini-kaggle :)
csv data data-science database datasets image-files mini-kaggle ml nlp-machine-learning tsv
Last synced: 01 Mar 2025
https://github.com/fneum/data-science-for-esm
data-science energy energy-data energy-system-modelling
Last synced: 05 Apr 2025
https://github.com/rodrigo-arenas/pyworkforce
Standard tools for workforce management, queuing, scheduling, rostering and optimization problems.
begginer-friendly data-science erlangc investigation-of-operation investigations-search looking-for-contributors operations-research optimization ortools python schedule scheduling-algorithms up-for-grabs workforce workforce-management
Last synced: 09 Apr 2025
https://github.com/piquette/qtrn
A cli tool to streamline financial markets data analysis :wrench:
cli data data-science finance go golang options quotes scraper stock stock-analysis stock-market
Last synced: 15 May 2025
https://github.com/thoughtspile/hippotable
๐ฉ๐ปโ๐ฌ๐ Lightweight data analysis in your browser
csv dashboard data-analysis data-science javascript table visualization
Last synced: 06 Oct 2025
https://github.com/flintml/flintml
ML infrastructure for teams that just want to get sh*t done.
data-science deltalake jupyter machine-learning mlops polars
Last synced: 18 Jan 2026
https://github.com/trainingbypackt/applied-deep-learning-with-python
Applied Deep Learning with Python, published by Packt
data-science deep-learning machine-learning python
Last synced: 10 Apr 2025
https://github.com/ploomber/soorgeon
Convert monolithic Jupyter notebooks ๐ into maintainable Ploomber pipelines. ๐
data-engineering data-science jupyter jupyter-notebooks machine-learning mlops workflow
Last synced: 10 Apr 2025
https://github.com/weiji14/zen3geo
The ๐ data science library you've been waiting for~
analysis-ready-data cloud-native cloud-optimized-geotiff composition data-science datapipe earth-observation foss4g geospatial machine-learning-ready-data stac torch torchdata zarr zen
Last synced: 13 Sep 2025
https://github.com/kennethleungty/generative-ai-pharmacist
Generative AI Pharmacist (For Demo Purposes Only)
ai ai-pharmacist artificial-intelligence chatgpt data-science deep-learning generative-ai generative-ai-pharmacist generative-art healthcare machine-learning pharmacist pharmacy
Last synced: 23 Sep 2025
https://github.com/vatshayan/final-year-disease-prediction-project
Final Year Project Diseases Prediction System through Machine Learning. Disease Prediction system with code and documents
btech btech-project btechfinalyear btechproject college-project data-science disease disease-prediction final final-project final-year-project finalyearproject finalyearprojects machine-learning machine-learning-algorithms machinelearning prediction python sem8
Last synced: 21 Mar 2025
https://github.com/frjnn/bhtsne
Parallel Barnes-Hut t-SNE implementation written in Rust.
barnes-hut bhtsne data-science data-visualization dimensionality-reduction machine-learning rust similarity-measures
Last synced: 27 Jul 2025
https://github.com/polymathorg/dataframe
DataFrame in Pharo - tabular data structures for data analysis
data-analysis data-frame data-science data-visualization gsoc hacktoberfest pharo pharo-smalltalk smalltalk statistics tabular-data
Last synced: 05 Apr 2025
https://github.com/siddhujetty/Product-analytics-insights-collection
My Solutions to "A Collection of Data Science Take-Home Challenges" by Giulio Palombo.
data-science machine-learning r-programming solutions take-home-test
Last synced: 29 Jul 2025
https://github.com/PolyMathOrg/DataFrame
DataFrame in Pharo - tabular data structures for data analysis
data-analysis data-frame data-science data-visualization gsoc hacktoberfest pharo pharo-smalltalk smalltalk statistics tabular-data
Last synced: 11 May 2025
https://github.com/5agado/conversation-analyzer
Analyzer and statistics generator for text-based conversations. Includes Facebook scraper and parser
data-science facebook quantified-self scraper
Last synced: 16 Apr 2025
https://github.com/cannlytics/cannabis-data-science
๐ Cannabis Data Science repository powered by ๐ฅ Cannlytics. ๐งโ๐ Meetup, code, and advance cannabis science ๐งช. Join the fun!
cannabis cannabis-api cannabis-data data-science python statustics
Last synced: 19 Jun 2026
https://github.com/psyplot/psyplot
Python package for interactive data visualization
cartopy climate data-science earth-science earth-system-model interactive matplotlib models netcdf python regression visualization
Last synced: 17 Mar 2025
https://github.com/ndleah/8-week-sql-challenge
#8WeekSQLChallenge by Danny Ma.
data-analysis data-science sql
Last synced: 25 Oct 2025
https://github.com/cloud-cv/evalai-starters
How to create a challenge on EvalAI?
agent ai cv data-science data-science-competition environments evalai get-started getting-started ml reinforcement-learning rl
Last synced: 13 Apr 2025
https://github.com/ECSIM/pem-dataset1
Proton Exchange Membrane (PEM) Fuel Cell Dataset
activation-procedure chemistry data data-science dataset electrochemistry energy fuel-cell impedance mea nafion open-science open-source pem physics polarization power proton-exchange-membrane science science-research
Last synced: 07 May 2025
https://github.com/urbslab/streamline
Simple Transparent End-To-End Automated Machine Learning Pipeline for Supervised Learning in Tabular Binary Classification Data
automl-pipeline binary-classification data-science data-visualization feature-selection imputation machine-learning model-application statistical-analysis supervised-learning
Last synced: 12 Jul 2025
https://github.com/wurmlab/oswitch
Provides access to complex Bioinformatics software (even BioLinux!) in just one command.
bioinformatics data-science docker virtualization
Last synced: 10 Apr 2025
https://github.com/dominodatalab/domino-research
Projects developed by Domino's R&D team
data-science mlflow mlops python sagemaker
Last synced: 11 Apr 2025
https://github.com/kianweelee/Edator
A python package that performs exploratory data analysis for users. Additionally, it generates 3 types of output files (cleaned CSV, plots and a text report).
data-analysis data-science exploratory-data-analysis
Last synced: 08 May 2025
https://github.com/jonrau1/SyntheticSun
SyntheticSun is a defense-in-depth security automation and monitoring framework which utilizes threat intelligence, machine learning, managed AWS security services and, serverless technologies to continuously prevent, detect and respond to threats.
anomaly-detection automation aws aws-security aws-serverless data-science data-visualization elasticsearch geolocation guardduty incident-response kibana machine-learning misp sagemaker security-automation security-tools serverless threat-detection threat-intelligence
Last synced: 12 Jul 2025
https://github.com/nbarrowman/vtree
An R package for calculating and drawing variable trees
data-science data-visualization exploratory-data-analysis r statistics
Last synced: 11 Oct 2025
https://github.com/ibm/kafka-streaming-click-analysis
Use Kafka and Apache Spark streaming to perform click stream analytics
apache-spark clickstream data-science ibm-data-science-experience ibmcode jupyter-notebook kafka spark structured-streaming
Last synced: 03 Oct 2025
https://github.com/scicloj/wolframite
An interface between Clojure and Wolfram Language (the language of Mathematica)
clojure data-science mathematica wolfram-language
Last synced: 05 Apr 2025
https://github.com/aws-samples/aws-fargate-with-rstudio-open-source
This project delivers AWS CDK Python code to provision serverless infrastructure in AWS Cloud to run Open Source RStudio Server and Shiny.
amazon-athena amazon-ecr amazon-ecs amazon-efs amazon-route53 amazon-s3 amazon-ses amazon-vpc aws-cdk aws-codepipeline aws-datasync aws-fargate-application aws-kms aws-lambda aws-secrets-manager aws-wafv2 data-science rstudio-server shiny-apps
Last synced: 29 Jul 2025
https://github.com/Vitruves/nail-parquet
Fast parquet command line tool with many functions, nailed it!
cli command-line-tool data-science database-management parquet parquet-format xlsx
Last synced: 18 Mar 2026
https://github.com/PetoLau/petolau.github.io
Blog about time series data mining in R.
artificial-intelligence blog data-analysis data-mining data-science data-visualization forecasting machine-learning r time-series time-series-analysis time-series-clustering time-series-data-mining time-series-forecasting time-series-prediction
Last synced: 26 Apr 2025
https://github.com/capitalone/dataCompareR
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
compare-data data data-analysis data-science r
Last synced: 30 Jul 2025
https://github.com/devinterview-io/sql-interview-questions
๐ฃ SQL interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview sql sql-interview-questions sql-questions sql-tech-interview technical-interview-questions
Last synced: 24 Aug 2025
https://github.com/reymond-group/faerun-python
A python module for generating interactive views of chemical spaces.
chemical-spaces chemistry data-science data-visualization plotting python
Last synced: 06 Mar 2026
https://github.com/anovos/anovos
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
bigdata data-science feature-engineering feature-recommendation machine-learning pyspark python scale transformation visualization
Last synced: 08 Apr 2026
https://github.com/aiwithqasim/Free-Artificial-Intelligence-Resources
Welcome, to this Open Source Repository regarding FREE ARTIFICIAL INTELLIGENCE RESOURCE. Get Benefit from the free resources mention & kindly five STAR & FORK this so that it can get maximum Fame so that Everyone can take advantage.
ai article artificial-intelligence artificial-neural-networks blog data-science datascientist deep-learning freeresources hacktoberfest hecktoberfest2021 jobs machine-learning machine-learning-algorithms natural-language-processing nlp project python3 youtube
Last synced: 01 Apr 2025
https://github.com/devinterview-io/data-scientist-interview-questions
๐ฃ Data Scientist interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist data-scientist-interview data-scientist-interview-questions data-scientist-questions data-scientist-tech-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 11 Jan 2026
https://github.com/glemaitre/pyparis-2018-sklearn
PyParis tutorial on machine learning using scikit-learn
data-science machine-learn pandas scikit-learn
Last synced: 09 Oct 2025
https://diegoinacio.github.io/machine-learning-notebooks/
๐ค An authorial set of fundamental python recipes on Machine Learning and Artificial Intelligence.
algorithms artificial-intelligence data-science deep-learning machine-learning machine-learning-algorithms machine-learning-notebooks mathematics natural-language-processing python
Last synced: 20 Nov 2025
https://github.com/xiaodaigh/jlboost.jl
A 100%-Julia implementation of Gradient-Boosting Regression Tree algorithms
catboost data-science gbdt gbrt lightgbm machine-learning tree tree-boosting-algorithms xgboost
Last synced: 17 Jan 2026
https://github.com/Thomas-George-T/Thomas-George-T
Readme for my :octocat: Profile
data-engineer data-science github github-profile icons machine-learning profile-readme readme svg svg-icons
Last synced: 15 Mar 2025
https://github.com/oneoffcoder/books
A collection of online books for data science, computer science and coding!
books coder computer-science data-science docker java python r scikit-learn scratch software software-development software-engineering spark sphinx tutorials
Last synced: 06 Apr 2025
https://github.com/felipenoris/math-server-docker
The ideal multi-user Data Science server with Jupyterhub and RStudio, ready for Python, R and Julia languages.
data-science docker julia julia-language jupyter jupyter-kernels jupyterhub jupyterlab latex python rstudio-servers shiny-server
Last synced: 21 Sep 2025
https://github.com/erfaniaa/crypto-trading-strategy-backtester
Easy-to-use cryptocurrency trading strategy simulator and backtester
backtesting backtesting-trading-strategies binance bitcoin crypto cryptocurrency data-science dataset dataset-generation machine-learning python quantitative-finance quantitative-trading simulation time-series trading trading-strategies
Last synced: 17 Mar 2025
https://github.com/holgerbrandl/kalasim
Discrete Event Simulator
agent-based-modeling data-science discrete-event-simulation optimization process-modeling simulation visulization
Last synced: 02 Jan 2026
https://github.com/nishkarshraj/automation-using-shell-scripts
Development Automation using Shell Scripting.
anacron at automation automation-framework backup bash-script cron crontab data-science data-structures development linux scenarios scheduler shell shell-scripts sorting-algorithms
Last synced: 15 Apr 2025
https://github.com/balavenkatesh3322/model_deployment
A collection of model deployment library and technique.
aws azure caffe data-science deep-learning keras machine-learning model model-deployment model-server model-serving mxnet neural-network pytorch serving serving-pytorch-models serving-recommendation serving-tensors tensorflow
Last synced: 22 Apr 2025
https://github.com/iterative/dataset-registry
Dataset registry DVC project
data-science dataset dvc example machine-learning registry
Last synced: 18 Jun 2025
https://github.com/bcgov/bcmaps
An R package of map layers for British Columbia
data-science env r r-package rstats
Last synced: 04 Apr 2025
https://github.com/grailbio/bio
Bioinformatic infrastructure libraries
bioinformatics data-science golang
Last synced: 19 Apr 2025
https://github.com/tomasonjo/graphs-network-science
Accompanying repository for my book about Graph Data Science
algorithms data-science graph graph-algorithms machine-learning
Last synced: 30 Apr 2025
https://github.com/umessen/fhir-pyrate
FHIR-PYrate is a package that provides a high-level API to query FHIR Servers for bundles of resources and return the structured information as pandas DataFrames. It can also be used to filter resources using RegEx and SpaCy and download DICOM studies and series.
data-science fhir fhirpath healthcare pyrate python ship ukessen ume
Last synced: 03 Feb 2026
https://github.com/maartengr/projects
Data Science Portfolio
data-science jupyter-notebook machine-learning nlp portfolio python pytorch reinforcement-learning
Last synced: 22 Mar 2025
https://github.com/robertmartin8/udemyml
Templates, code and notes for Kirill Eremenko's Machine Learning course
data-science machine-learning python r tutorial udemy udemy-machine-learning
Last synced: 30 Apr 2025
https://github.com/andrewtavis/kwx
BERT, LDA, and TFIDF based keyword extraction in Python
bert data-analysis data-science data-visualization keyword-extraction latent-dirichlet-allocation lda machine-learning multilingual natural-language-processing nlp open-source python python3 text-analysis text-classification text-mining tfidf topic-modeling unsupervised-learning
Last synced: 14 Apr 2025
https://github.com/verynifty/RolodETH
A Rolodex for popular Ethereum chain address.
data-science ethereum ethereum-blockchain
Last synced: 12 May 2025
https://github.com/hsbc/tslumen
A library for Time Series EDA (exploratory data analysis)
analysis data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations pandas profiling python time-series time-series-analysis time-series-eda time-series-profiling timeseries timeseries-analysis timeseries-eda
Last synced: 10 Mar 2026
https://github.com/davidrpugh/pybea
Python package for downloading data from the Bureau of Economic Analysis (BEA) data API.
data-science economics python-3
Last synced: 17 Aug 2025
https://github.com/montanaz0r/bayesian-statistics-the-fun-way
Solutions and workflow for the Bayesian Statistics The Fun Way book in Python
bayesian-data-analysis bayesian-statistics data-science jupyter-notebook numpy pandas probability python scipy statistics
Last synced: 25 Jun 2025
https://github.com/shenxiangzhuang/pythondataanalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 08 Aug 2025
https://github.com/uc-r/Advanced-R
Advanced Analytics with R training material delivered in a 2 day format
data-science educational-materials r training-materials workshop-materials
Last synced: 04 May 2025
https://github.com/lesander/netflix-viewing-activity
:tv: Download your Netflix account viewing activity in JSON or CSV.
chrome-extension csv data-science javascript js json netflix netflix-api
Last synced: 15 Mar 2026
https://github.com/yusufcinarci/data-science-projects
In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...
data-analysis data-science data-science-projects jupyter jupyter-notebook python
Last synced: 14 Mar 2026
https://github.com/aiwithqasim/free-artificial-intelligence-resources
Welcome, to this Open Source Repository regarding FREE ARTIFICIAL INTELLIGENCE RESOURCE. Get Benefit from the free resources mention & kindly five STAR & FORK this so that it can get maximum Fame so that Everyone can take advantage.
ai article artificial-intelligence artificial-neural-networks blog data-science datascientist deep-learning freeresources hacktoberfest hecktoberfest2021 jobs machine-learning machine-learning-algorithms natural-language-processing nlp project python3 youtube
Last synced: 17 Mar 2025
https://github.com/jianzhnie/autotabular
Automatic machine learning for tabular data. โก๐ฅโก
automl catboost data-science deep-learning feature-engineering hpo lightgbm machine-learning pytorch-lightning scikit-learn structured-data tabular-data xgboost
Last synced: 06 Sep 2025
https://github.com/data-centric-ai/dcbench
A benchmark of data-centric tasks from across the machine learning lifecycle.
Last synced: 27 Mar 2025
https://github.com/pbower/minarrow
Apache Arrow and Polars compatible, Rust-first columnar data library for real-time and systems workloads
arrow data-science dataengineering polars rust
Last synced: 17 May 2026
https://github.com/devparihar5/complete-data-science-roadmap
Complete Roadmap For Data Science
ai big-data data-analysis data-engineering data-science deep-learning machine-learning mathematics natural-language-processing neural-network python r-programming roadmap statistical-analysis statistics
Last synced: 30 Apr 2025
https://github.com/paris-saclay-cds/ramp-workflow
Toolkit for building predictive workflows on top of pydata (pandas, scikit-learn, pytorch, keras, etc.).
data-challenge data-science python ramp
Last synced: 14 Dec 2025
https://github.com/castlelemongrab/parlance
A minimum-dependency ECMAScript client library and CLI tool for Parler โ a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content
data-science datascience datascraping disinformation es7 hatespeech javascript law-enforcement misinformation node nodejs osint parlance parler social-media social-networks speech twitter
Last synced: 15 Apr 2025
https://github.com/kiprotect/data-privacy-for-data-scientists
A workshop on data privacy methods for data scientists.
anonymization data-privacy data-science education jupyter-notebooks tutorial
Last synced: 26 Jun 2025
https://github.com/mdeff/ntds_2019
Material for the EPFL master course "A Network Tour of Data Science", edition 2019.
data-science education epfl graph-neural-networks graphs network-science
Last synced: 01 Aug 2025
https://github.com/chuongmep/aps-toolkit
An Libray Unlock BIM Data With Autodesk Platform Services
acc ai aps aps-toolkit autodesk-docs autodesk-forge automation big-data bim bim360 data data-analyst data-science database forge llm
Last synced: 14 Apr 2025
https://github.com/shenxiangzhuang/PythonDataAnalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 26 Mar 2025
https://github.com/cannlytics/cannlytics
๐ฅ Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and the best statistics in the game. Made with โค๏ธ
cannabis cannabis-api cannabis-app cannabis-data cannabis-scripts cannabis-strains cannabis-variety cannabisapp data-mining data-science django firebase machine-learning metrc nlp python strain-data terpene-profile terpenes
Last synced: 19 Jun 2026
https://github.com/gesiscss/css_methods_python
A full course of self-explanatory and freely available materials on CSS methods
data-science jupyter-notebook python
Last synced: 15 Mar 2025
https://github.com/brubinstein/diffpriv
Easy differential privacy in R
data-science differential-privacy diffpriv machine-learning r r-package statistics
Last synced: 22 Oct 2025
https://github.com/argilla-io/biome-text
Custom Natural Language Processing with big and small models ๐ฒ๐ฑ
allennlp data-science natural-language-processing nlp pytorch
Last synced: 07 Oct 2025
https://github.com/devsgnr/breadroll
breadroll ๐ฅ is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser
Last synced: 11 Oct 2025
https://github.com/meteostat/weather-stations
A list of public weather stations everyone can edit and share.
climate data-science json meteostat weather weather-stations
Last synced: 20 Jul 2025
https://github.com/jozi/iranian-developers-in-telegram
Curated List of Persian Groups and Channels for Iranian Developers in Telegram
android-studio data-mining data-science deep-learning delphi iran iranian machine-learning news python sql-server telegram text-mining web-mining
Last synced: 26 Apr 2026
https://github.com/mitre/menelaus
Online and batch-based concept and data drift detection algorithms to monitor and maintain ML performance.
concept-drift data-drift data-science drift-detection machine-learning statistics
Last synced: 16 Mar 2026