Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-11 00:06:40 UTC
- JSON Representation
https://github.com/aws-samples/aws-ml-jp
SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集
aws data-science deep-learning jupyter-notebook machine-learning mlops sagemaker
Last synced: 08 Nov 2024
https://github.com/arabacibahadir/sup-res
A great companion for finding key support and resistance levels on financial charts, cryptocurrencies.
algotrade analysis binance binance-api bitcoin cryptocurrency data-science finance pandas pinescript python stock telegram telegram-bot tradingview
Last synced: 27 Oct 2024
https://github.com/apache/incubator-liminal
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
ai airflow big-data data-science machine-learning ml workflows
Last synced: 01 Oct 2024
https://github.com/egenn/rtemis
Advanced Machine Learning and Visualization
data-science data-visualization machine-learning machine-learning-library r rstats visualization
Last synced: 25 Oct 2024
https://github.com/ayush1997/YouTube-Like-predictor
YouTube Like Count Predictions using Machine Learning
data-analysis data-science machine-learning predictive-analysis random-forest visualization youtube-api
Last synced: 07 Aug 2024
https://github.com/ropensci/tarchetypes
Archetypes for targets and pipelines
data-science high-performance-computing peer-reviewed pipeline r r-package r-targetopia reproducibility rstats targets workflow
Last synced: 08 Nov 2024
https://github.com/durgeshsamariya/data-science-machine-learning-project-with-source-code
Data Science and Machine Learning projects with source code.
artificial-intelligence awesome awesome-list data-science data-science-projects machine-learning machine-learning-projects python
Last synced: 08 Nov 2024
https://github.com/dlab-berkeley/R-Fundamentals-Legacy
D-Lab's 12 hour introduction to R Fundamentals. Learn how to create variables and functions, manipulate data frames, make visualizations, use control flow structures, and more, using R in RStudio.
automation data-science data-visualization data-wrangling r
Last synced: 11 Nov 2024
https://github.com/kwmsmith/scipy-2017-cython-tutorial
Material for the SciPy 2017 Cython tutorial
c c-plus-plus cython data-science docker machine-learning notebook performance python
Last synced: 14 Oct 2024
https://github.com/h2oai/wave-apps
Sample AI Apps built with H2O Wave.
data-science h2oai hacktoberfest low-code machine-learning python3
Last synced: 06 Nov 2024
https://github.com/jupyterhub/repo2docker-action
A GitHub action to build data science environment images with repo2docker and push them to registries.
actions binder data-science datascience docker jupyter jupyter-notebook repo2docker repo2docker-action
Last synced: 08 Nov 2024
https://github.com/rk2900/drsa
Deep Recurrent Survival Analysis, an auto-regressive deep model for time-to-event data analysis with censorship handling. An implementation of our AAAI 2019 paper and a benchmark for several (Python) implemented survival analysis methods.
data-science deep-learning machine-learning survival-analysis
Last synced: 07 Nov 2024
https://rivasiker.github.io/ggHoriPlot/
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 02 Aug 2024
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 28 Oct 2024
https://github.com/rivasiker/ggHoriPlot
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 02 Aug 2024
https://github.com/picnicml/doddle-model
:cake: doddle-model: machine learning in Scala.
breeze data-science doddle-model machine-learning scala
Last synced: 04 Aug 2024
https://github.com/hamelsmu/seq2seq_tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 27 Oct 2024
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 29 Oct 2024
https://github.com/DataHaskell/dh-core
Functional data science
data-analysis data-mining data-science dataframes datahaskell datasets machine-learning numerical-methods
Last synced: 30 Oct 2024
https://github.com/lynxkite/lynxkite
The complete graph data science platform
complex-networks data-science graph-algorithms graph-visualization hacktoberfest machine-learning
Last synced: 08 Nov 2024
https://github.com/minerva-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 30 Oct 2024
https://github.com/genular/pandora
PANDORA - Predictive Analytics aNd Data Oriented Research Applications :computer:
bioinformatics biomarkers clinical-data clustering data-integration data-mining data-science data-visualization drug-discovery genomic-data-analysis machine-learning microbiome pandora predictive-analytics systems-biology transcriptomics tsne umap unsupervised-machine-learning
Last synced: 04 Nov 2024
https://github.com/neptune-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 28 Aug 2024
https://github.com/yizhe-ang/k-means-explorable
An Explorable Explainer of K-Means Clustering
ai clustering data-science explorable explorable-explanations javascript machine-learning svelte
Last synced: 02 Nov 2024
https://github.com/gzuidhof/zarr.js
Javascript implementation of Zarr
array data-science gehlenborglab javascript typescript zarr
Last synced: 30 Oct 2024
https://github.com/ray-project/xgboost_ray
Distributed XGBoost on Ray
dask data-science kaggle machine-learning modin xgboost
Last synced: 03 Aug 2024
https://github.com/EnvironmentOntology/envo
A community-driven ontology for the representation of environments
data-management data-science earth-science ecoinformatics ecology environment esip obofoundry ontology planetary-science semantics sustainable-development-goals
Last synced: 04 Nov 2024
https://github.com/gtkcyber/griffon-vm
Griffon Data Science Virtual Machine
apache-drill apache-spark big-data data-science database elasticsearch hadoop jupyter-notebook mysql node-js python r ruby scala virtual-machine
Last synced: 12 Oct 2024
https://github.com/rodrigo-arenas/portfolio
Personal website, Data scientist portfolio template
beginner-friendly create-react-app css3 data-science data-science-portfolio data-science-projects developer-portfolio good-first-issue javascript material-ui personal-website portfolio portfolio-template portfolio-website react react-portfolio reactjs up-for-grabs web-template
Last synced: 30 Oct 2024
https://github.com/ing-bank/probatus
Validation (like Recursive Feature Elimination for SHAP) of (multiclass) classifiers & regressors and data used to develop them.
binary-classifiers data-analysis data-science feature-elimination machine-learning multi-class-classification recursive-feature-elimination regressors shap statistics tree-model
Last synced: 08 Nov 2024
https://github.com/jacobgil/confidenceinterval
The long missing library for python confidence intervals
data-science machine-learning metrics statistics
Last synced: 30 Oct 2024
https://github.com/mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
bagging data-science dataflow-programming ensemble-learning machine-learning mlr3 pipelines preprocessing r r-package stacking
Last synced: 10 Oct 2024
https://github.com/morganjwilliams/pyrolite
A set of tools for getting the most from your geochemical data.
chemistry data-science geochemical-data geochemistry geoscience pyrolite ternary-diagrams
Last synced: 25 Oct 2024
https://github.com/kraina-ai/srai
Spatial Representations for Artificial Intelligence
artificial-intelligence data-science geo geospatial machine-learning python spatial spatial-analysis srai
Last synced: 10 Nov 2024
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 04 Nov 2024
https://github.com/njtierney/rmd4sci
Rmarkdown for Scientists
book bookdown data-science r rmarkdown rstats science
Last synced: 27 Oct 2024
https://github.com/ModelChimp/modelchimp
Experiment tracking for machine and deep learning projects
ai artificial-intelligence data-science deep-learning experiment machine-learning ml model-management platform tool
Last synced: 27 Oct 2024
https://github.com/machine-learning-apps/ml-template-azure
Template for getting started with automated ML Ops on Azure Machine Learning
aml azure azure-machine-learning data-science machine-learning machine-learning-lifecycle mlops
Last synced: 02 Nov 2024
https://github.com/RamiKrispin/Introduction-to-Docker
(WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications
data-engineering data-science docker dockerfile
Last synced: 25 Oct 2024
https://github.com/safe-graph/UGFraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 03 Aug 2024
https://github.com/safe-graph/ugfraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 11 Nov 2024
https://github.com/tirendazacademy/pandas-tutorial
Jupyter Notebooks and Data Sets for Pandas Library
data data-analysis data-preprocessing data-science machine-learning pandas pandas-dataframe pandas-datareader pandas-library pandas-python pandas-series pandas-tricks-for-data-manipulation pandas-tutorial python
Last synced: 08 Nov 2024
https://github.com/csinva/hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
acd ai artificial-intelligence convolutional-neural-networks data-science deep-learning deep-neural-networks explainability explainable-ai feature-importance iclr interpretability interpretation jupyter-notebook machine-learning ml neural-network python pytorch statistics
Last synced: 31 Oct 2024
https://github.com/scitime/scitime
Training time estimation for scikit-learn algorithms
data-science machine-learning python scikit-learn timer
Last synced: 01 Nov 2024
https://github.com/tpoisot/ScientificComputingForTheRestOfUs
Introduction to Scientific Computing 🦊
best-practices data-science educational-resources julia machine-learning reproducible-documents scientific-computing
Last synced: 02 Aug 2024
https://github.com/tpoisot/scientificcomputingfortherestofus
Introduction to Scientific Computing 🦊
best-practices data-science educational-resources julia machine-learning reproducible-documents scientific-computing
Last synced: 30 Oct 2024
https://github.com/suji04/normalizednerd
Codes for the videos of my YouTube channel
data-science machine-learning python tutorial youtube
Last synced: 10 Nov 2024
https://github.com/laura-rieger/deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584
ai artificial-intelligence cdep convolutional-neural-network data-science deep-learning explainability explainable-ai fairness fairness-ml feature-importance interpretability interpretable-deep-learning jupyter-notebook machine-learning ml neural-network python pytorch recurrent-neural-network
Last synced: 03 Aug 2024
https://github.com/storopoli/ciencia-de-dados
Disciplina de Ciências de Dados da UNINOVE
aprendizagem-de-maquina aprendizagem-profunda ciencia-de-dados data-science deeplearning machinelearning matplotlib pandas python pytorch scikit-learn tensorflow
Last synced: 27 Oct 2024
https://github.com/romanmichaelpaolucci/AI_Stock_Trading
Design pattern for critical stages in the development process of an AI Stock Trading Bot
artificial-intelligence data-science machine-learning neural-network python trading trading-algorithms trading-bot trading-strategies
Last synced: 07 Nov 2024
https://github.com/aydinnyunus/wallet-tracker
Detect real scammers with Wallet-Tracker CLI from anywhere.
bitcoin blueteam btc cli dashboard data-science database docker docker-compose eth ethereum golang graph hacking neo4j neodash visualization websocket
Last synced: 11 Nov 2024
https://github.com/scrapinghub/python-simhash
An efficient simhash implementation for python
Last synced: 10 Nov 2024
https://github.com/adijo/data-science-prep
Problems from https://datascienceprep.com/
data-science data-science-interview datascience interview-prep machine-learning machine-learning-interview machinelearning probability statistics
Last synced: 08 Nov 2024
https://github.com/vkoul/Econ-Data-Science
Articles/ Journals and Videos related to Economics:chart_with_upwards_trend: and Data Science :bar_chart:
casual-inference data-science econometrics economics economist machine-learning social-sciences
Last synced: 02 Aug 2024
https://github.com/winvector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 07 Nov 2024
https://github.com/autoviml/deep_autoviml
Build tensorflow keras model pipelines in a single line of code. Now with mlflow tracking. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.
autokeras automl data-science deep-learning gcp keras machine-learning mlflow mljar pycaret python tensorflow tensorflow2 tpot
Last synced: 10 Oct 2024
https://github.com/mszell/introdatasci
Course materials for: Introduction to Data Science and Programming
course-materials crash-course data-science network-analysis pandas-python programming programming-courses python teaching-materials
Last synced: 05 Nov 2024
https://github.com/jadianes/spark-r-notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
big-data bigdata data-analysis data-science exploratory-data-analysis jupyter jupyter-notebook notebook r sparkr
Last synced: 09 Nov 2024
https://github.com/napjon/krisk
Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
dashboard data-science data-visualization echarts interactive-charts jupyter-notebook python
Last synced: 31 Oct 2024
https://github.com/jakekandell/nba-predict
Predicts Daily NBA Games Using a Logistic Regression Model
data-science logistic-regression model nba nba-analytics nba-prediction nba-stats pandas prediction predictive-modeling python python3 scikit-learn
Last synced: 07 Nov 2024
https://github.com/autoviml/pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
data data-science dataquality dataqualitycheck machine-learning pandas python scikit-learn
Last synced: 31 Oct 2024
https://github.com/yandexdataschool/roc_comparison
The fast version of DeLong's method for computing the covariance of unadjusted AUC.
Last synced: 06 Nov 2024
https://github.com/diffusionkinetics/open
DiffusionKinetics open-source monorepo
Last synced: 11 Nov 2024
https://github.com/materialsproject/matbench
Matbench: Benchmarks for materials science property prediction
benchmark chemistry condensed-matter data-science machine-learning machine-learning-algorithms materials-science physics
Last synced: 11 Nov 2024
https://github.com/CertifaiAI/classifai
:fire: One of the most comprehensive open-source data annotation platform.
annotation annotation-tool big-data computervision data-annotation data-collection data-science deep-learning labelling machine-learning
Last synced: 03 Aug 2024
https://github.com/WinVector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 05 Aug 2024
https://github.com/mybridge/learn-machine-learning
Learn to Build a Machine Learning Application from Top Articles
computer-vision data-science deep-learning machine-learning neural-networks
Last synced: 07 Nov 2024
https://github.com/dltk/models
DLTK Model Zoo
cnn data-science deep-learning deep-neural-networks dltk dltk-model-zoo machine-learning medical medical-image-processing medical-imaging models pre-trained
Last synced: 09 Nov 2024
https://github.com/winvector/data_algebra
Codd method-chained SQL generator and Pandas data processing in Python.
data-analysis data-science pandas python
Last synced: 07 Nov 2024
https://github.com/medtagger/MedTagger
A collaborative framework for annotating medical datasets using crowdsourcing.
crowdsourcing data-science data-validation deep-learning labeling medical-imaging
Last synced: 03 Aug 2024
https://github.com/LankyCyril/pyvenn
Python module for plotting Venn diagrams of 2..6 sets
data-science matplotlib matplotlib-venn venn venn-diagram venndiagram visualization
Last synced: 03 Aug 2024
https://github.com/ColtAllen/btyd
Buy Till You Die and Customer Lifetime Value statistical models in Python.
bayesian buy-til-you-die customer-lifetime-value data-science python
Last synced: 02 Aug 2024
https://github.com/sravb/algorithmic-trading
Algorithmic trading using machine learning.
algorithmic-trading data-mining data-science decision-tree google-finance machine-learning matplotlib pandas python scikit-learn scipy stock
Last synced: 07 Nov 2024
https://github.com/ujjwalkarn/xda
R package for exploratory data analysis
data-analysis data-science exploratory-data-analysis r
Last synced: 11 Nov 2024
https://github.com/alexandervnikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)
augmentations data-augmentation data-science datasets deep-learning generative-model keras machine-learning python synthetic-data synthetic-time-series tensorflow2 time-series vae
Last synced: 13 Oct 2024
https://github.com/solegalli/hyperparameter-optimization
Code repository for the online course Hyperparameter Optimization for Machine Learning
data-science hyperopt hyperparameter-optimization machine-learning optuna python scikit-optimize
Last synced: 30 Oct 2024
https://github.com/JovianHQ/jovian-py
Collaboration platform for data science projects & Jupyter notebooks
data-science deep-learning jupyter-notebook machine-learning ml
Last synced: 11 Oct 2024
https://github.com/jovianhq/jovian-py
Collaboration platform for data science projects & Jupyter notebooks
data-science deep-learning jupyter-notebook machine-learning ml
Last synced: 02 Nov 2024
https://github.com/barrust/pyprobables
Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html
bitarray bloom-filter count-mean-min-sketch count-mean-sketch count-min-sketch counting-bloom-filter counting-cuckoo-filter cuckoo-filter data-analysis data-mining data-science data-structures datastructures heavy-hitters probabilistic-programming probability python quotient-filter stream-threshold
Last synced: 30 Oct 2024
https://github.com/nicohlr/ipychart
The power of Chart.js with Python
charting-library chartjs charts data data-analysis data-science data-visualization ipywidgets javascript-es6 jupyter jupyter-notebook notebook python
Last synced: 06 Nov 2024
https://github.com/lawmurray/Birch
A probabilistic programming language that combines automatic differentiation, automatic marginalization, and automatic conditioning within Monte Carlo methods.
autodiff bayesian bayesian-inference bayesian-methods bayesian-statistics data-science machine-learning machine-learning-algorithms machine-learning-projects monte-carlo-methods monte-carlo-sampling probabilistic-programming-languages statistics
Last synced: 30 Oct 2024
https://github.com/streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
apache-pulsar apache-spark batch-processing data-processing data-science flink spark spark-sql stream-processing structured-streaming
Last synced: 12 Oct 2024
https://github.com/lsys/forestplot
A Python package to make publication-ready but customizable coefficient plots.
coefficientplot data-science data-visualization dataviz forestplot matplotlib python visualization
Last synced: 02 Nov 2024
https://github.com/innat/ML-Resource
A concise resource repository for machine learning
data-analysis data-science deep-learning kaggle machine-learning python spark
Last synced: 02 Aug 2024
https://github.com/scrapinghub/mdr
A python library detect and extract listing data from HTML page.
Last synced: 10 Nov 2024
https://github.com/imsanjoykb/data-science-regular-bootcamp
Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.
artificial-intelligence data-analysis data-science data-science-notebook data-science-projects data-visualization database-connection deep-learning etl-pipeline etl-process feature-engineering machine-learning mysql-database neural-network numpy pandas postgresql python python-automation sqlite
Last synced: 12 Oct 2024
https://github.com/nicholasmamo/multiplex-plot
Multiplex: visualizations that tell stories—A Python library to create and annotate beautiful network graph visualizations, text visualizations and more.
data-science data-visualisation graph-visualization graphs information-retrieval matplotlib natural-language-processing network-visualization python text-mining text-visualisation text-visualization visualisation visualizations viz vizualisation
Last synced: 31 Oct 2024
https://github.com/takuti/flurs
:ocean: FluRS: A Python library for streaming recommendation algorithms
data-science factorization-machines machine-learning matrix-factorization python recommender-system
Last synced: 07 Nov 2024
https://github.com/NicholasMamo/multiplex-plot
Multiplex: visualizations that tell stories—A Python library to create and annotate beautiful network graph visualizations, text visualizations and more.
data-science data-visualisation graph-visualization graphs information-retrieval matplotlib natural-language-processing network-visualization python text-mining text-visualisation text-visualization visualisation visualizations viz vizualisation
Last synced: 07 Aug 2024
https://github.com/clipperhouse/jargon
Tokenizers and lemmatizers for Go
data-science go lemmatizer nlp tokenizer
Last synced: 30 Oct 2024
https://github.com/benthecoder/ml-blogs-that-are-worth-reading
Blogs on Machine Learning and Deep learning
ai artificial-intelligence data-science deep-learning machine-learning ml
Last synced: 02 Nov 2024
https://github.com/alexioannides/pymc-example-project
Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
bayesian-data-analysis bayesian-inference data-science machine-learning numpy pandas probabilistic-programming pymc3 python scikit-learn
Last synced: 27 Oct 2024
https://github.com/olow304/data-science-machine-learning
The overall objective of this toolkit is to provide and offer a free collection of data analysis and machine learning that is specifically suited for doing data science. Its purpose is to get you started in a matter of minutes. You can run this collections either in Jupyter notebook or python alone.
all best-practices cheatsheet cheatsheets data-science data-science-toolkit deep-learning jupyter-notebook machine-learning machine-learning-algorithms machine-learning-tutorials matplotlib mindmap numpy pandas popular-posts python roadmap sklearn toolkit
Last synced: 10 Oct 2024
https://github.com/thomasnield/oreilly_reactive_python_for_data
Resources for the O'Reilly online video "Reactive Python for Data"
data-science database python reactivex rxpy sqlalchemy tweepy twitter
Last synced: 30 Oct 2024
https://github.com/georgian-io/pyoats
Quick and Easy Time Series Outlier Detection
anomaly anomaly-detection data-science deep-learning machine-learning time-series timeseries
Last synced: 30 Oct 2024
https://github.com/formlio/forml
ForML - A development framework and MLOps platform for the lifecycle management of data science projects
ai data-science machine-learning ml mlops portability python reproducibility
Last synced: 03 Aug 2024
https://github.com/ome/ngff
Next-generation file format (NGFF) specifications for storing bioimaging data in the cloud.
bioimaging cloud data-science file-formats spec
Last synced: 03 Aug 2024
https://github.com/dssg/MLforPublicPolicy
Class resources for CAPP 30254 (Machine Learning for Public Policy)
data-science machine-learning public-policy
Last synced: 27 Oct 2024
https://github.com/senderle/topic-modeling-tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
data-science digital-humanities mallet text-analytics topic-modeling
Last synced: 02 Aug 2024