Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-12-25 00:06:44 UTC
- JSON Representation
https://github.com/sicara/sicarator
Instant Setup & Best Quality for Data Projects!
data-science generator machine-learning python
Last synced: 28 Dec 2024
https://github.com/t04glovern/selfie2anime
Anime2Selfie Backend Services - Lambda, Queue, API Gateway and traffic processing
aws aws-lambda data-science selfie2anime serverless
Last synced: 19 Dec 2024
https://github.com/salvatorera/tutorial
Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)
artificial-intelligence bioinformatics biology computer-vision convolutional-neural-networks data-science deep-learning graph image machine-learning natural-language-processing nlp python r streamlit streamlit-webapp tutorial tutorials vision-transformer
Last synced: 28 Dec 2024
https://github.com/milaan9/deep_learning_algorithms_from_scratch
This repository explores the variety of techniques and algorithms commonly used in deep learning and the implementation in MATLAB and PYTHON
adversarial-machine-learning autoencoders cnn-classification data-science deep-learning deep-learning-matlab deep-learning-python deep-learning-pytorch image-captioning image-processing linear-regression logistic-regression neural-networks object-detection rnn-pytorch tutor-milaan9
Last synced: 24 Dec 2024
https://github.com/capeprivacy/cape-dataframes
Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
collaboration data-science hacktoberfest machine-learning pandas policy privacy python spark
Last synced: 14 Nov 2024
https://github.com/Oxen-AI/Oxen
Oxen.ai's core rust library, server, and CLI
artificial-intelligence data-science database machine-learning version-control
Last synced: 09 Dec 2024
https://github.com/brendanhasz/probflow
A Python package for building Bayesian models with TensorFlow or PyTorch
bayesian bayesian-inference bayesian-methods bayesian-neural-networks bayesian-statistics data-science machine-learning python pytorch statistics tensorflow
Last synced: 24 Dec 2024
https://github.com/kdr-aus/ogma
Scripting language focused on processing tabular data.
data-science language rust scripting-language table-data
Last synced: 30 Oct 2024
https://github.com/learnbyexample/py_resources
Collection of Python learning resources
curated-list data-science learning machine-learning python resources scientific-computing
Last synced: 28 Dec 2024
https://github.com/youssefhosni/my-medium-articles-friendly-links
Friendly link to all of my medium articles
data-science deep-learning machine-learning python
Last synced: 26 Dec 2024
https://github.com/curiousily/machine-learning-from-scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
artificial-intelligence book classification data-science machine-learning machine-learning-algorithms neural-networks notebook recommender-systems regression reinforcement-learning sentiment-analysis
Last synced: 26 Dec 2024
https://github.com/fedora-infra/fedmsg
Federated Messaging with ZeroMQ
data-science fedora-project message-bus python zeromq
Last synced: 25 Dec 2024
https://github.com/maxheld83/ghactions
GitHub actions for R and accompanying R package
cicd continous-delivery continous-integration data-science devops github github-actions rstats setup
Last synced: 31 Oct 2024
https://learnbyexample.github.io/py_resources/
Collection of Python learning resources
curated-list data-science learning machine-learning python resources scientific-computing
Last synced: 02 Nov 2024
https://github.com/denizyuret/autograd.jl
Julia port of the Python autograd package.
autograd automatic-differentiation data-science deep-learning knet machine-learning neural-networks
Last synced: 03 Dec 2024
https://github.com/pydatablog/python-for-data-science
A blog for data analytics using data science technologies
Last synced: 19 Dec 2024
https://github.com/apachecn/ds-ai-tech-notes
:book: [译] 数据科学和人工智能技术笔记
ai data-science matplotlib notes numpy python sklearn
Last synced: 18 Dec 2024
https://github.com/tirthajyoti/ds-with-pysimplegui
Data science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
analytics application artificial-intelligence data-science desktop-app gui machine-learning python windows
Last synced: 19 Dec 2024
https://github.com/rsokl/learning_python
Source material for Python Like You Mean it
data-science educational numpy numpy-tutorial python python-tutorial textbook tutorial
Last synced: 28 Dec 2024
https://github.com/dlab-berkeley/Python-Fundamentals-Legacy
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
data-science introduction-to-python jupyter python
Last synced: 11 Nov 2024
https://github.com/hugohadfield/kalmangrad
Automated, smooth, N'th order derivatives of non-uniformly sampled time series data
data-science derivatives kalman-filter signal-processing smoothing
Last synced: 23 Oct 2024
https://github.com/wainberg/ryp
R inside Python
bioinformatics data-science python python-to-r r r-to-python rstats statistics
Last synced: 27 Dec 2024
https://github.com/lamastex/scalable-data-science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
apache-spark data-science databricks scala
Last synced: 23 Dec 2024
https://github.com/vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods
data-science imputation imputation-algorithm machine-learning machine-learning-prerequisites preprocessing-data python scikit-learn
Last synced: 24 Dec 2024
https://github.com/Automunge/AutoMunge
Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbations.
Last synced: 27 Oct 2024
https://github.com/unnati-xyz/scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
data-engineer data-pipeline data-science luigi machine-learning rest-api spark
Last synced: 27 Nov 2024
https://github.com/google/starthinker
Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for professionals with deadlines."
airflow app-engine automation bigquery cloud-functions cm360 colab-notebook data-science django dv360 google-ads google-analytics logger python scheduler ui workflows
Last synced: 29 Sep 2024
https://github.com/solegalli/machine-learning-imbalanced-data
Code repository for the online course Machine Learning with Imbalanced Data
data-science imbalanced-classification imbalanced-data imbalanced-learning machine-learning python
Last synced: 22 Dec 2024
https://github.com/ahammadmejbah/machine-learning-book-collections
Machine learning is the study and development of data-driven strategies to enhance task performance. AI includes it.
data-science deep-learning machine-learning
Last synced: 11 Nov 2024
https://github.com/anthony-wang/BestPractices
Things that you should (and should not) do in your Materials Informatics research.
best-practices common-pitfalls data-science example-code interactive-notebooks jupyter jupyter-notebooks machine-learning materials-informatics materials-science neural-networks python
Last synced: 13 Nov 2024
https://github.com/robb/rbbjson
Flexible JSON traversal for rapid prototyping.
data-science json jsonpath prototyping swift
Last synced: 27 Oct 2024
https://github.com/davendw49/k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
ai4science data-science geoai geoscience kg large-language-models llm
Last synced: 02 Nov 2024
https://github.com/younes-charfaoui/daily-coding-problem
Series of the problem 💯 and solution ✅ asked by Daily Coding problem👨🎓 website.
airbnb amazon apple cisco coding-interviews coding-problems coursera data-science dropbox facebook google ibm linkedin microsoft mozila netflix nvidia python twitter youtube
Last synced: 27 Oct 2024
https://github.com/hamelsmu/docker_tutorial
Code and helper scripts for article on Medium "How Docker Can Help You Become A More Effective Data Scientist"
data-science docker docker-tutorial medium medium-article
Last synced: 27 Oct 2024
https://github.com/phillipdupuis/dtale-desktop
Build a data visualization dashboard with simple snippets of python code
data-analysis data-science data-visualization fastapi pandas python react typescript visualization
Last synced: 27 Dec 2024
https://github.com/risenw/datasist
A Python library for easy data analysis, visualization, exploration and modeling
data-analysis data-science data-visualization feature-engineering machine-learning python-3
Last synced: 29 Dec 2024
https://github.com/pyscaffold/pyscaffoldext-dsproject
💫 PyScaffold extension for data-science projects
data-science pyscaffold pyscaffold-extension python
Last synced: 29 Dec 2024
https://github.com/anthdm/ml-email-clustering
Email clustering with machine learning
clustering data-science machine-learning scikit-learn
Last synced: 19 Nov 2024
https://github.com/curiousily/Machine-Learning-from-Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
artificial-intelligence book classification data-science machine-learning machine-learning-algorithms neural-networks notebook recommender-systems regression reinforcement-learning sentiment-analysis
Last synced: 27 Nov 2024
https://github.com/jgoerner/beyond-jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
airflow apache apistar data-science docker docker-compose jupyter jupyter-notebook minio postgres superset
Last synced: 27 Oct 2024
https://github.com/tvdboom/atom
Automated Tool for Optimized Modelling
automl dagshub data-exploration data-pipeline data-science interactive-visualizations machine-learning mlflow model-predictions modelling python scikit-learn shap visualization
Last synced: 29 Dec 2024
https://github.com/minerva-ml/open-solution-toxic-comments
Open solution to the Toxic Comment Classification Challenge
challenge competition data-science deep-learning ensemble-model kaggle kaggle-competition machine-learning neptune nlp pipeline prediction python python3
Last synced: 27 Nov 2024
https://github.com/saezlab/decoupler-py
Python package to perform enrichment analysis from omics data.
bioinformatics data-science enrichment enrichment-analysis numba python single-cell spatial-transcriptomics transcriptomics
Last synced: 12 Nov 2024
https://github.com/thebabylonai/babylog
A lightweight logger for machine learning teams to log images and predictions in production.
computer-vision cvops data-science logger logging-library machine-learning ml mlops python python3
Last synced: 25 Dec 2024
https://github.com/astronomer/airflow-provider-great-expectations
Great Expectations Airflow operator
airflow airflow-operators airflow-providers data-quality data-science data-testing
Last synced: 25 Dec 2024
https://github.com/raptor-ml/raptor
Transform your pythonic research to an artifact that engineers can deploy easily.
ai-infra data-engineering data-science dataops feature-engineering feature-extraction feature-platform featurestore kubeflow kubernetes machine-learning ml mlops model-deployment production raptor raptor-ml reactive-ml
Last synced: 24 Dec 2024
https://github.com/tirthajyoti/Interactive_Machine_Learning
IPython widgets, interactive plots, interactive machine learning
analytics animation classification data-science interactive jupyter-notebook machine-learning python regression scikit-learn statistics supervised-learning
Last synced: 27 Nov 2024
https://github.com/heidelbergcement/hcrystalball
A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosystem.
cross-validation data-science fbprophet model-selection pmdarima sarimax sklearn sklearn-api sklearn-compatible sklearn-library sktime statsmodels tbats time-series time-series-forecasting transformer wrapper
Last synced: 28 Dec 2024
https://github.com/tirthajyoti/interactive_machine_learning
IPython widgets, interactive plots, interactive machine learning
analytics animation classification data-science interactive jupyter-notebook machine-learning python regression scikit-learn statistics supervised-learning
Last synced: 20 Dec 2024
https://github.com/safreita1/TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
adversarial-attacks attack cascading-failures data-mining data-science defense diffusion epidemics graph graph-attack graph-mining machine-learning netshield network-attack networks robustness simulation vulnerability
Last synced: 12 Nov 2024
https://github.com/oxinabox/datadeps.jl
reproducible data setup for reproducible science
data data-science open-science
Last synced: 20 Nov 2024
https://github.com/tariqdaouda/mariana
The Cutest Deep Learning Framework which is also a wonderful Declarative Language
artificial-intelligence artificial-neural-networks data-science deep-learning deep-learning-algorithms deep-learning-library deep-neural-networks deeplearning machine-learning machine-learning-algorithms machinelearning python theano
Last synced: 09 Dec 2024
https://github.com/h2oai/wave-apps
Sample AI Apps built with H2O Wave.
data-science h2oai hacktoberfest low-code machine-learning python3
Last synced: 28 Dec 2024
https://github.com/benmarwick/ctv-archaeology
CRAN Task View: Archaeological Science
archaeological-science archaeology cran-task data-science r task-view
Last synced: 25 Dec 2024
https://github.com/whitews/flowkit
A Python toolkit for flow cytometry analysis supporting GatingML and FlowJo workspaces
cytometry data-science fcs fcs-files flow-cytometry flow-cytometry-analysis flowjo gatingml immunology python
Last synced: 22 Dec 2024
https://github.com/emilhvitfeldt/r-text-data
List of textual data sources to be used for text mining in R
data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext
Last synced: 18 Dec 2024
https://github.com/jazzdotdev/jazz
The Scripting Engine that Combines Speed, Safety, and Simplicity
actix android chromeos crypto cryptography data-science database development-environment embeddable jazz jinja2 linux lua markdown rust scraping scripting web witness
Last synced: 20 Nov 2024
https://github.com/TheDatumOrg/TSB-UAD
An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection
anomaly-detection anomaly-detection-algorithm benchmark data-mining data-science datasets python python3 time-series time-series-analysis
Last synced: 30 Oct 2024
https://github.com/EmilHvitfeldt/R-text-data
List of textual data sources to be used for text mining in R
data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext
Last synced: 22 Nov 2024
https://github.com/dongjunlee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 08 Nov 2024
https://github.com/voila-dashboards/voici
Voici turns any Jupyter Notebook into a static web application
dashboards data-science emscripten jupyter jupyterlite voila-dashboard wasm
Last synced: 29 Dec 2024
https://github.com/mybridge/learn-python
Python Top 45 Articles of 2017
algorithm data-science machine-learning python python3
Last synced: 07 Nov 2024
https://github.com/google/applied-machine-learning-intensive
Applied Machine Learning Intensive
data-science machine-learning python3 sklearn tensorflow tensorflow-examples tensorflow-tutorials
Last synced: 26 Sep 2024
https://github.com/DongjunLee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 06 Nov 2024
https://rivasiker.github.io/ggHoriPlot/
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 13 Nov 2024
https://github.com/rivasiker/ggHoriPlot
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 12 Nov 2024
https://github.com/aws-samples/aws-ml-jp
SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集
aws data-science deep-learning jupyter-notebook machine-learning mlops sagemaker
Last synced: 08 Nov 2024
https://github.com/alexandervnikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)
augmentations data-augmentation data-science datasets deep-learning generative-model keras machine-learning python synthetic-data synthetic-time-series tensorflow2 time-series vae
Last synced: 25 Dec 2024
https://github.com/apache/incubator-liminal
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
ai airflow big-data data-science machine-learning ml workflows
Last synced: 01 Oct 2024
https://github.com/arabacibahadir/sup-res
A great companion for finding key support and resistance levels on financial charts, cryptocurrencies.
algotrade analysis binance binance-api bitcoin cryptocurrency data-science finance pandas pinescript python stock telegram telegram-bot tradingview
Last synced: 27 Oct 2024
https://github.com/ayush1997/YouTube-Like-predictor
YouTube Like Count Predictions using Machine Learning
data-analysis data-science machine-learning predictive-analysis random-forest visualization youtube-api
Last synced: 27 Nov 2024
https://github.com/egenn/rtemis
Advanced Machine Learning and Visualization
data-science data-visualization machine-learning machine-learning-library r rstats visualization
Last synced: 29 Dec 2024
https://github.com/ropensci/tarchetypes
Archetypes for targets and pipelines
data-science high-performance-computing peer-reviewed pipeline r r-package r-targetopia reproducibility rstats targets workflow
Last synced: 22 Dec 2024
https://github.com/dlab-berkeley/R-Fundamentals-Legacy
D-Lab's 12 hour introduction to R Fundamentals. Learn how to create variables and functions, manipulate data frames, make visualizations, use control flow structures, and more, using R in RStudio.
automation data-science data-visualization data-wrangling r
Last synced: 11 Nov 2024
https://github.com/jialuechen/pytca
Python Library for Transaction Cost Analysis (TCA)
algorithmic-trading best-execution data-science gpu high-frequency-data jax limit-order-book order-book price-impact propagator python quantitative-finance reinforcement-learning statistics
Last synced: 28 Dec 2024
https://github.com/durgeshsamariya/data-science-machine-learning-project-with-source-code
Data Science and Machine Learning projects with source code.
artificial-intelligence awesome awesome-list data-science data-science-projects machine-learning machine-learning-projects python
Last synced: 08 Nov 2024
https://github.com/mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
bagging data-science dataflow-programming ensemble-learning machine-learning mlr3 pipelines preprocessing r r-package stacking
Last synced: 28 Dec 2024
https://github.com/kwmsmith/scipy-2017-cython-tutorial
Material for the SciPy 2017 Cython tutorial
c c-plus-plus cython data-science docker machine-learning notebook performance python
Last synced: 14 Oct 2024
https://github.com/jupyterhub/repo2docker-action
A GitHub action to build data science environment images with repo2docker and push them to registries.
actions binder data-science datascience docker jupyter jupyter-notebook repo2docker repo2docker-action
Last synced: 23 Dec 2024
https://github.com/rk2900/drsa
Deep Recurrent Survival Analysis, an auto-regressive deep model for time-to-event data analysis with censorship handling. An implementation of our AAAI 2019 paper and a benchmark for several (Python) implemented survival analysis methods.
data-science deep-learning machine-learning survival-analysis
Last synced: 07 Nov 2024
https://github.com/rodrigo-arenas/portfolio
Personal website, Data scientist portfolio template
beginner-friendly create-react-app css3 data-science data-science-portfolio data-science-projects developer-portfolio good-first-issue javascript material-ui personal-website portfolio portfolio-template portfolio-website react react-portfolio reactjs up-for-grabs web-template
Last synced: 27 Dec 2024
https://github.com/celebi-pkg/flight-analysis
Python package to scrape flight data from Google Flights and analyzes prices. Can determine optimal flight from date, place, and price
data-science google pandas planes prediction price-tracker python
Last synced: 28 Dec 2024
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 29 Oct 2024
https://github.com/gzuidhof/zarr.js
Javascript implementation of Zarr
array data-science gehlenborglab javascript typescript zarr
Last synced: 29 Dec 2024
https://github.com/hamelsmu/seq2seq_tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 27 Oct 2024
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 25 Dec 2024
https://github.com/pablormier/yabox
Yet another black-box optimization library for Python
algorithms black-box black-box-optimization data-science differential-evolution evolutionary-algorithms minimization optimization parallel python python3 stochastic-algorithms
Last synced: 24 Dec 2024
https://github.com/picnicml/doddle-model
:cake: doddle-model: machine learning in Scala.
breeze data-science doddle-model machine-learning scala
Last synced: 18 Nov 2024
https://github.com/doubleml/doubleml-for-r
DoubleML - Double Machine Learning in R
causal-inference data-science double-machine-learning econometrics machine-learning mlr3 r statistics
Last synced: 23 Dec 2024
https://github.com/hhhrrrttt222111/ds_and_ml_projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
data-science data-visualization hacktoberfest keras keras-tensorflow knn-classification linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier opencv python scikit-learn seaborn tensorflow
Last synced: 14 Dec 2024
https://github.com/mmbazel/springboard-datasciencetrack-student
Springboard Program: Data Science Career Track - NLP
capstone data-science data-wrangling datasciencedreamjob dsdj mikikobazeley nlp python springboard
Last synced: 18 Nov 2024
https://github.com/DataHaskell/dh-core
Functional data science
data-analysis data-mining data-science dataframes datahaskell datasets machine-learning numerical-methods
Last synced: 30 Oct 2024
https://github.com/lynxkite/lynxkite
The complete graph data science platform
complex-networks data-science graph-algorithms graph-visualization hacktoberfest machine-learning
Last synced: 08 Nov 2024
https://github.com/minerva-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 30 Oct 2024
https://github.com/genular/pandora
PANDORA - Predictive Analytics aNd Data Oriented Research Applications :computer:
bioinformatics biomarkers clinical-data clustering data-integration data-mining data-science data-visualization drug-discovery genomic-data-analysis machine-learning microbiome pandora predictive-analytics systems-biology transcriptomics tsne umap unsupervised-machine-learning
Last synced: 04 Nov 2024
https://github.com/yizhe-ang/k-means-explorable
An Explorable Explainer of K-Means Clustering
ai clustering data-science explorable explorable-explanations javascript machine-learning svelte
Last synced: 16 Nov 2024
https://github.com/ray-project/xgboost_ray
Distributed XGBoost on Ray
dask data-science kaggle machine-learning modin xgboost
Last synced: 25 Dec 2024
https://github.com/jacobgil/confidenceinterval
The long missing library for python confidence intervals
data-science machine-learning metrics statistics
Last synced: 23 Dec 2024
https://github.com/EnvironmentOntology/envo
A community-driven ontology for the representation of environments
data-management data-science earth-science ecoinformatics ecology environment esip obofoundry ontology planetary-science semantics sustainable-development-goals
Last synced: 04 Nov 2024
https://github.com/ing-bank/probatus
Validation (like Recursive Feature Elimination for SHAP) of (multiclass) classifiers & regressors and data used to develop them.
binary-classifiers data-analysis data-science feature-elimination machine-learning multi-class-classification recursive-feature-elimination regressors shap statistics tree-model
Last synced: 28 Dec 2024