Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-07-29 13:36:33 UTC
- JSON Representation
https://github.com/activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
ai computer-vision cv data-science data-version-control datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops python pytorch tensorflow vector-database vector-search
Last synced: 31 Jul 2024
https://github.com/drivendataorg/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
ai cookiecutter cookiecutter-data-science cookiecutter-template data-science machine-learning
Last synced: 01 Aug 2024
https://github.com/drivendata/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
ai cookiecutter cookiecutter-data-science cookiecutter-template data-science machine-learning
Last synced: 02 Aug 2024
https://github.com/Netflix/metaflow
:rocket: Build and manage real-life ML, AI, and data science projects with ease!
ai aws azure data-science datascience gcp high-performance-computing kubernetes machine-learning ml ml-infrastructure ml-platform mlops model-management productivity python r r-package reproducible-research rstats
Last synced: 30 Jul 2024
https://github.com/alan-turing-institute/sktime
A unified framework for machine learning with time series
data-mining data-science forecasting hacktoberfest machine-learning scikit-learn time-series time-series-analysis time-series-classification time-series-regression
Last synced: 05 Aug 2024
https://github.com/sktime/sktime
A unified framework for machine learning with time series
data-mining data-science forecasting hacktoberfest machine-learning scikit-learn time-series time-series-analysis time-series-classification time-series-regression
Last synced: 31 Jul 2024
https://github.com/mrdbourke/machine-learning-roadmap
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
data data-science deep-learning machine-learning
Last synced: 31 Jul 2024
https://github.com/mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
artificial-intelligence data data-engineering data-integration data-pipelines data-science dbt elt etl machine-learning orchestration pipeline pipelines python reverse-etl spark sql transformation
Last synced: 01 Aug 2024
https://github.com/rapidsai/cudf
cuDF - GPU DataFrame Library
arrow cpp cuda cudf dask data-analysis data-science dataframe gpu pandas pydata python rapids
Last synced: 31 Jul 2024
https://github.com/autogluon/autogluon
Fast and Accurate ML in 3 Lines of Code
autogluon automated-machine-learning automl computer-vision data-science deep-learning ensemble-learning forecasting gluon hyperparameter-optimization machine-learning natural-language-processing object-detection python pytorch scikit-learn structured-data tabular-data time-series transfer-learning
Last synced: 31 Jul 2024
https://github.com/awslabs/autogluon
Fast and Accurate ML in 3 Lines of Code
autogluon automated-machine-learning automl computer-vision data-science deep-learning ensemble-learning forecasting gluon hyperparameter-optimization machine-learning natural-language-processing object-detection python pytorch scikit-learn structured-data tabular-data time-series transfer-learning
Last synced: 27 Aug 2024
https://github.com/rasbt/python-machine-learning-book-2nd-edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
data-science deep-learning machine-learning python scikit-learn tensorflow
Last synced: 30 Jul 2024
https://github.com/Featuretools/featuretools
An open source python library for automated feature engineering
automated-feature-engineering automated-machine-learning automl data-science feature-engineering machine-learning python scikit-learn
Last synced: 31 Jul 2024
https://github.com/alteryx/featuretools
An open source python library for automated feature engineering
automated-feature-engineering automated-machine-learning automl data-science feature-engineering machine-learning python scikit-learn
Last synced: 30 Jul 2024
https://github.com/firmai/industry-machine-learning
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
data-science datascience example firmai jupyter-notebook machine-learning practical-machine-learning python
Last synced: 31 Jul 2024
https://github.com/unit8co/darts
A python library for user-friendly forecasting and anomaly detection on time series.
anomaly-detection data-science deep-learning forecasting machine-learning python time-series
Last synced: 31 Jul 2024
https://github.com/py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
bayesian-networks causal-inference causal-machine-learning causal-models causality data-science do-calculus graphical-models machine-learning python3 treatment-effects
Last synced: 31 Jul 2024
https://github.com/microsoft/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
bayesian-networks causal-inference causal-machine-learning causal-models causality data-science do-calculus graphical-models machine-learning python3 treatment-effects
Last synced: 28 Aug 2024
https://github.com/scikit-learn-contrib/imbalanced-learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
data-analysis data-science machine-learning python statistics
Last synced: 30 Jul 2024
https://github.com/h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
automl big-data data-science deep-learning distributed ensemble-learning gbm gpu h2o h2o-automl hadoop java machine-learning naive-bayes opensource pca python r random-forest spark
Last synced: 30 Jul 2024
https://github.com/jwilber/roughViz
Reusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
charting-library d3v5 dashboard data-science data-visualization visualization
Last synced: 31 Jul 2024
https://github.com/voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
active-learning artificial-intelligence computer-vision data-centric-ai data-cleaning data-curation data-quality data-science deep-learning developer-tools image-classification machine-learning object-detection python unstructured-data vector-search visualization
Last synced: 31 Jul 2024
https://github.com/mahmoud/boltons
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
cache data-science data-structures file json python queue recursive standard-library statistics utilities
Last synced: 30 Jul 2024
https://github.com/rushter/data-science-blogs
A curated list of data science blogs
Last synced: 31 Jul 2024
https://github.com/afshinea/stanford-cs-230-deep-learning
VIP cheatsheets for Stanford's CS 230 Deep Learning
cheatsheet convolutional-neural-networks data-science deep-learning recurrent-neural-networks
Last synced: 31 Jul 2024
https://github.com/nteract/nteract
📘 The interactive computing suite for you! ✨
data-science desktop-application ipython jupyter jupyter-notebook monorepo notebook nteract react react-components repl zeromq
Last synced: 30 Jul 2024
https://github.com/Visualize-ML/Book3_Elements-of-Mathematics
Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!
data-science linear-algebra machine-learning mathematics matrix
Last synced: 02 Aug 2024
https://github.com/pachyderm/pachyderm
Data-Centric Pipelines and Data Versioning
analytics big-data containers data-analysis data-science distributed-systems docker go kubernetes pachyderm
Last synced: 30 Jul 2024
https://github.com/rhiever/Data-Analysis-and-Machine-Learning-Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
data-analysis data-science evolutionary-algorithm ipython-notebook machine-learning python
Last synced: 31 Jul 2024
https://github.com/haifengl/smile
Statistical Machine Intelligence & Learning Engine
classification clustering computer-algebra-system data-science dataframe deep-learning genetic-algorithm graph interpolation linear-algebra machine-learning manifold-learning multidimensional-scaling nearest-neighbor-search nlp regression statistics visualization wavelet
Last synced: 30 Jul 2024
https://github.com/dair-ai/ML-Course-Notes
🎓 Sharing machine learning course / lecture notes.
ai data-science deep-learning machine-learning natural-language-processing
Last synced: 01 Aug 2024
https://github.com/skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
cloud-computing cloud-management cost-management cost-optimization data-science deep-learning distributed-training finops gpu hyperparameter-tuning job-queue job-scheduler llm-serving llm-training machine-learning ml-infrastructure ml-platform multicloud spot-instances tpu
Last synced: 31 Jul 2024
https://github.com/snorkel-team/snorkel
A system for quickly generating training data with weak supervision
ai data-augmentation data-science data-slicing labeling machine-learning python snorkel training-data weak-supervision
Last synced: 30 Jul 2024
https://github.com/HazyResearch/snorkel
A system for quickly generating training data with weak supervision
ai data-augmentation data-science data-slicing labeling machine-learning python snorkel training-data weak-supervision
Last synced: 05 Aug 2024
https://github.com/growthbook/growthbook
Open Source Feature Flagging and A/B Testing Platform
ab-testing abtest abtesting analytics bigquery clickhouse continuous-delivery data-analysis data-engineering data-science experimentation feature-flagging feature-flags mixpanel redshift remote-config snowflake split-testing statistics
Last synced: 31 Jul 2024
https://github.com/airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
data data-analysis data-science knowledge
Last synced: 31 Jul 2024
https://github.com/feast-dev/feast
The Open Source Feature Store for Machine Learning
big-data data-engineering data-quality data-science feature-store features machine-learning ml mlops python
Last synced: 31 Jul 2024
https://github.com/ujjwalkarn/DataSciencePython
common data analysis and machine learning tasks using python
data-science data-scientists python python-tutorial
Last synced: 30 Jul 2024
https://github.com/lux-org/lux
Automatically visualize your pandas dataframe via a single print! 📊 💡
data-science exploratory-data-analysis jupyter pandas python visualization visualization-tools
Last synced: 31 Jul 2024
https://github.com/geekywrites/datascience
This repository is a compilation of free resources for learning Data Science.
artificial-intelligence computer-vision data-science datascienceproject deeplearning machine-learning machine-learning-algorithms natural-language-processing neural-networks
Last synced: 31 Jul 2024
https://github.com/aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
ai data-science data-visualization experiment-tracking machine-learning metadata metadata-tracking ml mlflow mlops prompt-engineering python pytorch tensorboard tensorflow visualization
Last synced: 31 Jul 2024
https://github.com/microsoft/SynapseML
Simple and Distributed Machine Learning
ai apache-spark azure big-data cognitive-services data-science databricks deep-learning http lightgbm machine-learning microsoft ml model-deployment onnx opencv pyspark scala spark synapse
Last synced: 30 Jul 2024
https://microsoft.github.io/SynapseML/
Simple and Distributed Machine Learning
ai apache-spark azure big-data cognitive-services data-science databricks deep-learning http lightgbm machine-learning microsoft ml model-deployment onnx opencv pyspark scala spark synapse
Last synced: 02 Aug 2024
https://github.com/fastai/course-v3
The 3rd edition of course.fast.ai
data-science deep-learning fastai machine-learning machine-learning-courses mooc pytorch
Last synced: 31 Jul 2024
https://github.com/flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
data data-analysis data-science dataops declarative fine-tuning flyte golang grpc kubernetes kubernetes-operator llm machine-learning mlops orchestration-engine production production-grade python scale workflow
Last synced: 31 Jul 2024
https://github.com/rasbt/mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
association-rules data-mining data-science machine-learning python supervised-learning unsupervised-learning
Last synced: 30 Jul 2024
https://github.com/blei-lab/edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
bayesian-methods data-science deep-learning machine-learning neural-networks probabilistic-programming statistics tensorflow
Last synced: 31 Jul 2024
https://github.com/online-ml/river
🌊 Online machine learning in Python
concept-drift data-science incremental-learning machine-learning online-learning online-machine-learning online-statistics python real-time-processing stream-processing streaming streaming-data
Last synced: 31 Jul 2024
https://github.com/opensource9ja/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
danfojs data-analysis data-analytics data-manipulation data-science dataframe javascript pandas plotting-charts stream-data stream-processing table tensorflow tensors
Last synced: 02 Aug 2024
https://github.com/javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
danfojs data-analysis data-analytics data-manipulation data-science dataframe javascript pandas plotting-charts stream-data stream-processing table tensorflow tensors
Last synced: 31 Jul 2024
https://github.com/aaronwangy/Data-Science-Cheatsheet
A helpful 5-page machine learning cheatsheet to assist with exam reviews, interview prep, and anything in-between.
cheatsheet data-science machine-learning
Last synced: 01 Aug 2024
https://github.com/evidentlyai/evidently
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
data-drift data-science hacktoberfest html-report jupyter-notebook machine-learning machine-learning-operations mlops model-monitoring pandas-dataframe production-machine-learning
Last synced: 31 Jul 2024
https://github.com/biolab/orange3
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
classification clustering data-mining data-science data-visualization decision-trees machine-learning numpy orange orange3 pandas plotting python random-forest regression scikit-learn scipy visual-programming visualization
Last synced: 31 Jul 2024
https://github.com/man-group/dtale
Visualizer for pandas data structures
data-analysis data-science data-visualization flask ipython jupyter-notebook pandas plotly-dash python27 python3 react react-virtualized visualization xarray
Last synced: 31 Jul 2024
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 01 Aug 2024
https://github.com/Nyandwi/machine_learning_complete
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
computer-vision data-analysis data-science data-visualization datascience deep-learning keras machine-learning matplotlib neural-networks nlp numpy open-source pandas python scikit-learn seaborn tensorflow
Last synced: 01 Aug 2024
https://github.com/okfn-brasil/serenata-de-amor
🕵 Artificial Intelligence for social control of public administration | **This repository does not receive frequent updates. Check out the README**
artificial-intelligence civic-tech data-science machine-learning open-data politics
Last synced: 30 Jul 2024
https://github.com/goq/telegram-list
List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов
bot coding community data-science data-science-club deep-learning devops devops-teams frontend hacker-news linux machine-learning microsoft news programming programming-languages smm telegram telegram-group theory
Last synced: 31 Jul 2024
https://github.com/BoltzmannEntropy/interviews.ai
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.
artificial-intelligence autograd bayesian-statistics convolutional-neural-networks data-science deep-learning ensemble-learning feature-extraction graduate-school information-theory interview-preparation jax jobs logistic-regression loss-functions machine-learning python pytorch pytorch-tutorial
Last synced: 31 Jul 2024
https://github.com/FluxML/Flux.jl
Relax! Flux is the ML library that doesn't make you tensor
data-science deep-learning flux machine-learning neural-networks the-human-brain
Last synced: 31 Jul 2024
https://github.com/makcedward/nlpaug
Data augmentation for NLP
adversarial-attacks adversarial-example ai artificial-intelligence augmentation data-science machine-learning ml natural-language-processing nlp
Last synced: 01 Aug 2024
https://github.com/awslabs/gluonts
Probabilistic time series modeling in Python
artificial-intelligence aws data-science deep-learning forecasting machine-learning mxnet neural-networks pytorch sagemaker time-series time-series-forecasting time-series-prediction timeseries torch
Last synced: 30 Jul 2024
https://github.com/awslabs/gluon-ts
Probabilistic time series modeling in Python
artificial-intelligence aws data-science deep-learning forecasting machine-learning mxnet neural-networks pytorch sagemaker time-series time-series-forecasting time-series-prediction timeseries torch
Last synced: 05 Aug 2024
https://github.com/dsgiitr/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.
book computer-vision d2l data-science deep-learning dive-into-deep-learning mxnet nlp pytorch pytorch-implmention
Last synced: 31 Jul 2024
https://github.com/open-metadata/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datacatalog datadiscovery dataengineering dataquality dbt metadata metadata-management snowflake
Last synced: 31 Jul 2024
https://github.com/datawhalechina/competition-baseline
数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路
data-competition data-science deep-learning kaggle
Last synced: 01 Aug 2024
https://github.com/tensorflow/probability
Probabilistic reasoning and statistical analysis in TensorFlow
bayesian-methods data-science deep-learning machine-learning neural-networks probabilistic-programming statistics tensorflow
Last synced: 30 Jul 2024
https://github.com/alandefreitas/matplotplusplus
Matplot++: A C++ Graphics Library for Data Visualization 📊🗾
charting-library charts contour-plots data-analysis data-science data-visualization graphics graphics-library graphs matplot plot-categories plots polar-plots scientific-computing scientific-visualization visualization
Last synced: 31 Jul 2024
https://github.com/louisfb01/start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2024 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
artificial-intelligence cheat-sheets course coursera coursera-machine-learning data-science deep-learning learn-to-code learning learning-python linear-algebra machine-learning neural-networks practice probability-statistics read-articles tutorial tutorials youtube youtube-playlist
Last synced: 31 Jul 2024
https://github.com/hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox
Last synced: 31 Jul 2024
https://github.com/polakowo/vectorbt
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
algorithmic-trading algorithmic-traiding backtesting cryptocurrency data-science data-visualization finance machine-learning portfolio-optimization quantitative-analysis quantitative-finance time-series trading trading-strategies
Last synced: 31 Jul 2024
https://github.com/orchest/orchest
Build data pipelines, the easy way 🛠️
airflow cloud dag data-pipelines data-science deployment docker etl etl-pipeline ide jupyter jupyterlab kubernetes machine-learning notebooks orchest pipelines python self-hosted
Last synced: 01 Aug 2024
https://github.com/khuyentran1401/Data-science
Collection of useful data science topics along with articles, videos, and code
articles artificial-intelligence data-analysis data-science data-visualization machine-learning natural-language-processing python scraping time-series
Last synced: 31 Jul 2024
https://github.com/Azure/MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
azure azure-machine-learning azure-ml azureml data-science deep-learning machine-learning notebook
Last synced: 31 Jul 2024
https://github.com/faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
awesome competition data-mining data-science kaggle machine-learning solutions
Last synced: 31 Jul 2024
https://github.com/iterative/cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
bitbucket-pipelines ci ci-cd cicd cli continuous-delivery continuous-integration data-science developer-tools github-actions gitlab-ci hacktoberfest machine-learning
Last synced: 30 Jul 2024
https://github.com/nteract/hydrogen
:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
atom data-science hydrogen ipython jupyter jupyter-kernels nteract repl
Last synced: 31 Jul 2024
https://github.com/marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
artificial-intelligence data-science data-visualization developer-tools machine-learning notebooks pipeline python reactive web-app
Last synced: 31 Jul 2024
https://github.com/microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
automated-machine-learning automl classification data-science deep-learning finetuning hyperparam hyperparameter-optimization jupyter-notebook machine-learning natural-language-generation natural-language-processing python random-forest regression scikit-learn tabular-data timeseries-forecasting tuning
Last synced: 31 Jul 2024
https://github.com/aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
amazon-athena amazon-sagemaker-notebook apache-arrow apache-parquet athena aws aws-glue aws-lambda data-engineering data-science emr etl glue-catalog lambda modin mysql pandas python ray redshift
Last synced: 01 Aug 2024
https://github.com/gopherdata/gophernotes
The Go kernel for Jupyter notebooks and nteract.
artificial-intelligence data-science go golang gophernotes jupyter jupyter-notebook kernel machine-learning nteract numerical-methods zeromq
Last synced: 31 Jul 2024
https://github.com/mljar/mercury
Convert Jupyter Notebooks to Web Apps
data-science data-visualization jupyter jupyter-lab jupyter-notebook mercury mljar-mercury notebook notebook-application notebook-jupyter notebook-publish notebook-web notebooks-jupyter python
Last synced: 30 Jul 2024
https://github.com/jdb78/pytorch-forecasting
Time series forecasting with PyTorch
ai artifical-intelligense data-science deep-learning forecasting gpu machine-learning neural-networks pandas python pytorch pytorch-lightning temporal timeseries timeseries-forecasting uncertainty
Last synced: 01 Aug 2024
https://github.com/zenml-io/zenml
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
ai automl data-science deep-learning devops-tools hacktoberfest llm llmops machine-learning metadata-tracking ml mlops pipelines production-ready pytorch tensorflow workflow zenml
Last synced: 01 Aug 2024
https://github.com/justmarkham/scikit-learn-videos
Jupyter notebooks from the scikit-learn video series
data-science jupyter-notebook machine-learning python scikit-learn tutorial
Last synced: 01 Aug 2024
https://github.com/Nixtla/statsforecast
Lightning ⚡️ fast forecasting with statistical and econometric models.
arima automl baselines data-science econometrics ets exponential-smoothing fbprophet forecasting machine-learning mstl naive neuralprophet predictions prophet python seasonal-naive statistics theta time-series
Last synced: 31 Jul 2024
https://github.com/hemansnation/God-Level-AI
A collection of scientific methods, processes, algorithms, and systems to build stories & models.
computer-vision data-engineering data-science data-structures-and-algorithms data-system-design data-visualization datastructures deep-learning machine-learning matplotlib mlops natural-language-processing numpy pandas python pytorch scikit-learn statistics tableau
Last synced: 31 Jul 2024
https://github.com/jtablesaw/tablesaw
Java dataframe and visualization library
chart data-analysis data-frame data-science data-visualization dataframe high-performance java java-dataframe machine-learning plotly plotting statistical-analysis statistics visualization
Last synced: 01 Aug 2024
https://github.com/polyaxon/polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
artificial-intelligence caffe data-science deep-learning hyperparameter-optimization jupyter jupyterlab k8s keras kubernetes machine-learning ml mlops mxnet notebook pipelines pytorch reinforcement-learning tensorflow workflow
Last synced: 31 Jul 2024
https://github.com/spotify/chartify
Python library that makes it easy for data scientists to create charts.
bokeh data-science plots plotting python visualization
Last synced: 30 Jul 2024
https://github.com/kubeflow/pipelines
Machine Learning Pipelines for Kubeflow
data-science kubeflow kubeflow-pipelines kubernetes machine-learning mlops pipeline
Last synced: 30 Jul 2024
https://github.com/chiphuyen/python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
advanced-python data-science machine-learning python-tutorials python3
Last synced: 30 Jul 2024
https://github.com/evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis finance open-source self-hosted sql statistics svelte tailwindcss webassembly
Last synced: 31 Jul 2024
https://github.com/ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
data-engineering data-science jupyter jupyter-notebooks machine-learning mlops notebooks papermill pipelines pycharm vscode workflow
Last synced: 01 Aug 2024
https://github.com/jpmorganchase/python-training
Python training for business analysts and traders
banking binder binder-ready data-science finance jpmorgan jpmorganchase jupyter jupyterlab python
Last synced: 01 Aug 2024
https://github.com/fastai/course-nlp
A Code-First Introduction to NLP course
data-science machine-learning nlp python
Last synced: 31 Jul 2024
https://github.com/deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
data-drift data-science data-validation deep-learning html-report jupyter-notebook machine-learning ml mlops model-monitoring model-validation pandas-dataframe python pytorch
Last synced: 31 Jul 2024
https://github.com/microsoft/tensorwatch
Debugging, monitoring and visualization for Python Machine Learning and Data Science
ai data-science debug debugging debugging-tool deep-learning deeplearning explainable-ai explainable-ml jupyter jupyter-notebook machine-learning machinelearning model-visualization monitoring python reinforcement-learning saliency
Last synced: 31 Jul 2024
https://github.com/ml-tooling/ml-workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
anaconda data-analysis data-science data-visualization deep-learning docker gpu jupyter jupyter-lab jupyter-notebook kubernetes machine-learning neural-networks nlp python pytorch r scikit-learn tensorflow vscode
Last synced: 30 Jul 2024