Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-09 00:06:40 UTC
- JSON Representation
https://github.com/hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox
Last synced: 30 Oct 2024
https://github.com/louisfb01/start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2024 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
artificial-intelligence cheat-sheets course coursera coursera-machine-learning data-science deep-learning learn-to-code learning learning-python linear-algebra machine-learning neural-networks practice probability-statistics read-articles tutorial tutorials youtube youtube-playlist
Last synced: 11 Oct 2024
https://github.com/Azure/MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
azure azure-machine-learning azure-ml azureml data-science deep-learning machine-learning notebook
Last synced: 30 Oct 2024
https://github.com/khuyentran1401/Data-science
Collection of useful data science topics along with articles, videos, and code
articles artificial-intelligence data-analysis data-science data-visualization machine-learning natural-language-processing python scraping time-series
Last synced: 30 Oct 2024
https://github.com/khuyentran1401/data-science
Collection of useful data science topics along with articles, videos, and code
articles artificial-intelligence data-analysis data-science data-visualization machine-learning natural-language-processing python scraping time-series
Last synced: 15 Oct 2024
https://github.com/faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
awesome competition data-mining data-science kaggle machine-learning solutions
Last synced: 25 Sep 2024
https://github.com/orchest/orchest
Build data pipelines, the easy way 🛠️
airflow cloud dag data-pipelines data-science deployment docker etl etl-pipeline ide jupyter jupyterlab kubernetes machine-learning notebooks orchest pipelines python self-hosted
Last synced: 11 Oct 2024
https://github.com/iterative/cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
bitbucket-pipelines ci ci-cd cicd cli continuous-delivery continuous-integration data-science developer-tools github-actions gitlab-ci hacktoberfest machine-learning
Last synced: 29 Oct 2024
https://github.com/azure/machinelearningnotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
azure azure-machine-learning azure-ml azureml data-science deep-learning machine-learning notebook
Last synced: 29 Oct 2024
https://github.com/nteract/hydrogen
:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
atom data-science hydrogen ipython jupyter jupyter-kernels nteract repl
Last synced: 11 Oct 2024
https://github.com/marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
artificial-intelligence data-science data-visualization developer-tools machine-learning notebooks pipeline python reactive web-app
Last synced: 15 Oct 2024
https://github.com/microsoft/flaml
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
automated-machine-learning automl classification data-science deep-learning finetuning hyperparam hyperparameter-optimization jupyter-notebook machine-learning natural-language-generation natural-language-processing python random-forest regression scikit-learn tabular-data timeseries-forecasting tuning
Last synced: 28 Oct 2024
https://github.com/microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
automated-machine-learning automl classification data-science deep-learning finetuning hyperparam hyperparameter-optimization jupyter-notebook machine-learning natural-language-generation natural-language-processing python random-forest regression scikit-learn tabular-data timeseries-forecasting tuning
Last synced: 27 Oct 2024
https://github.com/sktime/pytorch-forecasting
Time series forecasting with PyTorch
ai artifical-intelligense data-science deep-learning forecasting gpu machine-learning neural-networks pandas python pytorch pytorch-lightning temporal timeseries timeseries-forecasting uncertainty
Last synced: 09 Oct 2024
https://github.com/aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
amazon-athena amazon-sagemaker-notebook apache-arrow apache-parquet athena aws aws-glue aws-lambda data-engineering data-science emr etl glue-catalog lambda modin mysql pandas python ray redshift
Last synced: 07 Oct 2024
https://github.com/gopherdata/gophernotes
The Go kernel for Jupyter notebooks and nteract.
artificial-intelligence data-science go golang gophernotes jupyter jupyter-notebook kernel machine-learning nteract numerical-methods zeromq
Last synced: 11 Oct 2024
https://github.com/mljar/mercury
Convert Jupyter Notebooks to Web Apps
data-science data-visualization jupyter jupyter-lab jupyter-notebook mercury mljar-mercury notebook notebook-application notebook-jupyter notebook-publish notebook-web notebooks-jupyter python
Last synced: 11 Oct 2024
https://github.com/lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust
Last synced: 29 Oct 2024
https://github.com/justmarkham/scikit-learn-videos
Jupyter notebooks from the scikit-learn video series
data-science jupyter-notebook machine-learning python scikit-learn tutorial
Last synced: 13 Oct 2024
https://github.com/TDAmeritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
anomaly-detection dask data-science matrix-profile motif-discovery numba pattern-matching pydata python time-series-analysis time-series-data-mining time-series-segmentation
Last synced: 30 Oct 2024
https://github.com/tdameritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
anomaly-detection dask data-science matrix-profile motif-discovery numba pattern-matching pydata python time-series-analysis time-series-data-mining time-series-segmentation
Last synced: 29 Oct 2024
https://github.com/zenml-io/zenml
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
ai automl data-science deep-learning devops-tools hacktoberfest llm llmops machine-learning metadata-tracking ml mlops pipelines production-ready pytorch tensorflow workflow zenml
Last synced: 14 Oct 2024
https://github.com/khanhnamle1994/cracking-the-data-science-interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
concepts data-journalism data-portfolio data-science data-wrangling deep-learning downloadable-cheatsheets machine-learning python statistics
Last synced: 15 Oct 2024
https://github.com/nixtla/statsforecast
Lightning ⚡️ fast forecasting with statistical and econometric models.
arima automl baselines data-science econometrics ets exponential-smoothing fbprophet forecasting machine-learning mstl naive neuralprophet predictions prophet python seasonal-naive statistics theta time-series
Last synced: 29 Oct 2024
https://github.com/Nixtla/statsforecast
Lightning ⚡️ fast forecasting with statistical and econometric models.
arima automl baselines data-science econometrics ets exponential-smoothing fbprophet forecasting machine-learning mstl naive neuralprophet predictions prophet python seasonal-naive statistics theta time-series
Last synced: 31 Oct 2024
https://github.com/deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
data-drift data-science data-validation deep-learning html-report jupyter-notebook machine-learning ml mlops model-monitoring model-validation pandas-dataframe python pytorch
Last synced: 29 Oct 2024
https://github.com/jtablesaw/tablesaw
Java dataframe and visualization library
chart data-analysis data-frame data-science data-visualization dataframe high-performance java java-dataframe machine-learning plotly plotting statistical-analysis statistics visualization
Last synced: 29 Oct 2024
https://github.com/hemansnation/God-Level-AI
A collection of scientific methods, processes, algorithms, and systems to build stories & models.
computer-vision data-engineering data-science data-structures-and-algorithms data-system-design data-visualization datastructures deep-learning machine-learning matplotlib mlops natural-language-processing numpy pandas python pytorch scikit-learn statistics tableau
Last synced: 31 Oct 2024
https://github.com/fastai/fastpages
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
actions data-science fastai github-pages jekyll jekyll-blog jupyter jupyter-lab jupyter-notebooks literate-programming nbdev python visualization visualization-tools
Last synced: 27 Sep 2024
https://github.com/hemansnation/god-level-ai
A collection of scientific methods, processes, algorithms, and systems to build stories & models.
computer-vision data-engineering data-science data-structures-and-algorithms data-system-design data-visualization datastructures deep-learning machine-learning matplotlib mlops natural-language-processing numpy pandas python pytorch scikit-learn statistics tableau
Last synced: 10 Oct 2024
https://github.com/chiphuyen/python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
advanced-python data-science machine-learning python-tutorials python3
Last synced: 14 Oct 2024
https://github.com/ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
data-engineering data-science jupyter jupyter-notebooks machine-learning mlops notebooks papermill pipelines pycharm vscode workflow
Last synced: 29 Oct 2024
https://github.com/polyaxon/polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
artificial-intelligence caffe data-science deep-learning hyperparameter-optimization jupyter jupyterlab k8s keras kubernetes machine-learning ml mlops mxnet notebook pipelines pytorch reinforcement-learning tensorflow workflow
Last synced: 13 Oct 2024
https://github.com/spotify/chartify
Python library that makes it easy for data scientists to create charts.
bokeh data-science plots plotting python visualization
Last synced: 15 Oct 2024
https://github.com/kubeflow/pipelines
Machine Learning Pipelines for Kubeflow
data-science kubeflow kubeflow-pipelines kubernetes machine-learning mlops pipeline
Last synced: 28 Oct 2024
https://github.com/evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis finance open-source self-hosted sql statistics svelte tailwindcss webassembly
Last synced: 10 Oct 2024
https://github.com/fastai/course-nlp
A Code-First Introduction to NLP course
data-science machine-learning nlp python
Last synced: 15 Oct 2024
https://github.com/jonkrohn/ml-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
calculus computer-science data-science data-structures jupyter-notebook linear-algebra machine-learning mathematics numpy probability python pytorch statistics tensorflow
Last synced: 15 Oct 2024
https://github.com/ml-tooling/ml-workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
anaconda data-analysis data-science data-visualization deep-learning docker gpu jupyter jupyter-lab jupyter-notebook kubernetes machine-learning neural-networks nlp python pytorch r scikit-learn tensorflow vscode
Last synced: 10 Oct 2024
https://github.com/microsoft/tensorwatch
Debugging, monitoring and visualization for Python Machine Learning and Data Science
ai data-science debug debugging debugging-tool deep-learning deeplearning explainable-ai explainable-ml jupyter jupyter-notebook machine-learning machinelearning model-visualization monitoring python reinforcement-learning saliency
Last synced: 11 Oct 2024
https://github.com/databricks/koalas
Koalas: pandas API on Apache Spark
big-data data-science dataframe mlflow pandas pydata spark
Last synced: 12 Oct 2024
https://github.com/youssefHosni/Data-Science-Interview-Questions-Answers
Curated list of data science interview questions and answers
data-science deep-learning interview-questions machine-learning python
Last synced: 07 Nov 2024
https://github.com/Moataz-Elmesmary/Data-Science-Roadmap
Data Science Roadmap from A to Z
big-data chatgpt cheatsheet cv-template data-analysis data-engineering data-science data-visualization deep-learning interview-questions linear-algebra llms machine-learning mathematics neural-network nlp probability python sql statistics
Last synced: 29 Oct 2024
https://github.com/eto-ai/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust
Last synced: 02 Aug 2024
https://github.com/moataz-elmesmary/data-science-roadmap
Data Science Roadmap from A to Z
big-data chatgpt cheatsheet cv-template data-analysis data-engineering data-science data-visualization deep-learning interview-questions linear-algebra llms machine-learning mathematics neural-network nlp probability python sql statistics
Last synced: 15 Oct 2024
https://github.com/alibaba/GraphScope
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
analytics big-data data-science graph graph-analytics graph-computation graph-computing graph-data graph-neural-networks gremlin
Last synced: 06 Nov 2024
https://github.com/youssefhosni/data-science-interview-questions-answers
Curated list of data science interview questions and answers
data-science deep-learning interview-questions machine-learning python
Last synced: 14 Oct 2024
https://github.com/opengeos/leafmap
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
data-science dataviz folium geoparquet geopython geospatial geospatial-analysis gis ipyleaflet jupyter jupyter-notebook leafmap mapping plotly python solara streamlit streamlit-webapp whiteboxtools
Last synced: 31 Oct 2024
https://github.com/jonkrohn/ML-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
calculus computer-science data-science data-structures jupyter-notebook linear-algebra machine-learning mathematics numpy probability python pytorch statistics tensorflow
Last synced: 02 Aug 2024
https://github.com/ethen8181/machine-learning
:earth_americas: machine learning tutorials (mainly in Python3)
data-science deep-learning jupyter-notebook machine-learning python python3
Last synced: 15 Oct 2024
https://github.com/gee-community/geemap
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
colab data-science dataviz earth-engine earthengine folium geospatial gis google-earth-engine image-processing ipyleaflet ipywidgets jupyter jupyter-notebook landsat mapping python remote-sensing streamlit streamlit-webapp
Last synced: 13 Oct 2024
https://github.com/giswqs/geemap
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
colab data-science dataviz earth-engine earthengine folium geospatial gis google-earth-engine image-processing ipyleaflet ipywidgets jupyter jupyter-notebook landsat mapping python remote-sensing streamlit streamlit-webapp
Last synced: 10 Aug 2024
https://github.com/spark-notebook/spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
apache-spark data-science notebook reactive scala spark
Last synced: 12 Oct 2024
https://github.com/andypetrella/spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
apache-spark data-science notebook reactive scala spark
Last synced: 12 Oct 2024
https://github.com/antonycourtney/tad
A desktop application for viewing and analyzing tabular data
csv data-analysis data-science database desktop-application duckdb parquet-viewer pivot-tables pivots tabular-data
Last synced: 28 Oct 2024
https://github.com/alibaba/graphscope
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
analytics big-data data-science graph graph-analytics graph-computation graph-computing graph-data graph-neural-networks gremlin
Last synced: 29 Oct 2024
https://github.com/ndleah/python-mini-project
🙌 Welcome open-source Python mini-project contributions!
data-analysis data-science data-visualization newbie-code newbie-friendly python python-mini-projects python-programming-exercises python-project-beginner
Last synced: 09 Oct 2024
https://github.com/aksnzhy/xlearn
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
data-analysis data-science factorization-machines ffm fm machine-learning statistics
Last synced: 15 Oct 2024
https://github.com/nidhaloff/igel
a delightful machine learning tool that allows you to train, test, and use models without writing code
artificial-intelligence automation automl automl-experiments data-analysis data-science hacktoberfest hacktoberfest2021 machine-learning machine-learning-algorithms machine-learning-library machinelearning neural-network neural-networks preprocessing scikit-learn scikitlearn-machine-learning sklearn
Last synced: 13 Oct 2024
https://github.com/tirthajyoti/machine-learning-with-python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
artificial-intelligence classification clustering data-science decision-trees deep-learning dimensionality-reduction flask k-nearest-neighbours machine-learning matplotlib naive-bayes neural-network numpy pandas pytest random-forest regression scikit-learn statistics
Last synced: 10 Oct 2024
https://github.com/tirthajyoti/Machine-Learning-with-Python
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
artificial-intelligence classification clustering data-science decision-trees deep-learning dimensionality-reduction flask k-nearest-neighbours machine-learning matplotlib naive-bayes neural-network numpy pandas pytest random-forest regression scikit-learn statistics
Last synced: 07 Aug 2024
https://github.com/shogun-toolbox/shogun
Shōgun
artificial-intelligence c-plus-plus cmake data-science machine-learning swig
Last synced: 14 Oct 2024
https://github.com/bfortuner/ml-glossary
Machine learning glossary
cheatsheets data-science deep-learning deep-learning-tutorial machine-learning neural-network
Last synced: 14 Oct 2024
https://github.com/GokuMohandas/mlops-course
Learn how to design, develop, deploy and iterate on production-grade ML applications.
data-engineering data-quality data-science deep-learning distributed-ml llms machine-learning mlops natural-language-processing python pytorch ray
Last synced: 30 Oct 2024
https://github.com/stellargraph/stellargraph
StellarGraph - Machine Learning on Graphs
data-science deep-learning gcn geometric-deep-learning graph-analysis graph-convolutional-networks graph-data graph-machine-learning graph-neural-networks graphs heterogeneous-networks interpretability link-prediction machine-learning machine-learning-algorithms networkx python saliency-map stellargraph-library
Last synced: 14 Oct 2024
https://github.com/gokumohandas/mlops-course
Learn how to design, develop, deploy and iterate on production-grade ML applications.
data-engineering data-quality data-science deep-learning distributed-ml llms machine-learning mlops natural-language-processing python pytorch ray
Last synced: 15 Oct 2024
https://github.com/mljar/mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
automated-machine-learning automatic-machine-learning automl catboost data-science decision-tree ensemble feature-engineering hyper-parameters hyperparameter-optimization lightgbm machine-learning mljar models-tuning neural-network random-forest scikit-learn shap tuning-algorithm xgboost
Last synced: 13 Oct 2024
https://github.com/tirthajyoti/Data-science-best-resources
Carefully curated resource links for data science in one place
analytics api artificial-intelligence aws cheatsheet data-science data-wrangling database deep-learning linux machine-learning neural-network online-course python r reinforcement-learning scikit-learn sql statistics visualization
Last synced: 07 Nov 2024
https://github.com/giswqs/leafmap
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
data-science dataviz folium geoparquet geopython geospatial geospatial-analysis gis ipyleaflet jupyter jupyter-notebook leafmap mapping plotly python streamlit streamlit-webapp whiteboxtools
Last synced: 11 Oct 2024
https://github.com/mrdbourke/zero-to-mastery-ml
All course materials for the Zero to Mastery Machine Learning and Data Science course.
data-science deep-learning machine-learning
Last synced: 15 Oct 2024
https://github.com/determined-ai/determined
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
data-science deep-learning distributed-training hyperparameter-optimization hyperparameter-search hyperparameter-tuning keras kubernetes machine-learning ml-infrastructure ml-platform mlops pytorch tensorflow
Last synced: 29 Oct 2024
https://github.com/dotnet/interactive
.NET Interactive combines the power of .NET with many other languages to create notebooks, REPLs, and embedded coding experiences. Share code, explore data, write, and learn across your apps in ways you couldn't before.
csharp data-science dotnet-interactive fsharp interactive-programming jupyter notebooks polyglot polyglot-dev powershell
Last synced: 07 Oct 2024
https://github.com/tirthajyoti/data-science-best-resources
Carefully curated resource links for data science in one place
analytics api artificial-intelligence aws cheatsheet data-science data-wrangling database deep-learning linux machine-learning neural-network online-course python r reinforcement-learning scikit-learn sql statistics visualization
Last synced: 10 Oct 2024
https://github.com/fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
data-analysis data-exploration data-profiling data-science data-visualization eda exploration exploratory-data-analysis machine-learning pandas pandas-dataframe python statistics
Last synced: 15 Oct 2024
https://github.com/parrt/dtreeviz
A python library for decision tree visualization and model interpretation.
data-science decision-trees machine-learning model-interpretation python random-forest scikit-learn visualization xgboost
Last synced: 13 Oct 2024
https://github.com/libffcv/ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
data-science machine-learning pytorch
Last synced: 29 Oct 2024
https://github.com/datafold/data-diff
Compare tables within or across databases
data data-diffing data-engineering data-quality data-quality-monitoring data-science database databricks-sql dataengineering dataquality dbt mysql oracle-database postgres postgresql python rdbms snowflake sql trino
Last synced: 29 Oct 2024
https://github.com/rasbt/deep-learning-book
Repository for "Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python"
artificial-intelligence data-science deep-learning machine-learning neural-network python pytorch tensorflow
Last synced: 15 Oct 2024
https://github.com/teamhg-memex/eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
crfsuite data-science explanation inspection lightgbm machine-learning nlp python scikit-learn xgboost
Last synced: 10 Oct 2024
https://github.com/quadratichq/quadratic
Quadratic | Data Science Spreadsheet with Python & SQL
data data-analysis data-engineering data-science etl python quadratic spreadsheet sql wasm webgl
Last synced: 15 Oct 2024
https://matheusfacure.github.io/python-causality-handbook/
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
causal-inference causality data-science econometrics harmless-econometrics impact-estimation python
Last synced: 05 Nov 2024
https://github.com/TeamHG-Memex/eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
crfsuite data-science explanation inspection lightgbm machine-learning nlp python scikit-learn xgboost
Last synced: 02 Aug 2024
https://github.com/rbhatia46/data-science-interview-resources
A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
artificial-intelligence data-science data-science-interview interview-questions interview-resources learning-resources machine-learning machine-learning-interview
Last synced: 15 Oct 2024
https://github.com/rbhatia46/Data-Science-Interview-Resources
A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
artificial-intelligence data-science data-science-interview interview-questions interview-resources learning-resources machine-learning machine-learning-interview
Last synced: 07 Nov 2024
https://github.com/afshinea/stanford-cs-221-artificial-intelligence
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
a-star artificial-intelligence bayesian-networks cheatsheet constraint-satisfaction-problem data-science markov-decision-processes
Last synced: 15 Oct 2024
https://github.com/ashishpatel26/andrew-ng-notes
This is Andrew NG Coursera Handwritten Notes.
andrew-ng andrew-ng-course andrew-ng-machine-learning andrewng coursera coursera-machine-learning data-science deep-learning deep-neural-networks dl machine-learning ml neural-network neural-networks numpy pandas python pytorch reinforcement-learning
Last synced: 14 Oct 2024
https://github.com/whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
ai-pipelines analytics approximate-statistics calculate-statistics constraints data-constraints data-pipeline data-quality data-science dataops dataset logging machine-learning ml-pipelines mlops model-performance python statistical-properties
Last synced: 15 Oct 2024
https://github.com/justinzm/gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
covid19-data data data-analysis data-science datasets economic-data gopup index-data python
Last synced: 15 Oct 2024
https://github.com/hosseinmoein/DataFrame
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
ai cpp data-analysis data-science dataframe financial-data-analysis financial-engineering heterogeneous-data large-data machine-learning multidimensional-data numerical-analysis pandas polars statistical statistical-analysis tensor tensorboard trading-algorithms trading-strategies
Last synced: 26 Oct 2024
https://github.com/yunabe/lgo
Interactive Go programming with Jupyter
data-science go golang jupyter-notebook jupyter-notebook-kernel machine-learning repl
Last synced: 09 Oct 2024
https://github.com/matheusfacure/python-causality-handbook
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
causal-inference causality data-science econometrics harmless-econometrics impact-estimation python
Last synced: 15 Oct 2024
https://github.com/reiinakano/scikit-plot
An intuitive library to add plotting functionality to scikit-learn objects.
data-science machine-learning plot plotting scikit-learn visualization
Last synced: 29 Oct 2024
https://github.com/jayinai/data-science-question-answer
A repo for data science related questions and answers
data-science deep-learning machine-learning reinforcement-learning sql statistics system
Last synced: 14 Oct 2024
https://github.com/tirthajyoti/papers-literature-ml-dl-rl-ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
artificial-intelligence data-mining data-science deep-learning game-theory hardware learning-theory literature machine-learning machine-learning-algorithms neural-network paper pattern-recognition reinforcement-learning silicon statistical-learning statistics
Last synced: 31 Oct 2024
https://github.com/MilesCranmer/PySR
High-Performance Symbolic Regression in Python and Julia
algorithm automl data-science distributed-systems equation-discovery evolutionary-algorithms explainable-ai genetic-algorithm interpretable-ml julia machine-learning python scikit-learn symbolic symbolic-regression
Last synced: 30 Oct 2024
https://github.com/visualize-ml/book7_visualizations-for-machine-learning
Book_7_《机器学习》 | 鸢尾花书:从加减乘除到机器学习;欢迎批评指正
baysian data-science linear-algebra machine-learning machine-learning-algorithms matrix
Last synced: 15 Oct 2024
https://github.com/tirthajyoti/Papers-Literature-ML-DL-RL-AI
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
artificial-intelligence data-mining data-science deep-learning game-theory hardware learning-theory literature machine-learning machine-learning-algorithms neural-network paper pattern-recognition reinforcement-learning silicon statistical-learning statistics
Last synced: 30 Oct 2024
https://github.com/CamDavidsonPilon/lifelines
Survival analysis in Python
cox-regression data-science maximum-likelihood python reliability-analysis statistics survival-analysis
Last synced: 30 Oct 2024
https://github.com/camdavidsonpilon/lifelines
Survival analysis in Python
cox-regression data-science maximum-likelihood python reliability-analysis statistics survival-analysis
Last synced: 29 Oct 2024