{"id":18946675,"url":"https://github.com/groda/big_data","last_synced_at":"2025-04-06T20:10:20.215Z","repository":{"id":45664152,"uuid":"204785265","full_name":"groda/big_data","owner":"groda","description":"Tutorials on Big Data essentials: Hadoop, MapReduce, Spark. Explore a variety of tutorials and demonstrations on Big Data technologies, primarily in the form of Jupyter notebooks. Most notebooks are self-contained and live—ready to run with a click.","archived":false,"fork":false,"pushed_at":"2025-01-02T13:58:01.000Z","size":54389,"stargazers_count":73,"open_issues_count":0,"forks_count":26,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-03-30T19:05:42.150Z","etag":null,"topics":["apache-sedona","apache-spark","big-data","bigdata","bigtop","docker","gutenberg-ebooks","hadoop","hadoop-cluster","hadoop-hdfs","hadoop-mapreduce","jupyter-notebook","mapreduce","mapreduce-bash","mrjob","pyspark","spark","spark-sql","testdfsio"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/groda.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-08-27T20:29:46.000Z","updated_at":"2025-03-22T17:34:18.000Z","dependencies_parsed_at":"2023-01-24T12:17:01.216Z","dependency_job_id":"4fc4ad6b-d35b-4fa3-b655-62c8e18b0655","html_url":"https://github.com/groda/big_data","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/groda%2Fbig_data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/groda%2Fbig_data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/groda%2Fbig_data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/groda%2Fbig_data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/groda","download_url":"https://codeload.github.com/groda/big_data/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247543591,"owners_count":20955865,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-sedona","apache-spark","big-data","bigdata","bigtop","docker","gutenberg-ebooks","hadoop","hadoop-cluster","hadoop-hdfs","hadoop-mapreduce","jupyter-notebook","mapreduce","mapreduce-bash","mrjob","pyspark","spark","spark-sql","testdfsio"],"created_at":"2024-11-08T13:07:38.988Z","updated_at":"2025-04-06T20:10:20.170Z","avatar_url":"https://github.com/groda.png","language":"Jupyter Notebook","readme":"![big_data](https://socialify.git.ci/groda/big_data/image?description=1\u0026font=Inter\u0026language=1\u0026name=1\u0026owner=1\u0026pattern=Diagonal%20Stripes\u0026stargazers=1\u0026forks=1\u0026theme=Light)\n\n# Big Data for beginners\n\nExplore a variety of tutorials and interactive demonstrations focused on Big Data technologies like Hadoop, Spark, and more, primarily presented in the format of Jupyter notebooks. Most notebooks are self-contained, with instructions for installing all required services. They can be run on Google Colab or in a virtual Ubuntu machine/container.\n\n## Setting Up Hadoop: Single-Node Configuration\n  - **[Hadoop_Setting_up_a_Single_Node_Cluster.ipynb](Hadoop_Setting_up_a_Single_Node_Cluster.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Hadoop_Setting_up_a_Single_Node_Cluster.ipynb\"\u003e\n  \u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Hadoop_Setting_up_a_Single_Node_Cluster.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Set up a single-node Hadoop cluster on Google Colab and run some basic HDFS and MapReduce examples \n  - **[Hadoop_single_node_cluster_setup_Python.ipynb](Hadoop_single_node_cluster_setup_Python.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Hadoop_single_node_cluster_setup_Python.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Hadoop_single_node_cluster_setup_Python.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Set up a single-node Hadoop cluster on Google Colab using Python ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n - **[Hadoop_minicluster.ipynb](Hadoop_minicluster.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Hadoop_minicluster.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Hadoop_minicluster.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Deploy a test Hadoop Cluster with a single command and no need for configuration. ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n   \n## Running Apache Spark in Standalone Mode\n  - **[Hadoop_Setting_up_Spark_Standalone_on_Google_Colab.ipynb](Hadoop_Setting_up_Spark_Standalone_on_Google_Colab.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Hadoop_Setting_up_Spark_Standalone_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Hadoop_Setting_up_Spark_Standalone_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Set up a single-node Spark server on Google Colab and estimate „π“ with a Montecarlo method\n  - **[Setting_up_Spark_Standalone_on_Google_Colab_BigtopEdition.ipynb](Setting_up_Spark_Standalone_on_Google_Colab_BigtopEdition.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Setting_up_Spark_Standalone_on_Google_Colab_BigtopEdition.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Setting_up_Spark_Standalone_on_Google_Colab_BigtopEdition.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Set up a single-node Spark server on Google Colab using the Bigtop distribution and utilities, estimate „π“ with a Montecarlo method and run another Java ML example. ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n  - **[Run_Spark_on_Google_Colab.ipynb](Run_Spark_on_Google_Colab.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Run_Spark_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Run_Spark_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Set up a single-node standalone Spark server on Google Colab including Web UI and History Server - compact version ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n  - **[Spark_Standalone_Architecture_on_Google_Colab.ipynb](Spark_Standalone_Architecture_on_Google_Colab.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Spark_Standalone_Architecture_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Spark_Standalone_Architecture_on_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Explore the Spark architecture through the immersive experience of deploying a standalone setup. ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n\n## MapReduce Tutorials\n- **[MapReduce_Primer_HelloWorld.ipynb](MapReduce_Primer_HelloWorld.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/MapReduce_Primer_HelloWorld.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/MapReduce_Primer_HelloWorld.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e A MapReduce Primer with “Hello, World!” ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[MapReduce_Primer_HelloWorld_bash.ipynb](MapReduce_Primer_HelloWorld_bash.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/MapReduce_Primer_HelloWorld.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/MapReduce_Primer_HelloWorld_bash.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e A MapReduce Primer with “Hello, World! in Bash with just a few lines of code” ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[mapreduce_with_bash.ipynb](mapreduce_with_bash.ipynb)** An introduction to MapReduce using MapReduce Streaming and bash to create mapper and reducer\n- **[simplest_mapreduce_bash_wordcount.ipynb](simplest_mapreduce_bash_wordcount.ipynb)** A very basic MapReduce wordcount example\n- **[mrjob_wordcount.ipynb](mrjob_wordcount.ipynb)** A simple MapReduce job with mrjob\n- **[Hadoop_spilling.ipynb](Hadoop_spilling.ipynb)** Hadoop spilling explained\n\n## PySpark Tutorials\n- **[PySpark_On_Google_Colab.ipynb](PySpark_On_Google_Colab.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/PySparkOnColab.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/PySpark_On_Google_Colab.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Explore the inner workings of PySpark on Google Colab ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[PySpark_miscellanea.ipynb](PySpark_miscellanea.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/PySpark_miscellanea.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/PySpark_miscellanea.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Tips, tricks, and insights related to PySpark. ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[demoSparkSQLPython.ipynb](demoSparkSQLPython.ipynb)** Pyspark basic demo \n- **[ngrams_with_pyspark.ipynb](ngrams_with_pyspark.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/ngrams_with_pyspark.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/ngrams_with_pyspark.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Basic example of n-grams extraction with PySpark ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[generate_data_with_Faker.ipynb](generate_data_with_Faker.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/generate_data_with_Faker.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/generate_data_with_Faker.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Data Generation and Aggregation with Python's Faker Library and PySpark ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[Encoding+dataframe+columns.ipynb](Encoding+dataframe+columns.ipynb)**  \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Encoding+dataframe+columns.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Encoding+dataframe+columns.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e DataFrame Column Encoding with PySpark and Parquet Format ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[Apache_Sedona_with_PySpark.ipynb](Apache_Sedona_with_PySpark.ipynb)**  \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Apache_Sedona_with_PySpark.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Apache_Sedona_with_PySpark.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Apache Sedona™ is a high-performance cluster computing system for processing large-scale spatial data, extending the capabilities of Apache Spark for advanced geospatial analytics. Run a basic example with PySpark on Google Colab ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n  \n## Miscellaneous Tutorials\n- **[GutenbergBooks.ipynb](GutenbergBooks.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/GutenbergBooks.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/GutenbergBooks.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Explore and download books from the Gutenberg books collection.  ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true) \n- **[TestDFSio.ipynb](TestDFSio.ipynb)** Demo of TestDFSio for benchmarking Hadoop clusters\n- **[Unicode.ipynb](Unicode.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Unicode.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Unicode.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e [![live on Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/groda/big_data/master?filepath=Unicode.ipynb) Exploring Unicode categories ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true) \n- **[polynomial_regression.ipynb](polynomial_regression.ipynb)**  \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/polynomial_regression.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/polynomial_regression.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e Worked out example of polynomial regression with numpy and matplotlib ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n- **[downloadSpark.ipynb](downloadSpark.ipynb)** \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/downloadSpark.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/downloadSpark.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e How to download and verify the Spark distribution ![recently updated](https://github.com/groda/big_data/blob/master/new3.gif?raw=true)\n\n\n## Virtualization and Cloud Automation \n  - **[docker_for_beginners.md](docker_for_beginners.md)** Docker for beginners: an introduction to the world of containers\n  - **[Terraform for beginners.md](terraform_for_beginners.md)** Getting started with Terraform\n  - **[Terraform in 5 minutes](Terraform%20in%205%20minutes.md)** A short introduction to Terraform, the powerful and popular tool for infrastructure provisioning and management ![recently updated](https://github.com/groda/big_data/blob/master/updated.gif?raw=true)\n\n## Big Data Learning Pathways\n- **[online_resources.md](online_resources.md)** Online resources for learning Big Data\n\n# About this repository\n\n## Notebooks Testing and CI\n\nMost executable Jupyter notebooks are tested on an Ubuntu virtual machine through a GitHub automated workflow. The log file for successful executions is named: [action_log.txt](https://github.com/groda/big_data/blob/master/action_log.txt) (see also: [Google Colab vs. GitHub Ubuntu Runner](Google_Colab_vs_GitHub_ubuntu_runner.ipynb) \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/groda/big_data/blob/master/Google_Colab_vs_GitHub_ubuntu_runner.ipynb\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e \u003ca target=\"_blank\" href=\"https://nbviewer.org/github/groda/big_data/blob/master/Google_Colab_vs_GitHub_ubuntu_runner.ipynb\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/jupyter/design/master/logos/Badges/nbviewer_badge.svg\" alt=\"Render in nbviewer\"\u003e\u003c/a\u003e).\n\nCurrent status: \n - [![Run Notebooks on Ubuntu](https://github.com/groda/big_data/actions/workflows/run-notebooks.yml/badge.svg)](https://github.com/groda/big_data/actions/workflows/run-notebooks.yml)\n - [![Run One Notebook on Ubuntu](https://github.com/groda/big_data/actions/workflows/run-one-notebook.yml/badge.svg)](https://github.com/groda/big_data/actions/workflows/run-one-notebook.yml)\n\nThe Github workflow is a starting point for what is known as _Continuous Integration_ (CI) in DevOps/Platform Engineering circles.\n\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgroda%2Fbig_data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgroda%2Fbig_data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgroda%2Fbig_data/lists"}