{"id":20113474,"url":"https://github.com/cloudacademy/ca-webinars-mastering-machine-learning","last_synced_at":"2026-02-28T03:37:43.151Z","repository":{"id":77791420,"uuid":"408502968","full_name":"cloudacademy/ca-webinars-mastering-machine-learning","owner":"cloudacademy","description":"Companion for the Cloud Academy Webinar Series \"Mastering Machine Learning\"","archived":false,"fork":false,"pushed_at":"2022-01-02T09:18:18.000Z","size":2152,"stargazers_count":11,"open_issues_count":0,"forks_count":6,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-05-06T12:37:52.193Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cloudacademy.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-09-20T15:42:39.000Z","updated_at":"2022-01-02T09:18:20.000Z","dependencies_parsed_at":"2023-03-12T02:04:49.349Z","dependency_job_id":null,"html_url":"https://github.com/cloudacademy/ca-webinars-mastering-machine-learning","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/cloudacademy/ca-webinars-mastering-machine-learning","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-webinars-mastering-machine-learning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-webinars-mastering-machine-learning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-webinars-mastering-machine-learning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-webinars-mastering-machine-learning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cloudacademy","download_url":"https://codeload.github.com/cloudacademy/ca-webinars-mastering-machine-learning/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-webinars-mastering-machine-learning/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29924177,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-27T19:37:42.220Z","status":"online","status_checked_at":"2026-02-28T02:00:07.010Z","response_time":90,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-13T18:24:32.237Z","updated_at":"2026-02-28T03:37:43.104Z","avatar_url":"https://github.com/cloudacademy.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Cloud Academy Machine Learning Webinar Series\n\n## Speaker info\nThis webinar is held by Andrea Giussani, Data Scientist at Cloud Academy.\nYou can reach him out wither at [:email:](andrea.giussani@cloudacademy.com) or on [Linkedin](https://it.linkedin.com/in/andrea-giussani-764816148?trk=public_profile_samename_mini-profile_title), and you can follow him on his [:rocket: blog](https://andreagiussani.github.io/the-long-beard-blog/).\n\n## Repository Structure\nYou will find:\n - a `data` folder containing the data used in this series;\n - a folder called `episode_01` containing the material related to the first episode entitled _Imputing Missing Values with scikit-learn_. The recorded session can be found [here](https://www.youtube.com/watch?v=FJLwxmdGOuU).\n  - a folder called `episode_02` containing the material related to the second episode entitled _Build and run a Machine Learning Pipeline_. The recorded session can be found [here](https://www.youtube.com/watch?v=qZ0dJhsxYB4).\n  - a folder called `episode_03` containing the material related to the second episode entitled _Be a data Science Master and Commander with XGBoost_. The recorded session can be found [here](https://www.youtube.com/watch?v=kAys5FhjUCA).\n  - a folder called `episode_04` containing the material related to the second episode entitled _Build a Sentiment Analysis Pipeline_. The recorded session can be found [here](https://www.youtube.com/watch?v=t9iDfG0dsE4).\n  - Episode 5 can be watched [here](https://www.youtube.com/watch?v=eMvY3MRmBNg).\n\n## Setting Up the Google Colab Environment\n\nThe Google Colab is a product from Google Research which allows\n\u003e anybody to write and execute arbitrary python code through the browser, and is especially well suited to machine learning, data analysis and education. More technically, Colab is a hosted Jupyter notebook service that requires no setup to use, while providing free access to computing resources including GPUs.\n\nFor more information, please visit the following [link](https://research.google.com/colaboratory/faq.html).\n\nHere, we provide a short tutorial on how to upload the data on that environemnt via Google drive, and then use the Google colab to run your analysis.\nPlease, note that we assume you have a google account to access to this Google product.\n\n### 1. Open a Google Colab session\n\nFrom your favourite browser, open a new colab notebook via the following [link](https://colab.research.google.com)\n\n### 2. Mount your drive on colab\n\nWe assume you have placed the data into the `My Drive` folder. If so, connecting your drive to a google machine is pretty easy, using the python google library: just run the following code snippet\n\n```python\nfrom google.colab import drive\ndrive.mount('/content/drive')\n```\nAfter an authorisation check, you will be able to interact with your drive content either from the file browser side panel (easier) or using command-line utilities.\n\n### 3. Clone the GitHub Repository on your Drive\nI suggest to create a folder inside your drive. For example, call it `ca.webinars`. Then, in any colab notebook cell, type the following commands\n```python\n%cd '/content/drive/My Drive/ca.webinars'\n```\nand then clone the following repository:\n```python\n!git clone https://github.com/cloudacademy/ca-webinars-mastering-machine-learning.git\n```\n\n### 4. Open the template .ipynb file from your drive\nNow, you have to navigate inside the Google Drive folder where the repo has been cloned. Once there, you just need to open, say, the file `'Episode 1 - Imputing Missing Values with scikit-learn.ipynb'` with Google Colab. And that's it! :smile:\n\n### 5. Let's get our hands dirty\nJust run the following snippet to put the raw data into a `pandas` dataframe:\n```python\nimport pandas as pd\ndf = pd.read_csv('/content/drive/My Drive/\u003cPATH_TO_FILE\u003e/\u003cFILENAME\u003e.csv')\n```\nAre you ready? Let us get started!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcloudacademy%2Fca-webinars-mastering-machine-learning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcloudacademy%2Fca-webinars-mastering-machine-learning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcloudacademy%2Fca-webinars-mastering-machine-learning/lists"}