{"id":20113481,"url":"https://github.com/cloudacademy/ca-pandas-webinars","last_synced_at":"2025-05-06T12:30:33.854Z","repository":{"id":77791426,"uuid":"332278585","full_name":"cloudacademy/ca-pandas-webinars","owner":"cloudacademy","description":"This is the companion repository of the Cloud Academy Webinar Series on Pandas.","archived":false,"fork":false,"pushed_at":"2021-05-06T17:08:34.000Z","size":276,"stargazers_count":23,"open_issues_count":0,"forks_count":23,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-04-09T12:21:14.740Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cloudacademy.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-01-23T18:19:52.000Z","updated_at":"2023-11-20T12:10:20.000Z","dependencies_parsed_at":"2023-04-30T04:36:20.060Z","dependency_job_id":null,"html_url":"https://github.com/cloudacademy/ca-pandas-webinars","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-pandas-webinars","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-pandas-webinars/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-pandas-webinars/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cloudacademy%2Fca-pandas-webinars/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cloudacademy","download_url":"https://codeload.github.com/cloudacademy/ca-pandas-webinars/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252683372,"owners_count":21788026,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-13T18:24:33.873Z","updated_at":"2025-05-06T12:30:33.843Z","avatar_url":"https://github.com/cloudacademy.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Cloud Academy Pandas Webinar Series\n\n## Speaker info\nThis webinar is held by Andrea Giussani, Data Scientist at Cloud Academy.\nYou can reach him out wither at [:email:](andrea.giussani@cloudacademy.com) or on [Linkedin](https://it.linkedin.com/in/andrea-giussani-764816148?trk=public_profile_samename_mini-profile_title), and you can follow him on his [:rocket: blog](https://andreagiussani.github.io/the-long-beard-blog/).\n\n## Repository Structure\nYou will find:\n - a `data` folder containing the data used in this series;\n - a folder called `part-01` containing the material related to the first episode entitled _How to filter your data source_. The recording of the webinar is available [here](https://cloudacademy.com/webinars/how-to-filter-your-data-source/);\n - a folder called `part-02` containing the material related to the second episode entitled _How to manage and manipulate your data_. The recording of the webinar is available [here](https://cloudacademy.com/webinars/how-to-manage-and-manipulate-your-data/);\n - a folder called `part-03` containing the material related to the third episode entitled _How to transform your data_. The recording of the webinar is available [here](https://cloudacademy.com/webinars/how-to-transform-your-data/);\n - a folder called `part-04` containing the material related to the fourth episode entitled _How to deal with time series data_. The recording of the webinar is available [here](https://cloudacademy.com/webinars/how-to-deal-with-time-series-data/);\n - a folder called `part-05` containing the material related to the fifth episode entitled _How to make your code base more efficient_.\n\n## Setting Up the Google Colab Environment\n\nThe Google Colab is a product from Google Research which allows\n\u003e anybody to write and execute arbitrary python code through the browser, and is especially well suited to machine learning, data analysis and education. More technically, Colab is a hosted Jupyter notebook service that requires no setup to use, while providing free access to computing resources including GPUs.\n\nFor more information, please visit the following [link](https://research.google.com/colaboratory/faq.html).\n\nHere, we provide a short tutorial on how to upload the data on that environemnt via Google drive, and then use the Google colab to run your analysis.\nPlease, note that we assume you have a google account to access to this Google product.\n\n### 1. Open a Google Colab session\n\nFrom your favourite browser, open a new colab notebook via the following [link](https://colab.research.google.com)\n\n### 2. Mount your drive on colab\n\nWe assume you have placed the data into the `My Drive` folder. If so, connecting your drive to a google machine is pretty easy, using the python google library: just run the following code snippet\n\n```python\nfrom google.colab import drive\ndrive.mount('/content/drive')\n```\nAfter an authorisation check, you will be able to interact with your drive content either from the file browser side panel (easier) or using command-line utilities.\n\n### 3. Clone the GitHub Repository on your Drive\nI suggest to create a folder inside your drive. For example, call it `ca.webinars`. Then, in any colab notebook cell, type the following commands\n```python\n%cd '/content/drive/My Drive/ca.webinars'\n```\nand then clone the following repository:\n```python\n!git clone https://github.com/cloudacademy/ca-pandas-webinars.git\n```\n\n### 4. Open the template .ipynb file from your drive\nNow, you have to navigate inside the Google Drive folder where the repo has been cloned. Once there, you just need to open, say, the file `'[PANDAS] Part 1 - How to Filter your Data Source.ipynb'` with Google Colab. And that's it! :smile:\n\n### 5. Let's get our hands dirty\nJust run the following snippet to put the raw data into a `pandas` dataframe:\n```python\nimport pandas as pd\ndf = pd.read_csv('/content/drive/My Drive/\u003cPATH_TO_FILE\u003e/\u003cFILENAME\u003e.csv')\n```\nAre you ready? Let us get started!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcloudacademy%2Fca-pandas-webinars","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcloudacademy%2Fca-pandas-webinars","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcloudacademy%2Fca-pandas-webinars/lists"}