{"id":17465897,"url":"https://github.com/paladique/codespaces-etl-basic-demo","last_synced_at":"2026-05-01T21:03:42.752Z","repository":{"id":198300882,"uuid":"657439260","full_name":"paladique/codespaces-etl-basic-demo","owner":"paladique","description":"ETL with Jupyter Notebooks, Pandas, and Azure Cosmos DB","archived":false,"fork":false,"pushed_at":"2023-10-05T15:04:49.000Z","size":385,"stargazers_count":3,"open_issues_count":0,"forks_count":2,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-07-01T09:48:33.636Z","etag":null,"topics":["azure","azure-cosmos-db","codespaces","data-engineering","etl","pandas"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/paladique.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-06-23T04:17:43.000Z","updated_at":"2025-05-05T14:54:56.000Z","dependencies_parsed_at":null,"dependency_job_id":"c2cb324b-c6e4-465a-bdb8-ea2e7982086a","html_url":"https://github.com/paladique/codespaces-etl-basic-demo","commit_stats":{"total_commits":5,"total_committers":1,"mean_commits":5.0,"dds":0.0,"last_synced_commit":"270b034422bfd525e9b94c028ecfa1052676375f"},"previous_names":["paladique/codespaces-etl-basic-demo"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/paladique/codespaces-etl-basic-demo","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paladique%2Fcodespaces-etl-basic-demo","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paladique%2Fcodespaces-etl-basic-demo/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paladique%2Fcodespaces-etl-basic-demo/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paladique%2Fcodespaces-etl-basic-demo/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/paladique","download_url":"https://codeload.github.com/paladique/codespaces-etl-basic-demo/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paladique%2Fcodespaces-etl-basic-demo/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32512670,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-30T13:12:12.517Z","status":"online","status_checked_at":"2026-05-01T02:00:05.856Z","response_time":64,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["azure","azure-cosmos-db","codespaces","data-engineering","etl","pandas"],"created_at":"2024-10-18T13:05:40.902Z","updated_at":"2026-05-01T21:03:42.714Z","avatar_url":"https://github.com/paladique.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ETL in GitHub Codespaces\n\n[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/paladique/codespaces-etl-basic-demo)\n\n[Sign up for Azure](https://azure.microsoft.com/free/?WT.mc_id=academic-99884-jasmineg)\n\n🎓 Students get $100 of credits with Azure for Students! No credit card required: [Sign up](https://aka.ms/azure4students)\n\n## Extract, Transform, and Load Operations with Python \u0026 Pandas library with Jupyter Notebooks and Azure Cosmos DB\n\nThis sample loads a csv file as a Pandas dataframe, filters the records by airports located in the United States, then the filtered data into JSON. A sample of the pared data is then loaded into Azure Cosmos DB.\n\n### Instructions \nConvert a filtered CSV File into JSON, then insert into Azure Cosmos DB in minutes with GitHub Codespaces\n\n1. [Create a Cosmos DB NoSQL Account, you can stop after creating the resource](https://learn.microsoft.com/azure/cosmos-db/nosql/quickstart-portal?WT.mc_id=academic-99884-jasmineg)\n\n2. After account is created in the Azure Portal, navigate to the resource (you can find it in your notifications)\n\n3. To the right of the resource overview select `Keys`. locate the `URI`, and `PRIMARY KEY` secrets.\n\n    ![Azure cosmos db secrets](img/azcosmosdb_secrets.png)\n \n4. Set copied secrets it as secrets in your [Codespaces settings here](https://github.com/settings/codespaces). \n    **`URI` should be the `COSMOS_ENDPOINT` secret and `PRIMARY KEY` should be the  `COSMOS_KEY` secret**\n     ![codespaces secrets](img/codespaces_secret_settings.png)\n\n5. Run the Notebook\n\n6. **[Clean up your Cosmos DB Account Resources after you're done!](https://learn.microsoft.com/en-us/cosmos-db/nosql/quickstart-portal#clean-up-resources?WT.mc_id=academic-99884-jasmineg)**\n\n## Learn More\n- [GitHub Codespaces]()\n- [Azure Cosmos DB](https://learn.microsoft.com/training/modules/explore-non-relational-data-stores-azure/?WT.mc_id=academic-99884-jasmineg)","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpaladique%2Fcodespaces-etl-basic-demo","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpaladique%2Fcodespaces-etl-basic-demo","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpaladique%2Fcodespaces-etl-basic-demo/lists"}