{"id":18903995,"url":"https://github.com/dudeperf3ct/11-cortex-deploy","last_synced_at":"2026-05-08T01:38:43.830Z","repository":{"id":191864841,"uuid":"447879163","full_name":"dudeperf3ct/11-cortex-deploy","owner":"dudeperf3ct","description":null,"archived":false,"fork":false,"pushed_at":"2022-01-15T17:16:04.000Z","size":29,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2024-12-31T10:17:49.854Z","etag":null,"topics":["aws-lambda","cortex","docker","fastapi","mlops","model-serving","transformers"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dudeperf3ct.png","metadata":{"files":{"readme":"Readme.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2022-01-14T07:28:56.000Z","updated_at":"2023-03-04T09:02:06.000Z","dependencies_parsed_at":"2023-09-01T07:12:33.627Z","dependency_job_id":"6e714834-9c87-46b2-9ba8-5ef3a0cc24fc","html_url":"https://github.com/dudeperf3ct/11-cortex-deploy","commit_stats":null,"previous_names":["dudeperf3ct/11-cortex-deploy"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dudeperf3ct%2F11-cortex-deploy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dudeperf3ct%2F11-cortex-deploy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dudeperf3ct%2F11-cortex-deploy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dudeperf3ct%2F11-cortex-deploy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dudeperf3ct","download_url":"https://codeload.github.com/dudeperf3ct/11-cortex-deploy/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239888526,"owners_count":19713692,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["aws-lambda","cortex","docker","fastapi","mlops","model-serving","transformers"],"created_at":"2024-11-08T09:07:07.567Z","updated_at":"2026-03-04T09:30:21.254Z","avatar_url":"https://github.com/dudeperf3ct.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Cortex\n\n[Cortex](https://www.cortex.dev) is built to deploy, manage, and scale machine learning models in production for AWS. It provides features such as\n\n- Serverless workloads\n- Automated cluster management\n- CI/CD and observability integrations\n\nCortex supports 4 different ways to build scalable API :\n\n1. [Realtime](https://docs.cortex.dev/workloads/realtime/example): create APIs that respond to requests in real-time.\n2. [Async](https://docs.cortex.dev/workloads/async/example): create APIs that respond to requests asynchronously.\n3. [Batch](https://docs.cortex.dev/workloads/batch/example): create APIs that run distributed batch jobs.\n4. [Task](https://docs.cortex.dev/workloads/task/example): create APIs that run jobs on-demand.\n\nCortex requires only two configuration file to deploy the application. Cortex creates a cluster from `cluster.yaml` file including a s3 bucket and cloudwatch log group. The Cortex cluster runs on an EKS (Kubernetes) cluster in a dedicated VPC on your AWS account. Each individual API contains `cortex.yaml` to deploy different types of workloads.\n\nIn this exercise, transformers sentiment classifier application is deployed using Cortex two different APIs.\n\n1. [Realtime API](realtime/Readme.md)\n2. [Async API](async/Readme.md)\n\nCortex is super :rocket: With just 2 commands, 2 configuration files and right amount of patience, the application is deployed seamlessly without modifications to the application.\n\nFurther Readings:\n\nWe just barely scratch the surface by deploying a simple application. Cortex provides different features such `Traffic Splitter`, `Autoscaling`, `Update strategy`, `node groups`, etc. Cortex [cli](https://docs.cortex.dev/clients/cli) and [python client](https://docs.cortex.dev/clients/python) provides easy way to handle cortex updates/upgrades to deployments.\n\n- Cortex Documentation : [Docs](https://docs.cortex.dev/)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdudeperf3ct%2F11-cortex-deploy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdudeperf3ct%2F11-cortex-deploy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdudeperf3ct%2F11-cortex-deploy/lists"}