An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by AI-Hypercomputer

A curated list of projects in awesome lists by AI-Hypercomputer .

https://github.com/AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

gpt large-language-models llm

Last synced: 25 Sep 2025

https://github.com/ai-hypercomputer/jetstream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 23 Oct 2025

https://github.com/AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 31 Mar 2025

https://github.com/ai-hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gcloud gke tpu

Last synced: 23 Jan 2026

https://github.com/AI-Hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gcloud gke tpu

Last synced: 31 Mar 2025

https://github.com/ai-hypercomputer/gpu-recipes

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

benchmarks distributed-training google-cloud-platform gpu serving

Last synced: 25 Jun 2025

https://github.com/ai-hypercomputer/jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

attention batching gemma inference llama llama2 llm llm-inference model-serving pytorch tpu

Last synced: 27 Oct 2025

https://github.com/ai-hypercomputer/torchprime

torchprime is a reference model implementation for PyTorch on TPU.

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/pathways-utils

Package of Pathways-on-Cloud utilities

Last synced: 27 Jan 2026

https://github.com/ai-hypercomputer/resiliency

This project provides resilient training logic to improve the training efficiency of large-scale distributed workloads on Google Cloud.

Last synced: 14 Sep 2025