Projects in Awesome Lists by AI-Hypercomputer
A curated list of projects in awesome lists by AI-Hypercomputer .
https://github.com/ai-hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
deepseek fine-tuning gemma2 gemma3 gpt jax large-language-models llama2 llama3 llama4 llm mistral mixtral sft
Last synced: 14 May 2025
https://github.com/AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
Last synced: 25 Sep 2025
https://github.com/ai-hypercomputer/jetstream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 23 Oct 2025
https://github.com/AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 31 Mar 2025
https://github.com/ai-hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Last synced: 23 Jan 2026
https://github.com/AI-Hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Last synced: 31 Mar 2025
https://github.com/ai-hypercomputer/gpu-recipes
Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
benchmarks distributed-training google-cloud-platform gpu serving
Last synced: 25 Jun 2025
https://github.com/ai-hypercomputer/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
attention batching gemma inference llama llama2 llm llm-inference model-serving pytorch tpu
Last synced: 27 Oct 2025
https://github.com/ai-hypercomputer/torchprime
torchprime is a reference model implementation for PyTorch on TPU.
Last synced: 02 Jul 2025
https://github.com/ai-hypercomputer/pathways-utils
Package of Pathways-on-Cloud utilities
Last synced: 27 Jan 2026
https://github.com/ai-hypercomputer/resiliency
This project provides resilient training logic to improve the training efficiency of large-scale distributed workloads on Google Cloud.
Last synced: 14 Sep 2025