Projects in Awesome Lists by AI-Hypercomputer | Ecosyste.ms: Awesome

Projects in Awesome Lists by AI-Hypercomputer

A curated list of projects in awesome lists by AI-Hypercomputer .

- Recently synced
- Stars

https://github.com/ai-hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

deepseek fine-tuning gemma2 gemma3 gpt jax large-language-models llama2 llama3 llama4 llm mistral mixtral sft

Last synced: 14 May 2025

https://github.com/AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

gpt large-language-models llm

Last synced: 25 Sep 2025

https://github.com/ai-hypercomputer/jetstream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 23 Oct 2025

https://github.com/AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 31 Mar 2025

https://github.com/ai-hypercomputer/maxdiffusion

Last synced: 13 Apr 2025

https://github.com/AI-Hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

Last synced: 31 Mar 2025

https://github.com/ai-hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

Last synced: 23 Jan 2026

https://github.com/ai-hypercomputer/gpu-recipes

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

benchmarks distributed-training google-cloud-platform gpu serving

Last synced: 25 Jun 2025

https://github.com/ai-hypercomputer/jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

attention batching gemma inference llama llama2 llm llm-inference model-serving pytorch tpu

Last synced: 27 Oct 2025

https://github.com/ai-hypercomputer/torchprime

torchprime is a reference model implementation for PyTorch on TPU.

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/pathways-utils

Package of Pathways-on-Cloud utilities

Last synced: 04 Apr 2026

https://github.com/ai-hypercomputer/cloud-accelerator-diagnostics

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/kithara

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/cloud-tpu-monitoring-debugging

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/ml-goodput-measurement

Last synced: 14 Apr 2025

https://github.com/ai-hypercomputer/ray-tpu

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/inference-benchmark

Last synced: 02 Jul 2025

https://github.com/AI-Hypercomputer/inference-benchmark

Last synced: 11 May 2025

https://github.com/ai-hypercomputer/cloud-diagnostics-xprof

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/tpu-recipes

Last synced: 03 Sep 2025

https://github.com/ai-hypercomputer/resiliency

This project provides resilient training logic to improve the training efficiency of large-scale distributed workloads on Google Cloud.

Last synced: 14 Sep 2025

https://github.com/ai-hypercomputer/recml

Last synced: 14 Apr 2025

https://github.com/AI-Hypercomputer/RecML

Last synced: 14 Apr 2025

https://github.com/ai-hypercomputer/accelerator-microbenchmarks

Last synced: 02 Jul 2025

https://github.com/ai-hypercomputer/aotc

Last synced: 02 Jul 2025