An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with tpu

A curated list of projects in awesome lists tagged with tpu .

https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda deepseek gpt hpu inference inferentia llama llm llm-serving llmops mlops model-serving pytorch qwen rocm tpu trainium transformer xpu

Last synced: 22 Apr 2025

https://github.com/tensorflow/tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

deep-learning machine-learning machine-translation reinforcement-learning tpu

Last synced: 24 Jan 2025

https://github.com/skypilot-org/skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

cloud-computing cloud-management cost-management cost-optimization data-science deep-learning distributed-training finops gpu hyperparameter-tuning job-queue job-scheduler llm-serving llm-training machine-learning ml-infrastructure ml-platform multicloud spot-instances tpu

Last synced: 22 Apr 2025

https://github.com/hollance/neural-engine

Everything we actually know about the Apple Neural Engine (ANE)

ane coreml ios iphone neural-engine neural-network tpu

Last synced: 26 Mar 2025

https://github.com/imcaspar/gpt2-ml

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

bert chinese colab gpt-2 nlp pretrained-models tensorflow text-generation tpu

Last synced: 07 Apr 2025

https://github.com/ayaka14732/tpu-starter

Everything you want to know about Google Cloud TPU

cloud-tpu deep-learning gcp google-cloud-platform jax machine-learning tpu

Last synced: 04 Apr 2025

https://github.com/ai-hypercomputer/jetstream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 08 Apr 2025

https://github.com/AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer

Last synced: 31 Mar 2025

https://github.com/kohulan/decimer-image_transformer

DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into SMILES strings, enabling the digitization of chemical data from scanned documents, literature, and patents.

chemical-image-recognition decimer deep-learning image-data-mining python tensorflow tpu transformers

Last synced: 14 Apr 2025

https://github.com/embedeep/Free-TPU

Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.

caffe cnn-accelerator darknet deep-learning fpga free hardware lstm npu npu-compiler pytorch rnn tpu zynq

Last synced: 21 Apr 2025

https://github.com/hhk7734/tensorflow-yolov4

YOLOv4 Implemented in Tensorflow 2.

coral edgetpu tensorflow tflite tpu yolov4

Last synced: 24 Jan 2025

https://github.com/rwightman/efficientnet-jax

EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax

efficientnet flax flax-linen jax mixnet mobilenetv2 mobilenetv3 objax tpu

Last synced: 14 Apr 2025

https://github.com/robotperf/benchmarks

Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.

acceleration benchmarking cpu fpga gpu performance robotics ros2 tpu

Last synced: 04 Dec 2024

https://github.com/AI-Hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gcloud gke tpu

Last synced: 31 Mar 2025

https://github.com/ai-hypercomputer/xpk

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gcloud gke tpu

Last synced: 05 Apr 2025

https://github.com/sayakpaul/funmatch-distillation

TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.

bit-resnet image-classification keras knowledge-distillation tensorflow tpu transfer-learning vision

Last synced: 13 Jan 2025

https://github.com/PINTO0309/TPU-MobilenetSSD

Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC

colaboratory google lattepanda mobilenetssd mobilenetv2 opencv python raspberrypi tensorflow-lite tensorflowlite tpu

Last synced: 07 Apr 2025

https://github.com/wmcnally/evopose2d

EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.

deep-learning human-pose-estimation pose-estimation tensorflow tensorflow2 tpu

Last synced: 28 Nov 2024

https://github.com/pinto0309/tpu-mobilenetssd

Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC

colaboratory google lattepanda mobilenetssd mobilenetv2 opencv python raspberrypi tensorflow-lite tensorflowlite tpu

Last synced: 08 Mar 2025

https://github.com/googlecloudplatform/ml-testing-accelerators

Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)

gpu machine-learning testing-accelerators tpu

Last synced: 05 Apr 2025

https://github.com/gsarti/t5-flax-gcp

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

gcp huggingface seq2seq t5 text-to-text-transfer-transformer tpu tpu-vm transformers

Last synced: 31 Mar 2025

https://github.com/instadeepai/sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

ai deep-learning hpc jax machine-learning podracer ppo reinforcement-learning sebulba tpu

Last synced: 30 Jan 2025

https://github.com/pinto0309/tpu-posenet

Edge TPU Accelerator / Multi-TPU / Multi-Model + Posenet/DeeplabV3/MobileNet-SSD + Python + Sync / Async + LaptopPC / RaspberryPi

opencv picamera posenet python raspberrypi tpu

Last synced: 09 Mar 2025

https://github.com/victordibia/tpudcgan

Train DCGAN with TPUs on Google Cloud

dcgan dcgan-tensorflow deep-learning gan machine-learning tpu

Last synced: 01 Jan 2025

https://github.com/soumik12345/point-cloud-segmentation

TF2 implementation of PointNet for segmenting point clouds

deep-learning keras point-cloud segmentation tensorflow2 tpu

Last synced: 07 Jan 2025

https://github.com/camenduru/stable-diffusion-diffusers-colab

🤗 HuggingFace Diffusers Flax TPU and PyTorch GPU for Colab

colab diffusers discord flax gdrive huggingface huggingface-diffusers pytorch tpu

Last synced: 22 Jan 2025

https://github.com/ai-hypercomputer/jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

attention batching gemma inference llama llama2 llm llm-inference model-serving pytorch tpu

Last synced: 10 Jan 2025

https://github.com/sayakpaul/big_vision_experiments

Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.

computer-vision google-cloud image-recognition jax large-scale-pretraining tpu

Last synced: 13 Jan 2025

https://github.com/young-geng/tpu_pod_commander

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

google-cloud tpu

Last synced: 07 Apr 2025

https://github.com/sophgo/tpu_compiler

cvitek ai compiler base on MLIR

compiler mlir tpu

Last synced: 20 Mar 2025

https://github.com/xadrianzetx/lanefinder

TPU accelerated traffic lane segmentation engine for your Raspberry Pi

coral deep-learning edge-tpu google-coral raspberry-pi semantic-segmentation tpu

Last synced: 22 Nov 2024

https://github.com/ashishpatel26/tpu_tf2

TPU use in single line in colab using tf2 package.

colab-notebook colaboratory deep-learning tensorflow2 tpu

Last synced: 19 Nov 2024

https://github.com/zackakil/edge-tpu-safe-bike

An application of realtime object-detection running on an Edge TPU for making cycling in busy cities a little less terrifying.

ml raspberry-pi raspberry-pi-camera tensorflow tflite tpu tpu-acceleration

Last synced: 13 Feb 2025

https://github.com/pedro-r-marques/tutorial-t5-fine-tune

Tutorial for text classification with fine tuning of a T5 model on TPUs.

colaboratory nlp-machine-learning t5-model tensorflow-tutorials tensorflow2 text-classification tpu

Last synced: 09 Feb 2025

https://github.com/trisongz/tpubar

Google Cloud TPU Utilization Bar for Training Models

tensorflow tpu tpus

Last synced: 13 Apr 2025

https://github.com/trisongz/tpu-vm-docker-containers

Docker File Templates to access TPUs within TPU VM in Containers

docker-compose docker-container google-cloud-platform machine-learning tpu

Last synced: 13 Apr 2025

https://github.com/seungjaelim/machine_learning_security

Individual Study in Computer Architecture and Systems Laboratory (CASYS) with Prof.Jaehyuk Huh in 2021 Summer

deepsniffer eyeriss fgsm-attack gemmini slalom steal-ml tpu

Last synced: 25 Mar 2025

https://github.com/ekzhang/archax

Experiments in multi-architecture parallelism for deep learning with JAX

cpu gpu jax machine-learning ml parallelism pipeline tpu

Last synced: 13 Mar 2025

https://github.com/cloudwiser/tensorflowliterpizerotpu

TensorFlow Lite & Coral TPU: C++ examples on Raspberry Pi Zero W

armv6 google-coral raspberry-pi-zero-w tensorflow-examples tensorflow-lite tpu

Last synced: 19 Feb 2025

https://github.com/apurva-modi/flower-classification

In this competition, we’re challenged to build a machine learning model that identifies the type of flowers in a dataset of images (for simplicity, we’re sticking to just over 100 types).

efficientnet-keras ensemble-model flower-classification kaggle-competition machine-learning multiclass-classification tensorflow tpu transfer-learning

Last synced: 02 Apr 2025

https://github.com/nicholaswilven/pegasus-tpu-trainer

Implementation to pretrain and finetune Transformer encoder-decoder (PEGASUS) using Tensorflow + TFRecords on TPU

nlp tensorflow tpu transformers

Last synced: 19 Dec 2024

https://github.com/alonfnt/tsnex

Minimal t-distributed stochastic neighbor embedding (t-SNE) implementation in JAX.

cpu dimensionality-reduction gpu jax t-sne tpu

Last synced: 18 Apr 2025

https://github.com/ayaka14732/bart-jax

JAX implementation of BART, aiming to demonstrate how Transformer-based models can be implemented using JAX and trained on Google Cloud TPUs

bart jax language-model natural-language-processing nlp tpu trans transformer

Last synced: 21 Mar 2025

https://github.com/pinto0309/edgetpu-bin

Prebuilt binary for EdgeTPU PythonAPI standalone installer.

aarch64 arm cross-compile edge-tpu installer tpu wheel x86-64

Last synced: 03 Apr 2025

https://github.com/sthysel/tpuparty

Tools and toys for working with the coral TPU

coral-tpu tensorflow tpu

Last synced: 13 Mar 2025

https://github.com/zackakil/6-oclock-helmet

A multi-use helmet that tells you what is behind you using machine learning. This is a continuation of the TPU bike project.

arduino automl computer-vision cycling edge-tpu electronics google-cloud machine-learning raspberry-pi tpu

Last synced: 02 Mar 2025

https://github.com/goruck/semantic-segmentation-server

Semantic segmentation served over grpc using Google Edge TPU.

gprc semantic-segmentation tpu

Last synced: 25 Feb 2025

https://github.com/riolaf05/cv-follow-camera

RaspberryPi Camera which follows objects using computer vision

camera computer-vision docker docker-compose opencv opencv-python raspberry-pi tpu

Last synced: 14 Mar 2025

https://github.com/dsseng/rust-tf-pluggabledevice

A reference TensorFlow PluggableDevice implementation, in Rust

ffi-bindings rust tensorflow tensorflow2 tpu tpu-acceleration

Last synced: 16 Mar 2025

https://github.com/0x7o/ae

Scalable code for training and fine-tuning language models on TPUs

large-language-models scaling tpu

Last synced: 10 Mar 2025