Projects in Awesome Lists tagged with tpu
A curated list of projects in awesome lists tagged with tpu .
https://github.com/vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
amd cuda deepseek gpt hpu inference inferentia llama llm llm-serving llmops mlops model-serving pytorch qwen rocm tpu trainium transformer xpu
Last synced: 22 Apr 2025
https://github.com/tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
deep-learning machine-learning machine-translation reinforcement-learning tpu
Last synced: 24 Jan 2025
https://github.com/skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
cloud-computing cloud-management cost-management cost-optimization data-science deep-learning distributed-training finops gpu hyperparameter-tuning job-queue job-scheduler llm-serving llm-training machine-learning ml-infrastructure ml-platform multicloud spot-instances tpu
Last synced: 22 Apr 2025
https://github.com/tensorflow/adanet
Fast and flexible AutoML with learning guarantees.
automl deep-learning distributed-training ensemble gpu learning-theory machine-learning neural-architecture-search python tensorflow tpu
Last synced: 10 Apr 2025
https://github.com/hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
ane coreml ios iphone neural-engine neural-network tpu
Last synced: 26 Mar 2025
https://github.com/imcaspar/gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
bert chinese colab gpt-2 nlp pretrained-models tensorflow text-generation tpu
Last synced: 07 Apr 2025
https://github.com/aphrodite-engine/aphrodite-engine
Large-scale LLM inference engine
api-rest cuda inference-engine inferentia intel lora machine-learning rocm speculative-decoding tpu
Last synced: 10 Apr 2025
https://github.com/pygmalionai/aphrodite-engine
Large-scale LLM inference engine
api-rest cuda inference-engine inferentia intel lora machine-learning rocm speculative-decoding tpu
Last synced: 02 Jan 2025
https://github.com/ayaka14732/tpu-starter
Everything you want to know about Google Cloud TPU
cloud-tpu deep-learning gcp google-cloud-platform jax machine-learning tpu
Last synced: 04 Apr 2025
https://github.com/jofrfu/tinytpu
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
assembly embedded-systems fpga fpga-accelerator hardware-acceleration hardware-architectures hardware-description-language hardware-designs internet-of-things iot ip-core linux tensor tensorflow tpu verilog vhdl vivado xilinx zynq
Last synced: 07 Apr 2025
https://github.com/ai-hypercomputer/jetstream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 08 Apr 2025
https://github.com/AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm llm-inference llmops mlops model-serving pytorch tpu transformer
Last synced: 31 Mar 2025
https://github.com/tumaer/JAXFLUIDS
Differentiable Fluid Dynamics Package
automatic-differentiation cfd compressible-flows computational-fluid-dynamics cuda deep-learning fluid-dynamics gpu gpu-computing high-performance hpc jax jaxfluids machine-learning multi-phase-flows tpu turbulence
Last synced: 11 Feb 2025
https://github.com/kohulan/decimer-image_transformer
DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into SMILES strings, enabling the digitization of chemical data from scanned documents, literature, and patents.
chemical-image-recognition decimer deep-learning image-data-mining python tensorflow tpu transformers
Last synced: 14 Apr 2025
https://github.com/embedeep/Free-TPU
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
caffe cnn-accelerator darknet deep-learning fpga free hardware lstm npu npu-compiler pytorch rnn tpu zynq
Last synced: 21 Apr 2025
https://github.com/juliagpu/xla.jl
Julia on TPUs
deep-learning going-faster julia-language machine-learning peanut-butter tpu xla
Last synced: 23 Jan 2025
https://github.com/JuliaGPU/XLA.jl
Julia on TPUs
deep-learning going-faster julia-language machine-learning peanut-butter tpu xla
Last synced: 27 Mar 2025
https://github.com/hhk7734/tensorflow-yolov4
YOLOv4 Implemented in Tensorflow 2.
coral edgetpu tensorflow tflite tpu yolov4
Last synced: 24 Jan 2025
https://github.com/rwightman/efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
efficientnet flax flax-linen jax mixnet mobilenetv2 mobilenetv3 objax tpu
Last synced: 14 Apr 2025
https://github.com/robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
acceleration benchmarking cpu fpga gpu performance robotics ros2 tpu
Last synced: 04 Dec 2024
https://github.com/AI-Hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Last synced: 31 Mar 2025
https://github.com/ai-hypercomputer/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Last synced: 05 Apr 2025
https://github.com/sayakpaul/funmatch-distillation
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
bit-resnet image-classification keras knowledge-distillation tensorflow tpu transfer-learning vision
Last synced: 13 Jan 2025
https://github.com/PINTO0309/TPU-MobilenetSSD
Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC
colaboratory google lattepanda mobilenetssd mobilenetv2 opencv python raspberrypi tensorflow-lite tensorflowlite tpu
Last synced: 07 Apr 2025
https://github.com/wmcnally/evopose2d
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
deep-learning human-pose-estimation pose-estimation tensorflow tensorflow2 tpu
Last synced: 28 Nov 2024
https://github.com/pinto0309/tpu-mobilenetssd
Edge TPU Accelerator / Multi-TPU + MobileNet-SSD v2 + Python + Async + LattePandaAlpha/RaspberryPi3/LaptopPC
colaboratory google lattepanda mobilenetssd mobilenetv2 opencv python raspberrypi tensorflow-lite tensorflowlite tpu
Last synced: 08 Mar 2025
https://github.com/rickiepark/deep-learning-with-python-2nd
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
cnn deep-learning gan image-augmentation image-classification image-segmentation image-style-transfer keras keras-tuner machine-translation mixed-precision multi-gpu neural-network rnn tensorflow text-classification text-generation time-series tpu transformer
Last synced: 10 Apr 2025
https://github.com/googlecloudplatform/ml-testing-accelerators
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
gpu machine-learning testing-accelerators tpu
Last synced: 05 Apr 2025
https://github.com/gsarti/t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
gcp huggingface seq2seq t5 text-to-text-transfer-transformer tpu tpu-vm transformers
Last synced: 31 Mar 2025
https://github.com/andreped/gradientaccumulator
:dart: Accumulated Gradients for TensorFlow 2
accumulated-batch-normalization accumulated-gradients adaptive-gradient-clipping batch-size deep-learning distributed-training float16 gpu gradient-accumulation hacktoberfest huggingface keras memory-constraints mixed-precision multi-gpu tensorflow tensorflow2 tf2 tpu
Last synced: 13 Apr 2025
https://github.com/instadeepai/sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
ai deep-learning hpc jax machine-learning podracer ppo reinforcement-learning sebulba tpu
Last synced: 30 Jan 2025
https://github.com/pinto0309/tpu-posenet
Edge TPU Accelerator / Multi-TPU / Multi-Model + Posenet/DeeplabV3/MobileNet-SSD + Python + Sync / Async + LaptopPC / RaspberryPi
opencv picamera posenet python raspberrypi tpu
Last synced: 09 Mar 2025
https://github.com/victordibia/tpudcgan
Train DCGAN with TPUs on Google Cloud
dcgan dcgan-tensorflow deep-learning gan machine-learning tpu
Last synced: 01 Jan 2025
https://github.com/soumik12345/point-cloud-segmentation
TF2 implementation of PointNet for segmenting point clouds
deep-learning keras point-cloud segmentation tensorflow2 tpu
Last synced: 07 Jan 2025
https://github.com/noahgift/managed_ml_systems_and_iot
Managed Machine Learning Systems and Internet of Things Live Lesson
ai automl bigquery cpu deep-learning deeplense edge-computing fpga iot machine-learning managed ml movidius python safari sagemaker tpu tutorial
Last synced: 10 Feb 2025
https://github.com/camenduru/stable-diffusion-diffusers-colab
🤗 HuggingFace Diffusers Flax TPU and PyTorch GPU for Colab
colab diffusers discord flax gdrive huggingface huggingface-diffusers pytorch tpu
Last synced: 22 Jan 2025
https://github.com/ai-hypercomputer/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
attention batching gemma inference llama llama2 llm llm-inference model-serving pytorch tpu
Last synced: 10 Jan 2025
https://github.com/eclipse-iofog/iofog.org
Website for Eclipse ioFog, a distributed Edge Compute Network (ECN) platform
eclipse-edge eclipse-iot edge edge-compute-network edge-computing edge-native gpu-acceleration iofog jetson-agx-xavier jetson-nano jetson-tx2 jetsontx2 kubernetes myriad neural-compute-stick neural-compute-stick-2 tpu tpu-acceleration vpu yaml
Last synced: 10 Mar 2025
https://github.com/sayakpaul/big_vision_experiments
Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.
computer-vision google-cloud image-recognition jax large-scale-pretraining tpu
Last synced: 13 Jan 2025
https://github.com/young-geng/tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
Last synced: 07 Apr 2025
https://github.com/Hoiy/berserker
Berserker - BERt chineSE woRd toKenizER
bert bert-chinese chinese-nlp chinese-word-segmentation nlp sequence-to-sequence state-of-the-art tensorflow tokenizer tpu
Last synced: 02 Apr 2025
https://github.com/tetutaro/object_detection_tflite
Object Detection using TensorFlow Lite
object-detection raspberrypi tensorflow-lite tpu yolov3 yolov4 yolov5
Last synced: 07 Apr 2025
https://github.com/xadrianzetx/lanefinder
TPU accelerated traffic lane segmentation engine for your Raspberry Pi
coral deep-learning edge-tpu google-coral raspberry-pi semantic-segmentation tpu
Last synced: 22 Nov 2024
https://github.com/ashishpatel26/tpu_tf2
TPU use in single line in colab using tf2 package.
colab-notebook colaboratory deep-learning tensorflow2 tpu
Last synced: 19 Nov 2024
https://github.com/zackakil/edge-tpu-safe-bike
An application of realtime object-detection running on an Edge TPU for making cycling in busy cities a little less terrifying.
ml raspberry-pi raspberry-pi-camera tensorflow tflite tpu tpu-acceleration
Last synced: 13 Feb 2025
https://github.com/pedro-r-marques/tutorial-t5-fine-tune
Tutorial for text classification with fine tuning of a T5 model on TPUs.
colaboratory nlp-machine-learning t5-model tensorflow-tutorials tensorflow2 text-classification tpu
Last synced: 09 Feb 2025
https://github.com/trisongz/tpubar
Google Cloud TPU Utilization Bar for Training Models
Last synced: 13 Apr 2025
https://github.com/chrischrislolo/chaskets1
TPU Gaskets for Choc switches
3d-printing choc-mod choc-switches gasket mod switch-mod switch-models tpu
Last synced: 16 Mar 2025
https://github.com/hyeonsangjeon/colab-tensorflow-tpu-example
colab-tensorflow-tpu-example
bert colab colab-notebook confusion-matrix histogram kobert nlp resolver-library sagemaker sentence sentence-embeddings sentence-transformers sentiment tpu transformers
Last synced: 17 Nov 2024
https://github.com/trisongz/tpu-vm-docker-containers
Docker File Templates to access TPUs within TPU VM in Containers
docker-compose docker-container google-cloud-platform machine-learning tpu
Last synced: 13 Apr 2025
https://github.com/seungjaelim/machine_learning_security
Individual Study in Computer Architecture and Systems Laboratory (CASYS) with Prof.Jaehyuk Huh in 2021 Summer
deepsniffer eyeriss fgsm-attack gemmini slalom steal-ml tpu
Last synced: 25 Mar 2025
https://github.com/unhkd-dee/colab_tpu
Notebooks for colab that use the TPU
bert colab colab-notebook estimator gcp google ipynb ipython-notebook keras-tensorflow tpu
Last synced: 03 Jan 2025
https://github.com/balena-io-experimental/egde-tpu-web-streamer
Webstreaming classification with Google edge TPU on the picamera and balenaFin
balena balena-io balenafin classification coral docker iot machine-learning picamera python raspberry-pi raspberrypi tpu tpu-acceleration
Last synced: 18 Apr 2025
https://github.com/balena-io-playground/egde-tpu-web-streamer
Webstreaming classification with Google edge TPU on the picamera and balenaFin
balena balena-io balenafin classification coral docker iot machine-learning picamera python raspberry-pi raspberrypi tpu tpu-acceleration
Last synced: 20 Apr 2025
https://github.com/ekzhang/archax
Experiments in multi-architecture parallelism for deep learning with JAX
cpu gpu jax machine-learning ml parallelism pipeline tpu
Last synced: 13 Mar 2025
https://github.com/cloudwiser/tensorflowliterpizerotpu
TensorFlow Lite & Coral TPU: C++ examples on Raspberry Pi Zero W
armv6 google-coral raspberry-pi-zero-w tensorflow-examples tensorflow-lite tpu
Last synced: 19 Feb 2025
https://github.com/apurva-modi/flower-classification
In this competition, we’re challenged to build a machine learning model that identifies the type of flowers in a dataset of images (for simplicity, we’re sticking to just over 100 types).
efficientnet-keras ensemble-model flower-classification kaggle-competition machine-learning multiclass-classification tensorflow tpu transfer-learning
Last synced: 02 Apr 2025
https://github.com/nicholaswilven/pegasus-tpu-trainer
Implementation to pretrain and finetune Transformer encoder-decoder (PEGASUS) using Tensorflow + TFRecords on TPU
nlp tensorflow tpu transformers
Last synced: 19 Dec 2024
https://github.com/alonfnt/tsnex
Minimal t-distributed stochastic neighbor embedding (t-SNE) implementation in JAX.
cpu dimensionality-reduction gpu jax t-sne tpu
Last synced: 18 Apr 2025
https://github.com/ayaka14732/bart-jax
JAX implementation of BART, aiming to demonstrate how Transformer-based models can be implemented using JAX and trained on Google Cloud TPUs
bart jax language-model natural-language-processing nlp tpu trans transformer
Last synced: 21 Mar 2025
https://github.com/tlatkowski/u-net-tpu
Tensorflow implementation of U-Net model with TPU Estimator support.
cnn convolutional-neural-networks deep-learning distributed-training encoder-decoder google-cloud-platform image-classification image-processing image-recognition image-segmentation tensorflow tensorflow-models tpu u-net unet unet-image-segmentation unet-model unet-tensorflow vision
Last synced: 04 Apr 2025
https://github.com/pinto0309/edgetpu-bin
Prebuilt binary for EdgeTPU PythonAPI standalone installer.
aarch64 arm cross-compile edge-tpu installer tpu wheel x86-64
Last synced: 03 Apr 2025
https://github.com/sthysel/tpuparty
Tools and toys for working with the coral TPU
Last synced: 13 Mar 2025
https://github.com/zackakil/6-oclock-helmet
A multi-use helmet that tells you what is behind you using machine learning. This is a continuation of the TPU bike project.
arduino automl computer-vision cycling edge-tpu electronics google-cloud machine-learning raspberry-pi tpu
Last synced: 02 Mar 2025
https://github.com/goruck/semantic-segmentation-server
Semantic segmentation served over grpc using Google Edge TPU.
gprc semantic-segmentation tpu
Last synced: 25 Feb 2025
https://github.com/riolaf05/cv-follow-camera
RaspberryPi Camera which follows objects using computer vision
camera computer-vision docker docker-compose opencv opencv-python raspberry-pi tpu
Last synced: 14 Mar 2025
https://github.com/dsseng/rust-tf-pluggabledevice
A reference TensorFlow PluggableDevice implementation, in Rust
ffi-bindings rust tensorflow tensorflow2 tpu tpu-acceleration
Last synced: 16 Mar 2025
https://github.com/0x7o/ae
Scalable code for training and fine-tuning language models on TPUs
large-language-models scaling tpu
Last synced: 10 Mar 2025