An open API service indexing awesome lists of open source software.

https://github.com/notshrirang/paligemma

A Vision Language Model implemented in PyTorch
https://github.com/notshrirang/paligemma

gemma gemma-2b multimodal transformers vlm

Last synced: about 1 year ago
JSON representation

A Vision Language Model implemented in PyTorch

Awesome Lists containing this project

README

          

# PaliGemma

PaliGemma is a Vision Language Model (VLM) released by Google.

Base Paper - https://arxiv.org/abs/2407.07726

## Architecture:
![PaliGemma Architecture](https://github.com/user-attachments/assets/78f23d8a-d20c-43f5-b31b-e2c4438017cd)