Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tincans-ai/gazelle
Joint speech-language model - respond directly to audio!
https://github.com/tincans-ai/gazelle
audio llm multimodal speech
Last synced: about 2 months ago
JSON representation
Joint speech-language model - respond directly to audio!
- Host: GitHub
- URL: https://github.com/tincans-ai/gazelle
- Owner: tincans-ai
- License: apache-2.0
- Created: 2024-02-20T13:43:05.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-07-01T18:56:06.000Z (6 months ago)
- Last Synced: 2024-08-05T06:03:29.096Z (5 months ago)
- Topics: audio, llm, multimodal, speech
- Language: Python
- Homepage: https://tincans.ai
- Size: 3.69 MB
- Stars: 296
- Watchers: 12
- Forks: 26
- Open Issues: 1
-
Metadata Files:
- Readme: readme.md
- License: LICENSE
Awesome Lists containing this project
- awesome_ai_agents - Gazelle - Joint speech-language model - respond directly to audio! (Building / LLM Models)
- awesome_ai_agents - Gazelle - Joint speech-language model - respond directly to audio! (Building / LLM Models)
README
# Gazelle - Joint Speech Language Model
This repository contains modeling code for the Gazelle Joint Speech Language Model.
![gazelle wearing headphones, cartoon style](logo.webp)
For some more details, read our [blog post](https://tincans.ai/slm) or [update](https://tincans.ai/slm2).
This inference code is heavily derived from [Huggingface's Llava implementation](https://github.com/huggingface/transformers/blob/f7ef7cec6c6c162087421f36a17eabdbb223579d/src/transformers/models/llava/modeling_llava.py). It is not particularly optimized, but we welcome community contributions.
**Checkpoints**:
- v0.2: [huggingface](https://huggingface.co/tincans-ai/gazelle-v0.2)
- v0.2-dpo: [huggingface](https://huggingface.co/tincans-ai/gazelle-v0.2-dpo)
- v0.1: [huggingface](https://huggingface.co/tincans-ai/gazelle-v0.1)**Links**:
- Join us in [Discord](https://discord.gg/qyC5h3FSzU).
- Original [blog post](https://tincans.ai/slm)
- v0.1 [release notes](https://tincans.ai/slm2)## Disclaimer
We are backproppin' on a budget and these initial checkpoints are not very robust to many real-world considerations. We make no representation of warranty or liability.
The v0.2 model responds to jailbreaks and adversarial attacks. We do not suggest using this in production.
## License
This modeling code is licensed under Apache 2.0. The v0.2 release is derived from Mistral 7B and is released under the same Apache 2.0 license.
The v0.1 pretrained checkpoints derive from Llama 2, which is governed by the [Llama 2 license](https://ai.meta.com/llama/license/). You must agree to these terms if you use the pretrained weights!