An open API service indexing awesome lists of open source software.

https://github.com/ksm26/introducing-multimodal-llama-3.2


https://github.com/ksm26/introducing-multimodal-llama-3.2

Last synced: 6 months ago
JSON representation

Awesome Lists containing this project

README

          

# ๐Ÿฆ™ [Introducing Multimodal Llama 3.2](https://www.deeplearning.ai/short-courses/introducing-multimodal-llama-3-2/)

Welcome to the "Introducing Multimodal Llama 3.2" course! ๐Ÿš€ This course covers the latest advancements in the Llama model family, including multimodality, custom tool calling, and the new Llama Stack.

## ๐Ÿ“˜ Course Summary
This course explores the new capabilities of Llama 3.2, focusing on custom tool calling, multimodal prompting, and the Llama Stack for orchestration. Learn how the Llama family of open models, ranging from 1B to 405B parameters, is driving AI innovation, allowing developers to customize, fine-tune, or build new applications.

**What Youโ€™ll Learn:**
1. ๐Ÿง  **Llama 3.2 Features**: Learn about the new models, their training, key features, and how they integrate into the Llama family.
2. ๐Ÿ–ผ๏ธ **Multimodal Prompting**: Explore advanced image reasoning use cases such as understanding car dashboard errors, adding up receipts, grading math homework, and more.
3. ๐ŸŽฏ **Role-based Prompting**: Understand how Llama 3.1 and 3.2 use different rolesโ€”system, user, assistant, and ipythonโ€”and the prompt format that identifies these roles.
4. ๐Ÿ”ข **Tokenization**: Learn how Llama uses the tiktoken tokenizer with an expanded 128k vocabulary that improves encoding efficiency and supports seven non-English languages.
5. ๐Ÿ”ง **Tool Calling**: Learn how to prompt Llama to call both built-in and custom tools with examples for web search and solving math equations.
6. ๐Ÿ› ๏ธ **Llama Stack API**: Discover the Llama Stack API, a standardized interface for toolchain components like fine-tuning and synthetic data generation, enabling you to customize Llama models and build agentic applications.

## ๐Ÿ”‘ Key Points
- ๐Ÿ–ผ๏ธ **Multimodal Capabilities**: Leverage the image classification, vision reasoning, and tool use capabilities of Llama 3.2.
- ๐Ÿงฉ **Advanced Prompting Techniques**: Learn the details of prompting, tokenization, and tool calling in Llama 3.2.
- ๐Ÿ› ๏ธ **Llama Stack**: Gain knowledge of the Llama Stack, a standardized interface for building advanced AI applications on top of the Llama models.

## ๐Ÿ‘จโ€๐Ÿซ About the Instructor
- ๐Ÿ‘จโ€๐Ÿ’ป **Amit Sangani**: Senior Director of AI Partner Engineering at Meta, Amit is a key contributor to the Llama model development and will guide you through the advanced capabilities of Llama 3.2.

๐Ÿ”— To enroll in the course or for more information, visit ๐Ÿ“š [deeplearning.ai](https://www.deeplearning.ai/short-courses/).