Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-Multimodal-Prompts
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
https://github.com/langgptai/Awesome-Multimodal-Prompts
- ChatGPT can now see, hear, and speak
- Awesome-Multimodal-Large-Language-Models - Multimodal-Large-Language-Models)
- The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
- 试过GPT-4V后,微软写了个166页的测评报告,业内人士:高级用户必读 - 4V-zh.pdf)
- ChatGPT多模态解禁,网友玩疯!拍图即生代码,古卷手稿一眼识别,图表总结超6
- AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model - Modality Augmented Language Model (AnyMAL), a unified model that reasons over diverse input modality signals (i.e. text, image, video, audio, IMU motion sensor), and generates textual responses.
- DALL·E 3
- DALL_E_3_System_Card
- Prompt transformation makes ChatGPT OpenAI's covert moderator for DALL-E 3
- DALLE3 Gallery for October 2023: Share Your Creations
- 百万网友围观DALL-E 3新玩法!钢铁侠特斯拉皆“中招”,强迫症友好,博主分享提示词
- 用 DALLE3 画12页绘本制作全流程
- DALL·E 3辣眼图流出!OpenAI 22页报告揭秘:ChatGPT自动改写Prompt
- 45个 DALL-E 3 使用案例 (附提示词)
- DALLE-3 的紧箍咒
- 🌋 LLaVA: Large Language and Vision Assistant - liu/LLaVA)|[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.|-|
- CogVLM - of-the-art-level open visual language model.|CogVLM 是一个强大的开源视觉语言模型,利用视觉专家模块深度整合语言编码和视觉编码,在 14 项权威跨模态基准上取得了 SOTA 性能。目前仅支持英文,后续会提供中英双语版本支持,欢迎持续关注!|
- ![Star History Chart - history.com/#yzfly/Awesome-Multimodal-Prompts&Date)
Programming Languages
Keywords
instruction-tuning
2
multi-modality
2
chain-of-thought
1
in-context-learning
1
instruction-following
1
large-language-models
1
large-vision-language-model
1
large-vision-language-models
1
multimodal-chain-of-thought
1
multimodal-in-context-learning
1
multimodal-instruction-tuning
1
multimodal-large-language-models
1
visual-chain-of-thought
1
visual-in-context-learning
1
visual-instruction-tuning
1
chatbot
1
chatgpt
1
foundation-models
1
gpt-4
1
llama
1
llama-2
1
llama2
1
llava
1
multimodal
1
vision-language-model
1
visual-language-learning
1
cross-modality
1
language-model
1
multi-modal
1
pretrained-models
1
visual-language-models
1