Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome-Multimodal-Prompts

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
https://github.com/langgptai/Awesome-Multimodal-Prompts

ChatGPT can now see, hear, and speak
Awesome-Multimodal-Large-Language-Models - Multimodal-Large-Language-Models)
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
试过GPT-4V后，微软写了个166页的测评报告，业内人士：高级用户必读 - 4V-zh.pdf)
ChatGPT多模态解禁，网友玩疯！拍图即生代码，古卷手稿一眼识别，图表总结超6
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model - Modality Augmented Language Model (AnyMAL), a unified model that reasons over diverse input modality signals (i.e. text, image, video, audio, IMU motion sensor), and generates textual responses.
DALL·E 3
DALL_E_3_System_Card
Prompt transformation makes ChatGPT OpenAI's covert moderator for DALL-E 3
DALLE3 Gallery for October 2023: Share Your Creations
百万网友围观DALL-E 3新玩法！钢铁侠特斯拉皆“中招”，强迫症友好，博主分享提示词
用 DALLE3 画12页绘本制作全流程
DALL·E 3辣眼图流出！OpenAI 22页报告揭秘：ChatGPT自动改写Prompt
45个 DALL-E 3 使用案例 (附提示词)
DALLE-3 的紧箍咒
🌋 LLaVA: Large Language and Vision Assistant - liu/LLaVA)|[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.|-|
CogVLM - of-the-art-level open visual language model.|CogVLM 是一个强大的开源视觉语言模型，利用视觉专家模块深度整合语言编码和视觉编码，在 14 项权威跨模态基准上取得了 SOTA 性能。目前仅支持英文，后续会提供中英双语版本支持，欢迎持续关注！|
![Star History Chart - history.com/#yzfly/Awesome-Multimodal-Prompts&Date)

Programming Languages

Python 2

Keywords

instruction-tuning 2 multi-modality 2 chain-of-thought 1 in-context-learning 1 instruction-following 1 large-language-models 1 large-vision-language-model 1 large-vision-language-models 1 multimodal-chain-of-thought 1 multimodal-in-context-learning 1 multimodal-instruction-tuning 1 multimodal-large-language-models 1 visual-chain-of-thought 1 visual-in-context-learning 1 visual-instruction-tuning 1 chatbot 1 chatgpt 1 foundation-models 1 gpt-4 1 llama 1 llama-2 1 llama2 1 llava 1 multimodal 1 vision-language-model 1 visual-language-learning 1 cross-modality 1 language-model 1 multi-modal 1 pretrained-models 1 visual-language-models 1