1页PPT说清1篇论文
- NExT-GPT: Any-to-Any Multimodal LLM [Paper]
- LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents [Paper]
- EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction [Paper]
- MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action [Paper]