多模态AIMultimodal AI

LLaVA视觉语言模型 - 融合图像理解的对话AI

LLaVA Vision-Language Model - Conversational AI with Image Understanding

LLaVA视觉语言模型，融合图像理解的对话AI。将视觉编码器与语言模型相结合，支持图像相关的对话和推理，适用于教育、客户服务等场景。

LLaVA vision-language model, conversational AI with image understanding. Combines visual encoder with language model, supports image-related conversations and reasoning, suitable for educational, customer service and other scenarios.

LLaVA视觉语言对话AI图像理解LLaVAVision-LanguageConversational AIImage Understanding

文件大小

15.3 GB

Upload Size

15.3 GB

上传日期

2025-04-13

Upload Date

2025-04-13

下载次数

16,800

Downloads

16,800

评分

4.7/5.0

Rating

4.7/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

Flamingo多模态AI模型，先进的视觉语言理解模型。可以回答关于图像的问题、描述视觉内容，并执行各种视觉语言任务。

Flamingo multimodal AI model, an advanced visual-language understanding model. Can answer questions about images, describe visual content, and perform various vision-language tasks.

Flamingo视觉语言理解模型FlamingoVisual-LanguageUnderstanding Model

14.2 GB2025-02-07

多模态 AI 模型资源 - 图像文本联合理解模型 Multimodal AI Model Resources - Joint Image-Text Understanding Model

多模态AI模型资源，实现图像与文本的联合理解。支持图像描述、视觉问答、图文检索等任务，为跨模态AI应用提供强大支持。

Multimodal AI model resources that enable joint understanding of images and text. Supports tasks such as image captioning, visual question answering, and image-text retrieval, providing strong support for cross-modal AI applications.

多模态图像理解文本理解MultimodalImage UnderstandingText Understanding

15.4 GB2024-01-05

CoCa多模态生成模型 - 联合图像文本生成 CoCa Multimodal Generative Model - Joint Image-Text Generation

CoCa多模态生成模型，联合图像文本生成模型。独特地将图像编码和文本生成结合起来，实现高效的视觉语言理解与生成，适用于内容创作和图像编辑。

CoCa multimodal generative model, joint image-text generation model. Uniquely combines image encoding and text generation, achieving efficient visual language understanding and generation, suitable for content creation and image editing.

CoCa多模态图像文本CoCaMultimodalImage-Text

8.7 GB2025-04-11

LLaVA视觉语言模型 - 融合图像理解的对话AI

LLaVA Vision-Language Model - Conversational AI with Image Understanding

下载资源 Download Resources

相关资源推荐