福模

免费开源AI模型下载_本地AI工具资源平台

多模态AIMultimodal AI

ALIGN多模态AI模型 - 大规模图像文本对齐

ALIGN Multimodal AI Model - Large-Scale Image-Text Alignment

ALIGN多模态AI模型,利用大规模图像文本对进行对比学习。在多个视觉语言任务中取得了优异成果,支持图像检索和文本生成。

ALIGN multimodal AI model, utilizing large-scale image-text pairs for contrastive learning. Achieves excellent results in multiple vision-language tasks, supporting image retrieval and text generation.

ALIGN多模态图像文本对比学习ALIGNMultimodalImage-TextContrastive Learning

文件大小

5.6 GB

Upload Size

5.6 GB

上传日期

2024-12-15

Upload Date

2024-12-15

下载次数

12,700

Downloads

12,700

评分

4.7/5.0

Rating

4.7/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

LLaVA视觉语言模型 - 融合图像理解的对话AILLaVA Vision-Language Model - Conversational AI with Image Understanding

LLaVA视觉语言模型,融合图像理解的对话AI。将视觉编码器与语言模型相结合,支持图像相关的对话和推理,适用于教育、客户服务等场景。

LLaVA vision-language model, conversational AI with image understanding. Combines visual encoder with language model, supports image-related conversations and reasoning, suitable for educational, customer service and other scenarios.

LLaVA视觉语言对话AILLaVAVision-LanguageConversational AI
15.3 GB2025-04-13
多模态 AI 模型资源 - 图像文本联合理解模型Multimodal AI Model Resources - Joint Image-Text Understanding Model

多模态AI模型资源,实现图像与文本的联合理解。支持图像描述、视觉问答、图文检索等任务,为跨模态AI应用提供强大支持。

Multimodal AI model resources that enable joint understanding of images and text. Supports tasks such as image captioning, visual question answering, and image-text retrieval, providing strong support for cross-modal AI applications.

多模态图像理解文本理解MultimodalImage UnderstandingText Understanding
15.4 GB2024-01-05
PaLI视觉语言模型 - 端到端语言图像理解PaLI Vision-Language Model - End-to-End Language-Image Understanding

PaLI视觉语言模型,实现端到端语言图像理解。支持图像分类、视觉问答、图像描述等多种任务,具有统一的架构和优秀的性能。

PaLI vision-language model, achieving end-to-end language-image understanding. Supports multiple tasks including image classification, visual question answering, and image captioning, with a unified architecture and excellent performance.

视觉语言PaLI端到端Vision-LanguagePaLIEnd-to-End
18.9 GB2025-03-13