福模

免费开源AI模型下载_本地AI工具资源平台

多模态AIMultimodal AI

CoCa多模态生成模型 - 联合图像文本生成

CoCa Multimodal Generative Model - Joint Image-Text Generation

CoCa多模态生成模型,联合图像文本生成模型。独特地将图像编码和文本生成结合起来,实现高效的视觉语言理解与生成,适用于内容创作和图像编辑。

CoCa multimodal generative model, joint image-text generation model. Uniquely combines image encoding and text generation, achieving efficient visual language understanding and generation, suitable for content creation and image editing.

CoCa多模态图像文本内容生成CoCaMultimodalImage-TextContent Generation

文件大小

8.7 GB

Upload Size

8.7 GB

上传日期

2025-04-11

Upload Date

2025-04-11

下载次数

11,200

Downloads

11,200

评分

4.4/5.0

Rating

4.4/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

Flamingo多模态AI模型 - 视觉语言理解Flamingo Multimodal AI Model - Visual-Language Understanding

Flamingo多模态AI模型,先进的视觉语言理解模型。可以回答关于图像的问题、描述视觉内容,并执行各种视觉语言任务。

Flamingo multimodal AI model, an advanced visual-language understanding model. Can answer questions about images, describe visual content, and perform various vision-language tasks.

Flamingo视觉语言理解模型FlamingoVisual-LanguageUnderstanding Model
14.2 GB2025-02-07
MUSE多模态AI生成模型 - 高质量文本到图像合成MUSE Multimodal AI Generation Model - High-Quality Text-to-Image Synthesis

MUSE多模态AI生成模型,基于Transformer的高质量文本到图像生成系统。结合了扩散模型和Transformer的优势,生成高质量图像。

MUSE multimodal AI generation model, a high-quality text-to-image generation system based on Transformer. Combines the advantages of diffusion models and Transformers to generate high-quality images.

MUSE多模态文本到图像MUSEMultimodalText-to-Image
18.7 GB2025-02-03
PaLI视觉语言模型 - 端到端语言图像理解PaLI Vision-Language Model - End-to-End Language-Image Understanding

PaLI视觉语言模型,实现端到端语言图像理解。支持图像分类、视觉问答、图像描述等多种任务,具有统一的架构和优秀的性能。

PaLI vision-language model, achieving end-to-end language-image understanding. Supports multiple tasks including image classification, visual question answering, and image captioning, with a unified architecture and excellent performance.

视觉语言PaLI端到端Vision-LanguagePaLIEnd-to-End
18.9 GB2025-03-13