福模

免费开源AI模型下载_本地AI工具资源平台

语音AISpeech AI

Hubert语音表示学习模型 - 无监督语音表征学习

Hubert Speech Representation Learning Model - Unsupervised Speech Representation Learning

HuBERT语音表示学习模型,Facebook提出的无监督语音表征学习模型。通过聚类平滑预测和掩码重建,实现了语音表示的层次化学习。

HuBERT speech representation learning model, an unsupervised speech representation learning model proposed by Facebook. Achieves hierarchical learning of speech representations through cluster-smoothed prediction and masked reconstruction.

HuBERT语音表示无监督学习语音识别HuBERTSpeech RepresentationUnsupervised LearningSpeech Recognition

文件大小

1.9 GB

Upload Size

1.9 GB

上传日期

2025-01-24

Upload Date

2025-01-24

下载次数

13,400

Downloads

13,400

评分

4.6/5.0

Rating

4.6/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

声音克隆模型下载 - 5秒音频即可克隆人声Voice Cloning Model Download - Clone Voice with 5-Second Audio Sample

声音克隆模型,只需5秒音频即可克隆人声。支持高保真度的声音复制,适用于配音、虚拟主播、语音助手等应用场景,提供详细的训练教程。

Voice cloning model that can clone a voice with just a 5-second audio sample. Supports high-fidelity voice replication, suitable for applications such as voice dubbing, virtual streamers, voice assistants, and includes detailed training tutorials.

声音克隆语音复制TTSVoice CloningVoice ReplicationTTS
3.2 GB2024-01-06
whisper语音识别模型 - 多语言自动语音识别系统Whisper Speech Recognition Model - Multilingual Automatic Speech Recognition System

whisper语音识别模型,OpenAI开发的多语言自动语音识别系统。支持99种语言的高精度语音转文本,适应多种口音、背景噪声和专业术语,广泛用于字幕生成和语音助手。

Whisper speech recognition model, a multilingual automatic speech recognition system developed by OpenAI. Supports high-precision speech-to-text conversion in 99 languages, adapting to various accents, background noise, and specialized terminology, widely used in subtitle generation and voice assistants.

Whisper语音识别多语言WhisperSpeech RecognitionMultilingual
2.8 GB2025-01-08
声音克隆模型下载 - 5秒音频即可克隆人声Voice Cloning Model Download - Clone Voices with Just 5 Seconds of Audio

声音克隆模型,只需5秒音频即可克隆人声。支持高保真度的声音复制,适用于配音、虚拟主播、语音助手等应用场景,提供详细的训练教程。

Voice cloning model, requiring only 5 seconds of audio to clone voices. Supports high-fidelity voice replication, applicable to dubbing, virtual streamers, voice assistants and other application scenarios, providing detailed training tutorials.

声音克隆语音复制TTSVoice CloningVoice ReplicationTTS
3.2 GB2025-04-19