Hubert语音表示学习模型 - 无监督语音表征学习
Hubert Speech Representation Learning Model - Unsupervised Speech Representation Learning
HuBERT语音表示学习模型,Facebook提出的无监督语音表征学习模型。通过聚类平滑预测和掩码重建,实现了语音表示的层次化学习。
HuBERT speech representation learning model, an unsupervised speech representation learning model proposed by Facebook. Achieves hierarchical learning of speech representations through cluster-smoothed prediction and masked reconstruction.
文件大小
1.9 GB
Upload Size
1.9 GB
上传日期
2025-01-24
Upload Date
2025-01-24
下载次数
13,400
Downloads
13,400
评分
4.6/5.0
Rating
4.6/5.0
下载资源 Download Resources
下载资源表示您同意我们的使用条款和隐私政策
By downloading this resource, you agree to our Terms of Service and Privacy Policy
相关资源推荐
声音克隆模型,只需5秒音频即可克隆人声。支持高保真度的声音复制,适用于配音、虚拟主播、语音助手等应用场景,提供详细的训练教程。
Voice cloning model that can clone a voice with just a 5-second audio sample. Supports high-fidelity voice replication, suitable for applications such as voice dubbing, virtual streamers, voice assistants, and includes detailed training tutorials.
whisper语音识别模型,OpenAI开发的多语言自动语音识别系统。支持99种语言的高精度语音转文本,适应多种口音、背景噪声和专业术语,广泛用于字幕生成和语音助手。
Whisper speech recognition model, a multilingual automatic speech recognition system developed by OpenAI. Supports high-precision speech-to-text conversion in 99 languages, adapting to various accents, background noise, and specialized terminology, widely used in subtitle generation and voice assistants.
声音克隆模型,只需5秒音频即可克隆人声。支持高保真度的声音复制,适用于配音、虚拟主播、语音助手等应用场景,提供详细的训练教程。
Voice cloning model, requiring only 5 seconds of audio to clone voices. Supports high-fidelity voice replication, applicable to dubbing, virtual streamers, voice assistants and other application scenarios, providing detailed training tutorials.