Skip to content

大佬好,seed-vc 可以用来训练小语种吗, 比如阿语。 如果可以需要准备多少时长的语料? 谢谢 #171

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
cillywill opened this issue Apr 22, 2025 · 1 comment

Comments

@cillywill
Copy link

No description provided.

@Plachtaa
Copy link
Owner

Plachtaa commented Apr 22, 2025

在做出训练或finetune的决定之前建议你检查以下事项:

  1. 当前release的模型是否在你的期望语种上表现不佳(如发音不准或口音过重)?鉴于V1模型训练数据囊括6种,V2模型训练数据囊括20+种语言,二者的预训练模型大多数情况下足以应对大多数语言;
  2. 训练数据音色是否足够diverse?这和zero shot表现密切相关,如不关心可忽略;
  3. 如果仅在单一语种上训练极大可能将导致在其它语言或跨语言任务上表现不佳
    如果你在检查完毕之后仍然希望微调或从零开始训练,建议准备至少100h+小时数据
    若微调,建议数据量和训练数据成正比,尽量不要超过2个epoch
    若从零开始,我将假设你经验丰富,这种情况下请自行探究

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants