We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我这边使用一小段音频作为 source 音频 和 target 音频尝试歌声转化,感觉效果不太好,具体推理过程如下:
source 歌声音频、target 歌声音频](url)以及生成的歌声音频:https://pan.quark.cn/s/5b438a7ff8bd
python inference.py --source ./dataset/这世界那么多人.m4a \ --target ./dataset/像我这样的人.m4a \ --output ./reconstructed \ --diffusion-steps 30 \ --f0-condition True \ --length-adjust 1.0 \ --inference-cfg-rate 0.7 \ --semi-tone-shift 0 \ --checkpoint ./pth/DiT_seed_v2_uvit_whisper_base_f0_44k_bigvgan_pruned_ft_ema.pth \ --config ./configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml
The text was updated successfully, but these errors were encountered:
Sorry, something went wrong.
你的 source 音频和 target 音频都是有背景音乐的,歌声转换之前先用音乐分离模型把人声分离出来,这样效果会好很多
请问下有没有比较好的音乐分离模型
你的 source 音频和 target 音频都是有背景音乐的,歌声转换之前先用音乐分离模型把人声分离出来,这样效果会好很多 请问下有没有比较好的音乐分离模型
请问找到了吗
No branches or pull requests
我这边使用一小段音频作为 source 音频 和 target 音频尝试歌声转化,感觉效果不太好,具体推理过程如下:
The text was updated successfully, but these errors were encountered: