推理速度超级慢？！ #48

chaorenai · 2024-11-13T13:24:53Z

我用的是4090，但是GPU只用了一点点，效果很棒，但是推理速度太慢了。是之前使用rvc模型推理时间的100倍都不止……是我哪里用错了吗？

Plachtaa · 2024-11-13T13:38:45Z

如果你需要debug帮助，希望你可以提供：

使用的是哪个script
是否是singing voice conversion
torch.cuda.available()是否返回True
source & target音频文件

teressawang · 2024-11-14T13:41:25Z

我也是，推理速度才 2.58it/s ，用的是app.py 的推理脚本， T4 ，torch.cuda.available() 返回True

teressawang · 2024-11-14T13:41:30Z

Guessed Channel Layout for Input Stream #0.0 : stereo
Input #0, wav, from 'donnachen.wav':
Metadata:
encoder : Lavf58.45.100
Duration: 00:00:20.15, bitrate: 1411 kb/s
Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 44100 Hz, stereo, s16, 1411 kb/s

Plachtaa · 2024-11-14T20:23:42Z

我也是，推理速度才 2.58it/s ，用的是app.py 的推理脚本， T4 ，torch.cuda.available() 返回True

T4上这个速度是正常的，svc的模型本身参数也更多

Nuyoah111111 · 2024-11-15T08:23:48Z

我也是，推理速度才 2.58it/s ，用的是app.py 的推理脚本， T4 ，torch.cuda.available() 返回True

T4上这个速度是正常的，svc的模型本身参数也更多

我也是运行的app.py这个脚本我是v100的gpu 速度也很慢

zhixianjuli · 2024-11-20T08:35:13Z

You can uninstall torch torchvision torchaudio. And then install as
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

teressawang · 2024-11-20T09:21:44Z

You can uninstall torch torchvision torchaudio. And then install as pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

not work for me ，but thx～

Bella-Tim · 2024-11-26T08:05:34Z

另外一个文本转语音的开源项目使用了融合cuda技术来实现推理加速，这个项目是否能够探索一下使用这个技术的可行性。

https://github.com/fishaudio/fish-speech/blob/main/docs/zh/inference.md
您可能希望使用 --compile 来融合 cuda 内核以实现更快的推理 (~30 个 token/秒 -> ~500 个 token/秒).
对应的, 如果你不打算使用加速, 你可以注释掉 --compile 参数.
@Plachtaa

Plachtaa · 2024-11-26T08:09:05Z

另外一个文本转语音的开源项目使用了融合cuda技术来实现推理加速，这个项目是否能够探索一下使用这个技术的可行性。

https://github.com/fishaudio/fish-speech/blob/main/docs/zh/inference.md 您可能希望使用 --compile 来融合 cuda 内核以实现更快的推理 (~30 个 token/秒 -> ~500 个 token/秒). 对应的, 如果你不打算使用加速, 你可以注释掉 --compile 参数. @Plachtaa

只有linux能用compile， windows不可以

Bella-Tim · 2024-11-26T08:11:41Z

那个项目我也在Windows上尝试了一下，但是在compile的时候报了语法错误，不过我目前还是认为是我的cuda环境配置问题引起的。

Plachtaa · 2024-11-26T08:13:46Z

那个项目我也在Windows上尝试了一下，但是在compile的时候报了语法错误，不过我目前还是认为是我的cuda环境配置问题引起的。

不是你的环境问题，是因为triton本身没有windows的GPU构筑，所以本质上是不可行的，详细请自己看torch.compile的文档

Plachtaa · 2024-11-29T05:10:52Z

现在增添了默认开启fp16推理，大概能提速一倍

Plachtaa added the enhancement New feature or request label Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

推理速度超级慢？！ #48

推理速度超级慢？！ #48

chaorenai commented Nov 13, 2024

Plachtaa commented Nov 13, 2024

teressawang commented Nov 14, 2024

teressawang commented Nov 14, 2024

Plachtaa commented Nov 14, 2024 •

edited

Loading

Nuyoah111111 commented Nov 15, 2024

zhixianjuli commented Nov 20, 2024

teressawang commented Nov 20, 2024

Bella-Tim commented Nov 26, 2024

Plachtaa commented Nov 26, 2024

Bella-Tim commented Nov 26, 2024

Plachtaa commented Nov 26, 2024 •

edited

Loading

Plachtaa commented Nov 29, 2024

推理速度超级慢？！ #48

推理速度超级慢？！ #48

Comments

chaorenai commented Nov 13, 2024

Plachtaa commented Nov 13, 2024

teressawang commented Nov 14, 2024

teressawang commented Nov 14, 2024

Plachtaa commented Nov 14, 2024 • edited Loading

Nuyoah111111 commented Nov 15, 2024

zhixianjuli commented Nov 20, 2024

teressawang commented Nov 20, 2024

Bella-Tim commented Nov 26, 2024

Plachtaa commented Nov 26, 2024

Bella-Tim commented Nov 26, 2024

Plachtaa commented Nov 26, 2024 • edited Loading

Plachtaa commented Nov 29, 2024

Plachtaa commented Nov 14, 2024 •

edited

Loading

Plachtaa commented Nov 26, 2024 •

edited

Loading