CUDA版で「推論に失敗しました」というエラーが出る #611

kuroneko6423 · 2023-02-03T00:59:11Z

不具合の内容

起動しすぐにリクエストすると以下のエラーが出ました。

現象・ログ

WARNING: JPCommonLabel_insert_pause() in jpcommon_label.c: First mora should not be short pause.
INFO:     164.70.240.218:0 - "POST /audio_query?text='メッセージ非公開'&speaker=3 HTTP/1.1" 200 OK
2023-02-03T00:55:30.909944Z  WARN onnxruntime::onnxruntime: "CUDNN failure 1: CUDNN_STATUS_NOT_INITIALIZED ; GPU=0 ; hostname=voicevox ; expr=status_; "
2023-02-03T00:55:30.909983Z  WARN onnxruntime::onnxruntime: "Non-zero status code returned while running FusedConv node. Name:\'Conv_762\' Status Message: CUDNN failure 1: CUDNN_STATUS_NOT_INITIALIZED ; GPU=0 ; hostname=voicevox ; expr=status_; "
INFO:     164.70.240.218:0 - "POST /synthesis?speaker=3 HTTP/1.1" 500 Internal Server Error
ERROR:    Exception in ASGI application
Traceback (most recent call last):
  File "uvicorn/protocols/http/h11_impl.py", line 373, in run_asgi
  File "uvicorn/middleware/proxy_headers.py", line 75, in __call__
  File "fastapi/applications.py", line 208, in __call__
  File "starlette/applications.py", line 112, in __call__
  File "starlette/middleware/errors.py", line 181, in __call__
  File "starlette/middleware/errors.py", line 159, in __call__
  File "starlette/middleware/base.py", line 57, in __call__
  File "anyio/_backends/_asyncio.py", line 662, in __aexit__
  File "starlette/middleware/base.py", line 30, in coro
  File "starlette/middleware/cors.py", line 84, in __call__
  File "starlette/exceptions.py", line 82, in __call__
  File "starlette/exceptions.py", line 71, in __call__
  File "starlette/routing.py", line 656, in __call__
  File "starlette/routing.py", line 259, in handle
  File "starlette/routing.py", line 61, in app
  File "fastapi/routing.py", line 226, in app
  File "fastapi/routing.py", line 161, in run_endpoint_function
  File "starlette/concurrency.py", line 39, in run_in_threadpool
  File "anyio/to_thread.py", line 31, in run_sync
  File "anyio/_backends/_asyncio.py", line 937, in run_sync_in_worker_thread
  File "anyio/_backends/_asyncio.py", line 867, in run
  File "run.py", line 388, in synthesis
  File "voicevox_engine/synthesis_engine/synthesis_engine_base.py", line 242, in synthesis
  File "voicevox_engine/synthesis_engine/synthesis_engine.py", line 479, in _synthesis_impl
  File "voicevox_engine/synthesis_engine/core_wrapper.py", line 521, in decode_forward
Exception: 推論に失敗しました

再現手順

起動しすぐにリクエストすると以下のエラーが出ました。

期待動作

200で返され正常に読み上げの処理がされる。

VOICEVOXのバージョン

0.14.1

OSの種類/ディストリ/バージョン

Windows
macOS
Linux

OS: Ubuntu Server 22.04.1 LTS
GPU: RTX3060
RAM: 8GB

The text was updated successfully, but these errors were encountered:

0kq-github · 2023-02-03T11:16:14Z

Windows環境でも同様の現象が起きたので報告させていただきます。

ログ

ERROR:    Exception in ASGI application
Traceback (most recent call last):
  File "uvicorn\protocols\http\h11_impl.py", line 373, in run_asgi
  File "uvicorn\middleware\proxy_headers.py", line 75, in __call__
  File "fastapi\applications.py", line 208, in __call__
  File "starlette\applications.py", line 112, in __call__
  File "starlette\middleware\errors.py", line 181, in __call__
  File "starlette\middleware\errors.py", line 159, in __call__
  File "starlette\middleware\base.py", line 57, in __call__
  File "anyio\_backends\_asyncio.py", line 662, in __aexit__
  File "starlette\middleware\base.py", line 30, in coro
  File "starlette\middleware\cors.py", line 84, in __call__
  File "starlette\exceptions.py", line 82, in __call__
  File "starlette\exceptions.py", line 71, in __call__
  File "starlette\routing.py", line 656, in __call__
  File "starlette\routing.py", line 259, in handle
  File "starlette\routing.py", line 61, in app
  File "fastapi\routing.py", line 226, in app
  File "fastapi\routing.py", line 161, in run_endpoint_function
  File "starlette\concurrency.py", line 39, in run_in_threadpool
  File "anyio\to_thread.py", line 31, in run_sync
  File "anyio\_backends\_asyncio.py", line 937, in run_sync_in_worker_thread
  File "anyio\_backends\_asyncio.py", line 867, in run
  File "run.py", line 388, in synthesis
  File "voicevox_engine\synthesis_engine\synthesis_engine_base.py", line 242, in synthesis
  File "voicevox_engine\synthesis_engine\synthesis_engine.py", line 479, in _synthesis_impl
  File "voicevox_engine\synthesis_engine\core_wrapper.py", line 521, in decode_forward
Exception: 推論に失敗しました

OS: Windows10 Pro
GPU: Tesla P40
RAM: 32GB

Hiroshiba · 2023-02-03T11:40:07Z

@kuroneko6423 @0kq-github
起動後、どれくらい待つとエラーが発生しない感じでしょうか 👀

（待たないといけない仕様にし、それをREADMEや起動メッセージで案内するのが良いのかなと思っています。）

0kq-github · 2023-02-03T13:44:56Z

私の環境で10分から1時間ほど時間を置いてみましたが同じエラーが出てしまします。

qryxip · 2023-02-03T14:28:48Z

私のLinux環境でも同様ですね。というより今に至るまで0.14 CUDA版でのsynthesisに一度も成功していません。
「おま環」でなければLinux CUDA版が完全に壊れている可能性を考えつつも、調査はしていませんでした。

ちなみにですがonnxruntime::onnxruntime WARNはORT本体の"ERROR"にあたると思います。 (VOICEVOX/onnxruntime-rs#16)

(追記) 私の環境:

❯ uname -srvmpio
Linux 5.15.90-1-lts #1 SMP Tue, 24 Jan 2023 12:46:03 +0000 x86_64 unknown unknown GNU/Linux
❯ lsb_release -a
LSB Version:    n/a
Distributor ID: Arch
Description:    Arch Linux
Release:        rolling
Codename:       n/a
❯ nvidia-smi -L
GPU 0: NVIDIA GeForce RTX 3070 Laptop GPU (UUID: GPU-5876832d-66f9-86dd-44b2-24a4492aafe0)

kuroneko6423 · 2023-02-03T16:29:05Z

私の環境で10分から1時間ほど時間を置いてみましたが同じエラーが出てしまします。

同じくです

okaits · 2023-02-03T17:27:22Z

同じくです
~/.config/voicevoxを削除して設定し直したり、再インストールも試しましたが失敗しました。

環境:

Ubuntu 22.10 amd64
NVIDIA Driver 525.78.01
NVIDIA CUDA Toolkit 12.0
Linux 6.2.0-rc1

Hiroshiba · 2023-02-04T02:45:44Z

詳細なご報告ありがとうございます！！

正直なところ、原因は全く思い至っていません･･･。
@okaits さんは特に不思議で、nvidia-docker環境であれば動いたという報告も頂いています･･･。

CUDA版実行時に「Exception: 無効なmodel_indexです: 0」が出ることがある #585 (comment)

原因を探究したいのでよければご協力お願いします 🙇‍♂️

@kuroneko6423 さんはwindows環境とのことで、CUDA版onnxruntimeが想定するバージョンとcudnnのバージョンが一致していないというのが原因かもしれません。
その場合は、cudnnのバージョン8.5.0をダウンロードして頂き、onnxruntime.dllなどのdllファイルをrun.exeがあるディレクトリにコピー（既存の場合は上書き）すると解決するかもしれません･･･ 🙇‍♂️

現在バージョン0.14.2をビルド中です。これはCPUデバイス依存の最適化を施していたのを修正したものになり、もしかしたらCUDA版での不具合に関係があるかもしれません。
こちらにリリースされるので、お手数ですがまたお試し頂けると助かります 🙇‍♂️
https://github.com/VOICEVOX/voicevox_engine/releases/tag/0.14.2

kuroneko6423 · 2023-02-04T03:11:45Z

自分Linuxです()

Hiroshiba · 2023-02-04T03:14:07Z

あ！！本当ですね。。すみません、勝手に勘違いしました。。

0kq-github · 2023-02-04T03:38:12Z

0.14.2で試したところWindows環境で問題なく動作しました

kuroneko6423 · 2023-02-04T04:04:17Z

Linux(Ubuntu)でも無事動作いたしました！

Hiroshiba · 2023-02-04T08:36:30Z

おーーーー！！！！なるほどです、ご報告助かります！！

ではたぶん大丈夫になったということでいったんcloseしたいと思います。
また問題が起こったり、 @okaits さんの環境とかでうまくいかなかったりした場合はコメント頂けると助かります！

kuroneko6423 added the バグ label Feb 3, 2023

Hiroshiba changed the title ~~推論に失敗しました~~ 起動しすぐにリクエストすると「推論に失敗しました」というエラーが出る Feb 3, 2023

Hiroshiba changed the title ~~起動しすぐにリクエストすると「推論に失敗しました」というエラーが出る~~ CUDA版で「推論に失敗しました」というエラーが出る Feb 3, 2023

Hiroshiba mentioned this issue Feb 3, 2023

DirectML版の推論速度が遅いっぽい VOICEVOX/voicevox_core#422

Closed

3 tasks

Hiroshiba closed this as completed Feb 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA版で「推論に失敗しました」というエラーが出る #611

CUDA版で「推論に失敗しました」というエラーが出る #611

kuroneko6423 commented Feb 3, 2023 •

edited

Loading

0kq-github commented Feb 3, 2023

Hiroshiba commented Feb 3, 2023

0kq-github commented Feb 3, 2023 •

edited

Loading

qryxip commented Feb 3, 2023 •

edited

Loading

kuroneko6423 commented Feb 3, 2023

okaits commented Feb 3, 2023

Hiroshiba commented Feb 4, 2023 •

edited

Loading

kuroneko6423 commented Feb 4, 2023

Hiroshiba commented Feb 4, 2023

0kq-github commented Feb 4, 2023

kuroneko6423 commented Feb 4, 2023

Hiroshiba commented Feb 4, 2023

CUDA版で「推論に失敗しました」というエラーが出る #611

CUDA版で「推論に失敗しました」というエラーが出る #611

Comments

kuroneko6423 commented Feb 3, 2023 • edited Loading

不具合の内容

現象・ログ

再現手順

期待動作

VOICEVOXのバージョン

OSの種類/ディストリ/バージョン

0kq-github commented Feb 3, 2023

ログ

Hiroshiba commented Feb 3, 2023

0kq-github commented Feb 3, 2023 • edited Loading

qryxip commented Feb 3, 2023 • edited Loading

kuroneko6423 commented Feb 3, 2023

okaits commented Feb 3, 2023

Hiroshiba commented Feb 4, 2023 • edited Loading

kuroneko6423 commented Feb 4, 2023

Hiroshiba commented Feb 4, 2023

0kq-github commented Feb 4, 2023

kuroneko6423 commented Feb 4, 2023

Hiroshiba commented Feb 4, 2023

kuroneko6423 commented Feb 3, 2023 •

edited

Loading

0kq-github commented Feb 3, 2023 •

edited

Loading

qryxip commented Feb 3, 2023 •

edited

Loading

Hiroshiba commented Feb 4, 2023 •

edited

Loading