prosody和phone量化后的维度不同 #3

wcr369 · 2024-04-16T11:12:29Z

在运行vq_post_emb_a, vq_id_a, _, quantized, spk_embs_a = fa_decoder_v2(enc_out_a, prosody_a, eval_vq=False, vq=True)时，由于phone量化后的维度比prosody大一位导致在outs += out的时候报错，请问这是bug吗

isMoJo · 2024-04-24T05:33:06Z

我也遇到了相同的问题，求答案~~

synthere · 2024-08-15T01:10:46Z

发现部分文件会有这要的问题，试着在forward里加了个补齐没问题了

+            pads = torch.zeros([prosody_feature.shape[0], prosody_feature.shape[1], x.shape[-1] - prosody_feature.shape[-1]])
+            prosody_feature = torch.cat([prosody_feature, pads], dim=2)

            x_timbre = x

            outs, qs, commit_loss, quantized_buf = self.quantize(
                x, prosody_feature, n_quantizers=n_quantizers
            )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prosody和phone量化后的维度不同 #3

prosody和phone量化后的维度不同 #3

wcr369 commented Apr 16, 2024

isMoJo commented Apr 24, 2024

synthere commented Aug 15, 2024 •

edited

Loading

prosody和phone量化后的维度不同 #3

prosody和phone量化后的维度不同 #3

Comments

wcr369 commented Apr 16, 2024

isMoJo commented Apr 24, 2024

synthere commented Aug 15, 2024 • edited Loading

synthere commented Aug 15, 2024 •

edited

Loading