Is pretrained model of Tacotron2 + LPCNet is avalilible? #1

mrgloom · 2019-05-16T13:45:46Z

Is pretrained model of Tacotron2 + LPCNet is avalilible?

MlWoo · 2019-05-16T14:12:53Z

We have the pretrained model of the two on mandarin dataset. But I think it is not illegal without the permission of my company to release them personally. It is not hard to train the model with your materials following the steps in readme. I will follow the steps to rerun the training and synthesising procedures to make sure it is right.

sheepHavingPurpleLeaf · 2019-05-25T06:49:22Z

Hi @MlWoo, i have trained Tacotron2 about 21000 steps on a female mandarin dataset and connected to LPCNet. At 21000, the error is about 0.33 and decoder is already aligned with encoder. The output wav has really bad quality, ie. large portion of the sentence is silence and cannot tell even gender from voicing part. How many steps have you trained tacotron to achieve good sound?

MlWoo · 2019-05-25T09:38:25Z

I have pointed that the quality of vocoder is sensitive to the estimation of pitch parameters. maybe you could achieve it with 210000 steps. we have different params to train the tacotron2. and I think it has no much meaningful.info.for you. our loss is less than 0.1.

sheepHavingPurpleLeaf · 2019-05-26T14:11:15Z

@MlWoo I waited for another 10k steps, the loss stays above 0.3. Any advice on taco2 params? Thanks in advance.

superhg2012 · 2019-05-30T01:58:03Z

Hi, @MlWoo Did you train T2 with 16k mandarin dataset?

MlWoo · 2019-05-31T06:37:43Z

@superhg2012 yes.

superhg2012 · 2019-06-10T02:48:52Z

@sheepHavingPurpleLeaf which Tacotron repo did you use? any better results?

superhg2012 · 2019-06-10T02:57:07Z

@MlWoo audio processing parameters is not used when training T2 with .f32 feature files. I tried different hparams, but can only achieve 0.2 loss, did you adjust the T2 network params? thanks in advance!

MlWoo · 2019-06-10T03:11:11Z

@superhg2012 The loss is gained with the teacher forcing mode.

superhg2012 · 2019-06-10T03:20:48Z

@MlWoo thanks!! constant and scheduled, which mode is preferenced?

MlWoo · 2019-06-10T06:29:22Z

@superhg2012 constant mode.

superhg2012 · 2019-06-10T06:37:14Z

many thanks !!

superhg2012 · 2019-06-13T06:50:45Z

I trained T2 for 130k steps and the lowest loss value is 0.13, and the synthsized audio is still not good as expected. some post processing needed?

demo.zip

@MlWoo I think LPCNet is ok, the cause is pitch parameters predicted from Tacotron2, could you give some suggestions?

sheepHavingPurpleLeaf · 2019-06-13T06:55:05Z

@superhg2012 Can you share your hparams? you are using pinyin to train or phoneme?

MlWoo · 2019-06-13T07:13:27Z

@estherxue Could you post your samples?

superhg2012 · 2019-06-13T07:20:14Z

@sheepHavingPurpleLeaf I used pinyin to train Tacotron2 and parameters is common.

sheepHavingPurpleLeaf · 2019-06-13T07:27:10Z

@superhg2012 I have got similar result with yours. Did you train LPCNet with the English dataset provided in Mozilla's repo or you used your mandarin dataset?

superhg2012 · 2019-06-13T07:33:03Z

@sheepHavingPurpleLeaf I used same mandarin dataset for LPCNet and Tacotron2. The sound quality is almost same while loss is around 0.13 ~ 0.17.

estherxue · 2019-06-13T07:45:48Z

Hi, here are my samples trained with Tacotron 2 + LPCNet.
tacotron2+lpcnet.zip

superhg2012 · 2019-06-13T07:52:20Z

@estherxue hi, the examples sounds good, I have several questions.
1 . you are using pinyin to train or phoneme?
2. you are using same dataset to train both t2 and lpcnet?
3. how many steps takes to train t2 part? and last loss?
4. you are training in GTA mode?

thanks in advance!!

MlWoo · 2019-06-13T14:32:53Z

@superhg2012 Our team (Xue is my collegue) does not use any other trick to train the tacotron2.

pinyin
the same dataset
280k if I remember correctly. and loss is about 0.1. Maybe the lr scheduling is not same as the t2 repo because T2 repo is updated recently.
No GTA. if you want to use gta mode, there is a lot tricky work (like round audio to the frames)to be done.

superhg2012 · 2019-06-14T03:27:38Z

@MlWoo get it, thanks for kind reply!!

ajaysg-zz · 2019-10-06T11:47:15Z

how long does it take to synthesize on GPU as well as CPU

estherxue · 2019-10-12T13:30:23Z

If my memory serves me correctly, synthesis on GPU is very slow. For synthesis on CPU, the speed can reach above 3 times real time. ajaysg <[email protected]> 于2019年10月6日周日下午7:47写道：

…

how long does it take to synthesize on GPU as well as CPU — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1?email_source=notifications&email_token=AEFARKARC2YBF4NFLXQGCHDQNHF4JA5CNFSM4HNMWYTKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAOIEIQ#issuecomment-538739234>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEFARKCSY4XN5YIFSS666ZTQNHF4JANCNFSM4HNMWYTA> .

superhg2012 mentioned this issue Jun 13, 2019

Integrating Tacotron and LPCNet: Training tacotron with .f32 features #4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is pretrained model of Tacotron2 + LPCNet is avalilible? #1

Is pretrained model of Tacotron2 + LPCNet is avalilible? #1

mrgloom commented May 16, 2019

MlWoo commented May 16, 2019

sheepHavingPurpleLeaf commented May 25, 2019

MlWoo commented May 25, 2019

sheepHavingPurpleLeaf commented May 26, 2019

superhg2012 commented May 30, 2019

MlWoo commented May 31, 2019

superhg2012 commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

MlWoo commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

MlWoo commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

superhg2012 commented Jun 13, 2019 •

edited

Loading

sheepHavingPurpleLeaf commented Jun 13, 2019 •

edited

Loading

MlWoo commented Jun 13, 2019 •

edited

Loading

superhg2012 commented Jun 13, 2019

sheepHavingPurpleLeaf commented Jun 13, 2019

superhg2012 commented Jun 13, 2019

estherxue commented Jun 13, 2019

superhg2012 commented Jun 13, 2019 •

edited

Loading

MlWoo commented Jun 13, 2019 •

edited

Loading

superhg2012 commented Jun 14, 2019

ajaysg-zz commented Oct 6, 2019

estherxue commented Oct 12, 2019 via email

Is pretrained model of Tacotron2 + LPCNet is avalilible? #1

Is pretrained model of Tacotron2 + LPCNet is avalilible? #1

Comments

mrgloom commented May 16, 2019

MlWoo commented May 16, 2019

sheepHavingPurpleLeaf commented May 25, 2019

MlWoo commented May 25, 2019

sheepHavingPurpleLeaf commented May 26, 2019

superhg2012 commented May 30, 2019

MlWoo commented May 31, 2019

superhg2012 commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

MlWoo commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

MlWoo commented Jun 10, 2019

superhg2012 commented Jun 10, 2019

superhg2012 commented Jun 13, 2019 • edited Loading

sheepHavingPurpleLeaf commented Jun 13, 2019 • edited Loading

MlWoo commented Jun 13, 2019 • edited Loading

superhg2012 commented Jun 13, 2019

sheepHavingPurpleLeaf commented Jun 13, 2019

superhg2012 commented Jun 13, 2019

estherxue commented Jun 13, 2019

superhg2012 commented Jun 13, 2019 • edited Loading

MlWoo commented Jun 13, 2019 • edited Loading

superhg2012 commented Jun 14, 2019

ajaysg-zz commented Oct 6, 2019

estherxue commented Oct 12, 2019 via email

superhg2012 commented Jun 13, 2019 •

edited

Loading

sheepHavingPurpleLeaf commented Jun 13, 2019 •

edited

Loading

MlWoo commented Jun 13, 2019 •

edited

Loading

superhg2012 commented Jun 13, 2019 •

edited

Loading

MlWoo commented Jun 13, 2019 •

edited

Loading