Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Future Work - Models #67

Open
AI-Guru opened this issue Apr 27, 2023 · 0 comments
Open

Future Work - Models #67

AI-Guru opened this issue Apr 27, 2023 · 0 comments

Comments

@AI-Guru
Copy link
Contributor

AI-Guru commented Apr 27, 2023

Hi!

I am very curious about the future work part of the paper.

There were a few suggestions in the paper. Let me talk about two.

1. Use perceptual losses.

You have just merged a PR that allows for loss customization. Which perceptual loss did you have in mind when you wrote the suggestion?

2. Using mel spectrograms instead of magnitude spectrograms as input.

dmae1d-ATC64-v2 Uses the magnitude spectrogram.

What would be a good mel feature extractor?

I sometimes ran into this one but I would like to know what you think about it:

encoder=MelE1d( # The encoder used, in this case a mel-spectrogram encoder
                in_channels=in_channels,
                channels=512,
                multipliers=[1, 1],
                factors=[2],
                num_blocks=[12],
                out_channels=32,
                mel_channels=80,
                mel_sample_rate=48000,
                mel_normalize_log=True,
                bottleneck=TanhBottleneck(),
            ),

I believe it extracts a lot of features, thus putting a strain on the GPU.

Curious what you have to say about 1 and 2.

Cheers,
Tristan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant