Is this a typo? #8

johnhuichen · 2024-06-29T23:39:31Z

When explaining Activation:

Activation: The RoBERTa uses a GELU activation function. We can implement the GELU using a similar approach as dropout above with no input params. Candle tensors have an inbuilt module to perform this operation

After that it continues to say:

Candle: In candle we can implement the dropout layer by just returning the input tensor

struct Activation {}

impl Activation {
    fn new() -> Self {
        Self {}
    }

    fn forward(&self, x: &Tensor) -> Result<Tensor> {
        Ok(x.gelu()?)
    }
}

Looks like a typo copied from the previous content. It should probably say in candle we implement the activation layer by calling gelu function

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this a typo? #8

Is this a typo? #8

johnhuichen commented Jun 29, 2024 •

edited

Loading

Is this a typo? #8

Is this a typo? #8

Comments

johnhuichen commented Jun 29, 2024 • edited Loading

johnhuichen commented Jun 29, 2024 •

edited

Loading