Implementation of Distilbert and added Example of Fill Mask task for Distilbert and Roberta #200

deveshjawla · 2024-10-25T12:55:50Z

No description provided.

example/DistilBert_FillMask/fill_mask.jl

example/Roberta_FillMask/fill_mask.jl

example/DistilBert_FillMask/fill_mask.jl

Co-authored-by: Peter <[email protected]>

AbrJA · 2024-11-13T22:11:28Z

Hi @deveshjawla I hope you are doing well,

There was an error running the tests, it's the same issue across all versions. In file load on line 286 you have this:

if !isnothing(m.pooler)
        get_state_dict(HGFDistilBertModel, m.pooler.layer.dense, state_dict, joinname(prefix, "pooler.dense"))
    end

But the atribute pooler is not defined for HGFDistilBertModel because the line with this definition is commented

# pooler = DistilBertPooler(Layers.Dense(NNlib.tanh_fast, weight, bias))

I'm gonna try to solve it but maybe you already have the solution!

Best regards, I really appreciate your contribution

deveshjawla · 2024-12-09T18:30:00Z

Hi @deveshjawla I hope you are doing well,

There was an error running the tests, it's the same issue across all versions. In file load on line 286 you have this:
if !isnothing(m.pooler)
        get_state_dict(HGFDistilBertModel, m.pooler.layer.dense, state_dict, joinname(prefix, "pooler.dense"))
    end
But the atribute pooler is not defined for HGFDistilBertModel because the line with this definition is commented
# pooler = DistilBertPooler(Layers.Dense(NNlib.tanh_fast, weight, bias))
I'm gonna try to solve it but maybe you already have the solution!

Best regards, I really appreciate your contribution

Dear Abraham,

I hope you are doing well. Apologies for a very late response. I have been working on another project.

Thank you for bringing this to my attention. I was having problems with running the HuggingValidation script when I was implementing the Distilbert. The error was raised as an EnvironmentException at the code where the hugging face checkpoint is being loaded by the python in the following code at Transformers.jl/example/HuggingFaceValidation/main.jl

@info "Load configure file in Python"
        global pyconfig = @tryrun begin
            cfg = hgf_trf.AutoConfig.from_pretrained(model_name, layer_norm_eps = 1e-9, layer_norm_epsilon = 1e-9)
            if cfg.model_type == "clip"
                if haskey(cfg, "text_config")
                    cfg.text_config.layer_norm_eps = 1e-9
                    cfg.text_config.layer_norm_epsilon = 1e-9
                end
                if haskey(cfg, "vision_config")
                    cfg.vision_config.layer_norm_eps = 1e-9
                    cfg.vision_config.layer_norm_epsilon = 1e-9
                end
            end
            cfg
        end "Failed to load configure file in Python, probably unsupported"

So I removed it and my validation for all models worked fine as below:

Perhaps when you run the validation, it catches those errors which my build had not.

The pooler layer I had implemented thinking NextSentencePrediciton in mind, but I think it is not implemented on HuggingFace Distilbert anyways.

Please let me know if the test pass now.

As for the other tasks, such as QA, SeqClassification, I have put the code in the distilbert implementation but not successfully tested it yet, but soon I intend to do so. Perhaps someone else might implement is sooner than me and so I have commented them out.

In any case, Please let me know. Thank you.

deveshjawla · 2024-12-12T09:15:14Z

Hi, I have fixed the error related to forcausalLM but there are failures which I don't understand.
Could you please let me know what might be causing the following:
` ```
Load: Log Test Failed at /Users/runner/work/Transformers.jl/Transformers.jl/test/huggingface/load.jl:34
Expression: load_model(model_name, hgf_model_name, task_type; config = cfg, cache = false)
Log Pattern: min_level = Logging.Debug

chengchingwen · 2024-12-14T03:00:03Z

/test/huggingface/load.jl:34 is testing if there are parameters that exist but are not found in the state_dict thus randomly initialized. You can set ENV["JULIA_DEBUG"] = Transformers before calling load_model in the REPL to see which parameter is being initialized.

deveshjawla added 4 commits October 25, 2024 13:38

Added Distilbert and Roberta Examples

6871a97

Implemented Distilbert

dc4a02d

Added Distilbert to Test

ffa9557

Update docs

7f882cd

chengchingwen reviewed Oct 29, 2024

View reviewed changes

deveshjawla and others added 6 commits October 29, 2024 13:09

Update example/DistilBert_FillMask/fill_mask.jl

dc6032e

Co-authored-by: Peter <[email protected]>

Update example/DistilBert_FillMask/fill_mask.jl

9bc79d7

Co-authored-by: Peter <[email protected]>

Update example/Roberta_FillMask/fill_mask.jl

aa7602c

Co-authored-by: Peter <[email protected]>

Update example/Roberta_FillMask/fill_mask.jl

8a223ad

Co-authored-by: Peter <[email protected]>

Update example/DistilBert_FillMask/fill_mask.jl

938cd48

Co-authored-by: Peter <[email protected]>

:ForMaskedLM

657de83

Removed Pooler and CausalLM from Distilbert Implementation

e21cc89

Fixed the error related to CausalLM

af584b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of Distilbert and added Example of Fill Mask task for Distilbert and Roberta #200

Implementation of Distilbert and added Example of Fill Mask task for Distilbert and Roberta #200

deveshjawla commented Oct 25, 2024

AbrJA commented Nov 13, 2024

deveshjawla commented Dec 9, 2024 •

edited

Loading

deveshjawla commented Dec 12, 2024

chengchingwen commented Dec 14, 2024

Implementation of Distilbert and added Example of Fill Mask task for Distilbert and Roberta #200

Are you sure you want to change the base?

Implementation of Distilbert and added Example of Fill Mask task for Distilbert and Roberta #200

Conversation

deveshjawla commented Oct 25, 2024

AbrJA commented Nov 13, 2024

deveshjawla commented Dec 9, 2024 • edited Loading

deveshjawla commented Dec 12, 2024

chengchingwen commented Dec 14, 2024

deveshjawla commented Dec 9, 2024 •

edited

Loading