Install the tensorflow example requirements in docker #31428

amyeroberts · 2024-06-14T17:31:36Z

What does this PR do?

Unlike the pytorch examples here the docker file used to run the tensorflow examples doesn't install the requirements from the examples requirements file.

Recently, we had to pin the datasets version used for the examples #31417, but this wasn't propogated for tensorflow because of this omisison.

This means added requirements won't be included, and is currently causing failing tests on main: https://app.circleci.com/pipelines/github/huggingface/transformers/95698/workflows/52a112da-0d84-4569-8f69-ca180f4c7b2a/jobs/1260731

gante · 2024-06-14T17:35:16Z

@amyeroberts There are PT example failures too, possibly related (e.g see here, from this PR)

amyeroberts · 2024-06-14T17:39:01Z

@gante Thanks for flagging!

I think what's happening is that datasets is already installed with the pip install . command and so not updated to reflect the constraints here. I'll play about and see if I can figure out what's happening

Scratch all that. If we're installing from the library's setup.py then the datasets version should still be limited, and my understanding is that if we're installing with pip install . datasets shouldn't be installed at all. Which brings into question how the TF examples were running in the first place....

HuggingFaceDocBuilderDev · 2024-06-14T17:51:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts · 2024-06-14T17:55:45Z

@gante Are you sure the linked PR is from the latest main? If I look at recent runs for examples_torch under the installation step -- uv venv && uv pip install . && uv pip install -r examples/pytorch/_tests_requirements.txt -- then you see datasets downgraded to 2.19.2, whereas I don't see that for that PR.

ydshieh · 2024-06-14T18:29:56Z

It makes examples_tf_job equivalent examples_torch_job regarding + the logic makes sense to me.

Thank you!

ArthurZucker

I'll open a fix for both next week, but we should not allow such quick fixes anymore given that the process to update the docker image is super straigforward now

ArthurZucker · 2024-06-14T20:48:39Z

.circleci/create_circleci_config.py

@@ -326,7 +326,7 @@ def job_name(self):
    "examples_tensorflow",
    cache_name="tensorflow_examples",
    docker_image=[{"image":"huggingface/transformers-examples-tf"}],
-    install_steps=["uv venv && uv pip install ."],
+    install_steps=["uv venv && uv pip install . && uv pip install -r examples/tensorflow/_tests_requirements.txt"],


Mmmm again, that is not what we should do. The CI is gonna get slower because we just need the docker that is built to run this.

amyeroberts · 2024-06-17T09:45:42Z

should not allow such quick fixes anymore given that the process to update the docker image is super straigforward now

@ArthurZucker I think "not allow" is very strong here for fixing something which would essentially be required anyway: regardless of upstream changes to the docker images (and how easy that is to implement) the commands for setting up our examples runs were not consistent. So, either, we need to have inconsistent docker images for the different frameworks (harder to maintain) or the pytorch command would need to be updated to remove the requirements install

gante · 2024-06-17T10:37:25Z

@gante Are you sure the linked PR is from the latest main?

@amyeroberts possibly not, can't confirm 👼 In any case, rebasing to the latest main has sorted it, thanks 🙏

ArthurZucker · 2024-06-18T13:08:59Z

We already have different docker images for different frameworks, and docker image exist for example_tensorflow and example_torch. So I think we do need to update this but the idea is to make sure we don't install anything in the CIs

ydshieh · 2024-06-18T13:16:13Z

Make sense. Sorry I didn't think of this aspect. We can definitely take the requirements in examples into account in the docker image build time.

amyeroberts · 2024-06-18T14:59:40Z

We already have different docker images for different frameworks, and docker image exist for example_tensorflow and example_torch

Yes, I'd expect the docker images to be different. After all, we need one with a tensorflow environment and one for torch. Their overall setup should be consistent though, and the errors which were being experienced on the CI highlighted that they weren't (the pytorch one was reliant on the requirements installs, whereas TF wasn't).

Once the docker images have been updated, we can remove the installs here. And, I'm assuming the _test_requirements.txt files?

ydshieh · 2024-06-18T17:55:46Z

Once the docker images have been updated, we can remove the installs here.
yeah!

And, I'm assuming the _test_requirements.txt files?

I guess those are better to be kept even if we update the docker file, although I don't feel strong (as it means there are duplication and we have to keep them synced.).

Or better, if we can use _test_requirements.txt directly in the docker file (not to repeat the versions and only keep one source of information)

amyeroberts · 2024-06-18T18:36:16Z

Or better, if we can use _test_requirements.txt directly in the docker file (not to repeat the versions and only keep one source of information)

Yes please :)

ArthurZucker · 2024-06-18T18:55:50Z

transformers/setup.py

Lines 466 to 479 in 682f221

    
           extras["tests_torch"] = deps_list() 
        
           extras["tests_tf"] = deps_list() 
        
           extras["tests_flax"] = deps_list() 
        
           extras["tests_torch_and_tf"] = deps_list() 
        
           extras["tests_torch_and_flax"] = deps_list() 
        
           extras["tests_hub"] = deps_list() 
        
           extras["tests_pipelines_torch"] = deps_list() 
        
           extras["tests_pipelines_tf"] = deps_list() 
        
           extras["tests_onnx"] = deps_list() 
        
           extras["tests_examples_torch"] = deps_list() 
        
           extras["tests_examples_tf"] = deps_list() 
        
           extras["tests_custom_tokenizers"] = deps_list() 
        
           extras["tests_exotic_models"] = deps_list() 
        
           extras["consistency"] = deps_list()

should be what we are looking for! Either this or the requirement but yes everything should be in the docker build

Install the tensorflow example requirements in docker

e214ed6

amyeroberts requested a review from ydshieh June 14, 2024 17:31

ydshieh approved these changes Jun 14, 2024

View reviewed changes

amyeroberts merged commit 3d0bd86 into huggingface:main Jun 14, 2024
22 checks passed

amyeroberts deleted the fix-tensorflow-examples-docker branch June 14, 2024 18:35

ArthurZucker reviewed Jun 14, 2024

View reviewed changes

itazap pushed a commit that referenced this pull request Jun 17, 2024

Install the tensorflow example requirements in docker (#31428)

22a8f80

itazap pushed a commit that referenced this pull request Jun 17, 2024

Install the tensorflow example requirements in docker (#31428)

b1fdf54

itazap pushed a commit that referenced this pull request Jun 17, 2024

Install the tensorflow example requirements in docker (#31428)

0f1c20b

itazap pushed a commit that referenced this pull request Jun 18, 2024

Install the tensorflow example requirements in docker (#31428)

ef14243

itazap pushed a commit that referenced this pull request Jun 20, 2024

Install the tensorflow example requirements in docker (#31428)

752a91c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Install the tensorflow example requirements in docker #31428

Install the tensorflow example requirements in docker #31428

amyeroberts commented Jun 14, 2024

gante commented Jun 14, 2024

amyeroberts commented Jun 14, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 14, 2024

amyeroberts commented Jun 14, 2024 •

edited

Loading

ydshieh commented Jun 14, 2024

ArthurZucker left a comment

ArthurZucker Jun 14, 2024

amyeroberts commented Jun 17, 2024

gante commented Jun 17, 2024 •

edited

Loading

ArthurZucker commented Jun 18, 2024

ydshieh commented Jun 18, 2024

amyeroberts commented Jun 18, 2024

ydshieh commented Jun 18, 2024

amyeroberts commented Jun 18, 2024

ArthurZucker commented Jun 18, 2024

Install the tensorflow example requirements in docker #31428

Install the tensorflow example requirements in docker #31428

Conversation

amyeroberts commented Jun 14, 2024

What does this PR do?

gante commented Jun 14, 2024

amyeroberts commented Jun 14, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Jun 14, 2024

amyeroberts commented Jun 14, 2024 • edited Loading

ydshieh commented Jun 14, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Jun 14, 2024

Choose a reason for hiding this comment

amyeroberts commented Jun 17, 2024

gante commented Jun 17, 2024 • edited Loading

ArthurZucker commented Jun 18, 2024

ydshieh commented Jun 18, 2024

amyeroberts commented Jun 18, 2024

ydshieh commented Jun 18, 2024

amyeroberts commented Jun 18, 2024

ArthurZucker commented Jun 18, 2024

amyeroberts commented Jun 14, 2024 •

edited

Loading

amyeroberts commented Jun 14, 2024 •

edited

Loading

gante commented Jun 17, 2024 •

edited

Loading