We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
transformers==4.46.0.dev0
No response
examples
If the size of pixel_values is 384 x 384, below will throw an error.
pixel_values
vision_tower = SiglipVisionModel.from_pretrained("google/siglip-so400m-patch14-384") image_outputs = vision_tower(pixel_values, interpolate_pos_encoding=True)
Error:
embeddings = embeddings + self.interpolate_pos_encoding(embeddings, height, width) ~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ TypeError: unsupported operand type(s) for +: 'Tensor' and 'Embedding'
Shouldn't raise an error even though pos_encoding doesn't need to be interpolated when interpolate_pos_encoding=True
interpolate_pos_encoding=True
The text was updated successfully, but these errors were encountered:
Hey! I think this is also related to #33965
Sorry, something went wrong.
Thanks for opening a PR seems right indeed! 🤗
Successfully merging a pull request may close this issue.
System Info
transformers==4.46.0.dev0
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
If the size of
pixel_values
is 384 x 384, below will throw an error.Error:
Expected behavior
Shouldn't raise an error even though pos_encoding doesn't need to be interpolated when
interpolate_pos_encoding=True
The text was updated successfully, but these errors were encountered: