Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Widget for Visual Question Answering #16

Closed
gchhablani opened this issue Jun 19, 2021 · 2 comments
Closed

Widget for Visual Question Answering #16

gchhablani opened this issue Jun 19, 2021 · 2 comments

Comments

@gchhablani
Copy link

This request for a widget/inference API for the hub for multimodal models for VQA tasks:

Would have input as

  • Image
  • Question
    Would output
  • Answer

This should be used for VisualBERT/LXMERT models. And might need detectron or something similar to the FasterRCNN model here : https://github.com/huggingface/transformers/tree/master/examples/research_projects/lxmert

@LysandreJik

@LysandreJik LysandreJik transferred this issue from huggingface/huggingface_hub Mar 16, 2022
@NielsRogge
Copy link
Contributor

cc @mishig25

@mishig25
Copy link
Collaborator

closed by #263

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants