Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to add a code for Visual QA on Bar Charts, Pie Charts and Visualizations ? #2504

Closed
pratikkotian04 opened this issue May 5, 2022 · 4 comments
Labels
topic:reader type:feature New feature or request wontfix This will not be worked on

Comments

@pratikkotian04
Copy link

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e.g. bar charts, pie charts, and line graphs. CQA requires capabilities that natural-image VQA algorithms lack: fine-grained measurements, optical character recognition, and handling out-of-vocabulary words in both questions and answers. Without modifications, state-of-the-art VQA algorithms perform poorly on this task. Here, we propose a novel CQA algorithm called parallel recurrent fusion of image and language (PReFIL). PReFIL first learns bimodal embeddings by fusing question and image features and then intelligently aggregates these learned embeddings to answer the given question. Despite its simplicity, PReFIL greatly surpasses state-of-the art systems and human baselines on both the FigureQA and DVQA datasets. Additionally, we demonstrate that PReFIL can be used to reconstruct tables by asking a series of questions about a chart.

@julian-risch
Copy link
Member

Linking the paper where the quote comes from: https://arxiv.org/abs/1908.01801

@julian-risch julian-risch added the type:feature New feature or request label May 5, 2022
@julian-risch
Copy link
Member

Hi @pratikkotian04 thanks for bringing up this topic. We are also quite interested in extending the question answering capabilities from text over tables to charts and other kinds of content. It's a larger topic though and right now I can't give you an estimation of when question answering on bar charts, pie charts and other visualizations will be supported by Haystack.

Did you have a look at the code accompanying the paper you just quoted? I found it here: https://github.com/kushalkafle/PREFIL As it is of 2020 and we're already in 2022 maybe there is some more recent research around on that topic.

@pratikkotian04
Copy link
Author

pratikkotian04 commented May 5, 2022

Hi @julian-risch , I am using Haystack for Text and Table question answering at my organization and it would have been helpful if I could continue using haystack for chart question answering as well.

@julian-risch
Copy link
Member

@pratikkotian04 We definitely have QA on charts on our list, yes. 👍 At the moment, we are working on an epic that will integrate image documents in addition to text documents into Haystack. In that context, we will implement an ImageRetriever but also an ImageToText node. Here is the link: #2418 Once this epic is done we will be able to tackle QA on charts in Q3, 2022. Stay tuned!

@masci masci added the wontfix This will not be worked on label Mar 12, 2024
@masci masci closed this as completed Mar 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
topic:reader type:feature New feature or request wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

3 participants