wikiHow-VGSI

The dataset and baseline model of EMNLP 2021 paper Visual Goal-Step Inference using wikiHow.

Dataset

The dataset is available here, it contains the following files:

wiki_images.zip: all 770k images {image_id.png}
WikihowText_data.json: json file of the textual annotation data, each line has the following structure:

{'file_id': article ID (str, e.g. 10000798),
 'goal': textual goal (str, e.g. 'How to Hang an Ironing Board'),
 'goal_description': description of the goal (str),
 'category_hierarchy': (list of str),
 'methods': [{'name': 'Hanging an Ironing Board Over a Door',
   'steps': [{'headline': 'Find a door to hang your ironing board.',
     'description': (str),
     'img': url of image,
     'img_license': 'Creative Commons',
     'step_id': '10000798_0_0'},
     ...
     ]
  ...
  ]

Use the following code to read the json file:

import json
articles = []
for line in open("WikihowText_data.json", "r"):
    articles.append(json.loads(line))

samples: check this folder which contains the train/test examples from 3 sampling methods.

Baseline Models

Check this notebook for the baseline model.

Board Game

There is a wikiHow to Meme board game which is similar to our VGSI task. During each round of the game, an image card describing a step is given and each player needs to select one textual card describing a goal which is the most suitable to the image card. Here is the purchasing link of the game. There is also a printable version.

Cite

Please cite our work if you think it is useful. Thank you!

@article{yang2021visual,
  title={Visual Goal-Step Inference using wikiHow},
  author={Yang, Yue and Panagopoulou, Artemis and Lyu, Qing and Zhang, Li and Yatskar, Mark and Callison-Burch, Chris},
  journal={arXiv preprint arXiv:2104.05845},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wikiHow-VGSI

Dataset

Baseline Models

Board Game

Cite

About

Releases

Packages

License

YueYANG1996/wikiHow-VGSI

Folders and files

Latest commit

History

Repository files navigation

wikiHow-VGSI

Dataset

Baseline Models

Board Game

Cite

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages