Trick Me If You Can: Human-in-the-Loop Generation of Adversarial Examples for Question Answering

Adversarial evaluation stress-tests a model’s understanding of natural language. Because past approaches expose superficial patterns, the resulting adversarial examples are limited in complexity and diversity. We propose human- in-the-loop adversarial generation, where human authors are guided to break models. We aid the authors with interpretations of model predictions through an interactive user interface. We apply this generation framework to...

Paper Fields

Paper Details

Title

DOI

doi.org/10.1162/tacl_a_00279

Published Date

Jul 26, 2019

Journal

Transactions of the Association for Computational Linguistics

Volume

7

Pages

387 - 401

Notes

To use the Note feature, you need to be logged in. Please

History