{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,7]],"date-time":"2024-08-07T23:35:01Z","timestamp":1723073701089},"reference-count":53,"publisher":"MIT Press - Journals","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Transactions of the Association for Computational Linguistics"],"published-print":{"date-parts":[[2020,12]]},"abstract":" Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, showing that quality QDMRs can be annotated at scale, and release the Break dataset, containing over 83K pairs of questions and their QDMRs. We demonstrate the utility of QDMR by showing that (a) it can be used to improve open-domain question answering on the HotpotQA dataset, (b) it can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications. Last, we use Break to train a sequence-to-sequence model with copying that parses questions into QDMR structures, and show that it substantially outperforms several natural baselines. <\/jats:p>","DOI":"10.1162\/tacl_a_00309","type":"journal-article","created":{"date-parts":[[2020,4,17]],"date-time":"2020-04-17T16:51:34Z","timestamp":1587142294000},"page":"183-198","source":"Crossref","is-referenced-by-count":28,"title":["Break<\/scp> It Down: A Question Understanding Benchmark"],"prefix":"10.1162","volume":"8","author":[{"given":"Tomer","family":"Wolfson","sequence":"first","affiliation":[{"name":"Tel Aviv University"},{"name":"Allen Institute for AI."}]},{"given":"Mor","family":"Geva","sequence":"additional","affiliation":[{"name":"Tel Aviv University"},{"name":"Allen Institute for AI."}]},{"given":"Ankit","family":"Gupta","sequence":"additional","affiliation":[{"name":"Tel Aviv University."}]},{"given":"Matt","family":"Gardner","sequence":"additional","affiliation":[{"name":"Allen Institute for AI."}]},{"given":"Yoav","family":"Goldberg","sequence":"additional","affiliation":[{"name":"Bar-Ilan University"},{"name":"Allen Institute for AI."}]},{"given":"Daniel","family":"Deutch","sequence":"additional","affiliation":[{"name":"Tel Aviv University."}]},{"given":"Jonathan","family":"Berant","sequence":"additional","affiliation":[{"name":"Tel Aviv University"},{"name":"Allen Institute for AI."}]}],"member":"281","reference":[{"key":"bib1","volume-title":"Association for Computational Linguistics (ACL)","author":"Abend Omri","year":"2013"},{"key":"bib2","volume-title":"North American Association for Computational Linguistics (NAACL)","author":"Abujabal Abdalghani","year":"2019"},{"key":"bib3","volume-title":"Human Language Technology and North American Association for Computational Linguistics (HLT\/NAACL)","author":"Andreas Jacob","year":"2016"},{"key":"bib4","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1017\/S135132490000005X","volume":"1","author":"Androutsopoulos Ion","year":"1995","journal-title":"Journal of Natural Language Engineering"},{"key":"bib5","first-page":"2425","volume-title":"International Conference on Computer Vision (ICCV)","author":"Antol Stanislaw","year":"2015"},{"key":"bib6","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1109\/ICDE.2019.00041","volume-title":"2019 IEEE 35th International Conference on Data Engineering (ICDE)","author":"Baik Christopher","year":"2019"},{"key":"bib7","volume-title":"7th Linguistic Annotation Workshop and Interoperability with Discourse","author":"Banarescu Laura","year":"2013"},{"key":"bib8","first-page":"249","volume-title":"Proceedings of the 1974 ACM SIGFIDET (now SIGMOD) Workshop on Data Description, Access and Control","author":"Chamberlin Donald D.","year":"1974"},{"key":"bib9","volume-title":"Association for Computational Linguistics (ACL)","author":"Chen Danqi","year":"2017"},{"key":"bib10","volume-title":"Association for Computational Linguistics (ACL)","author":"Chen Jifan","year":"2019"},{"key":"bib11","author":"Cheng Jianpeng","year":"2018","journal-title":"ArXiv"},{"key":"bib12","volume-title":"Association for Computational Linguistics (ACL)","author":"Choi Eunsol","year":"2015"},{"key":"bib13","first-page":"18","volume-title":"Computational Natural Language Learning (CoNLL)","author":"Clarke James","year":"2010"},{"issue":"6","key":"bib14","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1145\/362384.362685","volume":"13","author":"Codd Edgar F.","year":"1970","journal-title":"Communications of the ACM"},{"key":"bib15","doi-asserted-by":"crossref","first-page":"43","DOI":"10.3115\/1075812.1075823","volume-title":"Workshop on Human Language Technology","author":"Dahl Deborah A.","year":"1994"},{"key":"bib16","volume-title":"North American Association for Computational Linguistics (NAACL)","author":"Devlin Jacob","year":"2019"},{"key":"bib17","volume-title":"Human Language Technology and North American Association for Computational Linguistics (HLT\/NAACL)","author":"Dua Dheeru","year":"2019"},{"key":"bib18","volume-title":"Association for Computational Linguistics (ACL)","author":"FitzGerald Nicholas","year":"2018"},{"key":"bib19","unstructured":"Matt Gardner, Joel Grus, Mark Neumann, Oyvind Tafjord, Pradeep Dasigi, Nelson F. Liu, Matthew Peters, Michael Schmitz, and Luke S. Zettlemoyer. 2017. In AllenNLP: A deep semantic natural language processing platform, arXiv, abs\/1803.07640v2."},{"key":"bib20","volume-title":"Association for Computational Linguistics (ACL)","author":"Jiatao Gu","year":"2016"},{"key":"bib21","volume-title":"Association for Computational Linguistics (ACL)","author":"Guo Jiaqi","year":"2019"},{"key":"bib22","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Gupta Nitish","year":"2018"},{"issue":"2","key":"bib23","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1109\/TSSC.1968.300136","volume":"4","author":"Hart Peter E.","year":"1968","journal-title":"IEEE Transactions on Systems Science and Cybernetics"},{"key":"bib24","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"He Luheng","year":"2016"},{"key":"bib25","volume-title":"International Conference on Computer Vision (ICCV)","author":"Ronghang Hu","year":"2017"},{"key":"bib26","volume-title":"The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Hudson Drew A.","year":"2019"},{"key":"bib27","volume-title":"Association for Computational Linguistics (ACL)","author":"Iyer Srini","year":"2017"},{"key":"bib28","doi-asserted-by":"crossref","first-page":"1821","DOI":"10.18653\/v1\/P17-1167","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Iyyer Mohit","year":"2017"},{"key":"bib29","volume-title":"Association for Computational Linguistics (ACL)","author":"Jiang Yichen","year":"2019"},{"key":"bib30","volume-title":"Computer Vision and Pattern Recognition (CVPR)","author":"Johnson Justin","year":"2017"},{"key":"bib31","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Kwiatkowski Tom","year":"2013"},{"key":"bib32","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00276"},{"key":"bib33","volume-title":"International Conference on Management of Data, SIGMOD","author":"Li Fei","year":"2014"},{"key":"bib34","first-page":"pages 1051\u2013page","volume-title":"International Conference on Management of Data, SIGMOD","author":"Li Fei","year":"2014"},{"key":"bib35","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00127"},{"key":"bib36","volume-title":"North American Association for Computational Linguistics (NAACL)","author":"Michael Julian","year":"2018"},{"key":"bib37","volume-title":"Association for Computational Linguistics (ACL)","author":"Min Sewon","year":"2019"},{"key":"bib38","volume-title":"Association for Computational Linguistics (ACL)","author":"Min Sewon","year":"2019"},{"key":"bib39","volume-title":"Association for Computational Linguistics (ACL)","author":"Pasupat Panupong","year":"2015"},{"issue":"1","key":"bib40","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00763644","volume":"13","author":"Pelletier Francis Jeffry","year":"1994","journal-title":"Topoi"},{"key":"bib41","first-page":"91","volume-title":"Proceedings of the Third DARPA Speech and Natural Language Workshop","author":"Price P. J.","year":"1990"},{"key":"bib42","first-page":"2590","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Qi Peng","year":"2019"},{"key":"bib43","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Rajpurkar Pranav","year":"2016"},{"key":"bib44","volume-title":"Association for Computational Linguistics (ACL)","author":"Reddy Siva","year":"2016"},{"key":"bib45","author":"Suhr Alane","year":"2019","journal-title":"Association for Computational Linguistics (ACL)"},{"key":"bib46","volume-title":"North American Association for Computational Linguistics (NAACL)","author":"Talmor Alon","year":"2018"},{"key":"bib47","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00021"},{"key":"bib48","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1162\/tacl_a_00107","volume":"4","author":"Wei Xu","year":"2016","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"bib49","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Yang Zhilin","year":"2018"},{"key":"bib50","volume-title":"Association for Computational Linguistics (ACL)","author":"Yih Wen-tau","year":"2016"},{"key":"bib51","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Tao Yu","year":"2018"},{"key":"bib52","first-page":"1050","volume-title":"Association for the Advancement of Artificial Intelligence (AAAI)","author":"Zelle John M.","year":"1996"},{"key":"bib53","first-page":"658","volume-title":"Uncertainty in Artificial Intelligence (UAI)","author":"Zettlemoyer Luke","year":"2005"}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/tacl_a_00309","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:39:35Z","timestamp":1615585175000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/43549"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12]]},"references-count":53,"alternative-id":["10.1162\/tacl_a_00309"],"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00309","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12]]}}}