{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,4]],"date-time":"2025-04-04T12:11:46Z","timestamp":1743768706532,"version":"3.37.3"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","funder":[{"DOI":"10.13039\/501100010785","name":"Canada First Research Excellence Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100010785","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100013020","name":"Compute Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100013020","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Compute Ontario"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,6]]},"DOI":"10.1145\/3477495.3531749","type":"proceedings-article","created":{"date-parts":[[2022,7,7]],"date-time":"2022-07-07T15:12:13Z","timestamp":1657206733000},"page":"3187-3197","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2"],"prefix":"10.1145","author":[{"given":"Xueguang","family":"Ma","sequence":"first","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]},{"given":"Ronak","family":"Pradeep","sequence":"additional","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]},{"given":"Rodrigo","family":"Nogueira","sequence":"additional","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]},{"given":"Jimmy","family":"Lin","sequence":"additional","affiliation":[{"name":"University of Waterloo, Waterloo, ON, Canada"}]}],"member":"320","published-online":{"date-parts":[[2022,7,7]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/383952.383957"},{"key":"e_1_3_2_1_2_1","volume-title":"MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v3","author":"Bajaj Payal","year":"2018","unstructured":"Payal Bajaj , Daniel Campos , Nick Craswell , Li Deng , Jianfeng Gao , Xiaodong Liu , Rangan Majumder , Andrew McNamara , Bhaskar Mitra , Tri Nguyen , Mir Rosenberg , Xia Song , Alina Stoica , Saurabh Tiwary , and Tong Wang . 2018 . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v3 (2018). Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, and Tong Wang. 2018. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v3 (2018)."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the Twenty-Ninth Text REtrieval Conference Proceedings (TREC 2020)","author":"Craswell Nick","year":"2020","unstructured":"Nick Craswell , Bhaskar Mitra , Emine Yilmaz , and Daniel Campos . 2020 . Overview of the TREC 2020 Deep Learning Track . In Proceedings of the Twenty-Ninth Text REtrieval Conference Proceedings (TREC 2020) . Nick Craswell, Bhaskar Mitra, Emine Yilmaz, and Daniel Campos. 2020. Overview of the TREC 2020 Deep Learning Track. In Proceedings of the Twenty-Ninth Text REtrieval Conference Proceedings (TREC 2020) ."},{"volume-title":"Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019)","author":"Craswell Nick","key":"e_1_3_2_1_4_1","unstructured":"Nick Craswell , Bhaskar Mitra , Emine Yilmaz , Daniel Campos , and Ellen M. Voorhees . 2019. Overview of the TREC 2019 Deep Learning Track . In Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019) . Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, and Ellen M. Voorhees. 2019. Overview of the TREC 2019 Deep Learning Track. In Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019) ."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331303"},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , Volume 1 (Long and Short Papers) . Minneapolis, Minnesota, 4171--4186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) . Minneapolis, Minnesota, 4171--4186."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/32206.32212"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.241"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Quentin Lhoest Albert Villanova del Moral Yacine Jernite Abhishek Thakur Patrick von Platen Suraj Patil Julien Chaumond Mariama Drame Julien Plu Lewis Tunstall Joe Davison Mario vS avs ko Gunjan Chhablani Bhavitvya Malik Simon Brandeis Teven Le Scao Victor Sanh Canwen Xu Nicolas Patry Angelina McMillan-Major Philipp Schmid Sylvain Gugger Cl\u00e9ment Delangue Th\u00e9o Matussi\u00e8re Lysandre Debut Stas Bekman Pierric Cistac Thibault Goehringer Victor Mustar Francc ois Lagunas Alexander Rush and Thomas Wolf. 2021. Datasets: A Community Library for Natural Language Processing. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations . Online and Punta Cana Dominican Republic 175--184. Quentin Lhoest Albert Villanova del Moral Yacine Jernite Abhishek Thakur Patrick von Platen Suraj Patil Julien Chaumond Mariama Drame Julien Plu Lewis Tunstall Joe Davison Mario vS avs ko Gunjan Chhablani Bhavitvya Malik Simon Brandeis Teven Le Scao Victor Sanh Canwen Xu Nicolas Patry Angelina McMillan-Major Philipp Schmid Sylvain Gugger Cl\u00e9ment Delangue Th\u00e9o Matussi\u00e8re Lysandre Debut Stas Bekman Pierric Cistac Thibault Goehringer Victor Mustar Francc ois Lagunas Alexander Rush and Thomas Wolf. 2021. Datasets: A Community Library for Natural Language Processing. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations . Online and Punta Cana Dominican Republic 175--184.","DOI":"10.18653\/v1\/2021.emnlp-demo.21"},{"key":"e_1_3_2_1_11_1","volume-title":"A Proposed Conceptual Framework for a Representational Approach to Information Retrieval. arXiv:2110.01529","author":"Lin Jimmy","year":"2021","unstructured":"Jimmy Lin . 2021. A Proposed Conceptual Framework for a Representational Approach to Information Retrieval. arXiv:2110.01529 ( 2021 ). Jimmy Lin. 2021. A Proposed Conceptual Framework for a Representational Approach to Information Retrieval. arXiv:2110.01529 (2021)."},{"key":"e_1_3_2_1_12_1","volume-title":"COIL, and a Conceptual Framework for Information Retrieval Techniques. arXiv:2106.14807","author":"Lin Jimmy","year":"2021","unstructured":"Jimmy Lin and Xueguang Ma. 2021. A Few Brief Notes on DeepImpact , COIL, and a Conceptual Framework for Information Retrieval Techniques. arXiv:2106.14807 ( 2021 ). Jimmy Lin and Xueguang Ma. 2021. A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques. arXiv:2106.14807 (2021)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463238"},{"volume-title":"2021 b. Pretrained Transformers for Text Ranking: BERT and Beyond","author":"Lin Jimmy","key":"e_1_3_2_1_14_1","unstructured":"Jimmy Lin , Rodrigo Nogueira , and Andrew Yates . 2021 b. Pretrained Transformers for Text Ranking: BERT and Beyond . Morgan & Claypool Publishers . Jimmy Lin, Rodrigo Nogueira, and Andrew Yates. 2021 b. Pretrained Transformers for Text Ranking: BERT and Beyond .Morgan & Claypool Publishers."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.repl4nlp-1.17"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3404835.3463030"},{"key":"e_1_3_2_1_17_1","volume-title":"MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1","author":"Nguyen Tri","year":"2016","unstructured":"Tri Nguyen , Mir Rosenberg , Xia Song , Jianfeng Gao , Saurabh Tiwary , Rangan Majumder , and Li Deng . 2016 . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1 (2016). Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset . arXiv:1611.09268v1 (2016)."},{"key":"e_1_3_2_1_18_1","volume-title":"Passage Re-ranking with BERT . arXiv:1901.04085","author":"Nogueira Rodrigo","year":"2019","unstructured":"Rodrigo Nogueira and Kyunghyun Cho . 2019. Passage Re-ranking with BERT . arXiv:1901.04085 ( 2019 ). Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT . arXiv:1901.04085 (2019)."},{"key":"e_1_3_2_1_19_1","unstructured":"Rodrigo Nogueira and Jimmy Lin. 2019. From doc2query to docTTTTTquery . Rodrigo Nogueira and Jimmy Lin. 2019. From doc2query to docTTTTTquery ."},{"key":"e_1_3_2_1_20_1","volume-title":"Document Expansion by Query Prediction. arXiv:1904.08375","author":"Nogueira Rodrigo","year":"2019","unstructured":"Rodrigo Nogueira , Wei Yang , Jimmy Lin , and Kyunghyun Cho . 2019. Document Expansion by Query Prediction. arXiv:1904.08375 ( 2019 ). Rodrigo Nogueira, Wei Yang, Jimmy Lin, and Kyunghyun Cho. 2019. Document Expansion by Query Prediction. arXiv:1904.08375 (2019)."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020)","author":"Pradeep Ronak","year":"2020","unstructured":"Ronak Pradeep , Xueguang Ma , Xinyu Zhang , Hang Cui , Ruizhou Xu , Rodrigo Nogueira , and Jimmy Lin . 2020 . H$_2$oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine . In Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020) . Ronak Pradeep, Xueguang Ma, Xinyu Zhang, Hang Cui, Ruizhou Xu, Rodrigo Nogueira, and Jimmy Lin. 2020. H$_2$oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine. In Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020) ."},{"key":"e_1_3_2_1_22_1","volume-title":"The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models. arXiv:2101.05667","author":"Pradeep Ronak","year":"2021","unstructured":"Ronak Pradeep , Rodrigo Nogueira , and Jimmy Lin . 2021. The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models. arXiv:2101.05667 ( 2021 ). Ronak Pradeep, Rodrigo Nogueira, and Jimmy Lin. 2021. The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models. arXiv:2101.05667 (2021)."},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020)","author":"Qiao Yixuan","year":"2020","unstructured":"Yixuan Qiao , Hao Chen , Liyu Cao , Liping Chen , Pengyong Li , Jun Wang , Peng Gao , Yuan Ni , and Guotong Xie . 2020 . PASH at TREC 2020 Deep Learning Track: Dense Matching for Nested Ranking . In Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020) . Yixuan Qiao, Hao Chen, Liyu Cao, Liping Chen, Pengyong Li, Jun Wang, Peng Gao, Yuan Ni, and Guotong Xie. 2020. PASH at TREC 2020 Deep Learning Track: Dense Matching for Nested Ranking. In Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020) ."},{"key":"e_1_3_2_1_24_1","first-page":"1","article-title":"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel , Noam Shazeer , Adam Roberts , Katherine Lee , Sharan Narang , Michael Matena , Yanqi Zhou , Wei Li , and Peter J. Liu . 2020 . Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer . Journal of Machine Learning Research , Vol. 21 , 140 (2020), 1 -- 67 . Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research , Vol. 21, 140 (2020), 1--67.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477495.3531728"},{"volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations . Online, 38--45","author":"Wolf Thomas","key":"e_1_3_2_1_26_1","unstructured":"Thomas Wolf , Lysandre Debut , Victor Sanh , Julien Chaumond , Clement Delangue , Anthony Moi , Pierric Cistac , Tim Rault , R\u00e9mi Louf , Morgan Funtowicz , Joe Davison , Sam Shleifer , Patrick von Platen , Clara Ma , Yacine Jernite , Julien Plu , Canwen Xu , Teven Le Scao , Sylvain Gugger , Mariama Drame , Quentin Lhoest , and Alexander M. Rush . 2020. Transformers: State-of-the-Art Natural Language Processing . In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations . Online, 38--45 . Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, R\u00e9mi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, and Alexander M. Rush. 2020. Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations . Online, 38--45."},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the 9th International Conference on Learning Representations (ICLR 2021)","author":"Xiong Lee","year":"2021","unstructured":"Lee Xiong , Chenyan Xiong , Ye Li , Kwok-Fung Tang , Jialin Liu , Paul N. Bennett , Junaid Ahmed , and Arnold Overwijk . 2021 . Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval . In Proceedings of the 9th International Conference on Learning Representations (ICLR 2021) . Lee Xiong, Chenyan Xiong, Ye Li, Kwok-Fung Tang, Jialin Liu, Paul N. Bennett, Junaid Ahmed, and Arnold Overwijk. 2021. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In Proceedings of the 9th International Conference on Learning Representations (ICLR 2021) ."},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019)","author":"Yan Ming","year":"2019","unstructured":"Ming Yan , Chenliang Li , Chen Wu , Bin Bi , Wei Wang , Jiangnan Xia , and Luo Si . 2019 . IDST at TREC 2019 Deep Learning Track: Deep Cascade Ranking with Generation-based Document Expansion and Pre-trained Language Modeling .. In Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019) . Ming Yan, Chenliang Li, Chen Wu, Bin Bi, Wei Wang, Jiangnan Xia, and Luo Si. 2019. IDST at TREC 2019 Deep Learning Track: Deep Cascade Ranking with Generation-based Document Expansion and Pre-trained Language Modeling.. In Proceedings of the Twenty-Eighth Text REtrieval Conference Proceedings (TREC 2019) ."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080721"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3239571"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331340"}],"event":{"name":"SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Madrid Spain","acronym":"SIGIR '22"},"container-title":["Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477495.3531749","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,6]],"date-time":"2023-07-06T13:20:02Z","timestamp":1688649602000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477495.3531749"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":31,"alternative-id":["10.1145\/3477495.3531749","10.1145\/3477495"],"URL":"https:\/\/doi.org\/10.1145\/3477495.3531749","relation":{},"subject":[],"published":{"date-parts":[[2022,7,6]]},"assertion":[{"value":"2022-07-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}