{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,23]],"date-time":"2024-10-23T00:25:06Z","timestamp":1729643106048,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":63,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,26]]},"DOI":"10.1145\/3459637.3481950","type":"proceedings-article","created":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T10:53:43Z","timestamp":1636973623000},"page":"4173-4183","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["SAUCE"],"prefix":"10.1145","author":[{"given":"Muntasir","family":"Wahed","sequence":"first","affiliation":[{"name":"Virginia Tech, Blacksburg, VA, USA"}]},{"given":"Daniel","family":"Gruhl","sequence":"additional","affiliation":[{"name":"IBM Research Almaden, San Jose, CA, USA"}]},{"given":"Alfredo","family":"Alba","sequence":"additional","affiliation":[{"name":"IBM Research Almaden, San Jose, CA, USA"}]},{"given":"Anna Lisa","family":"Gentile","sequence":"additional","affiliation":[{"name":"IBM Research Almaden, San Jose, CA, USA"}]},{"given":"Petar","family":"Ristoski","sequence":"additional","affiliation":[{"name":"eBay Inc, San Jose, CA, USA"}]},{"given":"Chad","family":"DeLuca","sequence":"additional","affiliation":[{"name":"IBM Research Almaden, San Jose, CA, USA"}]},{"given":"Steve","family":"Welch","sequence":"additional","affiliation":[{"name":"IBM Research Almaden, San Jose, CA, USA"}]},{"given":"Ismini","family":"Lourentzou","sequence":"additional","affiliation":[{"name":"Virginia Tech, Blacksburg, VA, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,10,30]]},"reference":[{"volume-title":"Databases Theory and Applications, Renata Borovica-Gajic","author":"Abulaish Muhammad","key":"e_1_3_2_1_1_1","unstructured":"Muhammad Abulaish , Mohd Fazil , and Tarique Anwar . 2020. A Contextual Semantic-Based Approach for Domain-Centric Lexicon Expansion . In Databases Theory and Applications, Renata Borovica-Gajic , Jianzhong Qi, and Weiqing Wang (Eds.). Springer International Publishing , 216--224. Muhammad Abulaish, Mohd Fazil, and Tarique Anwar. 2020. A Contextual Semantic-Based Approach for Domain-Centric Lexicon Expansion. In Databases Theory and Applications, Renata Borovica-Gajic, Jianzhong Qi, and Weiqing Wang (Eds.). Springer International Publishing, 216--224."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2017.103"},{"key":"e_1_3_2_1_3_1","volume-title":"Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063","author":"Araci Dogu","year":"2019","unstructured":"Dogu Araci . 2019 . Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063 (2019). Dogu Araci. 2019. Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063 (2019)."},{"key":"e_1_3_2_1_4_1","unstructured":"Payal Bajaj Daniel Campos Nick Craswell Li Deng Jianfeng Gao Xiaodong Liu Rangan Majumder Andrew McNamara Bhaskar Mitra Tri Nguyen etal 2016. Ms marco: A human generated machine reading comprehension dataset. arXiv preprint arXiv:1611.09268 (2016). Payal Bajaj Daniel Campos Nick Craswell Li Deng Jianfeng Gao Xiaodong Liu Rangan Majumder Andrew McNamara Bhaskar Mitra Tri Nguyen et al. 2016. Ms marco: A human generated machine reading comprehension dataset. arXiv preprint arXiv:1611.09268 (2016)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01258-8_13"},{"key":"e_1_3_2_1_6_1","first-page":"1877","article-title":"Language Models are Few-Shot Learners","volume":"33","author":"Brown Tom","year":"2020","unstructured":"Tom Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared D Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , Sandhini Agarwal , Ariel Herbert-Voss , Gretchen Krueger , Tom Henighan , Rewon Child , Aditya Ramesh , Daniel Ziegler , Jeffrey Wu , Clemens Winter , Chris Hesse , Mark Chen , Eric Sigler , Mateusz Litwin , Scott Gray , Benjamin Chess , Jack Clark , Christopher Berner , Sam McCandlish , Alec Radford , Ilya Sutskever , and Dario Amodei . 2020 . Language Models are Few-Shot Learners . In Advances in Neural Information Processing Systems (NIPS) , Vol. 33. 1877 -- 1901 . Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems (NIPS), Vol. 33. 1877--1901.","journal-title":"Advances in Neural Information Processing Systems (NIPS)"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2898607.2898816"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2537734.2537742"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2018.09.001"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2835776.2835778"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.4218\/etrij.17.0116.0074"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.4218\/etrij.17.0116.0074"},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING)","volume":"6","author":"Curran James R","year":"2007","unstructured":"James R Curran , Tara Murphy , and Bernhard Scholz . 2007 . Minimising semantic drift with mutual exclusion bootstrapping . In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING) , Vol. 6 . Citeseer, 172--180. James R Curran, Tara Murphy, and Bernhard Scholz. 2007. Minimising semantic drift with mutual exclusion bootstrapping. In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING), Vol. 6. Citeseer, 172--180."},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT)","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT) , Volume 1 (Long and Short Papers). 4171--4186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Volume 1 (Long and Short Papers). 4171--4186."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.481"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/1090483.1644538"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/86197.86204"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/2002736.2002798"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/2002736.2002798"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063629"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/2976248.2976303"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080789"},{"key":"e_1_3_2_1_23_1","volume-title":"Petar Ristoski, Linda Ha Kato, Chad Eric DeLuca, Steven R. Welch, Alfredo Alba, and Ismini Lourentzou.","author":"Gruhl Daniel","year":"2020","unstructured":"Daniel Gruhl , Anna Lisa Gentile , Petar Ristoski, Linda Ha Kato, Chad Eric DeLuca, Steven R. Welch, Alfredo Alba, and Ismini Lourentzou. 2020 . Corpus Expansion Using Lexical Signatures. US Patent App . 17\/238288. Daniel Gruhl, Anna Lisa Gentile, Petar Ristoski, Linda Ha Kato, Chad Eric DeLuca, Steven R. Welch, Alfredo Alba, and Ismini Lourentzou. 2020. Corpus Expansion Using Lexical Signatures. US Patent App. 17\/238288."},{"key":"e_1_3_2_1_24_1","volume-title":"Smith","author":"Gururangan Suchin","year":"2020","unstructured":"Suchin Gururangan , Ana Marasovi\u0107 , Swabha Swayamdipta , Kyle Lo , Iz Beltagy , Doug Downey , and Noah A . Smith . 2020 . Don't Stop Pretraining : Adapt Language Models to Domains and Tasks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics , 8342--8360. Suchin Gururangan, Ana Marasovi\u0107, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, and Noah A. Smith. 2020. Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, 8342--8360."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944968"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3196826"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6293"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1963405.1963467"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380284"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505665"},{"key":"e_1_3_2_1_31_1","volume-title":"Billion-scale similarity search with GPUs. arXiv preprint arXiv:1702.08734","author":"Johnson Jeff","year":"2017","unstructured":"Jeff Johnson , Matthijs Douze , and Herv\u00e9 J\u00e9gou . 2017. Billion-scale similarity search with GPUs. arXiv preprint arXiv:1702.08734 ( 2017 ). Jeff Johnson, Matthijs Douze, and Herv\u00e9 J\u00e9gou. 2017. Billion-scale similarity search with GPUs. arXiv preprint arXiv:1702.08734 (2017)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/IALP48816.2019.9037567"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/IALP48816.2019.9037567"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-87599-4_38"},{"key":"e_1_3_2_1_35_1","volume-title":"Current Limitations of Language Models: What You Need is Retrieval. arXiv preprint arXiv:2009.06857","author":"Komatsuzaki Aran","year":"2020","unstructured":"Aran Komatsuzaki . 2020. Current Limitations of Language Models: What You Need is Retrieval. arXiv preprint arXiv:2009.06857 ( 2020 ). Aran Komatsuzaki. 2020. Current Limitations of Language Models: What You Need is Retrieval. arXiv preprint arXiv:2009.06857 (2020)."},{"key":"e_1_3_2_1_36_1","volume-title":"Unsupervised Machine Translation Using Monolingual Corpora Only. In International Conference on Learning Representations (ICLR).","author":"Lample Guillaume","year":"2018","unstructured":"Guillaume Lample , Alexis Conneau , Ludovic Denoyer , and Marc'Aurelio Ranzato . 2018 . Unsupervised Machine Translation Using Monolingual Corpora Only. In International Conference on Learning Representations (ICLR). Guillaume Lample, Alexis Conneau, Ludovic Denoyer, and Marc'Aurelio Ranzato. 2018. Unsupervised Machine Translation Using Monolingual Corpora Only. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: a pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee Jinhyuk","year":"2020","unstructured":"Jinhyuk Lee , Wonjin Yoon , Sungdong Kim , Donghyeon Kim , Sunkyu Kim , Chan Ho So , and Jaewoo Kang . 2020 . BioBERT: a pre-trained biomedical language representation model for biomedical text mining . Bioinformatics , Vol. 36 , 4 (2020), 1234 -- 1240 . Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, Vol. 36, 4 (2020), 1234--1240.","journal-title":"Bioinformatics"},{"key":"e_1_3_2_1_38_1","volume-title":"Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu , Myle Ott , Naman Goyal , Jingfei Du , Mandar Joshi , Danqi Chen , Omer Levy , Mike Lewis , Luke Zettlemoyer , and Veselin Stoyanov . 2019 . Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019). Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-6109"},{"key":"e_1_3_2_1_40_1","volume-title":"Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs","author":"Malkov Yury A","year":"2018","unstructured":"Yury A Malkov and Dmitry A Yashunin . 2018. Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs . IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) ( 2018 ). Yury A Malkov and Dmitry A Yashunin. 2018. Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (2018)."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/1394399"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1049"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.5555\/1699571.1699635"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1080\/14786440109462720"},{"key":"e_1_3_2_1_45_1","volume-title":"Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 1154--1164","author":"Qiu Xipeng","year":"2014","unstructured":"Xipeng Qiu , ChaoChao Huang , and Xuan-Jing Huang . 2014 . Automatic corpus expansion for Chinese word segmentation by exploiting the redundancy of web information . In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 1154--1164 . Xipeng Qiu, ChaoChao Huang, and Xuan-Jing Huang. 2014. Automatic corpus expansion for Chinese word segmentation by exploiting the redundancy of web information. In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 1154--1164."},{"volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Reimers Nils","key":"e_1_3_2_1_46_1","unstructured":"Nils Reimers and Iryna Gurevych . 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks . In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) . Association for Computational Linguistics , 3982--3992. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 3982--3992."},{"key":"e_1_3_2_1_47_1","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)","author":"Remus Steffen","year":"2016","unstructured":"Steffen Remus and Chris Biemann . 2016 a. Domain-specific corpus expansion with focused webcrawling . In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) . 3607--3611. Steffen Remus and Chris Biemann. 2016a. Domain-specific corpus expansion with focused webcrawling. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). 3607--3611."},{"key":"e_1_3_2_1_48_1","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)","author":"Remus Steffen","year":"2016","unstructured":"Steffen Remus and Chris Biemann . 2016 b. Domain-specific corpus expansion with focused webcrawling . In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) . 3607--3611. Steffen Remus and Chris Biemann. 2016b. Domain-specific corpus expansion with focused webcrawling. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). 3607--3611."},{"key":"e_1_3_2_1_49_1","volume-title":"The SMART system. Retrieval Results and Future Plans","author":"Salton G","year":"1971","unstructured":"G Salton . 1971. The SMART system. Retrieval Results and Future Plans ( 1971 ). G Salton. 1971. The SMART system. Retrieval Results and Future Plans (1971)."},{"key":"e_1_3_2_1_50_1","volume-title":"Workshop on Energy Efficient Machine Learning and Cognitive Computing at NeurIPS.","author":"Sanh Victor","year":"2019","unstructured":"Victor Sanh , Lysandre Debut , Julien Chaumond , and Thomas Wolf . 2019 . DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter . In Workshop on Energy Efficient Machine Learning and Cognitive Computing at NeurIPS. Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. In Workshop on Energy Efficient Machine Learning and Cognitive Computing at NeurIPS."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321585"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.577"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976698300017467"},{"key":"e_1_3_2_1_54_1","volume-title":"International Conference on Learning Representations (ICLR).","author":"Sharma Archit","year":"2019","unstructured":"Archit Sharma , Shixiang Gu , Sergey Levine , Vikash Kumar , and Karol Hausman . 2019 . Dynamics-aware unsupervised discovery of skills . In International Conference on Learning Representations (ICLR). Archit Sharma, Shixiang Gu, Sergey Levine, Vikash Kumar, and Karol Hausman. 2019. Dynamics-aware unsupervised discovery of skills. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2736277.2741644"},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2008.145"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553516"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_2_1_59_1","volume-title":"Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations (ICLR).","author":"Xiong Lee","year":"2021","unstructured":"Lee Xiong , Chenyan Xiong , Ye Li , Kwok-Fung Tang , Jialin Liu , Paul N. Bennett , Junaid Ahmed , and Arnold Overwijk . 2021 . Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations (ICLR). Lee Xiong, Chenyan Xiong, Ye Li, Kwok-Fung Tang, Jialin Liu, Paul N. Bennett, Junaid Ahmed, and Arnold Overwijk. 2021. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.02.100"},{"key":"e_1_3_2_1_61_1","volume-title":"Simple applications of BERT for ad hoc document retrieval. arXiv preprint arXiv:1903.10972","author":"Yang Wei","year":"2019","unstructured":"Wei Yang , Haotian Zhang , and Jimmy Lin . 2019. Simple applications of BERT for ad hoc document retrieval. arXiv preprint arXiv:1903.10972 ( 2019 ). Wei Yang, Haotian Zhang, and Jimmy Lin. 2019. Simple applications of BERT for ad hoc document retrieval. arXiv preprint arXiv:1903.10972 (2019)."},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331359"},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3409256.3409811"}],"event":{"name":"CIKM '21: The 30th ACM International Conference on Information and Knowledge Management","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Virtual Event Queensland Australia","acronym":"CIKM '21"},"container-title":["Proceedings of the 30th ACM International Conference on Information & Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3459637.3481950","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,12]],"date-time":"2023-11-12T12:09:14Z","timestamp":1699790954000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3459637.3481950"}},"subtitle":["Truncated Sparse Document Signature Bit-Vectors for Fast Web-Scale Corpus Expansion"],"short-title":[],"issued":{"date-parts":[[2021,10,26]]},"references-count":63,"alternative-id":["10.1145\/3459637.3481950","10.1145\/3459637"],"URL":"https:\/\/doi.org\/10.1145\/3459637.3481950","relation":{},"subject":[],"published":{"date-parts":[[2021,10,26]]},"assertion":[{"value":"2021-10-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}