default search action
Damai Dai
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang:
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models. ACL (1) 2024: 1280-1297 - [c24]Peiyi Wang, Lei Li, Zhihong Shao, Runxin Xu, Damai Dai, Yifei Li, Deli Chen, Yu Wu, Zhifang Sui:
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations. ACL (1) 2024: 9426-9439 - [c23]Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu:
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models. EMNLP 2024: 784-801 - [c22]Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Zhifang Sui:
A Survey on In-context Learning. EMNLP 2024: 1107-1128 - [i33]Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, Alex X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou:
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism. CoRR abs/2401.02954 (2024) - [i32]Fangwei Zhu, Damai Dai, Zhifang Sui:
Language Models Understand Numbers, at Least Partially. CoRR abs/2401.03735 (2024) - [i31]Damai Dai, Chengqi Deng, Chenggang Zhao, R. X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y. K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang:
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models. CoRR abs/2401.06066 (2024) - [i30]Xiangdi Meng, Damai Dai, Weiyao Luo, Zhe Yang, Shaoxiang Wu, Xiaochen Wang, Peiyi Wang, Qingxiu Dong, Liang Chen, Zhifang Sui:
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization. CoRR abs/2402.16141 (2024) - [i29]Jingyuan Ma, Damai Dai, Zhifang Sui:
Large Language Models Are Unconscious of Unreasonability in Math Problems. CoRR abs/2403.19346 (2024) - [i28]DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Deng, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, Hao Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, Tao Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun:
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model. CoRR abs/2405.04434 (2024) - [i27]Yudong Wang, Damai Dai, Zhifang Sui:
Exploring Activation Patterns of Parameters in Language Models. CoRR abs/2405.17799 (2024) - [i26]DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen, Xin Xie, Kang Guan, Yuxiang You, Aixin Liu, Qiushi Du, Wenjun Gao, Xuan Lu, Qinyu Chen, Yaohui Wang, Chengqi Deng, Jiashi Li, Chenggang Zhao, Chong Ruan, Fuli Luo, Wenfeng Liang:
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence. CoRR abs/2406.11931 (2024) - [i25]Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Y. Wu:
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models. CoRR abs/2407.01906 (2024) - [i24]Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, Damai Dai:
Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts. CoRR abs/2408.15664 (2024) - 2023
- [c21]Shaoxiang Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui:
Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion. ACL (1) 2023: 2231-2243 - [c20]Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Shuming Ma, Zhifang Sui, Furu Wei:
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers. ACL (Findings) 2023: 4005-4019 - [c19]Shoujie Tong, Heming Xia, Damai Dai, Runxin Xu, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui:
Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization. EMNLP (Findings) 2023: 5214-5227 - [c18]Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun:
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning. EMNLP 2023: 9840-9855 - [c17]Zhe Yang, Damai Dai, Peiyi Wang, Zhifang Sui:
Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning. EMNLP (Findings) 2023: 13209-13221 - [c16]Damai Dai, Jing Ren, Shuang Zeng, Baobao Chang, Zhifang Sui:
Coarse-to-Fine Entity Representations for Document-Level Relation Extraction. NLPCC (2) 2023: 185-197 - [c15]Damai Dai, Wenbin Jiang, Jiyuan Zhang, Yajuan Lyu, Zhifang Sui, Baobao Chang:
Mixture-of-Experts for Biomedical Question Answering. NLPCC (1) 2023: 604-615 - [c14]Damai Dai, Wenbin Jiang, Qingxiu Dong, Yajuan Lyu, Zhifang Sui:
Neural Knowledge Bank for Pretrained Transformers. NLPCC (2) 2023: 772-783 - [i23]Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu, Baobao Chang, Xu Sun, Jingjing Xu, Lei Li, Zhifang Sui:
A Survey for In-context Learning. CoRR abs/2301.00234 (2023) - [i22]Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun:
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning. CoRR abs/2305.14160 (2023) - [i21]Shaoxiang Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui:
Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion. CoRR abs/2305.14652 (2023) - [i20]Shoujie Tong, Heming Xia, Damai Dai, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui:
Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization. CoRR abs/2305.14760 (2023) - [i19]Zhe Yang, Damai Dai, Peiyi Wang, Zhifang Sui:
Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning. CoRR abs/2310.08309 (2023) - [i18]Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui:
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations. CoRR abs/2312.08935 (2023) - 2022
- [c13]Peiyi Wang, Liang Chen, Tianyu Liu, Damai Dai, Yunbo Cao, Baobao Chang, Zhifang Sui:
Hierarchical Curriculum Learning for AMR Parsing. ACL (2) 2022: 333-339 - [c12]Damai Dai, Li Dong, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei:
StableMoE: Stable Routing Strategy for Mixture of Experts. ACL (1) 2022: 7085-7095 - [c11]Damai Dai, Li Dong, Yaru Hao, Zhifang Sui, Baobao Chang, Furu Wei:
Knowledge Neurons in Pretrained Transformers. ACL (1) 2022: 8493-8502 - [c10]Qingxiu Dong, Damai Dai, Yifan Song, Jingjing Xu, Zhifang Sui, Lei Li:
Calibrating Factual Knowledge in Pretrained Language Models. EMNLP (Findings) 2022: 5937-5947 - [c9]Shoujie Tong, Qingxiu Dong, Damai Dai, Yifan Song, Tianyu Liu, Baobao Chang, Zhifang Sui:
Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances. IJCAI 2022: 4397-4403 - [c8]Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei:
On the Representation Collapse of Sparse Mixture of Experts. NeurIPS 2022 - [c7]Damai Dai, Hua Zheng, Zhifang Sui, Baobao Chang:
Plug-and-Play Module for Commonsense Reasoning in Machine Reading Comprehension. NLPCC (2) 2022: 29-41 - [i17]Damai Dai, Wenbin Jiang, Jiyuan Zhang, Weihua Peng, Yajuan Lyu, Zhifang Sui, Baobao Chang, Yong Zhu:
Mixture of Experts for Biomedical Question Answering. CoRR abs/2204.07469 (2022) - [i16]Damai Dai, Li Dong, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei:
StableMoE: Stable Routing Strategy for Mixture of Experts. CoRR abs/2204.08396 (2022) - [i15]Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Furu Wei:
On the Representation Collapse of Sparse Mixture of Experts. CoRR abs/2204.09179 (2022) - [i14]Shoujie Tong, Qingxiu Dong, Damai Dai, Yifan Song, Tianyu Liu, Baobao Chang, Zhifang Sui:
Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances. CoRR abs/2205.00633 (2022) - [i13]Damai Dai, Wenbin Jiang, Qingxiu Dong, Yajuan Lyu, Qiaoqiao She, Zhifang Sui:
Neural Knowledge Bank for Pretrained Transformers. CoRR abs/2208.00399 (2022) - [i12]Qingxiu Dong, Damai Dai, Yifan Song, Jingjing Xu, Zhifang Sui, Lei Li:
Calibrating Factual Knowledge in Pretrained Language Models. CoRR abs/2210.03329 (2022) - [i11]Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Zhifang Sui, Furu Wei:
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers. CoRR abs/2212.10559 (2022) - 2021
- [c6]Peiyi Wang, Runxin Xu, Tianyu Liu, Damai Dai, Baobao Chang, Zhifang Sui:
Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification. CIKM 2021: 1969-1978 - [c5]Hua Zheng, Lei Li, Damai Dai, Deli Chen, Tianyu Liu, Xu Sun, Yang Liu:
Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation. EMNLP (Findings) 2021: 918-923 - [c4]Hua Zheng, Damai Dai, Lei Li, Tianyu Liu, Zhifang Sui, Baobao Chang, Yang Liu:
Decompose, Fuse and Generate: A Formation-Informed Method for Chinese Definition Generation. NAACL-HLT 2021: 5524-5531 - [c3]Damai Dai, Hua Zheng, Fuli Luo, Pengcheng Yang, Tianyu Liu, Zhifang Sui, Baobao Chang:
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions. RepL4NLP@ACL-IJCNLP 2021: 83-89 - [i10]Damai Dai, Hua Zheng, Zhifang Sui, Baobao Chang:
Incorporating Connections Beyond Knowledge Embeddings: A Plug-and-Play Module to Enhance Commonsense Reasoning in Machine Reading Comprehension. CoRR abs/2103.14443 (2021) - [i9]Damai Dai, Li Dong, Yaru Hao, Zhifang Sui, Furu Wei:
Knowledge Neurons in Pretrained Transformers. CoRR abs/2104.08696 (2021) - [i8]Peiyi Wang, Lianzhe Huang, Tianyu Liu, Damai Dai, Runxin Xu, Houfeng Wang, Baobao Chang, Zhifang Sui:
Explicit Interaction Network for Aspect Sentiment Triplet Extraction. CoRR abs/2106.11148 (2021) - [i7]Peiyi Wang, Runxin Xu, Tianyu Liu, Damai Dai, Baobao Chang, Zhifang Sui:
Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification. CoRR abs/2108.12844 (2021) - 2020
- [i6]Damai Dai, Hua Zheng, Fuli Luo, Pengcheng Yang, Baobao Chang, Zhifang Sui:
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions. CoRR abs/2009.12765 (2020) - [i5]Damai Dai, Jing Ren, Shuang Zeng, Baobao Chang, Zhifang Sui:
Coarse-to-Fine Entity Representations for Document-level Relation Extraction. CoRR abs/2012.02507 (2020)
2010 – 2019
- 2019
- [c2]Shuming Ma, Lei Cui, Damai Dai, Furu Wei, Xu Sun:
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts. AAAI 2019: 6810-6817 - [c1]Fuli Luo, Damai Dai, Pengcheng Yang, Tianyu Liu, Baobao Chang, Zhifang Sui, Xu Sun:
Learning to Control the Fine-grained Sentiment for Story Ending Generation. ACL (1) 2019: 6020-6026 - 2018
- [i4]Damai Dai:
Live Video Comment Generation Based on Surrounding Frames and Live Comments. CoRR abs/1808.04091 (2018) - [i3]Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang, Xu Sun:
Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions. CoRR abs/1808.05437 (2018) - [i2]Shuming Ma, Lei Cui, Damai Dai, Furu Wei, Xu Sun:
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts. CoRR abs/1809.04938 (2018) - 2017
- [i1]Lun Wang, Damai Dai, Jie Jiang, Tong Yang, Xiaoke Jiang, Zekun Cai, Yang Li, Xiaoming Li:
FISF: Better User Experience using Smaller Bandwidth for Panoramic Virtual Reality Video. CoRR abs/1704.06444 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint