Mutual Information Maximization https://github.com/yanzhangnlp/IS-BERT