[2301.08810] Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions