Chinese Polyphones with Pinyin
A benchmark dataset that consists of 99,000+ sentences for Chinese polyphone disambiguation.
Source: g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset