Remix.run Logo
dirteater_ 2 hours ago

What are you doing for Chinese word segmentation/pinyin?

alder an hour ago | parent [-]

For segmentation and POS I rely on spaCy zh_core_web_sm, pinyin from pypinyin library. Also the small correction level on top. But I am not a Chinese language expert to judge if it really works and I'll rely on feedback from the users to improve it.