Hey mastodon, is there anything analogous to regex for CJK character sets? How do people, say, filter a Korean text based on the initial sound in each Hangul character? Or find all characters in a Chinese text that contain a certain radic…
- clacke wiederholte dies.
@cdxiao I feel like this entire field is still rapidly evolving, but HanziJS ( http://www.hanzijs.com/ ) may give you some options.Near as I can figure, the CJK Decomposition Data (http://cjkdecomp.codeplex.com/) can help br…