Öffentlich

https://sn.1w6.org/file/cdxiao-20170712T092803-2yckkt6.html

Hey mastodon, is there anything analogous to regex for CJK character sets? How do people, say, filter a Korean text based on the initial sound in each Hangul character? Or find all characters in a Chinese text that contain a certain radical?

Do people just use regular expressions plus some helper libraries that sort out the text encodings, and that contain some language-specific information??

#unicode #regex

Nachrichten, in denen dieser Anhang erscheint

cdxiao
Hey mastodon, is there anything analogous to regex for CJK character sets? How do people, say, filter a Korean text based on the initial sound in each Hangul character? Or find all characters in a Chinese text that contain a certain radic…

Wednesday, 12-Jul-17 05:13:14 UTC

1w6 uRPG ist ein Mikrobloggingdienst von Arne (Drak) Babenhauserheide. Es wird mit der Mikrobloggingsoftware StatusNet (Version 1.1.1-release) betrieben, die unter der GNU Affero General Public License erhältlich ist. The running version includes the patches from draketo.de/proj/statusnet-patches.

Alle Inhalte und Daten von 1w6 uRPG sind unter der Creative Commons Attribution 3.0 Lizenz verfügbar.

Öffentlich

https://sn.1w6.org/file/cdxiao-20170712T092803-2yckkt6.html

Nachrichten, in denen dieser Anhang erscheint