Commit Graph

10 Commits

Author SHA1 Message Date
Ivan Skytte Jørgensen
eaa5137582 Merge branch 'master' into tokenizer 2018-05-22 15:53:18 +02:00
Ivan Skytte Jørgensen
e7337a6ebd Log unicode data filename that failed to load 2018-05-17 15:19:23 +02:00
Ivan Skytte Jørgensen
9373a3485c Load unicode_is_alphabetic.dat (copy-pasta error) 2018-03-20 14:32:01 +01:00
Ivan Skytte Jørgensen
4fb49d608c Make unicode_is_ignorable.dat 2018-02-26 17:44:30 +01:00
Ivan Skytte Jørgensen
2e5ecde0c3 Added unicode decomposition map for combining-mark removal 2018-02-26 15:46:22 +01:00
Ivan Skytte Jørgensen
d4cf5f6455 unicode: Preliminary commit, work-in-progress 2018-02-06 17:06:35 +01:00
Ivan Skytte Jørgensen
a84144b0b1 unicode: made table name more explicit (g_unicode_canonical_decomposition_map) 2018-02-06 13:54:58 +01:00
Ivan Skytte Jørgensen
75a1f7d7b6 unicode: generate and load unicode_is_uppercase.dat and unicode_is_lowercase.dat 2018-02-05 23:57:36 +01:00
Ivan Skytte Jørgensen
0f748aaac4 unicode: generate and load unicode_is_alphabetic.dat 2018-02-05 23:50:01 +01:00
Ivan Skytte Jørgensen
f3d9b27440 Preliminary commit on unicode update 2018-02-02 18:01:59 +01:00