Conversation
|
Stefano,
I now realize what the problem is with the comma. The Tifinagh script has a separator character that I learned was used like a comma, so I mapped it to the Latin comma. Unfortunately, The languages that use Tifinagh script also use regular Latin punctuation marks, so I will need to map the special Tifinagh separator to something unique in the Latin script—perhaps the Latin comma with a double underscore (\u0333).
The other thing you mentioned, the possible problem with other characters, is a a font problem. Combinations of Tifinagh characters with combining characters aren’t displaying correctly in MS Excel. The underlying character values are correct, fortunately. I will fix the mapping of the Tifinagh separator and send you a new “tifinagh_generic.yml” file as well as test data.
Randy
…----------------------------------------
Randall K. Barry ᠷᠠᠨ᠋ᠳ᠋ᠠᠯᠯ ᠺ᠊ ᠪᠠᠷᠷᠶ
Email: ***@***.***> ***@***.***
Mobile: +1-703-244-1232
629 24th St. South
Arlington, VA 22202-2525 U.S.A.
From: Stefano Cossu ***@***.***>
Sent: Saturday, April 4, 2026 20:19
To: lcnetdev/scriptshifter ***@***.***>
Cc: Randall K. Barry ***@***.***>; Mention ***@***.***>
Subject: [lcnetdev/scriptshifter] Add Tifinagh languages and tests. (PR #292)
@RandyBarry <https://github.com/RandyBarry> see attached test results. There seems to be a problem transliterating the comma character, and maybe a couple of other characters that I can't tell right now if it's an incorrect test pair or an incorrect mapping.
tamashek.csv <https://github.com/user-attachments/files/26485324/tamashek.csv>
tamazight_moroccan.csv <https://github.com/user-attachments/files/26485325/tamazight_moroccan.csv>
tifinagh_generic.csv <https://github.com/user-attachments/files/26485326/tifinagh_generic.csv>
_____
You can view, comment on, or merge this pull request online at:
#292
Commit Summary
* 721a356 <721a356> Add Tifinagh languages and tests.
File Changes
(6 <https://github.com/lcnetdev/scriptshifter/pull/292/files> files)
* A scriptshifter/tables/data/tamashek.yml <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-e6383418c2f60e43b544c9fdf10b2a2875967c63d58688c5e02e681da246a769> (112)
* A scriptshifter/tables/data/tifinagh_generic.yml <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-72f7ae57cf635845e9a0d8e87cf19f8c1aaf1867134e9204ec19f909e7aa113e> (170)
* M scriptshifter/tables/index.yml <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-09f5531e9c25fa16448edb4eccda16f6bc0062c6da1ed21b4af3cfccbae1c991> (18)
* A test/data/script_samples/tamashek.csv <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-3c8487e4f754b4a86aa543bfc207e491831065943a4c64f0a87d9226153e3cec> (1)
* A test/data/script_samples/tamazight_moroccan.csv <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-446fd01bc3599a87a7f306d934979b6491c02980fdedd2767fddef9bea1ab7ad> (2)
* A test/data/script_samples/tifinagh_generic.csv <https://github.com/lcnetdev/scriptshifter/pull/292/files#diff-b215289bfb1e5752ee5d309d273c18d11bab80168a73016d186293a81ae66212> (5)
Patch Links:
* https://github.com/lcnetdev/scriptshifter/pull/292.patch
* https://github.com/lcnetdev/scriptshifter/pull/292.diff
—
Reply to this email directly, view it on GitHub <#292> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/BA4HLGL4G7M4TYHO6U57NNT4UGRA3AVCNFSM6AAAAACXM5UTFGVHI2DSMVQWIX3LMV43ASLTON2WKOZUGIYDMMBXGU2TEMY> .
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
|
I have updated the mappings and the tests. There are still a few outstanding issues: |
|
What are the outstanding issues in the mappings for Tifinagh script and/or the languages that use it? I don't see the before/after comparisons. |
|
Sorry, I had attached your test files instead of the reports. Here are the correct files: test_tamashek.log As you can see, some of the issues are related to the comma + combining underscore we discussed over email. E.g. "Imdanen, akken" transliterates "ⵉⵎⴷⴰⵏⴻⵏ, ⴰⴽⴽⴻⵏ" instead of "ⵉⵎⴷⴰⵏⴻⵏ⵰ ⴰⴽⴽⴻⵏ" because the lone comma is not mapped -- the comma followed by a combining underscore is. Conversely, "ⵉⵎⴷⴰⵏⴻⵏ⵰ ⴰⴽⴽⴻⵏ" transliterates into " Imdanen,̲ akken" instead of the expected "Imdanen, akken". |
@RandyBarry see attached test results. There seems to be a problem transliterating the comma character, and maybe a couple of other characters that I can't tell right now if it's an incorrect test pair or an incorrect mapping.
tamashek.csv
tamazight_moroccan.csv
tifinagh_generic.csv