-
-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The design_requirements for Dutch [nld] are misleading #118
Comments
Very interesting, thank you @moyogo! On 1) I agree... the use of the jdotless in the example likely stems from a designer centric view where that letter would be the component used to construct the ij with acute. As for 2) this is new to me. I was under the impression that the "technical limitation" should be circumvented when this is possible. So overall this should be an optional recommendation that also mentions the different styles/orthographies? |
Sorry for taking so long to get to this. I have a draft which I will push in a moment for your review. It is longer than what you proposed. Hopefully, it helps clarity. What I am still unsure about is this bit where I say:
We had some Dutch readers telling us they would prefer for <i><j> to get tracked and others would insist on keeping it a single unit. This:
says that it can “sometimes” behave like a single unit, hence my recommendation above. I can see three strategies font developers can take:
Each of these then requires a different solution when adding stress on the lange ij. The latter two strategies would work well for multilngual texts. |
I should have quoted the whole paragraph from Taalunie, Technische Handleiding, 2016 (it’s actually online: https://taalunie.org/feeds/download/technische-handleiding-2016-5dcab.pdf/Technische%20Handleiding/original):
The "sometimes" means it behaves like a single letter in some contexts (beginning of sentences and of proper nouns, or sorted like y in some reference works) and like two letters in others (sorted like i+j in most dictionaries).
Dissolving <ij> and <IJ> defeats their purpose, at least according to the Unicode paragraph quoted before. Fonts may provide the same tracking behaviour for <i><j> and <I><J> as a unit, it may be optional or by default, but either way it should be easy to enable or disable. Drawing with broad brushes, some Dutch speakers feel strongly that lange ij is a single letter with encoding issues and some other Dutch speakers feel more that it’s a letter combination with a special casing rule. Generalizing a bit, there is a Netherlands and Belgium divide on the issue. |
Thank you @moyogo, got it. Updated the design requirements one more time. Please, let me know if something does not sound right. I have read it too many times. |
It looks good to me. Thank you @MrBrezina. |
The design_requirements for Dutch [nld] are misleading:
hyperglot/lib/hyperglot/hyperglot.yaml
Lines 8189 to 8190 in d007599
The design requirements should say that "The <j> should lose its dot when combined with a combining acute for when the acute on j is not omitted on stressed lange ij, usually spelled íj but íj́ when possible. Generally, fonts should not add an acute that is not there in the text."
1. The characters used are confusing
The current design requirements say that <ij> U+0133 LATIN SMALL LIGATURE IJ is combined with U+0301 COMBINING ACUTE and that <ȷ> U+0237 LATIN SMALL LETTER DOTLESS J should have an acute when following <í> U+00ED LATIN SMALL LETTER I WITH ACUTE.
<ij> U+0069 U+006A should be used instead of <ij> is U+0133 LATIN SMALL LIGATURE IJ as the letter combination i+j U+0069 U+006A is generally used for the lange ij.
See for example Taalunie, Technische Handleiding: Regels voor de officiële spelling van het Nederlands, 2016, p. 19:
See also Unicode, chapter 7:
The ȷ U+0237 LATIN SMALL LETTER DOTLESS J is not used in Dutch, j U+006A LATIN SMALL LETTER J is.
2. Substitution of j after í breaks Dutch text
An automated substitution that replaces j by ȷ with acute after í breaks Dutch text.
The Taalunie stress marks spelling rule 5.1 says :
Which can be translated as:
So Dutch text can follow the official spelling rules and omit the acute on the j of stressed lange ij, like in the example provided.
Additionally, this spelling rule was standardized in the 1996 spelling and before that it was common to put the acute only on the first letter of digraphs composed of two different letters.
See for example Jan Renkema, Schrijfwijzer, 1987, p. 159.
Many Dutch speakers still write and many Dutch texts are written with pre-1996 rules.
They use níet instead of níét, góed instead of góéd, zíjn instead of zíj́n, a font should not make either look like they have an additional acute.
There is also the issue of foreign names in Dutch text, like Níjar or Szíj, which would be displayed incorrectly.
The text was updated successfully, but these errors were encountered: