|
Prev: How to wrap text in <p> tag if the text has no spaces and is very long
Next: IE compatible way of forcing footer to bottom of page?
From: Ben C on 9 Apr 2008 06:33 On 2008-04-03, Jukka K. Korpela <jkorpela(a)cs.tut.fi> wrote: > Scripsit Ben C: [...] >> In fact Unicode specifications do define a lot of stuff about line >> breaking > > Quite a lot indeed, with many oddities. > >> and most browsers either implement that or a simplification >> of it although as we know they aren't required to. > > I don't think any browser even tries to get close to implementing > Unicode line breaking rules or even a simplification of them. If you're writing a browser that has to display text in a variety of different scripts used for languages you aren't familiar with yourself, it's much easier to load in the Unicode breaking class tables and use the Unicode specification than it is to learn all those scripts and their different conventions.
From: Jukka K. Korpela on 9 Apr 2008 15:11
Scripsit Ben C: >> I don't think any browser even tries to get close to implementing >> Unicode line breaking rules or even a simplification of them. > > If you're writing a browser that has to display text in a variety of > different scripts used for languages you aren't familiar with > yourself, it's much easier to load in the Unicode breaking class > tables and use the Unicode specification than it is to learn all > those scripts and their different conventions. Quite right, so it's somewhat surprising that browsers don't do that. One reason to this might be that Unicode line breaking rules haven't been stable. Moreover, they are fairly complex. (And applying them indiscriminately leads to poor results, like breaking after "-" even in a context like "the temperature is -8 degrees".) -- Jukka K. Korpela ("Yucca") http://www.cs.tut.fi/~jkorpela/ |