German umlauts not displayed correctly in gutsy

Bug #132924 reported by Michael Deelwater
4
Affects Status Importance Assigned to Milestone
ttf-uralic (Ubuntu)
In Progress
Undecided
Arne Goetje

Bug Description

After an upgrade from feisty to gutsy, the german umlauts (ä, ö, ü, ß) are displayed by russian characters at some websites. All browsers show this behaviour. Encoding is UTF-8 and German keyboard layout is selected. The Russian characters are even shown right now in this box i am writing in. I could not figure out which fonts are used by the browser. I could post a screenshot if i knew how to do that here (most prominent examle is the font used in wikipedia main page).

Michael

Revision history for this message
Michael Deelwater (launchpad-falsum) wrote :
Revision history for this message
Arne Goetje (arnegoetje) wrote :

Can you please post the following information of your installation:
 1. list of your font packages installed (dpkg -l | grep ttf-)

 2. ls /usr/share/fonts/truetype

 3. ls ~/.fonts/

I suppose there is a font installed in your system which has a wrong encoding. Although currently I can only speculate.
To be able to find the culprit, I'll need the above mentioned information.

Thanks
Arne

Revision history for this message
Michael Deelwater (launchpad-falsum) wrote :
Download full text (7.3 KiB)

After removing ttf-uralic all characters were displayed correctly. Reinstalling ttf-uralic did not change this, so everything works fine now.

output of dpkg -l | grep ttf-

ii libsdl-ttf2.0-0 2.0.9-1 ttf library for Simple DirectMedia Layer wit
ii libttf2 1.4pre.20050518-0.5build1 FreeType 1, The FREE TrueType Font Engine, s
ii ttf-arabeyes 1.1-9 Arabeyes GPL TrueType Arabic fonts
ii ttf-arhangai 20030401-2 A TrueType font with Mongolian Cyrillic lett
ii ttf-arphic-ukai 0.1.20060928-2.2 "AR PL ZenKai Uni" Chinese Unicode TrueType
ii ttf-arphic-uming 0.1.20060928-2.1ubuntu1 "AR PL ShanHeiSun Uni" Chinese Unicode TrueT
ii ttf-baekmuk 2.2-1ubuntu3 Baekmuk series TrueType fonts
ii ttf-bengali-fonts 1:0.4.9 Free TrueType fonts for the Bengali language
ii ttf-bitstream-vera 1.10-7 The Bitstream Vera family of free TrueType f
ii ttf-bpg-georgian-fonts 0.5 BPG Georgian fonts
ii ttf-dejavu 2.17-2 Vera font family derivate with additional ch
ii ttf-devanagari-fonts 1:0.4.9 Free TrueType fonts for languages using the
ii ttf-dustin 20030517-6 Various TrueType fonts from dustismo.com
ii ttf-dzongkha 0.3-1 TrueType fonts for Dzongkha language
ii ttf-farsiweb 0.4.dfsg-6 FarsiWeb free TrueType Farsi fonts
ii ttf-freefont 20060501cvs-12 Freefont Serif, Sans and Mono Truetype fonts
ii ttf-gentium 1.02-2ubuntu2 Gentium TrueType font
ii ttf-georgewilliams 1.0-1 Free unicode TrueType fonts by George Willia
ii ttf-gujarati-fonts 1:0.4.9 Free TrueType fonts for the Gujarati languag
ii ttf-indic-fonts 1:0.4.9 Metapackage for free Indian language fonts
ii ttf-isabella 1.01-6ubuntu1 The Isabella free TrueType font
ii ttf-junicode 0.6.13-1 a Unicode font for medievalists (Latin, IPA
ii ttf-kacst 1.6.2+mry-1 KACST free TrueType Arabic fonts
ii ttf-kannada-fonts 1:0.4.9 Free TrueType fonts for the Kannada language
ii ttf-khmeros 4.0-2...

Read more...

Revision history for this message
Arne Goetje (arnegoetje) wrote :

That's what I expected to find. :)
ttf-uralic is not in Unicode format. Might have confused fontconfig.

Can we close this bug?

Cheers
Arne

Revision history for this message
Arne Goetje (arnegoetje) wrote :

From the README file:
-----------------------------------------------------
Additional letters

The encoding of the fonts is based on the model of the Cyrillic Asian
encoding. The Russian alphabet occupies the place of
the Latin-1 supplement in the Western (Windows CP 1252) encoding and its
own Unicode positions in the Cyrillic (CP 1251) encoding. Additional
Uralic letters can be found in three places - instead of additional
characters in the Western encoding, instead of additional characters in
the Cyrillic encoding and in their own Unicode positions (with the
exceptions of those letters that are not found in this standard).
Existing Mari and Udmurt fonts were taken into consideration while
distributing positions, but incorporating their encodings did not prove
possible. Eventually, Udmurt fonts were used as the starting point. See
the test page for details.

----------------------------------------------------------------------------

These fonts are for Uralic languages only! So, don't use these fonts for any other purpose.
I will nevertheless contact upstream and propose a fix. IMHO the fonts have been wrongfully encoded and are therefor broken. Even when used in a Unicode environment, additional Cyrillic characaters should only show up in the U+00A0 - U+00BF range. At least, that's the mapping of CP1251 to Unicode.

Changed in ttf-uralic:
status: Confirmed → In Progress
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.