Comment 18 for bug 16953

Revision history for this message
Daniel Hartwig (wigs) wrote :

Removing accents from strings before or while comparing them

There is libunac [1] which provides a function to remove accents from a string.

There is an algorithm discussed here [2] which could be implemented with libunistring:

Now, the best algorithm is hinted at the approved answer: Use NKD (decomposition) to decompose accented letters into the base letter and a seperate accent, and then remove all accents.

[accent characters are in the non-spacing mark category "Mn"]

[1] http://www.nongnu.org/unac/unac-man3.en.html
[2] http://stackoverflow.com/questions/144761/how-to-remove-accents-and-tilde-in-a-c-stdstring