From: Matthias Urlichs (urlichs@smurf.sub.org)
Date: 02/10/93


From: urlichs@smurf.sub.org (Matthias Urlichs)
Subject: Re: 8 bit clean implies what?
Date: 11 Feb 1993 01:49:24 +0100

In comp.os.linux, article <16849@auspex-gw.auspex.com>,
  guy@Auspex.COM (Guy Harris) writes:
>
> I.e., the upper-case version of the German "sz" character has a loweer
> code than the lower-case version?
>
OUCH!

> Warning: that is a trick question, at least as I understand the German
> case conventions. It may be unwise to assume that translating a string
> from lower-case to upper-case can be done simply by replacing each
> lower-case letter in the string with a character that's the upper-case
> version of that letter.

To be more exact, the uppercase version of the "sz" character (which some
people misread for a Greek beta) is the two characters "SS", except when
there already exists a word with "SS" at that position; then it becomes
"SZ". Neat, huh? ;-)

German has some other ugly features, eg. when you're hyphenating them,
some words suddenly change spelling. Fun if you're trying to do soft
hyphens correctly.

-- 
The Dragon Taxi Service - we will take thee... anywhere
-- 
Matthias Urlichs  --  urlichs@smurf.sub.org -- urlichs@smurf.ira.uka.de   /(o\
Humboldtstrasse 7 -- 7500 Karlsruhe 1 -- Germany  --  +49-721-9612521     \o)/