feklee
feklee

Reputation: 7705

Are some string comparisons not character by character?

With collation utf8_general_ci, MySQL considers Munchen and München to be the same, because with that collation u and ü are considered identical.

Are there any collations where character sequences of different length are considered identical, for example where ü is considered identical to ue?

Background concerning the above example: In Germany, ue is used as substitute of ü, when only the latin alphabet is available.

Actually: I hope no such collations exists! They would make my life harder.

Upvotes: 1

Views: 58

Answers (1)

RandomSeed
RandomSeed

Reputation: 29809

Yes, collations like xx_german2_ci do take these equivalences into account (see this fiddle for an example).

Upvotes: 2

Related Questions