Stuart has posted a reminder of this useful trick for confirm that your application properly handles international characters:
Iñtërnâtiônàlizætiøn
In the comments, Dominic points a further check for Unicode support: Ādam (the previous example only uses Latin-1). It seems my handmade CMS fails these tests miserably.
Rich wrote:
Testing the comments system: Iñtërnâtiônàlizætiøn Ādam
Jernej wrote:
How about a real test?
šđčćž
obviously it doesn’t work…
patrick h. lauke wrote:
interestingly, i have been fiddling with my own site, and one of the issues i tweaked was internationalisation. serendipity…
http://www.splintered.co.uk/news/74/
if it’s any help, i found that using htmlentities() (which i relied on heavily before moving to proper utf-8) can mess things up…i simply moved to htmlspecialchars() instead.
Rich wrote:
Patrick – serendipity indeed. It’s been
htmlentities()that has been causing all my problems too.Everything has been going into the datase OK, it’s just displaying it that has caused the problems (more with comemnts that with my own posts. I’ll try
htmlspecialchars()instead.patrick h. lauke wrote:
seems to work fine now.
Iñtërnâtiônàlizætiøn, Ādam, Uttryck din åsikt om framtiden för HTML, šđčćž
Rich wrote:
Getting there… Here’s a cut and paste test:
Iñtërnâtiônàlizætiøn, Ādam, Uttryck din åsikt om framtiden för HTML, šđčćž
And with a proper quite mark:
if it’s any help