Garbled Characters @ planetapache

Do you know why we encounter Mojibake (garbled-characters) phoenomena in planetapache.org?

When setting encodings to “Unicode (UTF-8)” on my/your browser, it’s okay. not-garbled.
But I/you have to set the encodings on my/your browser to “Unicode (UTF-8)” *WHENEVER* accessing to planetapache.org

> C:\cygwin\bin>wget -S http://www.planetapache.org/
> --16:36:45--  http://www.planetapache.org/
>            => `index.html'
> Resolving www.planetapache.org... done.
> Connecting to www.planetapache.org[195.54.228.42]:80... connected.
> HTTP request sent, awaiting response...
>  1 HTTP/1.1 200 OK
>  2 Date: Thu, 25 Mar 2004 07:37:01 GMT
>  3 Server: Apache/2.0.48 (Unix) DAV/2 mod_python/3.1.3 Python/2.3.2 SVN/1.0.0
>  4 Last-Modified: Thu, 25 Mar 2004 07:06:22 GMT
>  5 ETag: "2256c9-14b98-48d89780"
>  6 Accept-Ranges: bytes
>  7 Content-Length: 84888
>  8 Vary: Accept-Encoding,User-Agent
>  9 Keep-Alive: timeout=15, max=100
> 10 Connection: Keep-Alive
> 11 Content-Type: text/html; charset=ISO-8859-1
                              ^^^^^^^^^^^^^^^^^^

This is the root of the problem. Perhaps it would be possible for this line (#11) to be simplified to be “Content-Type: text/html;” – or – “Content-Type: text/html; charset=utf-8”.

I now have planetplanet powered sites running — in Japanese. In my server, it does not specify charset field in HTTP Header, so it appears to be working well for now. — using UTF-8 — e.g. http://www.terra-intl.com/planet/orkut/
This means that each browsers can choose charsets specified in the line
<meta http-equiv=”Content-Type” content=”text/html; charset=*****” />
— meta tag information included in each HTML pages —

If my memory serves me correctly, I saw the same thing in eyebrowse @ nagoya.apache.org in the past. Now, fixed.

c.f.

> C:\cygwin\bin>wget -S http://nagoya.apache.org/eyebrowse/
> --17:28:28--  http://nagoya.apache.org/eyebrowse/
>            => `index.html.3'
> Resolving nagoya.apache.org... done.
> Connecting to nagoya.apache.org[192.18.33.10]:80... connected.
> HTTP request sent, awaiting response...
>  1 HTTP/1.1 200 OK
>  2 Date: Thu, 25 Mar 2004 08:29:21 GMT
>  3 Server: Apache Coyote/1.0
>  4 Content-Type: text/html; charset=UTF-8
>  5 Content-Length: 28328
>  6 Connection: close

Look at line #4. — Goooooooood. Perfect!

Oh, forgot to mention. Perhaps, even *normal* Japanese characters might be perceived as “garbled” for those who do not know (nor appreciate) Kanji/Hiragana/Katakana …. Whether garbled or not — Very RELATIVE and PHILOSOPHICAL issue, indeed 🙂

コメントを残す

以下に詳細を記入するか、アイコンをクリックしてログインしてください。

WordPress.com ロゴ

WordPress.com アカウントを使ってコメントしています。 ログアウト / 変更 )

Twitter 画像

Twitter アカウントを使ってコメントしています。 ログアウト / 変更 )

Facebook の写真

Facebook アカウントを使ってコメントしています。 ログアウト / 変更 )

Google+ フォト

Google+ アカウントを使ってコメントしています。 ログアウト / 変更 )

%s と連携中