Platform’s default charset on different platforms?

Question

Some legacy code relies on the platform&#8217;s default charset for translations. For Windows and Linux installations in the &#8220;western world&#8221; I know what that means. But thinking about Russian or Asian platforms I am totally unsure what their platform&#8217;s default charset is (just UTF-16?). Ther…

Accepted Answer

That&#8217;s a user specific setting. On many modern Linux systems, it&#8217;s UTF-8. On Macs, it’s MacRoman. In the US on Windows, it&#8217;s often CP1250, in Europe it&#8217;s CP1252. In China, you often find simplified chinese (Big5 or a GB*).But that’s the system default, which each user can change at any time. Which is probably the solution: Set the encoding when you start your app using the system property file.encodingSee this answer how to do that. I suggest to put this into a small script which starts your app, so the user default isn&#8217;t tainted.

Advertisement

Answer