The Java programming language uses Unicode throughout. In an effort to integrate well with other locales, gcj allows .java files to be written using almost any encoding. gcj knows how to convert these encodings into its internal encoding at compile time.
You can use the --encoding=
NAME option to specify an
encoding (of a particular character set) to use for source files. If
this is not specified, the default encoding comes from your current
locale. If your host system has insufficient locale support, then
gcj assumes the default encoding to be the ‘UTF-8’ encoding
of Unicode.
To implement --encoding
, gcj simply uses the host
platform's iconv
conversion routine. This means that in practice
gcj is limited by the capabilities of the host platform.
The names allowed for the argument --encoding
vary from platform
to platform (since they are not standardized anywhere). However,
gcj implements the encoding named ‘UTF-8’ internally, so if
you choose to use this for your source files you can be assured that it
will work on every host.