iconvavailable code sets for conversion |
Miscellaneous Information |
iconv
converts characters from any
available single byte code set to any other available single byte code set.
The format of an iconv
code set file is:
The first seven bytes (iconv1\nXcodesetnameYverbose_descriptionTABLE
iconv1
and the newline character) are a
signature; the 1
is used for version information. For the
remainder:
A byte containing the length of the code set name which follows.
The main name of the code set. This is normally the file name, but may be different on systems which allow links to files.
A byte containing the length of the verbose description which follows.
A description of the code set. This sequence may not contain null (\0)
bytes. This is the string printed by the command
iconv
-lv
.
The table of characters is 512 bytes long: 256 entries of two bytes
each. The position in the table corresponds to the position in the
character set; the entries are the ISO 10646 two-octet representations of
the characters. For example, in the ISO 8859-6 character set, the Arabic
character alef is character 181 (0xBB). The ISO 10646 two-octet code for
alef is 061B
. If you were writing a code set file for
ISO8859-6, the 181st entry in the table would be 061B
.