Unicode

Formats

For storing text files, currently the following formats are supported. For each of these encodings some relevant information are available, which you can access through the appropriate links. It should be noted that each of these encodings has different advantages and disadvantages, so it depends on the appropriate use, which is the most ideal one. Remember that UTF is an acronym for Unicode Transformation Format while it is not possible to store Unicode with the ANSI format.

How to change the format of a file, you can read in the section at the bottom of this page.

ANSI and ASCII

The ANSI format stores only the 128 ASCII characters and 128 extended characters. ANSI uses only 1 byte per character, but you can save only 256 different characters, so that not all of the Unicode characters can be supported. [more...]

UTF-7

The seldom-used and variable-length format UTF-7 only uses ASCII characters to store Unicode strings, so that you are able to work with Unicode strings also in 7-bit enviroments, where only ASCII can be transmitted and stored. [more...]

UTF-8

The UTF-8 format is the most common encoding for Unicode characters and is optimized for the display of ASCII characters. For this it needs only one byte per character, for all other characters variable from two to four bytes. [more...]

UTF-16 Little Endian and UTF-16 Big Endian

The UTF-16 format is optimized for the most commonly used characters from the Basic Multilingual Plane and encodes characters in a variable-length from two to four bytes. As Byte Order, Little Endian and Big Endian is allowed. [more...]

UTF-32 Little Endian and UTF-32 Big Endian

This format encodes each character using uniform 4 bytes, the byte order can be elected both as Little Endian as well as Big Endian. Because of the high memory requirements, this format is rarely used. [more...]

Change Format of a File

The format of one or more files, you can easily change with the Text Converter. Such a conversion from one format in another one, must be carried out, for example, if you adapt your ANSI homepage to UTF-8 and you have to change a great number of files in its format. This is the course of action:

  • Open all files to be changed with the Text Converter.
  • Go to the menu "Settngs > Output Settings" and choose the desired format from the list, e.g. UTF-8.
  • Choose whether the files should be overwritten or stored as new files.
  • Click on "Convert" in the Text Converter to convert and save all of the files.