Unicode Format Conversion: With the Text Encoder, it is possible to convert a variety of text files from one Unicode format to another one or to or from ANSI. Supported formats are ASCII, ANSI, UTF-7, UTF-8, UTF-16 Big Endian, UTF-16 Little Endian, UTF-32 Big Endian, UTF-32 Little Endian, Latin-1 (ISO 8859-1), Latin-2 (ISO 8859-2), Win-1250 (Central European), Win-1251 (Cyrillic), Win-1252 (Western European) and Win-1253 (Greek). Each of these formats can be converted into any other format.
Change of the Line Break Type: Also the type of the used line break of text files can be changed with the TextEncoder. The TextEncoder supports the line break types CR LF (Windows, DOS, OS/2, CP/M, TOS), LF (Unix, Linux, MacOS, Mac OS X, AmigaOS), CR (Classic Mac OS, Apple II, Commodore) and NL (AIX OS, IBM Mainframe Systems, OS/390) as well as the various Unicode characters for line breaks FF, NEL, LS, PS, VT and TAB by presetting. In addition, any user-defined characters or strings can be defined directly as text or in the form of code points to be used as line breaks.
Fixed line length: In addition to line breaks defined by line break characters, the TextEncoder also supports line breaks that are defined by a fixed number of characters per line. Files with this type of line limitation without a specific sign for a line break can be converted by the TextEncoder to any other character-based formats. Also the other direction and thus the removal of line break characters from text files is possible with the TextEncoder.
Batch Processing: It can be worked on any number of text files at the same time. The files can be easily added to the program, by dragging them from any folder of your computer into the file list. After adding the files and choosing an encoding and/or line break type, you have just to click on "Convert" to apply all changes to all of the files.
Byte Order Mark (BOM): The program can be used to delete the Byte Order Mark (BOM) from files (or to add Byte Order Marks to files). Also when converting files from one format into another format, you can write the files with or without the Byte Order Mark as you want.
Line break at multiple characters: Normally within a file only one character or one string is used as a line break. The TextEncoder, however, also supports the use of a list of any number of characters or strings, at all of them the line is broken. This list can be defined either as text or in the form of code points. This function is useful for example if multiple line bracket types have been mixed in a file. In this case, the TextEncoder can repair such a file by converting the mixed line breaks into any uniform line break type.
Full Unicode Support: All functions listed on these pages can also be performed with Unicode characters. It does not matter whether you want to work with Unicode file names, or whether your files are containing Chinese, Cyrillic, Greek, Hebrew or other special characters.
Verification on unsupported characters: Before saving a file in a specific encoding, the TextEncoder automatically checks whether all characters contained in the file are also representable in the selected coding. If not, a corresponding warning message is displayed with the affected characters and you can decide if you want to continue or not. The warning message is not displayed if the TextEncoder was called via the command line or a script.
Script Control: In the version Batch Text Encoder you can execute all functions of the Text Encoder by using a batch script. So, you can completely automate the converting process. If you start the batch version without parameters, it can be used in the same way like the normal Text Encoder with graphical user interface.
No installation (Portable Software): This software works without installation. This spares your Registry and you can immediately use the program.