In C-language statements
The C compiler must recognize the code set that you use in your C-language statements.
The capabilities of your C compiler might limit your ability to use non-ASCII characters within the C-language statements in a UDR source file. For example, some C-language compilers support multibyte characters in literals or comments only.
- Multibyte characters might contain C-language tokens.
A component of a multibyte character might be indistinguishable from certain single-byte characters such as percent ( % ), comma, backslash ( \ ), and double quotation mark ( " ). If such characters exist in a quoted string, the C compiler might interpret them as C-language tokens, which can result in compilation errors or even lost characters.
- The C compiler might not be 8-bit clean.
If a code set contains non-ASCII characters (with code values that are greater than 127), the C compiler must be 8-bit clean to interpret the characters. To be 8-bit clean, a compiler must read the eighth bit as part of the code value; it must not ignore or put its own interpretation on the meaning of this eighth bit.