|
blocxx
|
Functions | |
| size_t | charCount (const char *utf8str) |
| Count the number of UTF-8 chars in the string. More... | |
| UInt16 | UTF8toUCS2 (const char *utf8char) |
| Convert one UTF-8 char (possibly multiple bytes) into a UCS2 16-bit char. More... | |
| String | UCS2toUTF8 (UInt16 ucs2char) |
| Convert one UCS2 16-bit char into a UTF-8 char (possibly multiple bytes) More... | |
| UInt32 | UTF8toUCS4 (const char *utf8char) |
| Convert one UTF-8 char (possibly multiple bytes) into a UCS4 32-bit char. More... | |
| String | UCS4toUTF8 (UInt32 ucs4char) |
| Convert one UCS4 32-bit char into a UTF-8 char (possibly multiple bytes) More... | |
| void | UCS4toUTF8 (UInt32 ucs4char, StringBuffer &sb) |
| Convert one UCS4 32-bit char into a UTF-8 char (possibly multiple bytes) This version is faster to use in a loop than the version which returns a String. More... | |
| Array< UInt16 > | StringToUCS2ReplaceInvalid (const String &input) |
| Convert a UTF-8 (or ASCII) string into a UCS2 string. More... | |
| Array< UInt16 > | StringToUCS2 (const String &input) |
| Convert a UTF-8 (or ASCII) string into a UCS2 string. More... | |
| String | UCS2ToString (const void *input, size_t inputLength) |
| Convert a UCS2 string into a UTF-8 (or ASCII) string. More... | |
| String | UCS2ToString (const Array< UInt16 > &input) |
| Convert a UCS2 string into a UTF-8 (or ASCII) string. More... | |
| String | UCS2ToString (const Array< char > &input) |
| Convert a UCS2 string into a UTF-8 (or ASCII) string. More... | |
| bool | toUpperCaseInPlace (char *input) |
| Convert the UTF-8 string to upper case. More... | |
| String | toUpperCase (const char *input) |
| Convert the UTF-8 string to upper case and return the result. More... | |
| bool | toLowerCaseInPlace (char *input) |
| Convert the UTF-8 string to lower case. More... | |
| String | toLowerCase (const char *input) |
| Convert the UTF-8 string to lower case and return the result. More... | |
| BLOCXX_COMMON_API int | compareToIgnoreCase (const char *str1, const char *str2) |
| Compares 2 UTF-8 strings, ignoring any case differences as defined by the Unicode spec CaseFolding.txt file. More... | |
| BLOCXX_COMMON_API size_t BLOCXX_NAMESPACE::UTF8Utils::charCount | ( | const char * | utf8str | ) |
Count the number of UTF-8 chars in the string.
This may be different than the number of bytes (as would be returned by strlen()). If utf8str is not a valid UTF-8 string, then the result is undefined.
| utf8str | string in UTF-8 encoding. |
Definition at line 134 of file UTF8Utils.cpp.
Referenced by BLOCXX_NAMESPACE::String::UTF8Length().
| int BLOCXX_NAMESPACE::UTF8Utils::compareToIgnoreCase | ( | const char * | str1, |
| const char * | str2 | ||
| ) |
Compares 2 UTF-8 strings, ignoring any case differences as defined by the Unicode spec CaseFolding.txt file.
| str1 | first string |
| str2 | second string |
Definition at line 105 of file UTF8UtilscompareToIgnoreCase.cpp.
Referenced by BLOCXX_NAMESPACE::String::compareToIgnoreCase(), and BLOCXX_NAMESPACE::String::endsWith().
Convert a UTF-8 (or ASCII) string into a UCS2 string.
| input | The UTF-8 string |
| InvalidUTF8Exception | if input contains invalid UTF-8 characters. |
Definition at line 403 of file UTF8Utils.cpp.
References BLOCXX_NAMESPACE::Array< T >::empty().
| BLOCXX_COMMON_API Array< UInt16 > BLOCXX_NAMESPACE::UTF8Utils::StringToUCS2ReplaceInvalid | ( | const String & | input | ) |
Convert a UTF-8 (or ASCII) string into a UCS2 string.
Invalid characters will be changed to U+FFFD (the Unicode Replacement character)
| input | The UTF-8 string |
Definition at line 397 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API String BLOCXX_NAMESPACE::UTF8Utils::toLowerCase | ( | const char * | input | ) |
Convert the UTF-8 string to lower case and return the result.
Definition at line 2123 of file UTF8Utils.cpp.
Referenced by BLOCXX_NAMESPACE::String::toLowerCase().
| BLOCXX_COMMON_API bool BLOCXX_NAMESPACE::UTF8Utils::toLowerCaseInPlace | ( | char * | input | ) |
Convert the UTF-8 string to lower case.
The string is modified in place. If a character is encountered whose replacement occupies a greater number of bytes than the original, processing will cease and false will be returned. The current implementation does not handle any of the special cases as defined in the Unicode SpecialCasing.txt file, and thus characters will not grow, so currently false will never be returned.
Definition at line 2117 of file UTF8Utils.cpp.
Referenced by BLOCXX_NAMESPACE::String::toLowerCase().
| BLOCXX_COMMON_API String BLOCXX_NAMESPACE::UTF8Utils::toUpperCase | ( | const char * | input | ) |
Convert the UTF-8 string to upper case and return the result.
Definition at line 2111 of file UTF8Utils.cpp.
Referenced by BLOCXX_NAMESPACE::String::toUpperCase().
| BLOCXX_COMMON_API bool BLOCXX_NAMESPACE::UTF8Utils::toUpperCaseInPlace | ( | char * | input | ) |
Convert the UTF-8 string to upper case.
The string is modified in place. If a character is encountered whose replacement occupies a greater number of bytes than the original, processing will cease and false will be returned. The current implementation does not handle any of the special cases as defined in the Unicode SpecialCasing.txt file, and thus characters will not grow, so currently false will never be returned.
Definition at line 2105 of file UTF8Utils.cpp.
Referenced by BLOCXX_NAMESPACE::String::toUpperCase().
Convert a UCS2 string into a UTF-8 (or ASCII) string.
| input | An Array of UCS2 characters |
Definition at line 433 of file UTF8Utils.cpp.
Convert a UCS2 string into a UTF-8 (or ASCII) string.
| input | An Array of UCS2 characters |
Definition at line 424 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API String BLOCXX_NAMESPACE::UTF8Utils::UCS2ToString | ( | const void * | input, |
| size_t | inputLength | ||
| ) |
Convert a UCS2 string into a UTF-8 (or ASCII) string.
| input | An Array of UCS2 characters |
| inputLength | The size (in bytes) of input. |
Definition at line 409 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API String BLOCXX_NAMESPACE::UTF8Utils::UCS2toUTF8 | ( | UInt16 | ucs2char | ) |
Convert one UCS2 16-bit char into a UTF-8 char (possibly multiple bytes)
| ucs2char | UCS2 char to convert. |
Definition at line 165 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API String BLOCXX_NAMESPACE::UTF8Utils::UCS4toUTF8 | ( | UInt32 | ucs4char | ) |
Convert one UCS4 32-bit char into a UTF-8 char (possibly multiple bytes)
| ucs4char | UCS4 char to convert. |
Definition at line 229 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API void BLOCXX_NAMESPACE::UTF8Utils::UCS4toUTF8 | ( | UInt32 | ucs4char, |
| StringBuffer & | sb | ||
| ) |
Convert one UCS4 32-bit char into a UTF-8 char (possibly multiple bytes) This version is faster to use in a loop than the version which returns a String.
| ucs4char | UCS4 char to convert. |
| sb | The corresponding UTF-8 char will be appended to the end of sb. |
Definition at line 237 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API UInt16 BLOCXX_NAMESPACE::UTF8Utils::UTF8toUCS2 | ( | const char * | utf8char | ) |
Convert one UTF-8 char (possibly multiple bytes) into a UCS2 16-bit char.
| utf8char | pointer to the UTF-8 char to convert |
Definition at line 152 of file UTF8Utils.cpp.
| BLOCXX_COMMON_API UInt32 BLOCXX_NAMESPACE::UTF8Utils::UTF8toUCS4 | ( | const char * | utf8char | ) |
Convert one UTF-8 char (possibly multiple bytes) into a UCS4 32-bit char.
| utf8char | pointer to the UTF-8 char to convert |
Definition at line 171 of file UTF8Utils.cpp.
1.8.18