Enumerations | |
| enum | category |
| Each Unicode character is in exactly one of these categories. More... | |
Functions | |
| unsigned | nonascii_to_utf8 (unsigned ch, char *buf) |
| Convert a single non-ASCII Unicode character to UTF-8. | |
| unsigned | to_utf8 (unsigned ch, char *buf) |
| Convert a single Unicode character to UTF-8. | |
| void | append_utf8 (std::string &s, unsigned ch) |
| Append the UTF-8 representation of a single Unicode character to a std::string. | |
| category | get_category (unsigned ch) |
| Return the category which a given Unicode character falls into. | |
| bool | is_wordchar (unsigned ch) |
| Test if a given Unicode character is "word character". | |
| bool | is_whitespace (unsigned ch) |
| Test if a given Unicode character is a whitespace character. | |
| bool | is_currency (unsigned ch) |
| Test if a given Unicode character is a currency symbol. | |
| unsigned | tolower (unsigned ch) |
| Convert a Unicode character to lowercase. | |
| unsigned | toupper (unsigned ch) |
| Convert a Unicode character to uppercase. | |
| std::string | tolower (const std::string &term) |
| Convert a UTF-8 std::string to lowercase. | |
| std::string | toupper (const std::string &term) |
| Convert a UTF-8 std::string to uppercase. | |
Each Unicode character is in exactly one of these categories.
| unsigned Xapian::Unicode::nonascii_to_utf8 | ( | unsigned | ch, | |
| char * | buf | |||
| ) |
Convert a single non-ASCII Unicode character to UTF-8.
This is intended mainly as a helper method for to_utf8().
The character ch (which must be > 128) is written to the buffer buf and the length of the resultant UTF-8 character is returned.
NB buf must have space for (at least) 4 bytes.
Referenced by to_utf8().
| unsigned Xapian::Unicode::to_utf8 | ( | unsigned | ch, | |
| char * | buf | |||
| ) | [inline] |
Convert a single Unicode character to UTF-8.
The character ch is written to the buffer buf and the length of the resultant UTF-8 character is returned.
NB buf must have space for (at least) 4 bytes.
References nonascii_to_utf8().
Referenced by append_utf8().