mariadb/sql/share/charsets
cmiller@zippy.cornsilk.net 1380fb167d Merge zippy.cornsilk.net:/home/cmiller/work/mysql/bug27562/my50-bug27562
into  zippy.cornsilk.net:/home/cmiller/work/mysql/bug27562/my51-bug27562
2007-08-07 06:22:52 -04:00
..
armscii8.xml
ascii.xml
cp850.xml
cp852.xml
cp866.xml
cp1250.xml
cp1251.xml
cp1256.xml
cp1257.xml
dec8.xml
geostd8.xml
greek.xml
hebrew.xml
hp8.xml
Index.xml
keybcs2.xml
koi8r.xml
koi8u.xml
languages.html
latin1.xml
latin2.xml
latin5.xml
latin7.xml
macce.xml
macroman.xml
README
swe7.xml

This directory holds configuration files which allow MySQL to work with
different character sets.  It contains:

*.conf
    Each conf file contains four tables which describe character types,
    lower- and upper-case equivalencies and sorting orders for the
    character values in the set.

Index
    The Index file lists all of the available charset configurations.

    Each charset is paired with a number.  The number is stored
    IN THE DATABASE TABLE FILES and must not be changed.  Always
    add new character sets to the end of the list, so that the
    numbers of the other character sets will not be changed.

Compiled in or configuration file?
    When should a character set be compiled in to MySQL's string library
    (libmystrings), and when should it be placed in a configuration
    file?

    If the character set requires the strcoll functions or is a
    multi-byte character set, it MUST be compiled in to the string
    library.  If it does not require these functions, it should be
    placed in a configuration file.

    If the character set uses any one of the strcoll functions, it
    must define all of them.  Likewise, if the set uses one of the
    multi-byte functions, it must define them all.  See the manual for
    more information on how to add a complex character set to MySQL.

Syntax of configuration files
    The syntax is very simple.  Comments start with a '#' character and
    proceed to the end of the line.  Words are separated by arbitrary
    amounts of whitespace.

    For the character set configuration files, every word must be a
    number in hexadecimal format.  The ctype array takes up the first
    257 words; the to_lower, to_upper and sort_order arrays take up 256
    words each after that.