# Collations and Character Sets

Dolt supports a subset of the character sets and collations that MySQL supports. Notably, the default character set is `utf8mb4`, while the default collation is `utf8mb4_0900_bin` (a case-sensitive collation). This default was chosen as it has the fastest implementation, and also from a legacy perspective, as before proper collation support was added, it was the only real collation that we supported. This differs from a standard MySQL instance, which defaults to `utf8mb4_0900_ai_ci` (a case-insensitive collation). Character sets and collations are added upon request, so please [file an issue](https://github.com/dolthub/dolt/issues) if a character set or collation that you need is missing.

**Currently supporting 181 of 286 MySQL collations.**

| Collation                       | Character Set | Supported |
| ------------------------------- | ------------- | --------- |
| armscii8\_bin                   | armscii8      | ❌         |
| armscii8\_general\_ci           | armscii8      | ❌         |
| ascii\_bin                      | ascii         | ✅         |
| ascii\_general\_ci              | ascii         | ✅         |
| big5\_bin                       | big5          | ❌         |
| big5\_chinese\_ci               | big5          | ❌         |
| binary                          | binary        | ✅         |
| cp1250\_bin                     | cp1250        | ❌         |
| cp1250\_croatian\_ci            | cp1250        | ❌         |
| cp1250\_czech\_cs               | cp1250        | ❌         |
| cp1250\_general\_ci             | cp1250        | ❌         |
| cp1250\_polish\_ci              | cp1250        | ❌         |
| cp1251\_bin                     | cp1251        | ❌         |
| cp1251\_bulgarian\_ci           | cp1251        | ❌         |
| cp1251\_general\_ci             | cp1251        | ❌         |
| cp1251\_general\_cs             | cp1251        | ❌         |
| cp1251\_ukrainian\_ci           | cp1251        | ❌         |
| cp1256\_bin                     | cp1256        | ✅         |
| cp1256\_general\_ci             | cp1256        | ✅         |
| cp1257\_bin                     | cp1257        | ✅         |
| cp1257\_general\_ci             | cp1257        | ✅         |
| cp1257\_lithuanian\_ci          | cp1257        | ✅         |
| cp850\_bin                      | cp850         | ❌         |
| cp850\_general\_ci              | cp850         | ❌         |
| cp852\_bin                      | cp852         | ❌         |
| cp852\_general\_ci              | cp852         | ❌         |
| cp866\_bin                      | cp866         | ❌         |
| cp866\_general\_ci              | cp866         | ❌         |
| cp932\_bin                      | cp932         | ❌         |
| cp932\_japanese\_ci             | cp932         | ❌         |
| dec8\_bin                       | dec8          | ✅         |
| dec8\_swedish\_ci               | dec8          | ✅         |
| eucjpms\_bin                    | eucjpms       | ❌         |
| eucjpms\_japanese\_ci           | eucjpms       | ❌         |
| euckr\_bin                      | euckr         | ❌         |
| euckr\_korean\_ci               | euckr         | ❌         |
| gb18030\_bin                    | gb18030       | ❌         |
| gb18030\_chinese\_ci            | gb18030       | ❌         |
| gb18030\_unicode\_520\_ci       | gb18030       | ❌         |
| gb2312\_bin                     | gb2312        | ❌         |
| gb2312\_chinese\_ci             | gb2312        | ❌         |
| gbk\_bin                        | gbk           | ❌         |
| gbk\_chinese\_ci                | gbk           | ❌         |
| geostd8\_bin                    | geostd8       | ✅         |
| geostd8\_general\_ci            | geostd8       | ✅         |
| greek\_bin                      | greek         | ❌         |
| greek\_general\_ci              | greek         | ❌         |
| hebrew\_bin                     | hebrew        | ❌         |
| hebrew\_general\_ci             | hebrew        | ❌         |
| hp8\_bin                        | hp8           | ❌         |
| hp8\_english\_ci                | hp8           | ❌         |
| keybcs2\_bin                    | keybcs2       | ❌         |
| keybcs2\_general\_ci            | keybcs2       | ❌         |
| koi8r\_bin                      | koi8r         | ❌         |
| koi8r\_general\_ci              | koi8r         | ❌         |
| koi8u\_bin                      | koi8u         | ❌         |
| koi8u\_general\_ci              | koi8u         | ❌         |
| latin1\_bin                     | latin1        | ✅         |
| latin1\_danish\_ci              | latin1        | ✅         |
| latin1\_general\_ci             | latin1        | ✅         |
| latin1\_general\_cs             | latin1        | ✅         |
| latin1\_german1\_ci             | latin1        | ✅         |
| latin1\_german2\_ci             | latin1        | ✅         |
| latin1\_spanish\_ci             | latin1        | ✅         |
| latin1\_swedish\_ci             | latin1        | ✅         |
| latin2\_bin                     | latin2        | ❌         |
| latin2\_croatian\_ci            | latin2        | ❌         |
| latin2\_czech\_cs               | latin2        | ❌         |
| latin2\_general\_ci             | latin2        | ❌         |
| latin2\_hungarian\_ci           | latin2        | ❌         |
| latin5\_bin                     | latin5        | ❌         |
| latin5\_turkish\_ci             | latin5        | ❌         |
| latin7\_bin                     | latin7        | ✅         |
| latin7\_estonian\_cs            | latin7        | ✅         |
| latin7\_general\_ci             | latin7        | ✅         |
| latin7\_general\_cs             | latin7        | ✅         |
| macce\_bin                      | macce         | ❌         |
| macce\_general\_ci              | macce         | ❌         |
| macroman\_bin                   | macroman      | ❌         |
| macroman\_general\_ci           | macroman      | ❌         |
| sjis\_bin                       | sjis          | ❌         |
| sjis\_japanese\_ci              | sjis          | ❌         |
| swe7\_bin                       | swe7          | ✅         |
| swe7\_swedish\_ci               | swe7          | ✅         |
| tis620\_bin                     | tis620        | ❌         |
| tis620\_thai\_ci                | tis620        | ❌         |
| ucs2\_bin                       | ucs2          | ❌         |
| ucs2\_croatian\_ci              | ucs2          | ❌         |
| ucs2\_czech\_ci                 | ucs2          | ❌         |
| ucs2\_danish\_ci                | ucs2          | ❌         |
| ucs2\_esperanto\_ci             | ucs2          | ❌         |
| ucs2\_estonian\_ci              | ucs2          | ❌         |
| ucs2\_general\_ci               | ucs2          | ❌         |
| ucs2\_general\_mysql500\_ci     | ucs2          | ❌         |
| ucs2\_german2\_ci               | ucs2          | ❌         |
| ucs2\_hungarian\_ci             | ucs2          | ❌         |
| ucs2\_icelandic\_ci             | ucs2          | ❌         |
| ucs2\_latvian\_ci               | ucs2          | ❌         |
| ucs2\_lithuanian\_ci            | ucs2          | ❌         |
| ucs2\_persian\_ci               | ucs2          | ❌         |
| ucs2\_polish\_ci                | ucs2          | ❌         |
| ucs2\_roman\_ci                 | ucs2          | ❌         |
| ucs2\_romanian\_ci              | ucs2          | ❌         |
| ucs2\_sinhala\_ci               | ucs2          | ❌         |
| ucs2\_slovak\_ci                | ucs2          | ❌         |
| ucs2\_slovenian\_ci             | ucs2          | ❌         |
| ucs2\_spanish2\_ci              | ucs2          | ❌         |
| ucs2\_spanish\_ci               | ucs2          | ❌         |
| ucs2\_swedish\_ci               | ucs2          | ❌         |
| ucs2\_turkish\_ci               | ucs2          | ❌         |
| ucs2\_unicode\_520\_ci          | ucs2          | ❌         |
| ucs2\_unicode\_ci               | ucs2          | ❌         |
| ucs2\_vietnamese\_ci            | ucs2          | ❌         |
| ujis\_bin                       | ujis          | ❌         |
| ujis\_japanese\_ci              | ujis          | ❌         |
| utf16\_bin                      | utf16         | ✅         |
| utf16\_croatian\_ci             | utf16         | ✅         |
| utf16\_czech\_ci                | utf16         | ✅         |
| utf16\_danish\_ci               | utf16         | ✅         |
| utf16\_esperanto\_ci            | utf16         | ✅         |
| utf16\_estonian\_ci             | utf16         | ✅         |
| utf16\_general\_ci              | utf16         | ✅         |
| utf16\_german2\_ci              | utf16         | ✅         |
| utf16\_hungarian\_ci            | utf16         | ✅         |
| utf16\_icelandic\_ci            | utf16         | ✅         |
| utf16\_latvian\_ci              | utf16         | ✅         |
| utf16\_lithuanian\_ci           | utf16         | ✅         |
| utf16\_persian\_ci              | utf16         | ✅         |
| utf16\_polish\_ci               | utf16         | ✅         |
| utf16\_roman\_ci                | utf16         | ✅         |
| utf16\_romanian\_ci             | utf16         | ✅         |
| utf16\_sinhala\_ci              | utf16         | ✅         |
| utf16\_slovak\_ci               | utf16         | ✅         |
| utf16\_slovenian\_ci            | utf16         | ✅         |
| utf16\_spanish2\_ci             | utf16         | ✅         |
| utf16\_spanish\_ci              | utf16         | ✅         |
| utf16\_swedish\_ci              | utf16         | ✅         |
| utf16\_turkish\_ci              | utf16         | ✅         |
| utf16\_unicode\_520\_ci         | utf16         | ✅         |
| utf16\_unicode\_ci              | utf16         | ✅         |
| utf16\_vietnamese\_ci           | utf16         | ✅         |
| utf16le\_bin                    | utf16le       | ❌         |
| utf16le\_general\_ci            | utf16le       | ❌         |
| utf32\_bin                      | utf32         | ✅         |
| utf32\_croatian\_ci             | utf32         | ✅         |
| utf32\_czech\_ci                | utf32         | ✅         |
| utf32\_danish\_ci               | utf32         | ✅         |
| utf32\_esperanto\_ci            | utf32         | ✅         |
| utf32\_estonian\_ci             | utf32         | ✅         |
| utf32\_general\_ci              | utf32         | ✅         |
| utf32\_german2\_ci              | utf32         | ✅         |
| utf32\_hungarian\_ci            | utf32         | ✅         |
| utf32\_icelandic\_ci            | utf32         | ✅         |
| utf32\_latvian\_ci              | utf32         | ✅         |
| utf32\_lithuanian\_ci           | utf32         | ✅         |
| utf32\_persian\_ci              | utf32         | ✅         |
| utf32\_polish\_ci               | utf32         | ✅         |
| utf32\_roman\_ci                | utf32         | ✅         |
| utf32\_romanian\_ci             | utf32         | ✅         |
| utf32\_sinhala\_ci              | utf32         | ✅         |
| utf32\_slovak\_ci               | utf32         | ✅         |
| utf32\_slovenian\_ci            | utf32         | ✅         |
| utf32\_spanish2\_ci             | utf32         | ✅         |
| utf32\_spanish\_ci              | utf32         | ✅         |
| utf32\_swedish\_ci              | utf32         | ✅         |
| utf32\_turkish\_ci              | utf32         | ✅         |
| utf32\_unicode\_520\_ci         | utf32         | ✅         |
| utf32\_unicode\_ci              | utf32         | ✅         |
| utf32\_vietnamese\_ci           | utf32         | ✅         |
| utf8mb3\_bin                    | utf8mb3       | ✅         |
| utf8mb3\_croatian\_ci           | utf8mb3       | ✅         |
| utf8mb3\_czech\_ci              | utf8mb3       | ✅         |
| utf8mb3\_danish\_ci             | utf8mb3       | ✅         |
| utf8mb3\_esperanto\_ci          | utf8mb3       | ✅         |
| utf8mb3\_estonian\_ci           | utf8mb3       | ✅         |
| utf8mb3\_general\_ci            | utf8mb3       | ✅         |
| utf8mb3\_general\_mysql500\_ci  | utf8mb3       | ✅         |
| utf8mb3\_german2\_ci            | utf8mb3       | ✅         |
| utf8mb3\_hungarian\_ci          | utf8mb3       | ✅         |
| utf8mb3\_icelandic\_ci          | utf8mb3       | ✅         |
| utf8mb3\_latvian\_ci            | utf8mb3       | ✅         |
| utf8mb3\_lithuanian\_ci         | utf8mb3       | ✅         |
| utf8mb3\_persian\_ci            | utf8mb3       | ✅         |
| utf8mb3\_polish\_ci             | utf8mb3       | ✅         |
| utf8mb3\_roman\_ci              | utf8mb3       | ✅         |
| utf8mb3\_romanian\_ci           | utf8mb3       | ✅         |
| utf8mb3\_sinhala\_ci            | utf8mb3       | ✅         |
| utf8mb3\_slovak\_ci             | utf8mb3       | ✅         |
| utf8mb3\_slovenian\_ci          | utf8mb3       | ✅         |
| utf8mb3\_spanish2\_ci           | utf8mb3       | ✅         |
| utf8mb3\_spanish\_ci            | utf8mb3       | ✅         |
| utf8mb3\_swedish\_ci            | utf8mb3       | ✅         |
| utf8mb3\_tolower\_ci            | utf8mb3       | ✅         |
| utf8mb3\_turkish\_ci            | utf8mb3       | ✅         |
| utf8mb3\_unicode\_520\_ci       | utf8mb3       | ✅         |
| utf8mb3\_unicode\_ci            | utf8mb3       | ✅         |
| utf8mb3\_vietnamese\_ci         | utf8mb3       | ✅         |
| utf8mb4\_0900\_ai\_ci           | utf8mb4       | ✅         |
| utf8mb4\_0900\_as\_ci           | utf8mb4       | ✅         |
| utf8mb4\_0900\_as\_cs           | utf8mb4       | ✅         |
| utf8mb4\_0900\_bin              | utf8mb4       | ✅         |
| utf8mb4\_bg\_0900\_ai\_ci       | utf8mb4       | ❌         |
| utf8mb4\_bg\_0900\_as\_cs       | utf8mb4       | ❌         |
| utf8mb4\_bin                    | utf8mb4       | ✅         |
| utf8mb4\_bs\_0900\_ai\_ci       | utf8mb4       | ❌         |
| utf8mb4\_bs\_0900\_as\_cs       | utf8mb4       | ❌         |
| utf8mb4\_croatian\_ci           | utf8mb4       | ✅         |
| utf8mb4\_cs\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_cs\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_czech\_ci              | utf8mb4       | ✅         |
| utf8mb4\_da\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_da\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_danish\_ci             | utf8mb4       | ✅         |
| utf8mb4\_de\_pb\_0900\_ai\_ci   | utf8mb4       | ✅         |
| utf8mb4\_de\_pb\_0900\_as\_cs   | utf8mb4       | ✅         |
| utf8mb4\_eo\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_eo\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_es\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_es\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_es\_trad\_0900\_ai\_ci | utf8mb4       | ✅         |
| utf8mb4\_es\_trad\_0900\_as\_cs | utf8mb4       | ✅         |
| utf8mb4\_esperanto\_ci          | utf8mb4       | ✅         |
| utf8mb4\_estonian\_ci           | utf8mb4       | ✅         |
| utf8mb4\_et\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_et\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_general\_ci            | utf8mb4       | ✅         |
| utf8mb4\_german2\_ci            | utf8mb4       | ✅         |
| utf8mb4\_gl\_0900\_ai\_ci       | utf8mb4       | ❌         |
| utf8mb4\_gl\_0900\_as\_cs       | utf8mb4       | ❌         |
| utf8mb4\_hr\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_hr\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_hu\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_hu\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_hungarian\_ci          | utf8mb4       | ✅         |
| utf8mb4\_icelandic\_ci          | utf8mb4       | ✅         |
| utf8mb4\_is\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_is\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_ja\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_ja\_0900\_as\_cs\_ks   | utf8mb4       | ✅         |
| utf8mb4\_la\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_la\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_latvian\_ci            | utf8mb4       | ✅         |
| utf8mb4\_lithuanian\_ci         | utf8mb4       | ✅         |
| utf8mb4\_lt\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_lt\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_lv\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_lv\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_mn\_cyrl\_0900\_ai\_ci | utf8mb4       | ❌         |
| utf8mb4\_mn\_cyrl\_0900\_as\_cs | utf8mb4       | ❌         |
| utf8mb4\_nb\_0900\_ai\_ci       | utf8mb4       | ❌         |
| utf8mb4\_nb\_0900\_as\_cs       | utf8mb4       | ❌         |
| utf8mb4\_nn\_0900\_ai\_ci       | utf8mb4       | ❌         |
| utf8mb4\_nn\_0900\_as\_cs       | utf8mb4       | ❌         |
| utf8mb4\_persian\_ci            | utf8mb4       | ✅         |
| utf8mb4\_pl\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_pl\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_polish\_ci             | utf8mb4       | ✅         |
| utf8mb4\_ro\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_ro\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_roman\_ci              | utf8mb4       | ✅         |
| utf8mb4\_romanian\_ci           | utf8mb4       | ✅         |
| utf8mb4\_ru\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_ru\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_sinhala\_ci            | utf8mb4       | ✅         |
| utf8mb4\_sk\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_sk\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_sl\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_sl\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_slovak\_ci             | utf8mb4       | ✅         |
| utf8mb4\_slovenian\_ci          | utf8mb4       | ✅         |
| utf8mb4\_spanish2\_ci           | utf8mb4       | ✅         |
| utf8mb4\_spanish\_ci            | utf8mb4       | ✅         |
| utf8mb4\_sr\_latn\_0900\_ai\_ci | utf8mb4       | ❌         |
| utf8mb4\_sr\_latn\_0900\_as\_cs | utf8mb4       | ❌         |
| utf8mb4\_sv\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_sv\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_swedish\_ci            | utf8mb4       | ✅         |
| utf8mb4\_tr\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_tr\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_turkish\_ci            | utf8mb4       | ✅         |
| utf8mb4\_unicode\_520\_ci       | utf8mb4       | ✅         |
| utf8mb4\_unicode\_ci            | utf8mb4       | ✅         |
| utf8mb4\_vi\_0900\_ai\_ci       | utf8mb4       | ✅         |
| utf8mb4\_vi\_0900\_as\_cs       | utf8mb4       | ✅         |
| utf8mb4\_vietnamese\_ci         | utf8mb4       | ✅         |
| utf8mb4\_zh\_0900\_as\_cs       | utf8mb4       | ✅         |
