LogoLogo
DoltHubBlogDiscordGitHubDolt
  • Introduction
    • What Is Dolt?
    • Installation
      • Linux
      • Windows
      • Mac
      • Build from Source
      • Application Server
      • Docker
      • Upgrading
    • Getting Started
      • Version Controlled Database
      • Git For Data
      • Versioned MySQL Replica
    • Use Cases
      • Data Sharing
      • Data and Model Quality Control
      • Manual Data Curation
      • Version Control for your Application
      • Versioned MySQL Replica
      • Audit
      • Configuration Management
      • Offline First
  • Concepts
    • Dolt
      • Git
        • Commits
        • Log
        • Diff
        • Branch
        • Merge
        • Conflicts
        • Remotes
        • Working Set
      • SQL
        • Databases
        • Schema
        • Tables
        • Primary Keys
        • Types
        • Indexes
        • Views
        • Constraints
        • Triggers
        • Procedures
        • Users/Grants
        • Transactions
        • System Variables
      • RDBMS
        • Server
        • Backups
        • Replication
    • DoltHub/DoltLab
      • Permissions
      • Pull Requests
      • Issues
      • Forks
  • SQL Reference
    • Running the Server
      • Configuration
      • Access Management
      • Branch Permissions
      • Backups
      • Garbage Collection
      • Metrics
      • Replication
      • Troubleshooting
    • Version Control Features
      • Using Branches
      • Merges
      • Querying History
      • Using Remotes
      • Procedures
      • Functions
      • System Tables
      • System Variables
      • Saved Queries
    • SQL Language Support
      • Data Description
      • Expressions, Functions, Operators
      • Supported Statements
      • MySQL Information Schema
      • Collations and Character Sets
      • System Variables
      • Miscellaneous
    • Supported Clients
      • Programmatic
      • SQL Editors
    • Benchmarks and Metrics
      • Correctness
      • Latency
      • Import
  • CLI Reference
    • Commands
    • Git Comparison
  • Architecture
    • Overview
    • Storage Engine
      • Commit Graph
      • Prolly Trees
      • Block Store
    • SQL
      • Go MySQL Server
      • Vitess
  • Guides
    • Cheat Sheet
    • Contributing
      • dolt
      • go-mysql-server
    • MySQL to Dolt Replication
    • Importing Data
    • Integrations
  • Other
    • FAQ
    • Roadmap
    • Versioning
  • Products
    • Hosted Dolt
      • Getting Started
      • Notable Features
      • SQL Workbench
      • Cloning a Hosted Database
      • Using DoltHub as a Remote
      • Infrastructure
    • DoltHub
      • Data Sharing
      • API
        • Authentication
        • SQL
        • CSV
        • Database
        • Hooks
      • Continuous Integration
        • Getting Started
        • Workflow Reference
      • Transform File Uploads
      • Workspaces
    • DoltLab
    • Dolt Workbench
    • DoltgreSQL
Powered by GitBook
On this page

Was this helpful?

Edit on GitHub
Export as PDF
  1. SQL Reference
  2. SQL Language Support

Collations and Character Sets

PreviousMySQL Information SchemaNextSystem Variables

Last updated 5 months ago

Was this helpful?

Dolt supports a subset of the character sets and collations that MySQL supports. Notably, the default character set is utf8mb4, while the default collation is utf8mb4_0900_bin (a case-sensitive collation). This default was chosen as it has the fastest implementation, and also from a legacy perspective, as before proper collation support was added, it was the only real collation that we supported. This differs from a standard MySQL instance, which defaults to utf8mb4_0900_ai_ci (a case-insensitive collation). Character sets and collations are added upon request, so please if a character set or collation that you need is missing.

Currently supporting 181 of 286 MySQL collations.

Collation
Character Set
Supported

armscii8_bin

armscii8

❌

armscii8_general_ci

armscii8

❌

ascii_bin

ascii

✅

ascii_general_ci

ascii

✅

big5_bin

big5

❌

big5_chinese_ci

big5

❌

binary

binary

✅

cp1250_bin

cp1250

❌

cp1250_croatian_ci

cp1250

❌

cp1250_czech_cs

cp1250

❌

cp1250_general_ci

cp1250

❌

cp1250_polish_ci

cp1250

❌

cp1251_bin

cp1251

❌

cp1251_bulgarian_ci

cp1251

❌

cp1251_general_ci

cp1251

❌

cp1251_general_cs

cp1251

❌

cp1251_ukrainian_ci

cp1251

❌

cp1256_bin

cp1256

✅

cp1256_general_ci

cp1256

✅

cp1257_bin

cp1257

✅

cp1257_general_ci

cp1257

✅

cp1257_lithuanian_ci

cp1257

✅

cp850_bin

cp850

❌

cp850_general_ci

cp850

❌

cp852_bin

cp852

❌

cp852_general_ci

cp852

❌

cp866_bin

cp866

❌

cp866_general_ci

cp866

❌

cp932_bin

cp932

❌

cp932_japanese_ci

cp932

❌

dec8_bin

dec8

✅

dec8_swedish_ci

dec8

✅

eucjpms_bin

eucjpms

❌

eucjpms_japanese_ci

eucjpms

❌

euckr_bin

euckr

❌

euckr_korean_ci

euckr

❌

gb18030_bin

gb18030

❌

gb18030_chinese_ci

gb18030

❌

gb18030_unicode_520_ci

gb18030

❌

gb2312_bin

gb2312

❌

gb2312_chinese_ci

gb2312

❌

gbk_bin

gbk

❌

gbk_chinese_ci

gbk

❌

geostd8_bin

geostd8

✅

geostd8_general_ci

geostd8

✅

greek_bin

greek

❌

greek_general_ci

greek

❌

hebrew_bin

hebrew

❌

hebrew_general_ci

hebrew

❌

hp8_bin

hp8

❌

hp8_english_ci

hp8

❌

keybcs2_bin

keybcs2

❌

keybcs2_general_ci

keybcs2

❌

koi8r_bin

koi8r

❌

koi8r_general_ci

koi8r

❌

koi8u_bin

koi8u

❌

koi8u_general_ci

koi8u

❌

latin1_bin

latin1

✅

latin1_danish_ci

latin1

✅

latin1_general_ci

latin1

✅

latin1_general_cs

latin1

✅

latin1_german1_ci

latin1

✅

latin1_german2_ci

latin1

✅

latin1_spanish_ci

latin1

✅

latin1_swedish_ci

latin1

✅

latin2_bin

latin2

❌

latin2_croatian_ci

latin2

❌

latin2_czech_cs

latin2

❌

latin2_general_ci

latin2

❌

latin2_hungarian_ci

latin2

❌

latin5_bin

latin5

❌

latin5_turkish_ci

latin5

❌

latin7_bin

latin7

✅

latin7_estonian_cs

latin7

✅

latin7_general_ci

latin7

✅

latin7_general_cs

latin7

✅

macce_bin

macce

❌

macce_general_ci

macce

❌

macroman_bin

macroman

❌

macroman_general_ci

macroman

❌

sjis_bin

sjis

❌

sjis_japanese_ci

sjis

❌

swe7_bin

swe7

✅

swe7_swedish_ci

swe7

✅

tis620_bin

tis620

❌

tis620_thai_ci

tis620

❌

ucs2_bin

ucs2

❌

ucs2_croatian_ci

ucs2

❌

ucs2_czech_ci

ucs2

❌

ucs2_danish_ci

ucs2

❌

ucs2_esperanto_ci

ucs2

❌

ucs2_estonian_ci

ucs2

❌

ucs2_general_ci

ucs2

❌

ucs2_general_mysql500_ci

ucs2

❌

ucs2_german2_ci

ucs2

❌

ucs2_hungarian_ci

ucs2

❌

ucs2_icelandic_ci

ucs2

❌

ucs2_latvian_ci

ucs2

❌

ucs2_lithuanian_ci

ucs2

❌

ucs2_persian_ci

ucs2

❌

ucs2_polish_ci

ucs2

❌

ucs2_roman_ci

ucs2

❌

ucs2_romanian_ci

ucs2

❌

ucs2_sinhala_ci

ucs2

❌

ucs2_slovak_ci

ucs2

❌

ucs2_slovenian_ci

ucs2

❌

ucs2_spanish2_ci

ucs2

❌

ucs2_spanish_ci

ucs2

❌

ucs2_swedish_ci

ucs2

❌

ucs2_turkish_ci

ucs2

❌

ucs2_unicode_520_ci

ucs2

❌

ucs2_unicode_ci

ucs2

❌

ucs2_vietnamese_ci

ucs2

❌

ujis_bin

ujis

❌

ujis_japanese_ci

ujis

❌

utf16_bin

utf16

✅

utf16_croatian_ci

utf16

✅

utf16_czech_ci

utf16

✅

utf16_danish_ci

utf16

✅

utf16_esperanto_ci

utf16

✅

utf16_estonian_ci

utf16

✅

utf16_general_ci

utf16

✅

utf16_german2_ci

utf16

✅

utf16_hungarian_ci

utf16

✅

utf16_icelandic_ci

utf16

✅

utf16_latvian_ci

utf16

✅

utf16_lithuanian_ci

utf16

✅

utf16_persian_ci

utf16

✅

utf16_polish_ci

utf16

✅

utf16_roman_ci

utf16

✅

utf16_romanian_ci

utf16

✅

utf16_sinhala_ci

utf16

✅

utf16_slovak_ci

utf16

✅

utf16_slovenian_ci

utf16

✅

utf16_spanish2_ci

utf16

✅

utf16_spanish_ci

utf16

✅

utf16_swedish_ci

utf16

✅

utf16_turkish_ci

utf16

✅

utf16_unicode_520_ci

utf16

✅

utf16_unicode_ci

utf16

✅

utf16_vietnamese_ci

utf16

✅

utf16le_bin

utf16le

❌

utf16le_general_ci

utf16le

❌

utf32_bin

utf32

✅

utf32_croatian_ci

utf32

✅

utf32_czech_ci

utf32

✅

utf32_danish_ci

utf32

✅

utf32_esperanto_ci

utf32

✅

utf32_estonian_ci

utf32

✅

utf32_general_ci

utf32

✅

utf32_german2_ci

utf32

✅

utf32_hungarian_ci

utf32

✅

utf32_icelandic_ci

utf32

✅

utf32_latvian_ci

utf32

✅

utf32_lithuanian_ci

utf32

✅

utf32_persian_ci

utf32

✅

utf32_polish_ci

utf32

✅

utf32_roman_ci

utf32

✅

utf32_romanian_ci

utf32

✅

utf32_sinhala_ci

utf32

✅

utf32_slovak_ci

utf32

✅

utf32_slovenian_ci

utf32

✅

utf32_spanish2_ci

utf32

✅

utf32_spanish_ci

utf32

✅

utf32_swedish_ci

utf32

✅

utf32_turkish_ci

utf32

✅

utf32_unicode_520_ci

utf32

✅

utf32_unicode_ci

utf32

✅

utf32_vietnamese_ci

utf32

✅

utf8mb3_bin

utf8mb3

✅

utf8mb3_croatian_ci

utf8mb3

✅

utf8mb3_czech_ci

utf8mb3

✅

utf8mb3_danish_ci

utf8mb3

✅

utf8mb3_esperanto_ci

utf8mb3

✅

utf8mb3_estonian_ci

utf8mb3

✅

utf8mb3_general_ci

utf8mb3

✅

utf8mb3_general_mysql500_ci

utf8mb3

✅

utf8mb3_german2_ci

utf8mb3

✅

utf8mb3_hungarian_ci

utf8mb3

✅

utf8mb3_icelandic_ci

utf8mb3

✅

utf8mb3_latvian_ci

utf8mb3

✅

utf8mb3_lithuanian_ci

utf8mb3

✅

utf8mb3_persian_ci

utf8mb3

✅

utf8mb3_polish_ci

utf8mb3

✅

utf8mb3_roman_ci

utf8mb3

✅

utf8mb3_romanian_ci

utf8mb3

✅

utf8mb3_sinhala_ci

utf8mb3

✅

utf8mb3_slovak_ci

utf8mb3

✅

utf8mb3_slovenian_ci

utf8mb3

✅

utf8mb3_spanish2_ci

utf8mb3

✅

utf8mb3_spanish_ci

utf8mb3

✅

utf8mb3_swedish_ci

utf8mb3

✅

utf8mb3_tolower_ci

utf8mb3

✅

utf8mb3_turkish_ci

utf8mb3

✅

utf8mb3_unicode_520_ci

utf8mb3

✅

utf8mb3_unicode_ci

utf8mb3

✅

utf8mb3_vietnamese_ci

utf8mb3

✅

utf8mb4_0900_ai_ci

utf8mb4

✅

utf8mb4_0900_as_ci

utf8mb4

✅

utf8mb4_0900_as_cs

utf8mb4

✅

utf8mb4_0900_bin

utf8mb4

✅

utf8mb4_bg_0900_ai_ci

utf8mb4

❌

utf8mb4_bg_0900_as_cs

utf8mb4

❌

utf8mb4_bin

utf8mb4

✅

utf8mb4_bs_0900_ai_ci

utf8mb4

❌

utf8mb4_bs_0900_as_cs

utf8mb4

❌

utf8mb4_croatian_ci

utf8mb4

✅

utf8mb4_cs_0900_ai_ci

utf8mb4

✅

utf8mb4_cs_0900_as_cs

utf8mb4

✅

utf8mb4_czech_ci

utf8mb4

✅

utf8mb4_da_0900_ai_ci

utf8mb4

✅

utf8mb4_da_0900_as_cs

utf8mb4

✅

utf8mb4_danish_ci

utf8mb4

✅

utf8mb4_de_pb_0900_ai_ci

utf8mb4

✅

utf8mb4_de_pb_0900_as_cs

utf8mb4

✅

utf8mb4_eo_0900_ai_ci

utf8mb4

✅

utf8mb4_eo_0900_as_cs

utf8mb4

✅

utf8mb4_es_0900_ai_ci

utf8mb4

✅

utf8mb4_es_0900_as_cs

utf8mb4

✅

utf8mb4_es_trad_0900_ai_ci

utf8mb4

✅

utf8mb4_es_trad_0900_as_cs

utf8mb4

✅

utf8mb4_esperanto_ci

utf8mb4

✅

utf8mb4_estonian_ci

utf8mb4

✅

utf8mb4_et_0900_ai_ci

utf8mb4

✅

utf8mb4_et_0900_as_cs

utf8mb4

✅

utf8mb4_general_ci

utf8mb4

✅

utf8mb4_german2_ci

utf8mb4

✅

utf8mb4_gl_0900_ai_ci

utf8mb4

❌

utf8mb4_gl_0900_as_cs

utf8mb4

❌

utf8mb4_hr_0900_ai_ci

utf8mb4

✅

utf8mb4_hr_0900_as_cs

utf8mb4

✅

utf8mb4_hu_0900_ai_ci

utf8mb4

✅

utf8mb4_hu_0900_as_cs

utf8mb4

✅

utf8mb4_hungarian_ci

utf8mb4

✅

utf8mb4_icelandic_ci

utf8mb4

✅

utf8mb4_is_0900_ai_ci

utf8mb4

✅

utf8mb4_is_0900_as_cs

utf8mb4

✅

utf8mb4_ja_0900_as_cs

utf8mb4

✅

utf8mb4_ja_0900_as_cs_ks

utf8mb4

✅

utf8mb4_la_0900_ai_ci

utf8mb4

✅

utf8mb4_la_0900_as_cs

utf8mb4

✅

utf8mb4_latvian_ci

utf8mb4

✅

utf8mb4_lithuanian_ci

utf8mb4

✅

utf8mb4_lt_0900_ai_ci

utf8mb4

✅

utf8mb4_lt_0900_as_cs

utf8mb4

✅

utf8mb4_lv_0900_ai_ci

utf8mb4

✅

utf8mb4_lv_0900_as_cs

utf8mb4

✅

utf8mb4_mn_cyrl_0900_ai_ci

utf8mb4

❌

utf8mb4_mn_cyrl_0900_as_cs

utf8mb4

❌

utf8mb4_nb_0900_ai_ci

utf8mb4

❌

utf8mb4_nb_0900_as_cs

utf8mb4

❌

utf8mb4_nn_0900_ai_ci

utf8mb4

❌

utf8mb4_nn_0900_as_cs

utf8mb4

❌

utf8mb4_persian_ci

utf8mb4

✅

utf8mb4_pl_0900_ai_ci

utf8mb4

✅

utf8mb4_pl_0900_as_cs

utf8mb4

✅

utf8mb4_polish_ci

utf8mb4

✅

utf8mb4_ro_0900_ai_ci

utf8mb4

✅

utf8mb4_ro_0900_as_cs

utf8mb4

✅

utf8mb4_roman_ci

utf8mb4

✅

utf8mb4_romanian_ci

utf8mb4

✅

utf8mb4_ru_0900_ai_ci

utf8mb4

✅

utf8mb4_ru_0900_as_cs

utf8mb4

✅

utf8mb4_sinhala_ci

utf8mb4

✅

utf8mb4_sk_0900_ai_ci

utf8mb4

✅

utf8mb4_sk_0900_as_cs

utf8mb4

✅

utf8mb4_sl_0900_ai_ci

utf8mb4

✅

utf8mb4_sl_0900_as_cs

utf8mb4

✅

utf8mb4_slovak_ci

utf8mb4

✅

utf8mb4_slovenian_ci

utf8mb4

✅

utf8mb4_spanish2_ci

utf8mb4

✅

utf8mb4_spanish_ci

utf8mb4

✅

utf8mb4_sr_latn_0900_ai_ci

utf8mb4

❌

utf8mb4_sr_latn_0900_as_cs

utf8mb4

❌

utf8mb4_sv_0900_ai_ci

utf8mb4

✅

utf8mb4_sv_0900_as_cs

utf8mb4

✅

utf8mb4_swedish_ci

utf8mb4

✅

utf8mb4_tr_0900_ai_ci

utf8mb4

✅

utf8mb4_tr_0900_as_cs

utf8mb4

✅

utf8mb4_turkish_ci

utf8mb4

✅

utf8mb4_unicode_520_ci

utf8mb4

✅

utf8mb4_unicode_ci

utf8mb4

✅

utf8mb4_vi_0900_ai_ci

utf8mb4

✅

utf8mb4_vi_0900_as_cs

utf8mb4

✅

utf8mb4_vietnamese_ci

utf8mb4

✅

utf8mb4_zh_0900_as_cs

utf8mb4

✅

file an issue