Base2048 Encoder And Decoder
Encode UTF-8 text as Base2048-style private-use Unicode text, or decode this tool's output back into readable text.
Convert plain text to a compact Base2048-style Unicode string, or decode compatible text produced by this tool back to readable UTF-8 text. Base2048 is a high-radix binary-to-text idea where each output character can represent one of 2,048 values. That lets Unicode text carry more data per character than small ASCII alphabets such as Base32 or Base64. This tool uses a private-use Base2048-style alphabet for UTF-8 text and includes a private-use length marker so decoded text can be recovered exactly, including inputs whose byte length does not align cleanly to 11-bit symbols. Exact compatibility matters. Base2048 implementations can use different alphabets, final-symbol rules, and Unicode safety assumptions. Treat this page's output as compatible with this page unless another tool documents the same alphabet and length-marker behavior. Base2048-style text is an encoding, not encryption, compression, signing, authentication, or privacy protection. Anyone with the same format can decode it, and changing one output character can corrupt the decoded text. The encoded output uses private-use Unicode characters, not printable ASCII. Fonts, copy and paste, Unicode normalization, JSON escaping, HTML, Markdown, databases, command lines, logs, messaging apps, source editors, and older systems may transform, reject, escape, wrap, or display those characters differently. Private-use characters are intentionally undefined by Unicode. They can be valid text while still having no shared visual meaning across fonts, platforms, or applications. If the target workflow needs URLs, shell-safe text, source-code literals, email-safe text, or ASCII-only protocols, use Base64, Base64url, Base32, Base58, or another encoding designed for that transport.What is Base2048?
Unicode and transport cautions