UTF-8, a transformation format of ISO 10646
RFC 3629, “UTF-8, a transformation format of ISO 10646”, is an Internet Standard document published in November 2003 by F. Yergeau. It obsoletes RFC 2279. The canonical text is published by the RFC Editor.
Abstract
ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.
What “Internet Standard” means
A mature, widely-implemented specification that has completed the full IETF standards process — the highest maturity level on the standards track.
The canonical text of RFC 3629 is hosted at rfc-editor.org. Available in TXT,HTML.
- RFC 3628 Policy Requirements for Time-Stamping Authorities
- RFC 3630 Traffic Engineering Extensions to OSPF Version 2
- RFC 3627 Use of /127 Prefix Length Between Routers Considered Harmful
- RFC 3631 Security Mechanisms for the Internet
- RFC 3626 Optimized Link State Routing Protocol
- RFC 3632 VeriSign Registry Registrar Protocol Version 2.0.0
- RFC 3625 The QCP File Format and Media Types for Speech Data
- RFC 3633 IPv6 Prefix Options for Dynamic Host Configuration Protocol version 6