Изменения

Перейти к: навигация, поиск

Представление символов, таблицы кодировок

903 байта убрано, 00:44, 25 октября 2011
Нет описания правки
| <code></code>
|-
| UTF-16 ([[Big Endian|BE]])
| <code>FE FF</code>
| <code>254 255</code>
| <code>þÿ</code>
|-
| UTF-16 ([[Little Endian|LE]])
| <code>FF FE</code>
| <code>255 254</code>
|-
| UTF-7
| <code>2B 2F 76 38<br>2B 2F 76 39<br>2B 2F 76 2B<br>2B 2F 76 2F</code><ref group="t">In UTF-7, the fourth byte of the BOM, before encoding as [[base64]], is <code>001111xx</code> in binary, and <code>xx</code> depends on the next character (the first character after the BOM). Hence, technically, the fourth byte is not purely a part of the BOM, but also contains information about the next (non-BOM) character. For <code>xx=00</code>, <code>01</code>, <code>10</code>, <code>11</code>, this byte is, respectively, <code>38</code>, <code>39</code>, <code>2B</code>, or <code>2F</code> when encoded as base64. If no following character is encoded, <code>38</code> is used for the fourth byte and the following byte is <code>2D</code>.</ref>
| <code>43 47 118 56<br>43 47 118 57<br>43 47 118 43<br>43 47 118 47</code>
| <code>+/v8<br>+/v9<br>+/v+<br>+/v/</code>
|-
| UTF-1<ref name=b group=t/>
| <code>F7 64 4C</code>
| <code>247 100 76</code>
| <code>÷dL</code>
|-
| UTF-EBCDIC<ref name=b group=t/>
| <code>DD 73 66 73</code>
| <code>221 115 102 115</code>
| <code>Ýsfs</code>
|-
| Standard Compression Scheme for Unicode|SCSU<ref name=b group=t/>| <code>0E FE FF</code>{{#tag:ref|SCSU allows other encodings of U+FEFF, the shown form is the signature recommended in UTR #6.</ref>|group=t}}
| <code>14 254 255</code>
| <code>□þÿ</code> (□ ASCII "shift out" символ)
|-
| BOCU-1<ref name=b group=t/>
| <code>FB EE 28
| <code>251 238 40</code>
| <code>ûî</code>
|-
| GB-18030<ref name=b group=t/>
| <code>84 31 95 33</code>
| <code>132 49 149 51</code>
55
правок

Навигация