The Russian windows-1251 encoding is similar to that of gb2313 and gb18030, but is only used in this region.
The difference is that Windows-1251 uses single-byte encoding in Russian, while the latter uses double-byte encoding in Chinese.
The KOI8-R is a KOI-8 series of 8-bit monogram characters, used in Russian and Portuguese. Before Unicode became popular, KOI8-R was the most widely used Russian Code, with a usage even higher than ISO 8859-5. Andrey Chernov after finishing the KOI8-R, submitted to the Internet Association (ISOC), into the RFC 1489 file ("Registration of a Cyrillic character set ").
The advantage of using such encoding is space saving, and the bad thing is that it is not common and incompatible with modern systems that use Unicode or UTF-8 encoding.
For this reason, human conversion is required. The following figure shows the corresponding code table ,,,
Http://msdn.microsoft.com/en-us/goglobal/cc305144.aspx
Windows 1251
This table is provided to help developers move their applications to Unicode. using Unicode is recommended in preference to any code page because it has better language support and is less ambiguous than any of the code pages.
For guidance about what Unicode is and how to migrate your applications to use it, please see theglobalization step-by-step: Unicode enabled article.
00 = u + 0000: NULL
01 = u + 0001: Start of heading
02 = u + 0002: Start of Text
03 = u + 0003: End of Text
04 = u + 0004: End of transmission
05 = u + 0005: Enquiry
06 = u + 0006: Acknowledge
07 = u+ 0007: bell
08 = u + 0008: backspace
09 = u+ 0009: horizontal tabulation
0a = u + 000a: line feed
0b = u + 000b: vertical tabulation
0C = u + 000c: Form feed
0d = u + 000d: Carriage Return
0e = u + 000e: Shift out
0f = u + 000f: Shift in
10 = u + 0010: Data Link escape
11 = u + 0011: Device Control one
12 = u + 0012: Device Control Two
13 = u + 0013: Device Control Three
14 = u + 0014: Device Control four
15 = u + 0015: Negative acknowledge
16 = u + 0016: Synchronous idle
17 = u + 0017: End of transmission Block
18 = u + 0018: Cancel
19 = u + 0019: End of Medium
1A = u + 001a: Substitute
1B = u + 001b: Escape
1c = u + 001c: file Separator
1D = u + 001d: group Separator
1e = u + 001e: Record Separator
1f = u + 001f: Unit Separator
20 = u+ 0020: Space
21 = u + 0021: Exclamation mark
22 = u+ 0022: Quotation Mark
23 = u+ 0023: number sign
24 = u + 0024: Dollar sign
25 = u + 0025: percent sign
26 = u+ 0026: ampersand
27 = u + 0027: apostrophe
28 = u+ 0028: left parenthesis
29 = u+ 0029: Right parenthesis
2a = u + 002a: Asterisk
2b = u + 002b: plus sign
2c = u + 002c: comma
2d = u + 002d: hyphen-minus
2e = u + 002e: Full Stop
2f = u + 002f: solidus
30 = u + 0030: digit zero
31 = u + 0031: digit one
32 = u + 0032: digit two
33 = u + 0033: digit three
34 = u + 0034: digit four
35 = u + 0035: digit five
36 = u + 0036: digit six
37 = u + 0037: digit seven
38 = u + 0038: digit eight
39 = u + 0039: digit nine
3A = u + 003a: Colon
3b = u + 003b: semicolon
3C = u + 003c: Less-than sign
3D = u + 003d: equals sign
3E = u + 003e: greater-than sign
3f = u + 003f: Question mark
40 = u + 0040: commercial
41 = u + 0041: Latin capital letter
42 = u + 0042: Latin capital letter B
43 = u + 0043: Latin capital letter C
44 = u + 0044: Latin capital letter d
45 = u + 0045: Latin capital letter e
46 = u + 0046: Latin capital letter F
47 = u + 0047: Latin capital letter g
48 = u + 0048: Latin capital letter H
49 = u + 0049: Latin capital letter I
4A = u + 004a: Latin capital letter J
4B = u + 004b: Latin capital letter K
4c = u + 004c: Latin capital letter L
4d = u + 004d: Latin capital letter M
4e E = u + 004e: Latin capital letter N
4f = u + 004f: Latin capital letter O
50 = u + 0050: Latin capital letter P
51 = u + 0051: Latin capital letter q
52 = u + 0052: Latin capital letter R
53 = u + 0053: Latin capital letter S
54 = u + 0054: Latin capital letter T
55 = u + 0055: Latin capital letter U
56 = u + 0056: Latin capital letter v
57 = u + 0057: Latin capital letter W
58 = u + 0058: Latin capital letter X
59 = u + 0059: Latin capital letter y
5A = u + 005a: Latin capital letter Z
5B = u + 005b: left square bracket
5C = u + 005c: reverse solidus
5D = u + 005d: right square bracket
5E = u + 005e: circumflex accent
5f = u + 005f: low line
60 = u + 0060: grave accent
61 = u + 0061: Latin small letter
62 = u + 0062: Latin small letter B
63 = u + 0063: Latin small letter C
64 = u + 0064: Latin small letter d
65 = u + 0065: Latin small letter e
66 = u + 0066: Latin small letter F
67 = u + 0067: Latin small letter g
68 = u + 0068: Latin small letter H
69 = u + 0069: Latin small letter I
6a = u + 006a: Latin small letter J
6b = u + 006b: Latin small letter K
6c = u + 006c: Latin small letter L
6D = u + 006d: Latin small letter M
6e = u + 006e: Latin small letter N
6f = u + 006f: Latin small letter O
70 = u + 0070: Latin small letter P
71 = u + 0071: Latin small letter q
72 = u + 0072: Latin small letter R
73 = u + 0073: Latin small letter S
74 = u + 0074: Latin small letter T
75 = u + 0075: Latin small letter U
76 = u + 0076: Latin small letter v
77 = u + 0077: Latin small letter W
78 = u + 0078: Latin small letter X
79 = u + 0079: Latin small letter y
7A = u + 007a: Latin small letter Z
7b = u + 007b: left curly Bracket
7c = u + 007c: vertical line
7D = u + 007d: Right curly Bracket
7E = u + 007e: Tilde
7f = u + 007f: Delete
80 = u + 0402: Cyrillic capital letter dje
81 = u + 0403: Cyrillic capital letter gje
82 = u + 201a: Single Low-9 quotation mark
83 = u + 0453: Cyrillic small letter gje
84 = u + 201e: double low-9 quotation mark
85 = u + 2026: horizontal ellipsis
86 = u+ 2020: dagger
87 = u+ 2021: Double dagger
88 = u + 20ac: euro sign
89 = u + 2030: per mille sign
8A = u + 0409: Cyrillic capital letter lje
8b = u + 2039: single left-pointing angle quotation mark
8C = u + 040a: Cyrillic capital letter nje
8d = u + 040c: Cyrillic capital letter kje
8e = u + 040b: Cyrillic capital letter TShe
8f = u + 040f: Cyrillic capital letter dzhe
90 = u + 0452: Cyrillic small letter dje
91 = u + 2018: Left single quotation mark
92 = u + 2019: Right single quotation mark
93 = u + 201c: Left Double quotation mark
94 = u + 201d: Right double quotation mark
95 = u + 2022: bullet
96 = u + 2013: en dash
97 = u + 2014: em dash
99 = u + 2122: Trade Mark sign
9A = u + 0459: Cyrillic small letter lje
9b = u + 203a: single right-pointing angle quotation mark
9C = u + 045a: Cyrillic small letter nje
9d = u + 045c: Cyrillic small letter kje
9e = u + 045b: Cyrillic small letter TShe
9f = u + 045f: Cyrillic small letter dzhe
A0 = u + 00a0: No-break Space
A1 = u + 040e: Cyrillic capital letter short u
A2 = u + 045e: Cyrillic small letter short u
A3 = u + 0408: Cyrillic capital letter je
A4 = u + 00a4: currency sign
A5 = u + 0490: Cyrillic capital letter ghe with upturn
A6 = u + 00a6: Broken bar
A7 = u + 00a7: Section sign
A8 = u + 0401: Cyrillic capital letter Io
A9 = u + 00a9: Copyright sign
AA = u + 0404: Cyrillic capital letter Ukrainian IE
AB = u + 00ab: Left-pointing double angle quotation mark
AC = u + 00ac: not sign
AD = u + 00ad: Soft hyphen
AE = u + 00ae: registered sign
AF = u + 0407: Cyrillic capital letter Yi
B0 = u + 00b0: Degree sign
B1 = u + 00b1: plus-minus sign
B2 = u + 0406: Cyrillic capital letter BYELORUSSIAN-UKRAINIAN I
B3 = u + 0456: Cyrillic small letter BYELORUSSIAN-UKRAINIAN I
B4 = u + 0491: Cyrillic small letter ghe with upturn
B5 = u + 00b5: Micro sign
B6 = u + 00b6: pilcrow sign
B7 = u + 00b7: middle dot
B8 = u + 0451: Cyrillic small letter Io
B9 = u + 2116: numero sign
BA = u + 0454: Cyrillic small letter Ukrainian IE
BB = u + 00bb: Right-pointing double angle quotation mark
BC = u + 0458: Cyrillic small letter je
BD = u + 0405: Cyrillic capital letter dze
Be = u + 0455: Cyrillic small letter dze
BF = u + 0457: Cyrillic small letter Yi
C0 = u + 0410: Cyrillic capital letter
C1 = u + 0411: Cyrillic capital letter be
C2 = u + 0412: Cyrillic capital letter ve
C3 = u + 0413: Cyrillic capital letter ghe
C4 = u + 0414: Cyrillic capital letter de
C5 = u + 0415: Cyrillic capital letter IE
C6 = u + 0416: Cyrillic capital letter Zhe
C7 = u + 0417: Cyrillic capital letter ze
C8 = u + 0418: Cyrillic capital letter I
C9 = u + 0419: Cyrillic capital letter short I
CA = u + 041a: Cyrillic capital letter Ka
CB = u + 041b: Cyrillic capital letter el
Cc = u + 041c: Cyrillic capital letter em
Cd = u + 041d: Cyrillic capital letter en
Ce = u + 041e: Cyrillic capital letter O
Cf = u + 041f: Cyrillic capital letter PE
D0 = u + 0420: Cyrillic capital letter er
D1 = u + 0421: Cyrillic capital letter es
D2 = u + 0422: Cyrillic capital letter te
D3 = u + 0423: Cyrillic capital letter U
D4 = u + 0424: Cyrillic capital letter ef
D5 = u + 0425: Cyrillic capital letter ha
D6 = u + 0426: Cyrillic capital letter Tse
D7 = u + 0427: Cyrillic capital letter Che
D8 = u + 0428: Cyrillic capital letter Sha
D9 = u + 0429: Cyrillic capital letter shcha
DA = u + 042a: Cyrillic capital letter hard sign
DB = u + 042b: Cyrillic capital letter yeru
Dc = u + 042c: Cyrillic capital letter soft sign
Dd = u + 042d: Cyrillic capital letter e
De = u + 042e: Cyrillic capital letter Yu
DF = u + 042f: Cyrillic capital letter ya
E0 = u + 0430: Cyrillic small letter
E1 = u + 0431: Cyrillic small letter be
E2 = u + 0432: Cyrillic small letter ve
E3 = u + 0433: Cyrillic small letter ghe
E4 = u + 0434: Cyrillic small letter de
E5 = u + 0435: Cyrillic small letter IE
E6 = u + 0436: Cyrillic small letter Zhe
E7 = u + 0437: Cyrillic small letter ze
E8 = u + 0438: Cyrillic small letter I
E9 = u + 0439: Cyrillic small letter short I
Ea = u + 043a: Cyrillic small letter Ka
EB = u + 043b: Cyrillic small letter el
EC = u + 043c: Cyrillic small letter em
Ed = u + 043d: Cyrillic small letter en
EE = u + 043e: Cyrillic small letter O
EF = u + 043f: Cyrillic small letter PE
F0 = u + 0440: Cyrillic small letter er
F1 = u + 0441: Cyrillic small letter es
F2 = u + 0442: Cyrillic small letter te
F3 = u + 0443: Cyrillic small letter U
F4 = u + 0444: Cyrillic small letter ef
F5 = u + 0445: Cyrillic small letter ha
F6 = u + 0446: Cyrillic small letter Tse
F7 = u + 0447: Cyrillic small letter Che
F8 = u + 0448: Cyrillic small letter Sha
F9 = u + 0449: Cyrillic small letter shcha
Fa = u + 044a: Cyrillic small letter hard sign
Fb = u + 044b: Cyrillic small letter yeru
Fc = u + 044c: Cyrillic small letter soft sign
FD = u + 044d: Cyrillic small letter e
Fe = u + 044e: Cyrillic small letter Yu
FF = u + 044f: Cyrillic small letter ya