HTML Basic Tutorial Computer Coding (Character Set)

Computer encoding (character set) - understand

Why there is a character set, because computers can only process binary data. In order for the computer to recognize human language (0-9, a-z, A-Z, special symbols), we need to "encode" each character. The so-called "encoding" means: each character can be represented by a different binary system.

Assumption: A uses binary to represent 1000, B uses binary to represent 1001

ASCII encoding: use 1 byte (8-bit binary) to represent all characters, a total of 2^8 = 256.

ANSI encoding: Other countries have extended the ASCII encoding to display their own language.

ANSI under the Chinese operating system, represents gb2312
ANSI under the traditional operating system, represents big5
ANSI under the Japanese operating system represents JIS
......
uses 2 bytes (16-bit binary) ( To represent, a total of 2^16 = 65536 characters can be represented.
##GB2312 contains a total of 6763 Chinese characters

## Its disadvantages: The encoding table file is too large and inconvenient. Use. Use 32-bit binary to represent a character, causing a huge waste of space.