When using golang to write files, sometimes the file content will be garbled. This is because golang uses UTF-8 encoding to write files by default. If you use other encodings when writing files, garbled characters will occur.
So, how can we avoid the problem of garbled files written in golang? Here are some solutions:
If you want to avoid garbled characters, the easiest way is to use UTF-8 encoding. In golang, you can set the encoding when writing files through the following code:
file, err := os.Create("test.txt") if err != nil { log.Fatal(err) } defer file.Close() file.WriteString("") // 写入UTF-8的BOM字符 writer := bufio.NewWriter(file) encoder := unicode.UTF8.NewEncoder() writer = transform.NewWriter(writer, encoder) _, err = writer.WriteString("这是一段中文文本") if err != nil { log.Fatal(err) }
The BOM character is used in this code to indicate that the file uses UTF-8 encoding. Ensure that the written text is UTF-8 encoded by using the UTF8 encoder encoder and converter transform on the write stream.
In addition to using UTF-8 encoding, golang also supports many other encodings. You can use the encoding/pinyin package to convert text to a specific encoding, such as GBK or Big5.
The following is a sample code that uses GBK encoding to write text to a file:
file, err := os.Create("test.txt") if err != nil { log.Fatal(err) } defer file.Close() writer := bufio.NewWriter(file) encoder := simplifiedchinese.GBK.NewEncoder() writer = transform.NewWriter(writer, encoder) _, err = writer.WriteString("这是一段中文文本") if err != nil { log.Fatal(err) }
This code uses simplifiedchinese.GBK.NewEncoder() to encode the written text. You can choose other encoding formats as needed.
In addition to using golang’s standard library to write files, you can also use third-party libraries, such as github.com/axgle/mahonia. Solve the problem of garbled characters.
The following is a sample code that uses the mahonia library to write text to a file:
import ( "github.com/axgle/mahonia" ) file, err := os.Create("test.txt") if err != nil { log.Fatal(err) } defer file.Close() writer := bufio.NewWriter(file) enc := mahonia.NewEncoder("gbk") writer.WriteString(enc.ConvertString("这是一段中文文本")) writer.Flush()
This code uses mahonia.NewEncoder("gbk") to encode the written text.
Summary
The best way to avoid garbled characters when writing files in golang is to specify the correct encoding format when writing code. UTF-8 encoding is used by default. If you want to use other encoding formats, you can use an encoder or a third-party library to do so.
The above is the detailed content of Golang writes garbled files. For more information, please follow other related articles on the PHP Chinese website!