Home  >  Article  >  Database  >  Introduction to two persistence defects in redis

Introduction to two persistence defects in redis

尚
forward
2020-05-19 09:11:511683browse

Introduction to two persistence defects in redis

1. RDB persistence mode defects

1. Problem description:

Concurrent 200 channels, simulating continuous writing to Redis, after 4 hours, A large number of interface calls began to fail, and the error message was as follows:

{"data":{"sendResult":null},"base":{"returncode":"99999","returndesc":"系统异常:MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk. Commands that may modify the data set are disabled. Please check Redis logs for details about the error."},"qrybase":{"total":0,"count":0,"start":0}}

2. Cause analysis:

Interpreted the error message and thought it was caused by insufficient disk space. It was found that 42% of the disk was left, as follows Display:

Introduction to two persistence defects in redis

So I opened the Redis log according to the error message prompts and continued the stress test. The interface still reported an error, but

can be obtained from the Redis log information. Can't save in background: fork: Cannot allocate memory

It is related to the improper use of memory by the process. Check the memory occupied by the Redis main process as follows: occupying nearly 55%*4G memory

Introduction to two persistence defects in redis

Specific reason: In order to avoid the main process suspended animation when saving data to the hard disk, Redis needs to Fork a copy of the main process, and then complete the operation of saving the data to the hard disk in the Fork process. If the main process uses 2.2GB Of memory, an additional 2.2GB is needed when Fork child process. At this time, the memory is not enough, Fork fails, and the data saving hard disk also fails.

3. Mitigation plan (cannot fundamentally solve the problem):

3.1 Modify the configuration item stop-writes-on-bgsave-error no in the redis.conf file (the default value is yes), that is When an error occurs in the bgsave snapshot operation, stop writing data to the disk. In this way, any subsequent write errors will fail. In order not to affect subsequent write operations, you need to change this value to no

3.2 Modify the kernel parameters (3 below) method), but requires root permissions:

(1)	编辑/etc/sysctl.conf ,改vm.overcommit_memory=1,然后sysctl -p 使配置文件生效
(2)sysctl vm.overcommit_memory=1
(3)echo 1 > /proc/sys/vm/overcommit_memory

2. AOF persistence mode defects

1. Description of problem 1:

Both Redis master and slave nodes enable AOF mode. There were 200 concurrent connections, and the simulation continued to write to Redis. After 15 minutes, a large number of interface calls began to fail, and the Linux virtual server where Redis was located hung.

The interface error is as follows:

{"data":null,"base":{"returndesc":"系统异常","returncode":"999999"},"qrybase":null}
Biz(dubbo)接口报错如下:
2015-06-05 11:28:28.760 [DubboServerHandler-X.X.X.X:20882-thread-173] ERROR  - error while validate jedis!
redis.clients.jedis.exceptions.JedisConnectionException: java.net.SocketTimeoutException: Read timed out

Cause analysis:

From the dubbo interface error message, it is caused by the interface API operation Redis timeout. Judging from the system logs and IO monitoring, it is shown that the above problems are caused by IO bottlenecks (system IO is too busy), as shown below:

Introduction to two persistence defects in redis

Introduction to two persistence defects in redis

It can also be seen from the system log that the IO blocking time exceeds 120 seconds, and the machine hangs due to the system security mechanism.

Summary
The test results prove that the AOF mode has the most obvious flaw, that is, IO will become a performance bottleneck when access pressure is high, resulting in service unavailability.

3. Mitigation plan (cannot fundamentally solve the problem)

Edit /etc/sysctl.conf and add the following configuration:

vm.dirty_background_ratio = 5
vm.dirty_ratio = 10

Then sysctl -p makes the configuration file take effect.

Description of Problem 2:

Whether AOF mode or RDB (snapshot mode) is used, when the size of the two files (.aof or .rdb) exceeds 80% of the system memory, the Redis process will be killed by the system down, causing the service to become unavailable.

Summary

The above problems indicate that we need to plan the system memory capacity in advance when using Redis, because once Redis crashes, a large amount of data will be lost and it is unrecoverable.

For more redis knowledge, please pay attention to the redis introductory tutorial column.

The above is the detailed content of Introduction to two persistence defects in redis. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:csdn.net. If there is any infringement, please contact admin@php.cn delete