集群简介

数据分布规则

Redis Cluster采用哈希分区规则将数据分布到不同的节点,键空间被分割为 16384 槽(slot),事实上集群的最大节点数量是 16384 个。(然而建议最大节点数量设置在1000这个数量级上)
所有的主节点都负责 16384 个哈希槽中的一部分。当集群处于稳定状态时,集群中没有在执行重配置(reconfiguration)操作,每个哈希槽都只由一个节点进行处理(不过主节点可以有一个或多个从节点,可以在网络断线或节点失效时替换掉主节点)。
以下是用来把键映射到哈希槽的算法:
HASH_SLOT = CRC16(key) mod 16384
CRC16的16位输出中的14位会被使用(这也是为什么上面的式子中有一个对 16384 取余的操作)。 在我们的测试中,CRC16能相当好地把不同的键均匀地分配到 16384 个槽中。

集群限制

Redis 集群实现了所有在非分布式 Redis 版本中出现的处理单一键值(key)的命令。那些使用多个键值的复杂操作, 比如 set 里的并集(unions)和交集(intersections)操作,就没有实现。通常来说,那些处理命令的节点获取不到键值的所有操作都不会被实现。
Redis 集群不像单机版本的 Redis 那样支持多个数据库,集群只有数据库 0,而且也不支持 SELECT 命令。

1.环境

1.1 3台centos 7 服务器 ip地址如下

10.0.0.2
10.0.0.3
10.0.0.4

1.2 docker 版本

20.10.8
1.3 docker-compose版本
docker-compose -v
docker-compose version 1.29.2, build 5becea4c

1.4 每个节点目录结构

10.0.0.2

10.0.0.3

10.0.0.4

2.搭建集群

2.1 前提

docker 和docker-compose安装配置好

2.2 集群规划

3个节点 每个节点上跑2个redis容器

2.2 配置redis

集群最小配置如下

port 7000
cluster-enabled yes
cluster-config-file nodes.conf
cluster-node-timeout 5000
appendonly yes

以10.0.0.2为例,redis-6379/redis.conf和redis-6380/redis.conf配置文件内容如下,将这两个配置文件复制到10.0.0.3和10.0.0.4对应的目录,并将ip地址改为本节点对应的ip即可。

[root@ceph-01 opt]# egrep -v "^#|^$" /home/zshield/conf/redis-6379/redis.conf 
bind 10.0.0.2
protected-mode yes
port 6379
tcp-backlog 511
timeout 0
tcp-keepalive 300
daemonize no
supervised no
pidfile /var/run/redis_6379.pid
loglevel notice
logfile "/logs/redis.log"
databases 16
always-show-logo yes
save 900 1
save 300 10
save 60 10000
stop-writes-on-bgsave-error yes
rdbcompression yes
rdbchecksum yes
dbfilename dump.rdb
rdb-del-sync-files no
dir /data/
masterauth 123456
replica-serve-stale-data yes
replica-read-only yes
repl-diskless-sync no
repl-diskless-sync-delay 5
repl-diskless-load disabled
repl-disable-tcp-nodelay no
replica-priority 100
acllog-max-len 128
requirepass 123456
lazyfree-lazy-eviction no
lazyfree-lazy-expire no
lazyfree-lazy-server-del no
replica-lazy-flush no
lazyfree-lazy-user-del no
oom-score-adj no
oom-score-adj-values 0 200 800
appendonly yes
appendfilename "appendonly.aof"
appendfsync everysec
no-appendfsync-on-rewrite no
auto-aof-rewrite-percentage 100
auto-aof-rewrite-min-size 64mb
aof-load-truncated yes
aof-use-rdb-preamble yes
lua-time-limit 5000
cluster-enabled yes
cluster-config-file nodes-6379.conf
cluster-node-timeout 15000
cluster-announce-port 6379
cluster-announce-bus-port 16379
slowlog-log-slower-than 10000
slowlog-max-len 128
latency-monitor-threshold 0
notify-keyspace-events ""
hash-max-ziplist-entries 512
hash-max-ziplist-value 64
list-max-ziplist-size -2
list-compress-depth 0
set-max-intset-entries 512
zset-max-ziplist-entries 128
zset-max-ziplist-value 64
hll-sparse-max-bytes 3000
stream-node-max-bytes 4096
stream-node-max-entries 100
activerehashing yes
client-output-buffer-limit normal 0 0 0
client-output-buffer-limit replica 256mb 64mb 60
client-output-buffer-limit pubsub 32mb 8mb 60
hz 10
dynamic-hz yes
aof-rewrite-incremental-fsync yes
rdb-save-incremental-fsync yes
jemalloc-bg-thread yes
[root@ceph-01 zshield]# egrep -v "^$|^#" /home/zshield/conf/redis-6380/redis.conf 
bind 10.0.0.2
protected-mode yes
port 6380
tcp-backlog 511
timeout 0
tcp-keepalive 300
daemonize no
supervised no
pidfile /var/run/redis_6380.pid
loglevel notice
databases 16
always-show-logo yes
save 900 1  #900秒内至少有1个key被更改就执行快照
save 300 10  #300秒内至少有10个key被更改就执行快照
save 60 10000  #60秒内至少有10000个key被更改就执行快照
stop-writes-on-bgsave-error yes  #拍摄快照失败是否继续执行写命令
rdbcompression yes #是否对快照文件进行压缩
rdbchecksum yes #是否进行数据校验
dbfilename dump.rdb #快照文件存储的名称
rdb-del-sync-files no
dir /data/ #数据(包括快照)文件存储的位置
masterauth 123456  #如果redis设置了密码,这个参数必须配置,否则从节点无法提升为主节点
replica-serve-stale-data yes
replica-read-only yes
repl-diskless-sync no
repl-diskless-sync-delay 5
repl-diskless-load disabled
repl-disable-tcp-nodelay no
replica-priority 100
acllog-max-len 128
requirepass 123456
lazyfree-lazy-eviction no
lazyfree-lazy-expire no
lazyfree-lazy-server-del no
replica-lazy-flush no
lazyfree-lazy-user-del no
oom-score-adj no
oom-score-adj-values 0 200 800
appendonly yes
appendfilename "appendonly.aof"
appendfsync everysec
no-appendfsync-on-rewrite no
auto-aof-rewrite-percentage 100
auto-aof-rewrite-min-size 64mb
aof-load-truncated yes
aof-use-rdb-preamble yes
lua-time-limit 5000
cluster-enabled yes
cluster-config-file nodes-6380.conf
cluster-node-timeout 15000
cluster-announce-port 6380
cluster-announce-bus-port 16380
slowlog-log-slower-than 10000
slowlog-max-len 128
latency-monitor-threshold 0
notify-keyspace-events ""
hash-max-ziplist-entries 512
hash-max-ziplist-value 64
list-max-ziplist-size -2
list-compress-depth 0
set-max-intset-entries 512
zset-max-ziplist-entries 128
zset-max-ziplist-value 64
hll-sparse-max-bytes 3000
stream-node-max-bytes 4096
stream-node-max-entries 100
activerehashing yes
client-output-buffer-limit normal 0 0 0
client-output-buffer-limit replica 256mb 64mb 60
client-output-buffer-limit pubsub 32mb 8mb 60
hz 10
dynamic-hz yes
aof-rewrite-incremental-fsync yes
rdb-save-incremental-fsync yes
jemalloc-bg-thread yes

2.3 配置docker-compose

以10.0.0.2为例,redis-6379.yml和redis-6380.yml文件内容如下,将这两个文件复制到10.0.0.3和10.0.0.4对应目录即可。

[root@ceph-01 zshield]# cat /home/zshield/docker/compose/redis-6379.yml 
version: '3'
services:
redis-6379:
container_name: redis-6379
restart: always
image: redis:6.0.15
network_mode: host
volumes:
- "/home/zshield/conf/redis-6379/redis.conf:/usr/local/etc/redis/redis.conf"
- "/home/zshield/BIGdata/redis_data/6379/:/data"
- /etc/localtime:/etc/localtime
command: redis-server /usr/local/etc/redis/redis.conf
[root@ceph-01 zshield]# cat /home/zshield/docker/compose/redis-6380.yml 
version: "3"
services:
redis-6380:
container_name: redis-6380
restart: always
image: redis:6.0.15
network_mode: host
volumes:
- "/home/zshield/conf/redis-6380/redis.conf:/usr/local/etc/redis/redis.conf"
- "/home/zshield/BIGdata/redis_data/6380/:/data"
- /etc/localtime:/etc/localtime
command: redis-server /usr/local/etc/redis/redis.conf

2.4 启动redis容器

在每个节点执行如下命令,并确保容器正常启动。

docker-compose -f /home/zshield/docker/compose/redis-6379.yml up -d

docker-compose -f /home/zshield/docker/compose/redis-6380.yml up -d

2.5 配置redis集群

在任意一个节点的任意一个redis容器内执行如下命令

redis-cli --cluster create --cluster-replicas 1 10.0.0.2:6379 10.0.0.2:6380 10.0.0.3:6379 10.0.0.4:6379 10.0.0.3:6380 10.0.0.4:6380 -a 123456

这里在10.0.0.2的redis-6380 容器里执行命令

docker exec -it redis-6380 redis-cli --cluster create --cluster-replicas 1 10.0.0.2:6379 10.0.0.2:6380 10.0.0.3:6379 10.0.0.4:6379 10.0.0.3:6380 10.0.0.4:6380 -a 123456
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
>>> Performing hash slots allocation on 6 nodes...
Master[0] -> Slots 0 - 5460
Master[1] -> Slots 5461 - 10922
Master[2] -> Slots 10923 - 16383
Adding replica 10.0.0.3:6380 to 10.0.0.2:6379
Adding replica 10.0.0.4:6380 to 10.0.0.3:6379
Adding replica 10.0.0.2:6380 to 10.0.0.4:6379
M: 14718dd3a915378e841c03f1a697773f60411123 10.0.0.2:6379
slots:[0-5460] (5461 slots) master
S: 36eb505404a82e67e98d961c6398dfa558a37999 10.0.0.2:6380
replicates 1e23f1f7c57f921b07de4feab4bdabef8c2d63cc
M: 59c99e409c53356ac29c8a5479c322f2de1f43d7 10.0.0.3:6379
slots:[5461-10922] (5462 slots) master
M: 1e23f1f7c57f921b07de4feab4bdabef8c2d63cc 10.0.0.4:6379
slots:[10923-16383] (5461 slots) master
S: 05f7c63ba8bd535213e4ac14b2e3e7d74d718c94 10.0.0.3:6380
replicates 14718dd3a915378e841c03f1a697773f60411123
S: 390c83dadd16f54b35dc9eb5b8cee1dda28c9832 10.0.0.4:6380
replicates 59c99e409c53356ac29c8a5479c322f2de1f43d7
Can I set the above configuration? (type 'yes' to accept): yes
>>> Nodes configuration updated
>>> Assign a different config epoch to each node
>>> Sending CLUSTER MEET messages to join the cluster
Waiting for the cluster to join
.
>>> Performing Cluster Check (using node 10.0.0.2:6379)
M: 14718dd3a915378e841c03f1a697773f60411123 10.0.0.2:6379
slots:[0-5460] (5461 slots) master
1 additional replica(s)
S: 390c83dadd16f54b35dc9eb5b8cee1dda28c9832 10.0.0.4:6380
slots: (0 slots) slave
replicates 59c99e409c53356ac29c8a5479c322f2de1f43d7
M: 59c99e409c53356ac29c8a5479c322f2de1f43d7 10.0.0.3:6379
slots:[5461-10922] (5462 slots) master
1 additional replica(s)
M: 1e23f1f7c57f921b07de4feab4bdabef8c2d63cc 10.0.0.4:6379
slots:[10923-16383] (5461 slots) master
1 additional replica(s)
S: 36eb505404a82e67e98d961c6398dfa558a37999 10.0.0.2:6380
slots: (0 slots) slave
replicates 1e23f1f7c57f921b07de4feab4bdabef8c2d63cc
S: 05f7c63ba8bd535213e4ac14b2e3e7d74d718c94 10.0.0.3:6380
slots: (0 slots) slave
replicates 14718dd3a915378e841c03f1a697773f60411123
[OK] All nodes agree about slots configuration.
>>> Check for open slots...
>>> Check slots coverage...
[OK] All 16384 slots covered.

从输出可以看到,redis会自动将每个master的slave分配到不同的主机上,这样就确保了,3节点的集群就可以容忍一台主机宕机。

Adding replica 10.0.0.3:6380 to 10.0.0.2:6379
Adding replica 10.0.0.4:6380 to 10.0.0.3:6379
Adding replica 10.0.0.2:6380 to 10.0.0.4:6379

2.6 查看集群状态

[root@ceph-01 opt]# docker exec -it redis-6379 bash
root@ceph-01:/data# redis-cli -a 123456 -c -h 10.0.0.2 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:6
cluster_my_epoch:1
cluster_stats_messages_ping_sent:65
cluster_stats_messages_pong_sent:66
cluster_stats_messages_sent:131
cluster_stats_messages_ping_received:61
cluster_stats_messages_pong_received:65
cluster_stats_messages_meet_received:5
cluster_stats_messages_received:131
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 master - 0 1632728187539 5 connected 10923-16383
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 slave 53433c54a23cbac6fbced8c571966d66b4856a5e 0 1632728186523 3 connected
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632728185000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 slave 343fafe7458d0d52f2394906bab1a1b9fc6c981f 0 1632728184491 5 connected
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master - 0 1632728186000 3 connected 5461-10922
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632728186000 1 connected

从输出可以看出,集群状态正常。
cluster_state:ok

3. 故障转移和数据恢复

3.1故障转移

3.1.1 模拟宕机(不删数据)

到10.0.0.4机器上将redis停掉(模拟该节点宕机)

[root@ceph-03 conf]# docker stop redis-6379
[root@ceph-03 conf]# docker stop redis-6380
记录下操作时间
[root@ceph-03 ~]# date
Mon Sep 27 15:44:30 CST 2021

查看集群状态

10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:7
cluster_my_epoch:1
cluster_stats_messages_ping_sent:434
cluster_stats_messages_pong_sent:453
cluster_stats_messages_fail_sent:6
cluster_stats_messages_auth-ack_sent:1
cluster_stats_messages_sent:894
cluster_stats_messages_ping_received:448
cluster_stats_messages_pong_received:433
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:2
cluster_stats_messages_auth-req_received:1
cluster_stats_messages_received:889
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 master,fail - 1632728516901 1632728511805 5 disconnected
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 slave,fail 53433c54a23cbac6fbced8c571966d66b4856a5e 1632728518925 1632728515000 3 disconnected
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632728564000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632728565938 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master - 0 1632728565000 3 connected 5461-10922
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632728563896 1 connected

从输出可以看出,10.0.0.4:6379和10.0.0.4:6380状态都为fail,但集群状态仍然ok,master节点10.0.0.4:6379的slave节点10.0.0.2:6380,提升为了master节点。
现在往目前的集群中写点数据进去,模拟节点宕机后继续数据写入。

root@ceph-01:/data# for i in `seq 1000`;do echo "set foo$i $i" >> /tmp/set.txt;done

root@ceph-01:/data# cat /tmp/set.txt|redis-cli -a 123456 -h 10.0.0.2 -c

等10分钟后,将10.0.0.4:6379和10.0.0.4:6380实例再启动,查看实例是否会自动加入集群。

[root@ceph-03 ~]# date
Mon Sep 27 15:54:51 CST 2021

[root@ceph-03 ~]# docker start redis-6379
redis-6379
[root@ceph-03 ~]# docker start redis-6380
redis-6380

再次查看集群状态

cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:7
cluster_my_epoch:1
cluster_stats_messages_ping_sent:1176
cluster_stats_messages_pong_sent:1202
cluster_stats_messages_fail_sent:6
cluster_stats_messages_auth-ack_sent:1
cluster_stats_messages_sent:2385
cluster_stats_messages_ping_received:1197
cluster_stats_messages_pong_received:1175
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:2
cluster_stats_messages_auth-req_received:1
cluster_stats_messages_received:2380
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 slave f0f3760a605d56d567ab65afdef7968e40bb1e5b 0 1632729359274 7 connected
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 slave 53433c54a23cbac6fbced8c571966d66b4856a5e 0 1632729357244 3 connected
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632729359000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632729358000 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master - 0 1632729358260 3 connected 5461-10922
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632729357000 1 connected

从输出可以看出,10.0.0.4:6379和10.0.0.4:6380都已经加入集群成为slave,集群状态为ok。

3.1.2 模拟宕机(删除aof和rdb数据,不删除集群配置)

删除数据,停掉10.0.0.2:6379和10.0.0.3:6380实例

[root@ceph-02 ~]# rm -rf rm -rf /home/zshield/BIGdata/redis_data/6379/appendonly.aof 
[root@ceph-02 ~]# rm -rf rm -rf /home/zshield/BIGdata/redis_data/6379/dump.rdb 
[root@ceph-02 ~]# rm -rf rm -rf /home/zshield/BIGdata/redis_data/6380/appendonly.aof 
[root@ceph-02 ~]# rm -rf rm -rf /home/zshield/BIGdata/redis_data/6380/dump.rdb 
[root@ceph-02 ~]# docker stop redis-6379
redis-6379
[root@ceph-02 ~]# docker stop redis-6380
redis-6380
[root@ceph-02 ~]# date
Mon Sep 27 16:01:36 CST 2021

查看集群状态

10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:8
cluster_my_epoch:1
cluster_stats_messages_ping_sent:1549
cluster_stats_messages_pong_sent:1591
cluster_stats_messages_fail_sent:9
cluster_stats_messages_auth-ack_sent:2
cluster_stats_messages_sent:3151
cluster_stats_messages_ping_received:1586
cluster_stats_messages_pong_received:1547
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:5
cluster_stats_messages_auth-req_received:2
cluster_stats_messages_received:3145
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 slave f0f3760a605d56d567ab65afdef7968e40bb1e5b 0 1632729736367 7 connected
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 master - 0 1632729737390 8 connected 5461-10922
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632729735000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632729736000 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master,fail - 1632729693527 1632729692000 3 disconnected
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave,fail f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 1632729697599 1632729695563 1 disconnected

输出可以看出,10.0.0.3:6379和10.0.0.3:6380状态失败,但集群正常。
再写入数据

root@ceph-01:/data# for i in `seq 1000 2000`;do echo "set foo$i $i" >> /tmp/set2.txt;done

root@ceph-01:/data# cat /tmp/set2.txt | redis-cli -a 123456 -h 10.0.0.2 -c

十分钟后启动10.0.0.3:6379和10.0.0.3:6380实例

[root@ceph-02 ~]# date
Mon Sep 27 16:10:36 CST 2021

[root@ceph-02 ~]# docker-compose -f /home/zshield/docker/compose/redis-6379.yml up -d
[root@ceph-02 ~]# docker-compose -f /home/zshield/docker/compose/redis-6380.yml up -d

验证集群状态

10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:8
cluster_my_epoch:1
cluster_stats_messages_ping_sent:2082
cluster_stats_messages_pong_sent:2125
cluster_stats_messages_fail_sent:9
cluster_stats_messages_auth-ack_sent:2
cluster_stats_messages_sent:4218
cluster_stats_messages_ping_received:2120
cluster_stats_messages_pong_received:2080
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:5
cluster_stats_messages_auth-req_received:2
cluster_stats_messages_received:4212
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 slave f0f3760a605d56d567ab65afdef7968e40bb1e5b 0 1632730293886 7 connected
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 master - 0 1632730296000 8 connected 5461-10922
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632730294000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632730293000 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 slave 9f75a70689e4e01ce707fafad324e8d038d35413 0 1632730296926 8 connected
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632730295000 1 connected

从输出可以看出,集群状态ok,实例10.0.0.3:6379和10.0.0.3:6380成功加入集群。

3.1.3模拟宕机(删除aof、rdb和集群配置数据)

删除所有数据,停止10.0.0.4:6379和10.0.0.4:6380实例

[root@ceph-03 ~]# rm -rf /home/zshield/BIGdata/redis_data/6379/*
[root@ceph-03 ~]# rm -rf /home/zshield/BIGdata/redis_data/6380/*
[root@ceph-03 ~]# docker stop redis-6379
redis-6379
[root@ceph-03 ~]# docker stop redis-6380
redis-6380

[root@ceph-03 ~]# date
Mon Sep 27 16:19:03 CST 2021

查看集群状态

10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:9
cluster_my_epoch:1
cluster_stats_messages_ping_sent:2591
cluster_stats_messages_pong_sent:2619
cluster_stats_messages_fail_sent:9
cluster_stats_messages_auth-ack_sent:3
cluster_stats_messages_sent:5222
cluster_stats_messages_ping_received:2614
cluster_stats_messages_pong_received:2588
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:7
cluster_stats_messages_auth-req_received:3
cluster_stats_messages_received:5217
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f 10.0.0.4:6379@16379 slave,fail f0f3760a605d56d567ab65afdef7968e40bb1e5b 1632730695228 1632730690130 7 disconnected
9f75a70689e4e01ce707fafad324e8d038d35413 10.0.0.4:6380@16380 master,fail - 1632730697260 1632730692180 8 disconnected
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632730813000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632730816662 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master - 0 1632730817684 9 connected 5461-10922
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632730816000 1 connected

可以看出,集群状态ok,10.0.0.4:6379和10.0.0.4:6380实例状态为fail。
写入数据

root@ceph-01:/data# cat /tmp/set3.txt | redis-cli -a 123456 -h 10.0.0.2 -c

root@ceph-01:/data# for i in `seq 2000 3000`;do echo "set foo$i $i" >> /tmp/set3.txt;done

十分钟后启动实例

[root@ceph-03 ~]# date
Mon Sep 27 16:29:03 CST 2021

[root@ceph-03 ~]# docker-compose -f /home/zshield/docker/compose/redis-6379.yml up -d
[root@ceph-03 ~]# docker-compose -f /home/zshield/docker/compose/redis-6380.yml up -d

查看集群状态

10.0.0.2:6379> cluster info
cluster_state:ok
cluster_slots_assigned:16384
cluster_slots_ok:16384
cluster_slots_pfail:0
cluster_slots_fail:0
cluster_known_nodes:6
cluster_size:3
cluster_current_epoch:9
cluster_my_epoch:1
cluster_stats_messages_ping_sent:3177
cluster_stats_messages_pong_sent:3184
cluster_stats_messages_fail_sent:9
cluster_stats_messages_auth-ack_sent:3
cluster_stats_messages_sent:6373
cluster_stats_messages_ping_received:3179
cluster_stats_messages_pong_received:3174
cluster_stats_messages_meet_received:5
cluster_stats_messages_fail_received:7
cluster_stats_messages_auth-req_received:3
cluster_stats_messages_received:6368
10.0.0.2:6379> cluster nodes
343fafe7458d0d52f2394906bab1a1b9fc6c981f :0@0 slave,fail,noaddr f0f3760a605d56d567ab65afdef7968e40bb1e5b 1632730695228 1632730690130 7 disconnected
9f75a70689e4e01ce707fafad324e8d038d35413 :0@0 master,fail,noaddr - 1632730697260 1632730692180 8 disconnected
f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 10.0.0.2:6379@16379 myself,master - 0 1632731438000 1 connected 0-5460
f0f3760a605d56d567ab65afdef7968e40bb1e5b 10.0.0.2:6380@16380 master - 0 1632731438495 7 connected 10923-16383
53433c54a23cbac6fbced8c571966d66b4856a5e 10.0.0.3:6379@16379 master - 0 1632731437476 9 connected 5461-10922
a84fc63bdee373589647feceb9b8bb68cc49a81e 10.0.0.3:6380@16380 slave f372c5b042101b9e6dfe9b8b06929ae0598dd1f7 0 1632731436453 1 connected

从输出可以看出,集群状态ok(仍可提供服务),但是实例加入集群失败,原因是因为将实例的集群配置删除了。针对这种情况,要手动干预,将实例加入集群,具体操作方法见下文。

3.1.4 结论

3个节点的redis集群(配置了主备),可容灾一个节点宕机,集群状态ok;只要不丢本节点的实例的集群配置的情况下,启动实例后,节点实例仍然可以正常加入集群;丢失本节点的实例的集群配置信息,启动实例后,节点不可以正常加入集群。

3.2 数据恢复

演示集群中10.0.0.3 6379实例的数据恢复
提前备份10.0.0.3 6379 实例的aof文件

[root@ceph-02 ~]# mkdir /home/zshield/backup/6379 -p

[root@ceph-02 ~]# cp /home/zshield/BIGdata/redis_data/6379/appendonly.aof /home/zshield/backup/6379/

删除10.0.0.3 6379实例数据

10.0.0.3:6379> info keyspace
# Keyspace
db0:keys=992,expires=0,avg_ttl=0

10.0.0.3:6379> flushall
OK
10.0.0.3:6379> info keyspace
# Keyspace

恢复数据

[root@ceph-02 ~]# cp /home/zshield/backup/6379/appendonly.aof /home/zshield/BIGdata/redis_data/6379/
cp: overwrite ‘/home/zshield/BIGdata/redis_data/6379/appendonly.aof’? y
[root@ceph-02 ~]# docker restart redis-6379
redis-6379

验证数据是否恢复

10.0.0.3:6379> info keyspace
# Keyspace
db0:keys=992,expires=0,avg_ttl=0

4 移动哈希槽

4.1 移动哈希槽方式

CLUSTER ADDSLOTS slot1 [slot2] … [slotN]
CLUSTER DELSLOTS slot1 [slot2] … [slotN]
CLUSTER SETSLOT slot NODE node
CLUSTER SETSLOT slot MIGRATING node
CLUSTER SETSLOT slot IMPORTING node

头两个命令,ADDSLOTS 和 DELSLOTS,就是简单地用来给一个 Redis 节点指派(assign)或移除哈希槽。 在哈希槽被指派后,节点会将这个消息通过 gossip 协议向整个集群传播。ADDSLOTS 命令通常是用于在一个集群刚建立的时候快速给所有节点指派哈希槽。
当 SETSLOT 子命令使用 NODE 形式的时候,用来给指定 ID 的节点指派哈希槽。 除此之外哈希槽能通过两个特殊的状态来设定,MIGRATING 和 IMPORTING:
当一个槽被设置为 MIGRATING,原来持有该哈希槽的节点仍会接受所有跟这个哈希槽有关的请求,但只有当查询的键还存在原节点时,原节点会处理该请求,否则这个查询会通过一个 -ASK 重定向(-ASK redirection)转发到迁移的目标节点。
当一个槽被设置为 IMPORTING,只有在接受到 ASKING 命令之后节点才会接受所有查询这个哈希槽的请求。如果客户端一直没有发送 ASKING 命令,那么查询都会通过 -MOVED 重定向错误转发到真正处理这个哈希槽的节点那里。

4.2 NODE形式移动哈希槽

#查看集群节点信息

10.0.0.4:6379> cluster nodes
f130db7007c45c240bf0c3b59aebb4029beba7f9 10.0.0.2:6380@16380 slave 3545e21b7156aeb00fa951615ebe328c98c339cd 0 1632305360230 5 connected
996fabf73d677186eeaa781cbc520e7da70fbeab 10.0.0.4:6380@16380 slave 2d78cef6d9845bf3e3c323660ac139e75b04ed1e 0 1632305362000 3 connected
cdefd06878028807e4dbf1ae94d0e0b46cd3f024 10.0.0.2:6379@16379 master - 0 1632305363290 1 connected 0-5460
c3fab683854482c67cff9f592e81738851f5c47b 10.0.0.3:6380@16380 slave cdefd06878028807e4dbf1ae94d0e0b46cd3f024 0 1632305364308 1 connected
3545e21b7156aeb00fa951615ebe328c98c339cd 10.0.0.4:6379@16379 myself,master - 0 1632305361000 5 connected 10923-16383
2d78cef6d9845bf3e3c323660ac139e75b04ed1e 10.0.0.3:6379@16379 master - 0 1632305362267 3 connected 5461-10922

#尝试通过指定节点ip地址移动哈希槽,由报错可知,在节点之间移动哈希槽的时候不能指定ip地址

10.0.0.4:6379> cluster setslot 6000 node 10.0.0.4
(error) ERR Unknown node 10.0.0.4

#通过指定节点id移动哈希槽(将6000哈希槽从10.0.0.3:6379移动到10.0.0.4:6379)

10.0.0.4:6379> cluster setslot 6000 node 3545e21b7156aeb00fa951615ebe328c98c339cd
OK

#查看哈希槽是否移动成功

10.0.0.4:6379> cluster nodes
f130db7007c45c240bf0c3b59aebb4029beba7f9 10.0.0.2:6380@16380 slave 3545e21b7156aeb00fa951615ebe328c98c339cd 0 1632305926000 5 connected
996fabf73d677186eeaa781cbc520e7da70fbeab 10.0.0.4:6380@16380 slave 2d78cef6d9845bf3e3c323660ac139e75b04ed1e 0 1632305925000 3 connected
cdefd06878028807e4dbf1ae94d0e0b46cd3f024 10.0.0.2:6379@16379 master - 0 1632305925000 1 connected 0-5460
c3fab683854482c67cff9f592e81738851f5c47b 10.0.0.3:6380@16380 slave cdefd06878028807e4dbf1ae94d0e0b46cd3f024 0 1632305926221 1 connected
3545e21b7156aeb00fa951615ebe328c98c339cd 10.0.0.4:6379@16379 myself,master - 0 1632305923000 5 connected 6000 10923-16383
2d78cef6d9845bf3e3c323660ac139e75b04ed1e 10.0.0.3:6379@16379 master - 0 1632305927236 3 connected 5461-5999 6001-10922
10.0.0.4:6379> cluster setslot 6000 node 2d78cef6d9845bf3e3c323660ac139e75b04ed1e
OK

4.3 设置哈希槽的状态移动哈希槽

###把哈希槽 6000 从 节点 10.0.0.3 移到 节点10.0.0.2
#登录10.0.0.2 节点,移动之前查看集群哈希槽分布状态

10.0.0.2:6379> cluster nodes
18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 10.0.0.2:6379@16379 myself,master - 0 1632379310000 1 connected 0-5460
05b6a8f73a058da489fb50e6de190611fa393148 10.0.0.4:6379@16379 master - 0 1632379311000 5 connected 10923-16383
e81e79ad7244c9ce61c68a613040e92e30aab046 10.0.0.2:6380@16380 slave 05b6a8f73a058da489fb50e6de190611fa393148 0 1632379310000 5 connected
6647d69e7eecb215ee336a622110875e78d05a63 10.0.0.4:6380@16380 slave 91c767b5c3483c9a89de793d57213a682431d021 0 1632379310000 3 connected
91c767b5c3483c9a89de793d57213a682431d021 10.0.0.3:6379@16379 master - 0 1632379313232 3 connected 5461-10922
8a9320198744e08d9f94ad2ae2c0b47211133c83 10.0.0.3:6380@16380 slave 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 0 1632379312216 1 connected

#执行CLUSTER SETSLOT slot MIGRATING node命令

10.0.0.2:6379> cluster setslot 6000 importing 91c767b5c3483c9a89de793d57213a682431d021 
OK

#再次查看集群哈希槽状态,可以看见6000哈希槽被10.0.0.2 节点接受
#说明:哈希槽-<-节点id 表示哈希槽来自哪个节点

10.0.0.2:6379> cluster nodes
18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 10.0.0.2:6379@16379 myself,master - 0 1632379423000 1 connected 0-5460 [6000-<-91c767b5c3483c9a89de793d57213a682431d021]
05b6a8f73a058da489fb50e6de190611fa393148 10.0.0.4:6379@16379 master - 0 1632379423215 5 connected 10923-16383
e81e79ad7244c9ce61c68a613040e92e30aab046 10.0.0.2:6380@16380 slave 05b6a8f73a058da489fb50e6de190611fa393148 0 1632379424230 5 connected
6647d69e7eecb215ee336a622110875e78d05a63 10.0.0.4:6380@16380 slave 91c767b5c3483c9a89de793d57213a682431d021 0 1632379423000 3 connected
91c767b5c3483c9a89de793d57213a682431d021 10.0.0.3:6379@16379 master - 0 1632379422192 3 connected 5461-10922
8a9320198744e08d9f94ad2ae2c0b47211133c83 10.0.0.3:6380@16380 slave 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 0 1632379425251 1 connected

#退出登录

10.0.0.2:6379> quit

#登录10.0.0.3

root@ceph-01:/data# redis-cli -c -h 10.0.0.3 -a 123456
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.

#查看集群哈希槽分布状态

10.0.0.3:6379> cluster nodes
18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 10.0.0.2:6379@16379 master - 0 1632379459472 1 connected 0-5460
91c767b5c3483c9a89de793d57213a682431d021 10.0.0.3:6379@16379 myself,master - 0 1632379457000 3 connected 5461-10922
05b6a8f73a058da489fb50e6de190611fa393148 10.0.0.4:6379@16379 master - 0 1632379460495 5 connected 10923-16383
6647d69e7eecb215ee336a622110875e78d05a63 10.0.0.4:6380@16380 slave 91c767b5c3483c9a89de793d57213a682431d021 0 1632379459000 3 connected
8a9320198744e08d9f94ad2ae2c0b47211133c83 10.0.0.3:6380@16380 slave 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 0 1632379458451 1 connected
e81e79ad7244c9ce61c68a613040e92e30aab046 10.0.0.2:6380@16380 slave 05b6a8f73a058da489fb50e6de190611fa393148 0 1632379459000 5 connected

#执行CLUSTER SETSLOT slot MIGRATING node 命令

10.0.0.3:6379> cluster setslot 6000 migrating 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68
OK

#再次查看集群哈希槽分布状态,可以看见6000哈希槽迁移到了10.0.0.2
#说明:哈希槽>-节点id 表示迁移到哪个节点

10.0.0.3:6379> cluster nodes
18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 10.0.0.2:6379@16379 master - 0 1632379501000 1 connected 0-5460
91c767b5c3483c9a89de793d57213a682431d021 10.0.0.3:6379@16379 myself,master - 0 1632379500000 3 connected 5461-10922 [6000->-18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68]
05b6a8f73a058da489fb50e6de190611fa393148 10.0.0.4:6379@16379 master - 0 1632379500000 5 connected 10923-16383
6647d69e7eecb215ee336a622110875e78d05a63 10.0.0.4:6380@16380 slave 91c767b5c3483c9a89de793d57213a682431d021 0 1632379500245 3 connected
8a9320198744e08d9f94ad2ae2c0b47211133c83 10.0.0.3:6380@16380 slave 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 0 1632379501260 1 connected
e81e79ad7244c9ce61c68a613040e92e30aab046 10.0.0.2:6380@16380 slave 05b6a8f73a058da489fb50e6de190611fa393148 0 1632379499000 5 connected

#退出登录

10.0.0.3:6379> quit

#登录10.0.0.4 节点查看集群哈希槽分布,可以发现这种操作对不相干的节点是不可见的

root@ceph-01:/data# redis-cli -c -h 10.0.0.4 -a 123456
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.4:6379> cluster nodes
91c767b5c3483c9a89de793d57213a682431d021 10.0.0.3:6379@16379 master - 0 1632379544000 3 connected 5461-10922
05b6a8f73a058da489fb50e6de190611fa393148 10.0.0.4:6379@16379 myself,master - 0 1632379541000 5 connected 10923-16383
e81e79ad7244c9ce61c68a613040e92e30aab046 10.0.0.2:6380@16380 slave 05b6a8f73a058da489fb50e6de190611fa393148 0 1632379543000 5 connected
18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 10.0.0.2:6379@16379 master - 0 1632379543000 1 connected 0-5460
6647d69e7eecb215ee336a622110875e78d05a63 10.0.0.4:6380@16380 slave 91c767b5c3483c9a89de793d57213a682431d021 0 1632379544912 3 connected
8a9320198744e08d9f94ad2ae2c0b47211133c83 10.0.0.3:6380@16380 slave 18e370fb0ffdf3a37ada1db5281aae6ad9ff8a68 0 1632379542000 1 connected

5 redis命令

5.1 从文件导入数据、命令

从文件导入key的值

root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.2 -c -x set foo </etc/adduser.conf 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
OK
root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.2 -c get foo 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
"# /etc/adduser.conf: `adduser' configuration.\n# See adduser(8) and adduser.conf(5) for full documentation.\n\n# The DSHELL variable specifies the default login shell on your\n# system.\nDSHELL=/bin/bash\n\n# The DHOME variable specifies the directory containing users' home\n# directories.\nDHOME=/home\n\n# If GROUPHOMES is \"yes\", then the home directories will be created as\n# /home/groupname/user.\nGROUPHOMES=no\n\n# If LETTERHOMES is \"yes\", then the created home directories will have\n# an extra directory - the first letter of the user name. For example:\n# /home/u/user.\nLETTERHOMES=no\n\n# The SKEL variable specifies the directory containing \"skeletal\" user\n# files; in other words, files such as a sample .profile that will be\n# copied to the new user's home directory when it is created.\nSKEL=/etc/skel\n\n# FIRST_SYSTEM_[GU]ID to LAST_SYSTEM_[GU]ID inclusive is the range for UIDs\n# for dynamically allocated administrative and system accounts/groups.\n# Please note that system software, such as the users allocated by the base-passwd\n# package, may assume that UIDs less than 100 are unallocated.\nFIRST_SYSTEM_UID=100\nLAST_SYSTEM_UID=999\n\nFIRST_SYSTEM_GID=100\nLAST_SYSTEM_GID=999\n\n# FIRST_[GU]ID to LAST_[GU]ID inclusive is the range of UIDs of dynamically\n# allocated user accounts/groups.\nFIRST_UID=1000\nLAST_UID=59999\n\nFIRST_GID=1000\nLAST_GID=59999\n\n# The USERGROUPS variable can be either \"yes\" or \"no\". If \"yes\" each\n# created user will be given their own group to use as a default. If\n# \"no\", each created user will be placed in the group whose gid is\n# USERS_GID (see below).\nUSERGROUPS=yes\n\n# If USERGROUPS is \"no\", then USERS_GID should be the GID of the group\n# `users' (or the equivalent group) on your system.\nUSERS_GID=100\n\n# If DIR_MODE is set, directories will be created with the specified\n# mode. Otherwise the default mode 0755 will be used.\nDIR_MODE=0755\n\n# If SETGID_HOME is \"yes\" home directories for users with their own\n# group the setgid bit will be set. This was the default for\n# versions << 3.13 of adduser. Because it has some bad side effects we\n# no longer do this per default. If you want it nevertheless you can\n# still set it here.\nSETGID_HOME=no\n\n# If QUOTAUSER is set, a default quota will be set from that user with\n# `edquota -p QUOTAUSER newuser'\nQUOTAUSER=\"\"\n\n# If SKEL_IGNORE_REGEX is set, adduser will ignore files matching this\n# regular expression when creating a new home directory\nSKEL_IGNORE_REGEX=\"dpkg-(old|new|dist|save)\"\n\n# Set this if you want the --add_extra_groups option to adduser to add\n# new users to other groups.\n# This is the list of groups that new non-system users will be added to\n# Default:\n#EXTRA_GROUPS=\"dialout cdrom floppy audio video plugdev users\"\n\n# If ADD_EXTRA_GROUPS is set to something non-zero, the EXTRA_GROUPS\n# option above will be default behavior for adding new, non-system users\n#ADD_EXTRA_GROUPS=1\n\n\n# check user and group names also against this regular expression.\n#NAME_REGEX=\"^[a-z][-a-z0-9_]*\\$\"\n"
root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.2 -c getrange foo 0 50
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
"# /etc/adduser.conf: `adduser' configuration.\n# See"

从文件导入命令执行
需要注意的是:
redis-cli中只支持dos格式的换行符 \r\n ,如果你在Linux下、Mac下或者Windows下创建的文件,最好都转个码。没有转码的文件,执行会失败。
下面是转码指令, 只需要在命令后加入要转码的文件即可:
unix2dos 文件名

root@ceph-01:/data# cat /tmp/command.txt
set foo 100
incr foo
append foo xxx
get foo
root@ceph-01:/data# cat /tmp/command.txt|redis-cli -a 123456 -h 10.0.0.2 -c 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
-> Redirected to slot [12182] located at 10.0.0.4:6379
OK
(integer) 101
(integer) 6
"101xxx"
root@ceph-01:/data# cat /tmp/command.txt
set foo "This is a single argument"
strlen foo
root@ceph-01:/data# cat /tmp/command.txt|redis-cli -a 123456 -h 10.0.0.2 -c
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
-> Redirected to slot [12182] located at 10.0.0.4:6379
OK
(integer) 25

5.2 监控rss内存大小

root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.2 -c -r -1 -i 1 INFO |grep rss_human
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
used_memory_rss_human:14.05M
used_memory_rss_human:14.04M
used_memory_rss_human:14.04M
used_memory_rss_human:14.03M
used_memory_rss_human:14.02M
used_memory_rss_human:14.01M
used_memory_rss_human:14.00M
used_memory_rss_human:14.00M
used_memory_rss_human:13.99M
used_memory_rss_human:14.22M
used_memory_rss_human:14.21M
used_memory_rss_human:14.21M
used_memory_rss_human:14.20M
^C

5.3 监测redis状态

root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.2 -c --stat
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
------- data ------ --------------------- load -------------------- - child -
keys mem clients blocked requests connections 
1 2.55M 1 0 7448 (+0) 8 
1 2.55M 1 0 7450 (+2) 8 
1 2.55M 1 0 7452 (+2) 8 
1 2.55M 1 0 7454 (+2) 8 
1 2.55M 1 0 7456 (+2) 8 
1 2.55M 1 0 7458 (+2) 8 
1 2.55M 1 0 7460 (+2) 8 
1 2.55M 1 0 7462 (+2) 8 
1 2.55M 1 0 7464 (+2) 8 
1 2.55M 1 0 7466 (+2) 8 
1 2.55M 1 0 7468 (+2) 8 
1 2.59M 1 0 7470 (+2) 8 
1 2.59M 1 0 7472 (+2) 8 
1 2.59M 1 0 7474 (+2) 8 
1 2.55M 1 0 7476 (+2) 8 
1 2.59M 1 0 7478 (+2) 8 
1 2.55M 1 0 7479 (+1) 8 
1 2.55M 1 0 7481 (+2) 8 
1 2.55M 1 0 7483 (+2) 8 
1 2.55M 1 0 7485 (+2) 8 
------- data ------ --------------------- load -------------------- - child -
keys mem clients blocked requests connections 
1 2.55M 1 0 7487 (+2) 8 
1 2.55M 1 0 7489 (+2) 8 
1 2.55M 1 0 7491 (+2) 8 
1 2.55M 1 0 7493 (+2) 8 
1 2.55M 1 0 7495 (+2) 8 
^C

5.4 监测会话操作

监测10.0.0.3 6379会话操作

root@ceph-01:/data# redis-cli -a 123456 -h 10.0.0.3 -c monitor 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
OK
1632465213.356014 [0 10.0.0.3:47034] "COMMAND"
1632465223.045208 [0 10.0.0.3:47034] "set" "test" "1"

连接到10.0.0.3 6379 操作

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.3   
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.3:6379> set test 1
OK
10.0.0.3:6379> quit

5.4 切换会话

10.0.0.4:6379> connect 10.0.0.2 6379
10.0.0.2:6379>

5.5 bigkeys

在这种特殊模式下,redis-cli作为键空间分析器工作。它扫描数据集的大键,但也提供关于数据集组成的数据类型的信息。这种模式是通过——bigkeys选项启用的,并产生相当详细的输出

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --bigkeys
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.

# Scanning the entire keyspace to find biggest keys as well as
# average sizes per key type. You can use -i 0.1 to sleep 0.1 sec
# per 100 SCAN commands (not usually needed).

[00.00%] Biggest string found so far '"string"' with 119 bytes

-------- summary -------

Sampled 6 keys in the keyspace!
Total key length in bytes is 33 (avg len 5.50)

Biggest string found '"string"' has 119 bytes

0 lists with 0 items (00.00% of keys, avg size 0.00)
0 hashs with 0 fields (00.00% of keys, avg size 0.00)
6 strings with 124 bytes (100.00% of keys, avg size 20.67)
0 streams with 0 entries (00.00% of keys, avg size 0.00)
0 sets with 0 members (00.00% of keys, avg size 0.00)
0 zsets with 0 members (00.00% of keys, avg size 0.00)
5.6 scan
root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --scan
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
string
foo2
mycounter
test4
foo3
test5

5.7scan

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --scan --pattern "fo*"
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
foo2
foo3

5.8 监控redis实例的延迟时间

CLI有多个工具来研究Redis实例的延迟,并了解延迟的最大值、平均值和分布。
基本的延迟检查工具是——latency选项。使用此选项,CLI运行一个循环,PING命令被发送到Redis实例,并测量得到回复的时间。这种情况每秒发生100次,并且在控制台中实时更新状态
统计数据以毫秒为单位。通常,由于运行redis-cli本身的系统内核调度器的延迟,一个非常快的实例的平均延迟往往被高估了一点。然而,这通常不是一个大问题,因为我们对几毫秒或更长的事件感兴趣。

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --latency 
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
min: 0, max: 11, avg: 0.94 (14345 samples)^C

有时,研究最大延迟和平均延迟是如何在一段时间内演化的是很有用的。为此使用——latency-history选项:它的工作方式与——latency完全相同,但是每隔15秒(默认情况下)就会从头开始一个新的采样会话

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --latency-history
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
min: 0, max: 3, avg: 0.87 (1278 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.83 (1289 samples) -- 15.00 seconds range
min: 0, max: 4, avg: 0.85 (1288 samples) -- 15.00 seconds range
min: 0, max: 5, avg: 1.01 (1267 samples) -- 15.01 seconds range
min: 0, max: 7, avg: 0.88 (1283 samples) -- 15.01 seconds range
min: 0, max: 10, avg: 0.97 (1269 samples) -- 15.00 seconds range
min: 0, max: 5, avg: 0.87 (1286 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.87 (1284 samples) -- 15.01 seconds range
min: 0, max: 4, avg: 0.81 (1291 samples) -- 15.01 seconds range
min: 0, max: 5, avg: 0.95 (1274 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.83 (1289 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.85 (1287 samples) -- 15.00 seconds range
min: 0, max: 3, avg: 0.82 (1293 samples) -- 15.01 seconds range
min: 0, max: 4, avg: 0.80 (1293 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.83 (1291 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.78 (1298 samples) -- 15.01 seconds range
min: 0, max: 3, avg: 0.83 (1293 samples) -- 15.00 seconds range

可以通过-i选项指定间隔时间(单位秒)

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.2 --latency-history -i 1
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
min: 0, max: 2, avg: 1.04 (78 samples) -- 1.01 seconds range
min: 0, max: 3, avg: 1.04 (80 samples) -- 1.01 seconds range
min: 0, max: 2, avg: 0.93 (85 samples) -- 1.01 seconds range
min: 0, max: 2, avg: 0.67 (63 samples)^C

在redis-cli中还实现了另一个非常不寻常的延迟工具。它不检查Redis实例的延迟,但是检查你运行Redis -cli的计算机的延迟。你可能会问什么延迟?内核调度器固有的延迟,虚拟化实例的hypervisor等等。
10表示检测10秒

root@ceph-02:/data# redis-cli --intrinsic-latency 10
Max latency so far: 6 microseconds.
Max latency so far: 7 microseconds.
Max latency so far: 646 microseconds.
Max latency so far: 901 microseconds.
Max latency so far: 1358 microseconds.

710522 total runs (avg latency: 14.0742 microseconds / 14074.16 nanoseconds per run).
Worst run took 96x longer than the average latency.

5.9 备份远端的rdb文件到本地

root@ceph-02:/data# redis-cli -c -a 123456 -h 10.0.0.4 --rdb /tmp/dump.rdb
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
sending REPLCONF capa eof
SYNC sent to master, writing 207 bytes to '/tmp/dump.rdb'
Transfer finished with success.
Transfer finished with success.
root@ceph-02:/data# echo $?
0

5.10 对key做加操作

对key的值做加加操作,并返回新的值。注意incr一个不是int的 value会返回错误,incr一个不存在的key,则设置key为1。范围为 64有符号,-9223372036854775808~9223372036854775807。

10.0.0.4:6379> incr add 
-> Redirected to slot [1943] located at 10.0.0.2:6379
(integer) 1

10.0.0.2:6379> get add
"1"
10.0.0.2:6379> incr add 
(integer) 2
10.0.0.2:6379> get add
"2"

5.11 info

info all命令包含Redis最全的系统状态信息

6 Redis 持久化

6.1 持久化方式

Redis 提供了不同级别的持久化方式:
RDB持久化方式能够在指定的时间间隔能对你的数据进行快照存储.
AOF持久化方式记录每次对服务器写的操作,当服务器重启的时候会重新执行这些命令来恢复原始的数据,AOF命令以redis协议追加保存每次写的操作到文件末尾.Redis还能对AOF文件进行后台重写,使得AOF文件的体积不至于过大.
如果你只希望你的数据在服务器运行的时候存在,你也可以不使用任何持久化方式.
你也可以同时开启两种持久化方式, 在这种情况下, 当redis重启的时候会优先载入AOF文件来恢复原始的数据,因为在通常情况下AOF文件保存的数据集要比RDB文件保存的数据集要完整.
最重要的事情是了解RDB和AOF持久化方式的不同,让我们以RDB持久化方式开始:

6.2 RDB优缺点

RDB的优点
RDB是一个非常紧凑的文件,它保存了某个时间点得数据集,非常适用于数据集的备份,比如你可以在每个小时报保存一下过去24小时内的数据,同时每天保存过去30天的数据,这样即使出了问题你也可以根据需求恢复到不同版本的数据集.
RDB是一个紧凑的单一文件,很方便传送到另一个远端数据中心或者亚马逊的S3(可能加密),非常适用于灾难恢复.
RDB在保存RDB文件时父进程唯一需要做的就是fork出一个子进程,接下来的工作全部由子进程来做,父进程不需要再做其他IO操作,所以RDB持久化方式可以最大化redis的性能.
与AOF相比,在恢复大的数据集的时候,RDB方式会更快一些.
RDB的缺点
如果你希望在redis意外停止工作(例如电源中断)的情况下丢失的数据最少的话,那么RDB不适合你.虽然你可以配置不同的save时间点(例如每隔5分钟并且对数据集有100个写的操作),是Redis要完整的保存整个数据集是一个比较繁重的工作,你通常会每隔5分钟或者更久做一次完整的保存,万一在Redis意外宕机,你可能会丢失几分钟的数据.
RDB 需要经常fork子进程来保存数据集到硬盘上,当数据集比较大的时候,fork的过程是非常耗时的,可能会导致Redis在一些毫秒级内不能响应客户端的请求.如果数据集巨大并且CPU性能不是很好的情况下,这种情况会持续1秒,AOF也需要fork,但是你可以调节重写日志文件的频率来提高数据集的耐久度.

6.3 AOF优缺点

AOF 优点
使用AOF 会让你的Redis更加耐久: 你可以使用不同的fsync策略:无fsync,每秒fsync,每次写的时候fsync.使用默认的每秒fsync策略,Redis的性能依然很好(fsync是由后台线程进行处理的,主线程会尽力处理客户端请求),一旦出现故障,你最多丢失1秒的数据.
AOF文件是一个只进行追加的日志文件,所以不需要写入seek,即使由于某些原因(磁盘空间已满,写的过程中宕机等等)未执行完整的写入命令,你也也可使用redis-check-aof工具修复这些问题.
Redis 可以在 AOF 文件体积变得过大时,自动地在后台对 AOF 进行重写: 重写后的新 AOF 文件包含了恢复当前数据集所需的最小命令集合。 整个重写操作是绝对安全的,因为 Redis 在创建新 AOF 文件的过程中,会继续将命令追加到现有的 AOF 文件里面,即使重写过程中发生停机,现有的 AOF 文件也不会丢失。 而一旦新 AOF 文件创建完毕,Redis 就会从旧 AOF 文件切换到新 AOF 文件,并开始对新 AOF 文件进行追加操作。
AOF 文件有序地保存了对数据库执行的所有写入操作, 这些写入操作以 Redis 协议的格式保存, 因此 AOF 文件的内容非常容易被人读懂, 对文件进行分析(parse)也很轻松。 导出(export) AOF 文件也非常简单: 举个例子, 如果你不小心执行了 FLUSHALL 命令, 但只要 AOF 文件未被重写, 那么只要停止服务器, 移除 AOF 文件末尾的 FLUSHALL 命令, 并重启 Redis , 就可以将数据集恢复到 FLUSHALL 执行之前的状态。
AOF 缺点
对于相同的数据集来说,AOF 文件的体积通常要大于 RDB 文件的体积。
根据所使用的 fsync 策略,AOF 的速度可能会慢于 RDB 。 在一般情况下, 每秒 fsync 的性能依然非常高, 而关闭 fsync 可以让 AOF 的速度和 RDB 一样快, 即使在高负荷之下也是如此。 不过在处理巨大的写入载入时,RDB 可以提供更有保证的最大延迟时间(latency)。
6.4 如何选择使用哪种持久化方式
一般来说, 如果想达到足以媲美 PostgreSQL 的数据安全性, 你应该同时使用两种持久化功能。
如果你非常关心你的数据, 但仍然可以承受数分钟以内的数据丢失, 那么你可以只使用 RDB 持久化。
有很多用户都只使用 AOF 持久化, 但我们并不推荐这种方式: 因为定时生成 RDB 快照(snapshot)非常便于进行数据库备份, 并且 RDB 恢复数据集的速度也要比 AOF 恢复的速度要快, 除此之外, 使用 RDB 还可以避免之前提到的 AOF 程序的 bug 。
Note: 因为以上提到的种种原因, 未来我们可能会将 AOF 和 RDB 整合成单个持久化模型。 (这是一个长期计划。) 接下来的几个小节将介绍 RDB 和 AOF 的更多细节。

7 集群添加新节点

添加新的节点的基本过程就是添加一个空的节点然后移动一些数据给它,有两种情况,添加一个主节点和添加一个从节点(添加从节点时需要将这个新的节点设置为集群中某个节点的复制)

7.1 添加一个主节点

命令
./redis-cli --cluster add-node 新节点ip:端口 集群中已经存在的任意节点的ip:端口

7.1.1 启动一个新的redis实例

写实例compose文件

[root@ceph-01 opt]# cat /home/zshield/docker/compose/redis-6381.yml 
version: "3"
services:
redis-6381:
container_name: redis-6381
restart: always
image: redis:6.0.15
network_mode: host
volumes:
- "/home/zshield/conf/redis-6381/redis.conf:/usr/local/etc/redis/redis.conf"
- "/home/zshield/BIGdata/redis_data/6381/:/data"
- /etc/localtime:/etc/localtime
command: redis-server /usr/local/etc/redis/redis.conf

配置新实例

[root@ceph-01 opt]# egrep -v '^#|^$' /home/zshield/conf/redis-6381/redis.conf 
bind 10.0.0.2
protected-mode yes
port 6381
tcp-backlog 511
timeout 0
tcp-keepalive 300
daemonize no
supervised no
pidfile /var/run/redis_6381.pid
loglevel notice
databases 16
always-show-logo yes
save 900 1
save 300 10
save 60 10000
stop-writes-on-bgsave-error yes
rdbcompression yes
rdbchecksum yes
dbfilename dump.rdb
rdb-del-sync-files no
dir /data/
masterauth 123456
replica-serve-stale-data yes
replica-read-only yes
repl-diskless-sync no
repl-diskless-sync-delay 5
repl-diskless-load disabled
repl-disable-tcp-nodelay no
replica-priority 100
acllog-max-len 128
requirepass 123456
lazyfree-lazy-eviction no
lazyfree-lazy-expire no
lazyfree-lazy-server-del no
replica-lazy-flush no
lazyfree-lazy-user-del no
oom-score-adj no
oom-score-adj-values 0 200 800
appendonly yes
appendfilename "appendonly.aof"
appendfsync everysec
no-appendfsync-on-rewrite no
auto-aof-rewrite-percentage 100
auto-aof-rewrite-min-size 64mb
aof-load-truncated yes
aof-use-rdb-preamble yes
lua-time-limit 5000
cluster-enabled yes
cluster-config-file nodes-6381.conf
cluster-node-timeout 15000
cluster-announce-port 6381
cluster-announce-bus-port 16381
slowlog-log-slower-than 10000
slowlog-max-len 128
latency-monitor-threshold 0
notify-keyspace-events ""
hash-max-ziplist-entries 512
hash-max-ziplist-value 64
list-max-ziplist-size -2
list-compress-depth 0
set-max-intset-entries 512
zset-max-ziplist-entries 128
zset-max-ziplist-value 64
hll-sparse-max-bytes 3000
stream-node-max-bytes 4096
stream-node-max-entries 100
activerehashing yes
client-output-buffer-limit normal 0 0 0
client-output-buffer-limit replica 256mb 64mb 60
client-output-buffer-limit pubsub 32mb 8mb 60
hz 10
dynamic-hz yes
aof-rewrite-incremental-fsync yes
rdb-save-incremental-fsync yes
jemalloc-bg-thread yes

启动新实例

[root@ceph-01 opt]# docker-compose -f /home/zshield/docker/compose/redis-6381.yml up -d

7.1.2 将新加点加入集群

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster add-node 10.0.0.2:6381 10.0.0.2:6380
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
>>> Adding node 10.0.0.2:6381 to cluster 10.0.0.2:6380
>>> Performing Cluster Check (using node 10.0.0.2:6380)
S: 580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380
slots: (0 slots) slave
replicates a5e9ac8b1fa24e5210a5229d9152d09295de94fa
M: 5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379
slots:[5461-10922] (5462 slots) master
1 additional replica(s)
S: 9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380
slots: (0 slots) slave
replicates 5236059f29c871c274ccc9dfda397c342b9ba58f
M: a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379
slots:[10923-16383] (5461 slots) master
1 additional replica(s)
M: 59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379
slots:[0-5460] (5461 slots) master
1 additional replica(s)
S: 6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380
slots: (0 slots) slave
replicates 59fa5deeea5f61fccd198bfdfa55a00529456c30
[OK] All nodes agree about slots configuration.
>>> Check for open slots...
>>> Check slots coverage...
[OK] All 16384 slots covered.
>>> Send CLUSTER MEET to node 10.0.0.2:6381 to make it join the cluster.
[OK] New node added correctly.
[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster info 10.0.0.2:6379
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.2:6379 (59fa5dee...) -> 0 keys | 5461 slots | 1 slaves.
10.0.0.3:6379 (5236059f...) -> 0 keys | 5462 slots | 1 slaves.
10.0.0.2:6381 (f4a51f8c...) -> 0 keys | 0 slots | 0 slaves.
10.0.0.4:6379 (a5e9ac8b...) -> 0 keys | 5461 slots | 1 slaves.
[OK] 0 keys in 4 masters.
0.00 keys per slot on average.

移动哈希槽

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster reshard 10.0.0.2:6379

输出略,根据提示执行即可

移动成功后查看集群信息

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster info 10.0.0.2:6379
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.2:6379 (59fa5dee...) -> 0 keys | 5128 slots | 1 slaves.
10.0.0.3:6379 (5236059f...) -> 0 keys | 5128 slots | 1 slaves.
10.0.0.2:6381 (f4a51f8c...) -> 0 keys | 1000 slots | 0 slaves.
10.0.0.4:6379 (a5e9ac8b...) -> 0 keys | 5128 slots | 1 slaves.
[OK] 0 keys in 4 masters.
0.00 keys per slot on average.

7.2 添加一个从节点

7.2.1 启动一个新的redis实例

写实例compose文件

[root@ceph-02 ~]# docker-compose -f /home/zshield/docker/compose/redis-6380.yml up -d
WARNING: Found orphan containers (redis-6379) for this project. If you removed or renamed this service in your compose file, you can run this command with the --remove-orphans flag to clean it up.
Creating redis-6380 ... done

[root@ceph-02 ~]#

[root@ceph-02 ~]# cat /home/zshield/docker/compose/redis-6381.yml 
version: "3"
services:
redis-6381:
container_name: redis-6381
restart: always
image: redis:6.0.15
network_mode: host
volumes:
- "/home/zshield/conf/redis-6381/redis.conf:/usr/local/etc/redis/redis.conf"
- "/home/zshield/BIGdata/redis_data/6381/:/data"
- /etc/localtime:/etc/localtime
command: redis-server /usr/local/etc/redis/redis.conf

配置实例

[root@ceph-02 ~]# egrep -v "^$|^#" /home/zshield/conf/redis-6381/redis.conf 
bind 10.0.0.3
protected-mode yes
port 6381
tcp-backlog 511
timeout 0
tcp-keepalive 300
daemonize no
supervised no
pidfile /var/run/redis_6381.pid
loglevel notice
databases 16
always-show-logo yes
save 900 1
save 300 10
save 60 10000
stop-writes-on-bgsave-error yes
rdbcompression yes
rdbchecksum yes
dbfilename dump.rdb
rdb-del-sync-files no
dir /data/
masterauth 123456
replica-serve-stale-data yes
replica-read-only yes
repl-diskless-sync no
repl-diskless-sync-delay 5
repl-diskless-load disabled
repl-disable-tcp-nodelay no
replica-priority 100
acllog-max-len 128
requirepass 123456
lazyfree-lazy-eviction no
lazyfree-lazy-expire no
lazyfree-lazy-server-del no
replica-lazy-flush no
lazyfree-lazy-user-del no
oom-score-adj no
oom-score-adj-values 0 200 800
appendonly yes
appendfilename "appendonly.aof"
appendfsync everysec
no-appendfsync-on-rewrite no
auto-aof-rewrite-percentage 100
auto-aof-rewrite-min-size 64mb
aof-load-truncated yes
aof-use-rdb-preamble yes
lua-time-limit 5000
cluster-enabled yes
cluster-config-file nodes-6381.conf
cluster-node-timeout 15000
cluster-announce-port 6381
cluster-announce-bus-port 16381
slowlog-log-slower-than 10000
slowlog-max-len 128
latency-monitor-threshold 0
notify-keyspace-events ""
hash-max-ziplist-entries 512
hash-max-ziplist-value 64
list-max-ziplist-size -2
list-compress-depth 0
set-max-intset-entries 512
zset-max-ziplist-entries 128
zset-max-ziplist-value 64
hll-sparse-max-bytes 3000
stream-node-max-bytes 4096
stream-node-max-entries 100
activerehashing yes
client-output-buffer-limit normal 0 0 0
client-output-buffer-limit replica 256mb 64mb 60
client-output-buffer-limit pubsub 32mb 8mb 60
hz 10
dynamic-hz yes
aof-rewrite-incremental-fsync yes
rdb-save-incremental-fsync yes
jemalloc-bg-thread yes

启动新实例

[root@ceph-02 ~]# docker-compose -f /home/zshield/docker/compose/redis-6381.yml up -d

7.2.2 添加新节点为从节点

#将新节点加入集群

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster add-node 10.0.0.3:6381 10.0.0.2:6379
输出略

注意:不要分配哈希槽给新加点

#查询集群中master节点(10.0.0.2:6379)的id

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -h 10.0.0.2 cluster nodes
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
a6aceb0be7ec0b335158dbc42c9aa1ff42d78b14 10.0.0.3:6381@16381 master - 0 1632641977310 0 connected
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632641978000 3 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632641976294 3 connected 5795-10922
f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381@16381 master - 0 1632641975000 7 connected 0-332 5461-5794 10923-11255
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave 59fa5deeea5f61fccd198bfdfa55a00529456c30 0 1632641978326 1 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 myself,master - 0 1632641973000 1 connected 333-5460
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632641975279 5 connected 11256-16383
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632641974000 5 connected

#登录10.0.0.3:6381

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -c -h 10.0.0.3 -p 6381
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.

#将10.0.0.3:6381作为10.0.0.2:6379的从节点

10.0.0.3:6381> cluster replicate 59fa5deeea5f61fccd198bfdfa55a00529456c30

OK

#验证

10.0.0.3:6381> cluster nodes
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632642002000 5 connected 11256-16383
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632642003412 3 connected
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632642001000 5 connected
a6aceb0be7ec0b335158dbc42c9aa1ff42d78b14 10.0.0.3:6381@16381 myself,slave 59fa5deeea5f61fccd198bfdfa55a00529456c30 0 1632642001000 1 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632642004422 3 connected 5795-10922
f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381@16381 master - 0 1632642002390 7 connected 0-332 5461-5794 10923-11255
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 0 1632642003000 7 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 master - 0 1632642002000 1 connected 333-5460

8 移除节点

8.1 移除从节点

命令格式
redis-cli -a 123456 --cluster del-node 集群任意节点:端口 要移除的节点id

#查看集群信息

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -c -h 10.0.0.2 cluster nodes
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
a6aceb0be7ec0b335158dbc42c9aa1ff42d78b14 10.0.0.3:6381@16381 slave 59fa5deeea5f61fccd198bfdfa55a00529456c30 0 1632643053672 1 connected
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632643054683 3 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632643052000 3 connected 5795-10922
f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381@16381 master - 0 1632643053570 7 connected 0-332 5461-5794 10923-11255
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 0 1632643050000 7 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 myself,master - 0 1632643052000 1 connected 333-5460
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632643051000 5 connected 11256-16383
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632643053000 5 connected

#移除10.0.0.3:6381节点

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster del-node 10.0.0.2:6379 a6aceb0be7ec0b335158dbc42c9aa1ff42d78b14
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
>>> Removing node a6aceb0be7ec0b335158dbc42c9aa1ff42d78b14 from cluster 10.0.0.2:6379
>>> Sending CLUSTER FORGET messages to the cluster...
>>> Sending CLUSTER RESET SOFT to the deleted node.

#验证是否移除

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -c -h 10.0.0.2 cluster nodes
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632643107000 3 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632643106000 3 connected 5795-10922
f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381@16381 master - 0 1632643104639 7 connected 0-332 5461-5794 10923-11255
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 0 1632643107692 7 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 myself,master - 0 1632643104000 1 connected 333-5460
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632643106676 5 connected 11256-16383
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632643108709 5 connected

8.2 移除主节点

命令格式
redis-cli -a 123456 --cluster del-node 集群任意节点:端口 要移除的节点id
注意:移除主节点之前要先将该主节点上的哈希槽移除
这里演示移除10.0.0.2:6381主节点

将10.0.0.2:6381节点的哈希槽移除

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster reshard 10.0.0.2:6379
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
>>> Performing Cluster Check (using node 10.0.0.2:6379)
M: 59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379
slots:[333-5460] (5128 slots) master
S: 9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380
slots: (0 slots) slave
replicates 5236059f29c871c274ccc9dfda397c342b9ba58f
M: 5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379
slots:[5795-10922] (5128 slots) master
1 additional replica(s)
M: f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381
slots:[0-332],[5461-5794],[10923-11255] (1000 slots) master
1 additional replica(s)
S: 6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380
slots: (0 slots) slave
replicates f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6
M: a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379
slots:[11256-16383] (5128 slots) master
1 additional replica(s)
S: 580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380
slots: (0 slots) slave
replicates a5e9ac8b1fa24e5210a5229d9152d09295de94fa
[OK] All nodes agree about slots configuration.
>>> Check for open slots...
>>> Check slots coverage...
[OK] All 16384 slots covered.
How many slots do you want to move (from 1 to 16384)? 1000
What is the receiving node ID? 59fa5deeea5f61fccd198bfdfa55a00529456c30
Please enter all the source node IDs.
Type 'all' to use all the nodes as source nodes for the hash slots.
Type 'done' once you entered all the source nodes IDs.
Source node #1: f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6

Source node #2: done

输出略

Do you want to proceed with the proposed reshard plan (yes/no)?yes

输出略

#验证是否移除成功

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster info 10.0.0.2:6379
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
10.0.0.2:6379 (59fa5dee...) -> 0 keys | 6128 slots | 1 slaves.
10.0.0.3:6379 (5236059f...) -> 0 keys | 5128 slots | 1 slaves.
10.0.0.2:6381 (f4a51f8c...) -> 0 keys | 0 slots | 0 slaves.
10.0.0.4:6379 (a5e9ac8b...) -> 0 keys | 5128 slots | 1 slaves.
[OK] 0 keys in 4 masters.
0.00 keys per slot on average.

移除节点
#查看要移除的节点的id

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -h 10.0.0.2 -c cluster nodes
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632644410306 3 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632644409289 3 connected 5795-10922
f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 10.0.0.2:6381@16381 master - 0 1632644410000 7 connected
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave 59fa5deeea5f61fccd198bfdfa55a00529456c30 0 1632644411325 8 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 myself,master - 0 1632644409000 8 connected 0-5794 10923-11255
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632644407248 5 connected 11256-16383
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632644408265 5 connected

#移除节点

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 --cluster del-node 10.0.0.2:6379 f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
>>> Removing node f4a51f8c7b9972cd8f6a08f2aebc86532f6ee3c6 from cluster 10.0.0.2:6379
>>> Sending CLUSTER FORGET messages to the cluster...
>>> Sending CLUSTER RESET SOFT to the deleted node.

#验证是否移除

[root@ceph-01 opt]# docker exec -it redis-6379 redis-cli -a 123456 -h 10.0.0.2 -c cluster nodes
Warning: Using a password with '-a' or '-u' option on the command line interface may not be safe.
9c3c6d768605804166e34a038904f25fbc474686 10.0.0.4:6380@16380 slave 5236059f29c871c274ccc9dfda397c342b9ba58f 0 1632644500881 3 connected
5236059f29c871c274ccc9dfda397c342b9ba58f 10.0.0.3:6379@16379 master - 0 1632644501901 3 connected 5795-10922
6bd52218d3dabcee17277bde9c577837df6abc32 10.0.0.3:6380@16380 slave 59fa5deeea5f61fccd198bfdfa55a00529456c30 0 1632644502917 8 connected
59fa5deeea5f61fccd198bfdfa55a00529456c30 10.0.0.2:6379@16379 myself,master - 0 1632644501000 8 connected 0-5794 10923-11255
a5e9ac8b1fa24e5210a5229d9152d09295de94fa 10.0.0.4:6379@16379 master - 0 1632644502000 5 connected 11256-16383
580e1d888ee0edb7f458aa45f3c135f4fb30267c 10.0.0.2:6380@16380 slave a5e9ac8b1fa24e5210a5229d9152d09295de94fa 0 1632644499000 5 connected

9 redis配置参数解释

9.1 standalone配置

9.1.2 总体配置

redis集群支持lua脚本吗 redis集群16384_redis


redis集群支持lua脚本吗 redis集群16384_docker_02

9.1.2 最大内存及策略

redis集群支持lua脚本吗 redis集群16384_数据库_03

9.1.3 AOF相关配置

redis集群支持lua脚本吗 redis集群16384_2d_04


redis集群支持lua脚本吗 redis集群16384_redis集群支持lua脚本吗_05

9.1.4 RDB相关配置

redis集群支持lua脚本吗 redis集群16384_2d_06

9.1.5 慢查询配置

redis集群支持lua脚本吗 redis集群16384_docker_07

9.1.6 数据结构优化配置

redis集群支持lua脚本吗 redis集群16384_redis_08

9.1.7 复制相关配置

redis集群支持lua脚本吗 redis集群16384_redis_09


redis集群支持lua脚本吗 redis集群16384_2d_10

9.1.8 客户端相关配置

redis集群支持lua脚本吗 redis集群16384_数据库_11

9.1.9 安全相关配置

redis集群支持lua脚本吗 redis集群16384_数据库_12

9.2 sentinel配置

redis集群支持lua脚本吗 redis集群16384_docker_13

9.3 cluster配置

redis集群支持lua脚本吗 redis集群16384_数据库_14


redis集群支持lua脚本吗 redis集群16384_redis_15

10 redis优化

10.1 操作系统优化

10.1.1 TCP backlog

谈优化前先来看看redis启动日志输出的警告信息

1:M 27 Sep 2021 14:38:47.610 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128.

警告是因为redis配置文件中配置了tcp-backlog 511(511为默认值),而操作系统内核参数/proc/sys/net/core/somaxconn值是128(默认值),小于redis配置值。
具体优化方法,日志中已经给出,如下
注意:root用户操作

echo 511 > /proc/sys/net/core/somaxconn
echo "net.core.somaxconn = 511" >> /etc/sysctl.conf

10.1.2 THP

谈优化前先来看看redis启动日志输出的警告信息

1:M 27 Sep 2021 14:38:47.610 # WARNING you have Transparent Huge Pages (THP) support enabled in your kernel. This will create latency and memory usage issues with Redis. To fix this issue run the command 'echo madvise > /sys/kernel/mm/transparent_hugepage/enabled' as root, and add it to your /etc/rc.local in order to retain the setting after a reboot. Redis must be restarted after THP is disabled (set to 'madvise' or 'never').

警告是因为操作系统内核(2.6.38之后)增加了THP(Transparent Huge Pages)特性,支持大内存页(2MB)分配。这个redis fork操作之后,每个内存页从原来4KB变为2MB,会大幅增加重写期间父进程内存消耗。同时每次写命令引起的复制内存页单位放大了512倍,会拖慢写操作的执行时间,导致大量写操作慢查询,例如简单的incr命令也会出现在慢查询中。
具体优化方法,日志中已经给出,如下
注意:root用户操作

echo never > /sys/kernel/mm/transparent_hugepage/enabled
echo "echo never > /sys/kernel/mm/transparent_hugepage/enabled" >>  /etc/rc.local

10.1.3 ulimit

Redis允许同时有多个客户端通过网络进行连接,可以通过配置maxclients来限制最大客户端连接数。对Linux操作系统来说,这些网络连接都是文件句柄。maxclients的值要小于等于系统最大打开文件数。
Redis建议把open files至少设置成10032,那么这个10032是如何来的呢?因为maxclients默认是10000,这些是用来处理客户端连接的,除此之外,Redis内部会使用最多32个文件描述符,所以这里的10032=10000+32。
优化如下
注意:root用户操作

ulimit -n 10032
echo "* soft nofile 10032" >> /etc/security/limits.conf 
echo "* hard nofile 10032" >> /etc/security/limits.conf

10.1.4 内存分配控制

vm.overcommit_memory用来设置内存分配策略,有三个可选值,如表

redis集群支持lua脚本吗 redis集群16384_docker_16

可用内存代表物理内存与swap之和

值	含义
0	表示内核将检查是否有足够的可用内存。如果有足够的可用内存,内存申请通过,否则内存申请失败,并把错误返回给应用进程
1	表示内核允许超量使用内存直到用完为止
2	表示内核绝不过量的(“never overcommit”)使用内存,即系统整个内存地址空间不能超过swap+50%的RAM值,50%是overcommit_ratio的默认值,此参数同样支持修改

Redis建议把这个值设置为1,是为了让fork操作能够在低内存下也执行成功。
注意:root用户操作

echo "vm.overcommit_memory = 1" >> /etc/sysctl.conf

10.1.5 swappiness

swap对于操作系统来比较重要,当物理内存不足时,可以将一部分内存页进行swap操作,已解燃眉之急。但世界上没有免费午餐,swap空间由硬盘提供,对于需要高并发、高吞吐的应用来说,磁盘IO通常会成为系统瓶颈。在Linux中,并不是要等到所有物理内存都使用完才会使用到swap,系统参数swppiness会决定操作系统使用swap的倾向程度。swappiness的取值范围是0~100,swappiness的值越大,说明操作系统可能使用swap的概率越高,swappiness值越低,表示操作系统更加倾向于使用物理内存。swap的默认值是60,了解这个值的含义后,有利于Redis的性能优化。

swappiness重要值策略说明

redis集群支持lua脚本吗 redis集群16384_redis集群支持lua脚本吗_17

值	策略
0	linux3.5以及以上:宁愿用OOM killer也不用swap
linux3.4以及更早:宁愿用swap也不用OOM killer
1	linux3.5以及以上:宁愿用swap也不用OOM killer
60	默认值
100	操作系统会主动使用swap

注意:root用户操作

echo vm.swappiness=10 >> /etc/sysctl.conf

10.2 redis优化

10.2.1 慢查询

相关知识点解释
Redis提供了slowlog-log-slower-than和slowlog-max-len配置来解决这两个问题。从字面意思就可以看出,slowlog-log-slower-than就是那个预设阀值,它的单位是微秒(1秒=1000毫秒=1000000微秒),默认值是10000,假如执行了一条“很慢”的命令(例如keys*),如果它的执行时间超过了10000微秒,那么它将被记录在慢查询日志中。
如果slowlog-log-slower-than=0会记录所有的命令,slowlog-log-slower-than<0对于任何命令都不会进行记录。
slowlog-max-len只是说明了慢查询日志最多存储多少条,并没有说明存放在哪里?实际上Redis使用了一个列表来存储慢查询日志,slowlog-max-len就是列表的最大长度。默认值128
一个新的命令满足慢查询条件时被插入到这个列表中,当慢查询日志列表已处于其最大长度时,最早插入的一个命令将从列表中移出,例如slowlog-max-len设置为5,当有第6条慢查询插入的话,那么队头的第一条数据就出列,第6条慢查询就会入列。

虽然慢查询日志是存放在Redis内存列表中的,但是Redis并没有暴露这个列表的键,而是通过一组命令来实现对慢查询日志的访问和管理。下面介绍这几个命令。
(1)获取慢查询日志
slowlog get [n]
参数n可以指定要查询的条数
(2)获取慢查询日志当前的长度
slowlog len
(3)慢查询日志重置
slowlog reset
执行该命令会对慢查询列表进行清理操作
优化建议
(1)slowlog-max-len配置建议:线上建议调大慢查询列表,记录慢查询时Redis会对长命令做截断操作,并不会占用大量内存。增大慢查询列表可以减缓慢查询被剔除的可能,例如线上可设置为1000以上。
(2)slowlog-log-slower-than配置建议:默认值超过10毫秒判定为慢查询,需要根据Redis并发量调整该值。由于Redis采用单线程响应命令,对于高流量的场景,如果命令执行时间在1毫秒以上,那么Redis最多可支撑OPS不到1000。因此对于高OPS场景的Redis建议设置为1毫秒。
(3)慢查询只记录命令执行时间,并不包括命令排队和网络传输时间。因此客户端执行命令的时间会大于命令实际执行时间。因为命令执行排队机制,慢查询会导致其他命令级联阻塞,因此当客户端出现请求超时,需要检查该时间点是否有对应的慢查询,从而分析出是否为慢查询导致的命令级联阻塞。
(4)由于慢查询日志是一个先进先出的队列,也就是说如果慢查询比较多的情况下,可能会丢失部分慢查询命令,为了防止这种情况发生,可以定期执行slow get命令将慢查询日志持久化到其他存储中。

10.2.2 fork优化

当Redis做RDB或AOF重写时,一个必不可少的操作就是执行fork操作创建子进程,对于大多数操作系统来说fork是个重量级错误。虽然fork创建的子进程不需要拷贝父进程的物理内存空间,但是会复制父进程的空间内存页表。例如对于10GB的Redis进程,需要复制大约20MB的内存页表,因此fork操作耗时跟进程总内存量息息相关,如果使用虚拟化技术,特别是Xen虚拟机,fork操作会更耗时。
可以在info stats统计中查latest_fork_usec指标获取最近一次fork操作耗时,单位微秒。

优化建议
1)优先使用物理机或者高效支持fork操作的虚拟化技术,避免使用Xen。
2)控制Redis实例最大可用内存,fork耗时跟内存量成正比,线上建议每个Redis实例内存控制在10GB以内。
3)合理配置Linux内存分配策略,避免物理内存不足导致fork失败,具体细节见上文。
4)降低fork操作的频率,如适度放宽AOF自动触发时机,避免不必要的全量复制等。redis默认的AOF触发时机默认值如下,适当调大
auto-aof-rewrite-percentage 100
auto-aof-rewrite-min-size 64mb
5)Redis是CPU密集型服务,不要做绑定单核CPU操作。由于子进程非常消耗CPU,会和父进程产生单核资源竞争。不要和其他CPU密集型服务部署在一起,造成CPU过度竞争。
6)如果部署多个Redis实例,尽量保证同一时刻只有一个子进程执行重写工作,info persistence配合外部程序实现。
7)内存消耗监控。RDB重写时,Redis日志输出容如下:

1:M 28 Sep 2021 10:38:36.109 * Background saving started by pid 26
26:C 28 Sep 2021 10:38:36.111 * DB saved on disk
26:C 28 Sep 2021 10:38:36.112 * RDB: 2 MB of memory used by copy-on-write
1:M 28 Sep 2021 10:38:36.193 * Background saving terminated with success

AOF重写时,Redis日志输出容如下:

Background append only file rewriting started by pid 8937
AOF rewrite child asks to stop sending diffs.
Parent agreed to stop sending diffs. Finalizing AOF...
Concatenating 0.00 MB of AOF diff received from parent.
SYNC append only file rewrite performed
AOF rewrite: 53 MB of memory used by copy-on-write
Background AOF rewrite terminated with success
Residual parent diff successfully flushed to the rewritten AOF (1.49 MB)
Background AOF rewrite finished successfully

通过编写shell脚本监测内存消耗情况。
8)AOF重写时会消耗大量硬盘IO,可以开启配置no-appendfsync-on-rewrite,默认关闭。表示在AOF重写期间不做fsync操作。配置no-appendfsync-on-rewrite=yes时,在极端情况下可能丢失整个AOF重写期间的数据,需要根据数据安全性决定是否配置。
9)对于单机配置多个Redis实例的情况,可以配置不同实例分盘存储AOF文件,分摊硬盘写入压力。