前言
上次分享了barman备份的安装部署过程,这次将继续分享一下barman备份恢复工具的使用过程。
barman是目前postgreSQL数据库的比较成熟的一款备份工具,是基于postgresql的一款物理备份工具。与oracle数据库中的rman较为类似。
其简单的工作原理即为:
物理全备+归档备份(WAL日志备份),从而达到任意时间点的恢复。即恢复全备后,通过指定WAL日志的时间点、LSN等即可恢复到数据库的任意时间点。
barman备份主要分为三类:
1.仅流复制;
2.ssh/rsync;
3.流复制+ssh/rsync
上一次,已经完成了barman工具的安装部署工作。本文中,将对postgresql和barman进行配置并测试备份恢复。
一、postgreSQL配置
参数配置(具体参数可以根据实际情况配置):
listen_addresses = '*'
port = 18801
max_connections = 100
superuser_reserved_connections = 3
tcp_keepalives_idle = 7200
tcp_keepalives_interval = 130
tcp_keepalives_count = 3
shared_buffers = 256MB
work_mem = 16MB
maintenance_work_mem = 128MB
wal_level = logical
full_page_writes = on
max_wal_size = 1GB
min_wal_size = 80MB
wal_keep_segments = 32
logging_collector = on
log_directory = '/PgData/pg_log'
log_filename = 'postgresql-%Y-%m-%d_%H%M%S.log'
log_truncate_on_rotation = on
log_rotation_age = 1d
log_rotation_size = 0
log_min_messages = info
log_min_error_statement = info
shared_preload_libraries = 'pg_stat_statements' #加载pg_stat_statements模块
track_io_timing = on #跟踪IO消耗的时间
pg_stat_statements.max = 10000 #最多保留多少条统计信息,通过LRU算法,覆盖老的记录。
pg_stat_statements.track = all #all:所有SQL包括函数内嵌套的SQL,top:直接执行的SQL(函数内的sql不被跟踪),none:不跟踪
pg_stat_statements.track_utility=on # 是否跟踪非DML语句 (例如DDL,DCL),on表示跟踪, off表示不跟踪
pg_stat_statements.save = on # 重启后是否保留统计信息
#track_activity_query_size = 2048 #设置单条SQL的最长长度,超过被截断显示
修改pg_hba.conf
host all barman 0.0.0.0/0 trust
host replication barmstr 0.0.0.0/0 trust
postgresql数据库上创建用户(barman需要superuser权限,barmanStr需要replication权限):
create user barman superuser password 'barman';
create user barmanStr replication password 'barman';
重启数据库:
pg_ctl -D /PgData start
连接尝试:
在备份服务器上尝试连接验证:
psql -h 192.168.138.141 -U barman -d wangxin -p 18801
psql -h 192.168.138.141 -U barmstr -d wangxin -p 18801
二、barman配置:
创建用户:
useradd barman
修改barman的配置文件:
vi /etc/barman.conf
[barman]
barman_user = barman
configuration_files_directory = /etc/barman.d
;指定barman_home,即备份文件的存放路径
barman_home = /home/barman
log_file = /home/barman/barman.log
log_level = INFO
compression = gzip
parallel_jobs = 1
immediate_checkpoint = true
basebackup_retry_times = 3
basebackup_retry_sleep = 60
check_timeout = 120
retention_policy = REDUNDANCY 2
retention_policy = RECOVERY WINDOW OF 4 WEEKS
修改服务配置文件:
cp streaming-server.conf-template pgsql-streaming.conf
vi pgsql-streaming.conf
内容如下:
[pgsql_streaming]
description = "pgsql11-streaming (Streaming-Only)"
conninfo = host=pgsql11 user=barman dbname=wangxin port=18801
streaming_conninfo = host=pgsql11 user=barmanstr port=18801
backup_method = postgres
streaming_backup_name = barman_streaming_backup
streaming_archiver = on
archiver = on
slot_name = pgsql_streaming
create_slot = auto
streaming_archiver_name = barman_receive_wal
streaming_archiver_batch_size = 50
path_prefix = "/pg_client/bin"
配置服务端和备份端的ssh免密登录:
pgsql:
ssh-keygen -t rsa
cat id_rsa.pub >> authorized_keys
scp authorized_keys >> barman@barman:/home/barman/.ssh/
barman:
ssh-keygen -t rsa
cat id_rsa.pub >> authorized_keys
scp authorized_keys >> pgsql@pgsql11:/home/pgsql/.ssh/
pgsql:
chmod -R 600 /home/pgsql/.ssh/authorized_keys
barman:
chmod -R 600 /home/barman/.ssh/authorized_keys
创建slot槽:
barman receive-wal --create-slot pgsql_streaming
进行check检查:
barman check pgsql_streaming
查看连接服务的信息
[barman@barman ~]$ barman show-server pgsql_streaming
Server pgsql_streaming:
active: True
archive_command: (disabled)
archive_mode: off
archive_timeout: 0
archived_count: 0
archiver: True
archiver_batch_size: 0
backup_directory: /home/barman/pgsql_streaming
backup_method: postgres
backup_options: BackupOptions({'concurrent_backup'})
bandwidth_limit: None
barman_home: /home/barman
barman_lock_directory: /home/barman
basebackup_retry_sleep: 60
basebackup_retry_times: 3
basebackups_directory: /home/barman/pgsql_streaming/base
check_timeout: 120
checkpoint_timeout: 300
compression: gzip
config_file: /PgData/postgresql.conf
connection_error: None
conninfo: host=pgsql11 user=barman dbname=wangxin port=18801
create_slot: auto
current_archived_wals_per_second: 0.0
current_lsn: 0/669C3780
current_size: 1134777182
current_xlog: 000000010000000000000066
custom_compression_filter: None
custom_decompression_filter: None
data_checksums: off
data_directory: /PgData
description: pgsql11-streaming (Streaming-Only)
disabled: False
errors_directory: /home/barman/pgsql_streaming/errors
failed_count: 0
has_backup_privileges: True
hba_file: /PgData/pg_hba.conf
hot_standby: on
ident_file: /PgData/pg_ident.conf
immediate_checkpoint: True
**incoming_wals_directory: /home/barman/pgsql_streaming/incoming**
is_archiving: False
is_in_recovery: False
is_superuser: True
last_archived_time: None
last_archived_wal: None
last_backup_maximum_age: None
last_failed_time: None
last_failed_wal: None
max_incoming_wals_queue: None
max_replication_slots: 10
max_wal_senders: 10
minimum_redundancy: 0
msg_list: []
name: pgsql_streaming
network_compression: False
parallel_jobs: 1
passive_node: False
path_prefix: /pg_client/bin
pg_basebackup_bwlimit: True
pg_basebackup_compatible: True
pg_basebackup_installed: True
pg_basebackup_path: /pg_client/bin/pg_basebackup
pg_basebackup_tbls_mapping: True
pg_basebackup_version: 11.4
pg_receivexlog_compatible: True
pg_receivexlog_installed: True
pg_receivexlog_path: /pg_client/bin/pg_receivewal
pg_receivexlog_supports_slots: True
pg_receivexlog_synchronous: False
pg_receivexlog_version: 11.4
pgespresso_installed: False
post_archive_retry_script: None
post_archive_script: None
post_backup_retry_script: None
post_backup_script: None
post_delete_retry_script: None
post_delete_script: None
post_recovery_retry_script: None
post_recovery_script: None
post_wal_delete_retry_script: None
post_wal_delete_script: None
postgres_systemid: 6901481653023568481
pre_archive_retry_script: None
pre_archive_script: None
pre_backup_retry_script: None
pre_backup_script: None
pre_delete_retry_script: None
pre_delete_script: None
pre_recovery_retry_script: None
pre_recovery_script: None
pre_wal_delete_retry_script: None
pre_wal_delete_script: None
primary_ssh_command: None
recovery_options: RecoveryOptions()
replication_slot: Record(slot_name='pgsql_streaming', active=False, restart_lsn=None)
replication_slot_support: True
retention_policy: RECOVERY WINDOW OF 4 WEEKS
retention_policy_mode: auto
reuse_backup: None
server_txt_version: 11.4
slot_name: pgsql_streaming
ssh_command: None
stats_reset: 2020-12-02 10:33:56.983254+08:00
streaming: True
streaming_archiver: True
streaming_archiver_batch_size: 50
streaming_archiver_name: barman_receive_wal
streaming_backup_name: barman_streaming_backup
streaming_conninfo: host=pgsql11 user=barmanstr port=18801
streaming_supported: True
streaming_systemid: 6901481653023568481
streaming_wals_directory: /home/barman/pgsql_streaming/streaming
synchronous_standby_names: ['']
tablespace_bandwidth_limit: None
timeline: 1
wal_compression: off
wal_keep_segments: 0
wal_level: logical
wal_retention_policy: MAIN
wals_directory: /home/barman/pgsql_streaming/wals
xlog_segment_size: 16777216
xlogpos: 0/669C3780
检查barman是否满足备份要求:
[barman@barman ~]$ barman check pgsql_streaming
Server pgsql_streaming:
WAL archive: FAILED (please make sure WAL shipping is setup)
PostgreSQL: OK
superuser or standard user with backup privileges: OK
PostgreSQL streaming: OK
wal_level: OK
replication slot: FAILED (slot 'pgsql_streaming' not initialised: is 'receive-wal' running?)
directories: OK
retention policy settings: OK
backup maximum age: OK (no last_backup_maximum_age provided)
compression settings: OK
failed backups: OK (there are 0 failed backups)
minimum redundancy requirements: OK (have 0 backups, expected at least 0)
pg_basebackup: OK
pg_basebackup compatible: OK
pg_basebackup supports tablespaces mapping: OK
systemid coherence: OK (no system Id stored on disk)
pg_receivexlog: OK
pg_receivexlog compatible: OK
receive-wal running: FAILED (See the Barman log file for more details)
archive_mode: FAILED (please set it to 'on' or 'always')
archive_command: FAILED (please set it accordingly to documentation)
archiver errors: OK
可以关注到一个错误:
replication slot: FAILED (slot ‘pgsql_streaming’ not initialised: is ‘receive-wal’ running?)
解决该错误,我们需要执行以下命令,让barman服务器来强制接受源端的wal日志:
nohup barman receive-wal pgsql_streaming &
再次进行check:
[barman@barman ~]$ barman check pgsql_streaming
Server pgsql_streaming:
WAL archive: FAILED (please make sure WAL shipping is setup)
PostgreSQL: OK
superuser or standard user with backup privileges: OK
PostgreSQL streaming: OK
wal_level: OK
replication slot: OK
directories: OK
retention policy settings: OK
backup maximum age: OK (no last_backup_maximum_age provided)
compression settings: OK
failed backups: OK (there are 0 failed backups)
minimum redundancy requirements: OK (have 0 backups, expected at least 0)
pg_basebackup: OK
pg_basebackup compatible: OK
pg_basebackup supports tablespaces mapping: OK
systemid coherence: OK (no system Id stored on disk)
pg_receivexlog: OK
pg_receivexlog compatible: OK
receive-wal running: OK
archive_mode: FAILED (please set it to 'on' or 'always')
archive_command: FAILED (please set it accordingly to documentation)
archiver errors: OK
此时,可以发现,错误只有以下内容:
WAL archive: FAILED (please make sure WAL shipping is setup)
archive_mode: FAILED (please set it to ‘on’ or ‘always’)
archive_command: FAILED (please set it accordingly to documentation)
但是这三个错误是通过ssh/rsync的方式,如果我们只用streaming的方式可以忽略,然后直接发起备份,当然如果想要处理这些问题,可以这样做:
1.问题一
执行barman cron:
Starting WAL archiving for server pgsql_streaming
执行barman switch-xlog
barman switch-xlog pgsql_streaming
2.问题二
根据提示在server端设置archive_mod和archive_command参数:
vi postgresql.conf
archive_mod = on
archive_command = 'rsync -a %p barman@barman:/home/barman/pgsql_streaming/incoming/%f' #该路径就是我们刚在barman show-server pg_streaming中查询的
重启数据库
此时再次执行check可以看到状态都正常了:
[root@barman barman.d]# barman check pgsql_streaming
Server pgsql_streaming:
PostgreSQL: OK
superuser or standard user with backup privileges: OK
PostgreSQL streaming: OK
wal_level: OK
replication slot: OK
directories: OK
retention policy settings: OK
backup maximum age: OK (no last_backup_maximum_age provided)
compression settings: OK
failed backups: OK (there are 0 failed backups)
minimum redundancy requirements: OK (have 0 backups, expected at least 0)
pg_basebackup: OK
pg_basebackup compatible: OK
pg_basebackup supports tablespaces mapping: OK
systemid coherence: OK (no system Id stored on disk)
pg_receivexlog: OK
pg_receivexlog compatible: OK
receive-wal running: OK
archive_mode: OK
archive_command: OK
continuous archiving: OK
archiver errors: OK
全部正常以后,我们则可以开始进行备份。
三、barman备份:
进行备份:
barman backup pgsql_streaming
查看备份:
barman list-backup pgsql_streaming
查看备份详细信息
barman show-backup pgsql_streaming 20201204T164310
Backup 20201204T164310:
Server Name : pgsql_streaming
System Id : 6901481653023568481
Status : DONE
PostgreSQL Version : 110004
PGDATA directory : /PgData
Base backup information:
Disk usage : 1.1 GiB (1.1 GiB with WALs)
Incremental size : 1.1 GiB (-0.00%)
Timeline : 1
Begin WAL : 00000001000000000000006F
End WAL : 00000001000000000000006F
WAL number : 1
WAL compression ratio: 99.90%
Begin time : 2020-12-04 16:43:10+08:00
End time : 2020-12-04 16:43:39.799061+08:00
Copy time : 29 seconds
Estimated throughput : 37.4 MiB/s
Begin Offset : 40
End Offset : 0
Begin LSN : 0/6F000028
End LSN : 0/70000000
WAL information:
No of files : 0
Disk usage : 0 B
Last available : 00000001000000000000006F
Catalog information:
Retention Policy : VALID
Previous Backup : - (this is the oldest base backup)
Next Backup : - (this is the latest base backup)
实时查看备份情况:
[root@barman .ssh]# barman status pgsql_streaming
Server pgsql_streaming:
Description: pgsql11-streaming (Streaming-Only)
Active: True
Disabled: False
PostgreSQL version: 11.4
Cluster state: in production
pgespresso extension: Not available
Current data size: 1.1 GiB
PostgreSQL Data directory: /PgData
Current WAL segment: 000000010000000000000073
PostgreSQL 'archive_command' setting: rsync -a %p barman@barman:/home/barman/pgsql_streaming/incoming/%f
Last archived WAL: 000000010000000000000072, at Fri Dec 4 16:50:13 2020
Failures of WAL archiver: 0
Server WAL archiving rate: 0.26/hour
Passive node: False
Retention policies: enforced (mode: auto, retention: RECOVERY WINDOW OF 4 WEEKS, WAL retention: MAIN)
No. of available backups: 1
First available backup: 20201204T164310
Last available backup: 20201204T164310
Minimum redundancy requirements: satisfied (1/0)
说明:
配置文件中的barman_home,即为备份存放路径
打开barman_home下对应的备份服务下,可以看到以下目录:
drwxrwxr-x. 2 barman barman 6 Dec 4 15:36 errors
-rw-r–r--. 1 barman barman 63 Dec 4 16:43 identity.json
drwxrwxr-x. 3 barman barman 45 Dec 4 16:43 wals
drwxrwxr-x. 5 barman barman 75 Dec 4 17:14 base
drwxrwxr-x. 2 barman barman 78 Dec 4 17:14 streaming
drwxrwxr-x. 2 barman barman 6 Dec 4 17:14 incoming
其中,比较重要的有wals、base、streaming、incoming
base中,存放的是数据库的基本备份文件(最近的一份全备)
streaming中,存放的是实时传过来的wal文件
incoming中,存放的是主库传输过来的归档文件
wals中,是所有的归档文件,可以根据这个和基础备份恢复到任意时间点
barman恢复:
查看拥有的备份集:
barman list-backup pgsql_streaming
pgsql_streaming 20201207T231544 - Sun Dec 6 23:15:56 2020 - Size: 356.9 MiB - WAL Size: 0 B
pgsql_streaming 20201207T225457 - Sun Dec 6 22:55:44 2020 - Size: 1.1 GiB - WAL Size: 4.4 MiB
pgsql_streaming 20201204T171433 - Fri Dec 4 17:14:42 2020 - Size: 1.1 GiB - WAL Size: 6.3 MiB
pgsql_streaming 20201204T170853 - Fri Dec 4 17:09:11 2020 - Size: 1.1 GiB - WAL Size: 503.1 KiB
pgsql_streaming 20201204T164310 - Fri Dec 4 16:43:39 2020 - Size: 1.1 GiB - WAL Size: 2.6 MiB
检查选定的备份的详细信息:
[barman@barman 0000000100000000]$ barman show-backup pgsql_streaming 20201207T225457
Backup 20201207T225457:
Server Name : pgsql_streaming
System Id : 6901481653023568481
Status : DONE
PostgreSQL Version : 110004
PGDATA directory : /PgData
Base backup information:
Disk usage : 1.1 GiB (1.1 GiB with WALs)
Incremental size : 1.1 GiB (-0.00%)
Timeline : 1
Begin WAL : 00000001000000000000007C
End WAL : 00000001000000000000007E
WAL number : 3
WAL compression ratio: 99.90%
Begin time : 2020-12-06 22:55:02+08:00
End time : 2020-12-06 22:55:44.674452+08:00
Copy time : 42 seconds
Estimated throughput : 26.4 MiB/s
Begin Offset : 40
End Offset : 96
Begin LSN : 0/7C000028
End LSN : 0/7E000060
WAL information:
No of files : 4
Disk usage : 4.4 MiB
Compression ratio : 93.14%
Last available : 000000010000000000000082
Catalog information:
Retention Policy : VALID
Previous Backup : 20201204T171433
Next Backup : 20201207T231544
barman恢复(恢复到远端数据库):
[barman@barman PgData_recover]$ barman recover pgsql_streaming 20201207T225457 /PgData --remote-ssh-command "ssh postgres@pgsql11-recover" --target-time "2020-12-06 22:57:00.0000"
Processing xlog segments from streaming for pgsql_streaming
000000010000000000000083
Processing xlog segments from file archival for pgsql_streaming
000000010000000000000083
The authenticity of host 'pgsql11-recover (192.168.138.143)' can't be established.
ECDSA key fingerprint is SHA256:vu/gUNno3I8feQxKWcPkcm2kSS+g72zAE2JafM9CBF4.
ECDSA key fingerprint is MD5:59:a7:b9:12:29:31:b3:78:7f:ed:71:44:7a:7e:f2:fc.
Are you sure you want to continue connecting (yes/no)? yes
postgres@pgsql11-recover's password:
Starting remote restore for server pgsql_streaming using backup 20201207T225457
Destination directory: /PgData
Remote command: ssh postgres@pgsql11-recover
Doing PITR. Recovery target time: '2020-12-06 22:57:00+08:00'
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
Copying the base backup.
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
Copying required WAL segments.
postgres@pgsql11-recover's password:
Generating recovery configuration
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
postgres@pgsql11-recover's password:
Identify dangerous settings in destination directory.
postgres@pgsql11-recover's password:
IMPORTANT
These settings have been modified to prevent data losses
postgresql.conf line 706: archive_command = false
Recovery completed (start time: 2020-12-07 23:57:04.056352, elapsed time: 1 minute, 54 seconds)
Your PostgreSQL server has been successfully prepared for recovery!
出现:
Your PostgreSQL server has been successfully prepared for recovery!
则表示恢复成功。
通过pg_dump/dblink等方式,将恢复的表或者数据回写到源端。