时间:2021-05-23
1、2部分是对XLOG生成和清理逻辑的分析,XLOG暴涨的处理直接看第3部分。
1.1 不开启归档时
文件数量受下面几个参数控制,通常不超过
(2 + checkpoint_completion_target) * checkpoint_segments + 1或
checkpoint_segments + wal_keep_segments + 1个文件。
如果一个旧段文件不再需要了会重命名然后继续覆盖使用,如果由于短期的日志输出高峰导致了超过
3 * checkpoint_segments + 1个文件,直接删除文件。
1.2 开启归档时
文件数量:删除归档成功的段文件
抽象来看一个运行的PG生成一个无限长的WAL日志序列。每段16M,这些段文件的名字是数值命名的,反映在WAL序列中的位置。在不用WAL归档的时候,系统通常只是创建几个段文件然后循环使用,方法是把不再使用的段文件重命名为更高的段编号。
当且仅当归档命令成功时,归档命令返回零。 在得到一个零值结果之后,PostgreSQL将假设该WAL段文件已经成功归档,稍后将删除段文件。一个非零值告诉PostgreSQL该文件没有被归档,会周期性的重试直到成功。
2.1 删除逻辑
触发删除动作
RemoveOldXlogFiles> CreateCheckPoint> CreateRestartPointwal_keep_segments判断(调用这个函数修改_logSegNo,然后再传入RemoveOldXlogFiles)
static voidKeepLogSeg(XLogRecPtr recptr, XLogSegNo *logSegNo){ XLogSegNo segno; XLogRecPtr keep; XLByteToSeg(recptr, segno); keep = XLogGetReplicationSlotMinimumLSN(); if (wal_keep_segments > 0) { if (segno <= wal_keep_segments) segno = 1; else segno = segno - wal_keep_segments; } if (max_replication_slots > 0 && keep != InvalidXLogRecPtr) { XLogSegNo slotSegNo; XLByteToSeg(keep, slotSegNo); if (slotSegNo <= 0) segno = 1; else if (slotSegNo < segno) segno = slotSegNo; } if (segno < *logSegNo) *logSegNo = segno;}删除逻辑
static voidRemoveOldXlogFiles(XLogSegNo segno, XLogRecPtr endptr){ ... ... while ((xlde = ReadDir(xldir, XLOGDIR)) != NULL) { if (strlen(xlde->d_name) != 24 || strspn(xlde->d_name, "0123456789ABCDEF") != 24) continue; /* * We ignore the timeline part of the XLOG segment identifiers in * deciding whether a segment is still needed. This ensures that we * won't prematurely remove a segment from a parent timeline. We could * probably be a little more proactive about removing segments of * non-parent timelines, but that would be a whole lot more * complicated. * * We use the alphanumeric sorting property of the filenames to decide * which ones are earlier than the lastoff segment. */ if (strcmp(xlde->d_name + 8, lastoff + 8) <= 0) { if (XLogArchiveCheckDone(xlde->d_name)) # 归档关闭返回真 # 存在done文件返回真 # 存在.ready返回假 # recheck存在done文件返回真 # 重建.ready文件返回假 { UpdateLastRemovedPtr(xlde->d_name); # 回收 或者 直接删除,清理.done和.ready文件 RemoveXlogFile(xlde->d_name, endptr); } } } ... ...}2.2 归档逻辑
static voidpgarch_ArchiverCopyLoop(void){ char xlog[MAX_XFN_CHARS + 1]; # 拿到最老那个没有被归档的xlog文件名 while (pgarch_readyXlog(xlog)) { int failures = 0; for (;;) { /* * Do not initiate any more archive commands after receiving * SIGTERM, nor after the postmaster has died unexpectedly. The * first condition is to try to keep from having init SIGKILL the * command, and the second is to avoid conflicts with another * archiver spawned by a newer postmaster. */ if (got_SIGTERM || !PostmasterIsAlive()) return; /* * Check for config update. This is so that we'll adopt a new * setting for archive_command as soon as possible, even if there * is a backlog of files to be archived. */ if (got_SIGHUP) { got_SIGHUP = false; ProcessConfigFile(PGC_SIGHUP); } # archive_command没设的话不再执行 # 我们的command没有设置,走的是这个分支 if (!XLogArchiveCommandSet()) { /* * Change WARNING to DEBUG1, since we will left archive_command empty to * let external tools to manage archive */ ereport(DEBUG1, (errmsg("archive_mode enabled, yet archive_command is not set"))); return; } # 执行归档命令! if (pgarch_archiveXlog(xlog)) { # 成功了,把.ready改名为.done pgarch_archiveDone(xlog); /* * Tell the collector about the WAL file that we successfully * archived */ pgstat_send_archiver(xlog, false); break; } else { /* * Tell the collector about the WAL file that we failed to * archive */ pgstat_send_archiver(xlog, true); if (++failures >= NUM_ARCHIVE_RETRIES) { ereport(WARNING, (errmsg("archiving transaction log file \"%s\" failed too many times, will try again later", xlog))); return; } pg_usleep(1000000L); } } }}2.3 ready生成逻辑
static voidXLogWrite(XLogwrtRqst WriteRqst, bool flexible){... if (finishing_seg) { issue_xlog_fsync(openLogFile, openLogSegNo); WalSndWakeupRequest(); LogwrtResult.Flush = LogwrtResult.Write; # 归档打开 && wal_level >= archive if (XLogArchivingActive()) # 生成ready文件 XLogArchiveNotifySeg(openLogSegNo); XLogCtl->lastSegSwitchTime = (pg_time_t) time(NULL);...2.4 总结
ready文件只要满足archive_mode=on和wal_lever>=archive,就总会生成(XLogWrite函数调用生成)
因为archive_command设置空,所以ready文件的消费完全由外部程序控制
done文件的处理由PG完成,两个地方会触发done文件处理,检查点和重启点
处理多少done文件受wal_keep_segments和replication_slot控制(KeepLogSeg函数)
注意:无论如何注意不要手动删除xlog文件
注意:checkpoint产生的日志回不立即生成ready文件,是在下一个xlog后一块生成的
3.1 ReplicationSlot
打开流了复制槽
-- 流复制插槽-- 如果restart_lsn和当前XLOG相差非常大的字节数, 需要排查slot的订阅者是否能正常接收XLOG, -- 或者订阅者是否正常. 长时间不将slot的数据取走, pg_xlog目录可能会撑爆select pg_xlog_location_diff(pg_current_xlog_location(),restart_lsn), * from pg_replication_slots;删除
select pg_drop_replication_slot('xxx');删除后PG会在下一个checkpoint清理xlog
3.2 较大的wal_keep_segments
检查参数配置,注意打开这个参数会使xlog和ready有一定延迟
3.3 回收出现问题
如果不使用PG自动回收机制,数据库依赖外部程序修改.ready文件,需要检测回收进程
(archive_mode=on archive_command='')3.4 检查点间隔过长
检查参数配置
以上为个人经验,希望能给大家一个参考,也希望大家多多支持。如有错误或未考虑完全的地方,望不吝赐教。
声明:本页内容来源网络,仅供用户参考;我单位不保证亦不表示资料全面及准确无误,也不保证亦不表示这些资料为最新信息,如因任何原因,本网内容或者用户因倚赖本网内容造成任何损失或损害,我单位将不会负任何法律责任。如涉及版权问题,请提交至online#300.cn邮箱联系删除。
一、逻辑操作符:常用的逻辑操作符有:AND、OR和NOT。其语义与其它编程语言中的逻辑操作符完全相同。二、比较操作符:下面是PostgreSQL中提供的比较操作
####PostgreSQLLogicalReplication(逻辑复制)####LogicalReplication(逻辑复制)为PostgreSQL自己提
用Python随机生成学生姓名,三科成绩和班级数据,再插入到PostgreSQL中。模块用psycopg2randomimportrandomimportpsy
最近有网友问小编这样一个问题:xlog是什么文件?为什么将手机中的xlog文件传到电脑后,也打不开呢?其实,很多人容易把xlog文件误以为是常见的Log日志文件
WAL是WriteAheadLog的简写,和oracle的redo日志类似,存放在$PGDATA/pg_xlog中,10版本以后在$PGDATA/pg_wal目