www.infradead.org Git - users/hch/xfsprogs.git/log

]> www.infradead.org Git - users/hch/xfsprogs.git/log

projects / users / hch / xfsprogs.git / log

Christoph Hellwig [Thu, 15 Aug 2024 07:43:48 +0000 (09:43 +0200)]

repair: stop tracking duplicate RT extents with rtgroups

Nothing ever looks them up, so don't bother with tracking them by
overloading the AG numbers.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Christoph Hellwig [Thu, 15 Aug 2024 07:18:08 +0000 (09:18 +0200)]

repair: use a separate bmaps array for real time groups

Stop pretending RTGs are high numbered AGs and just use separate
structures instead.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Christoph Hellwig [Thu, 15 Aug 2024 06:44:30 +0000 (08:44 +0200)]

repair: add a real per-AG bitmap abstraction

Add a struct bmap that contains the btree root and the lock, and provide
helpers for loking instead of directly poking into the data structure.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Christoph Hellwig [Thu, 15 Aug 2024 06:52:04 +0000 (08:52 +0200)]

repair: simplify rt_lock handling

No need to cacheline align rt_lock if we move it next to the data
it protects. Also reduce the critical section to just where those
data structures are accessed.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Christoph Hellwig [Thu, 15 Aug 2024 06:53:29 +0000 (08:53 +0200)]

repair: remove rmap handling in process_rt_rec

rtrmap is only supported with RT groups, which don't use this path.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Christoph Hellwig [Wed, 14 Aug 2024 08:20:49 +0000 (10:20 +0200)]

db: simplify rtinode checks

Stop discoverіng the RT inodes and just look at di_metatype instead.

Signed-off-by: Christoph Hellwig <hch@lst.de>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:59 +0000 (15:54 -0700)]

xfs_repair: allow adding rmapbt to reflink filesystems

New debugging knob so that I can upgrade a filesystem to have rmap
btrees even if reflink was already enabled. We cannot easily precompute
the space requirements, so this is dangerous.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:59 +0000 (15:54 -0700)]

xfs_repair: skip free space checks when upgrading

Add a debug knob to disable the free space checks when upgrading a
system. This is extremely risky and will cause severe tire damage!!!

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:58 +0000 (15:54 -0700)]

xfs: upgrade filesystem features

Add the ability to upgrade *some* filesystem features. Note that you'll
have to run online fsck immediately afterwards to build metadata!

XXX DO NOT MERGE

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:58 +0000 (15:54 -0700)]

debian: enable xfs_scrubbed on the root filesystem by default

Now that we're finished building autonomous repair, enable the service
on the root filesystem by default. The root filesystem is mounted by
the initrd prior to starting systemd, which is why the udev rule cannot
autostart the service for the root filesystem.

dh_installsystemd won't activate a template service (aka one with an
at-sign in the name) even if it provides a DefaultInstance directive to
make that possible. Use a fugly shim for this.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:58 +0000 (15:54 -0700)]

xfs_scrubbed: use the autofsck fsproperty to select mode

Make the xfs_scrubbed background service query the autofsck filesystem
property to figure out which operating mode it should use.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:58 +0000 (15:54 -0700)]

xfs_scrubbed: don't start service if kernel support unavailable

Use ExecCondition= in the system service to check if kernel support for
the health monitor is available. If not, we don't want to run the
service, have it fail, and generate a bunch of silly log messages.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:58 +0000 (15:54 -0700)]

xfs_scrubbed: create a background monitoring service

Create a systemd service and activate it automatically.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:57 +0000 (15:54 -0700)]

builddefs: refactor udev directory specification

Refactor the code that finds the udev rules directory to detect the
location of the parent udev directory instead. IOWs, we go from:

UDEV_RULE_DIR=/foo/bar/rules.d

to:

UDEV_DIR=/foo/bar
UDEV_RULE_DIR=/foo/bar/rules.d

This is needed by the next patch, which adds a helper script.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:57 +0000 (15:54 -0700)]

xfs_scrubbed: use getparents to look up file names

If the kernel tells about something that happened to a file, use the
GETPARENTS ioctl to try to look up the path to that file for more
ergonomic reporting.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:57 +0000 (15:54 -0700)]

xfs_scrubbed: check for fs features needed for effective repairs

Online repair relies heavily on back references such as reverse mappings
and directory parent pointers to add redundancy to the filesystem.
Check for these two features and whine a bit if they are missing.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:57 +0000 (15:54 -0700)]

xfs_scrubbed: enable repairing filesystems

Make it so that our health monitoring daemon can initiate repairs.
Because repairs can take a while to run, so we don't actually want to be
doing that work in the event thread because the kernel queue can drop
events if userspace doesn't respond in time.

Therefore, create a subprocess executor to run the repairs in the
background, and do the repairs from there. The subprocess executor is
similar in concept to what a libfrog workqueue does, but the workers do
not share address space, which eliminates GIL contention.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:56 +0000 (15:54 -0700)]

xfs_scrubbed: check events against schema

Validate that the event objects that we get from the kernel actually
obey the schema that the kernel publishes.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:56 +0000 (15:54 -0700)]

xfs_scrubbed: create daemon to listen for health events

Create a daemon program that can listen for and log health events.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:56 +0000 (15:54 -0700)]

xfs_io: monitor filesystem health events

Create a subcommand to monitor for health events generated by the kernel.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree

Darrick J. Wong [Wed, 7 Aug 2024 22:54:56 +0000 (15:54 -0700)]

xfs: report file io errors through healthmon

Set up a file io error event hook so that we can send events about read
errors, writeback errors, and directio errors to userspace.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>

commit | commitdiff | tree