]> www.infradead.org Git - users/hch/xfsprogs.git/log
users/hch/xfsprogs.git
11 months agoxfs_db: dump rt bitmap blocks
Darrick J. Wong [Wed, 7 Aug 2024 23:04:43 +0000 (16:04 -0700)]
xfs_db: dump rt bitmap blocks

Now that rtbitmap blocks have a header, make it so that xfs_db can
analyze the structure.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: discover rtgroup inodes
Darrick J. Wong [Fri, 9 Aug 2024 21:30:04 +0000 (14:30 -0700)]
xfs_db: discover rtgroup inodes

If we need to, discover the rtgroup inodes.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: hoist bit scraping function
Darrick J. Wong [Wed, 7 Aug 2024 23:04:43 +0000 (16:04 -0700)]
xfs_db: hoist bit scraping function

Hoist the bit scraping code into a separate helper function so that we
can create a _le version later, and also enable some testing functions.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: metadump realtime devices
Darrick J. Wong [Wed, 7 Aug 2024 23:04:42 +0000 (16:04 -0700)]
xfs_db: metadump realtime devices

Teach the metadump device to dump the filesystem metadata of a realtime
device to the metadump file.  Currently, this is limited to the realtime
superblock.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: metadump metadir rt bitmap and summary files
Christoph Hellwig [Wed, 7 Aug 2024 23:04:42 +0000 (16:04 -0700)]
xfs_db: metadump metadir rt bitmap and summary files

Don't skip dumping the data fork for regular files that are marked as
metadata inodes.  This catches rtbitmap and summary inodes on rtgroup
enabled file systems where their inode numbers aren't recorded in the
superblock.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: report rtgroups via version command
Darrick J. Wong [Wed, 7 Aug 2024 23:04:42 +0000 (16:04 -0700)]
xfs_db: report rtgroups via version command

Report the rtgroups feature in the version command output.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: enable conversion of rt space units
Darrick J. Wong [Wed, 7 Aug 2024 23:04:42 +0000 (16:04 -0700)]
xfs_db: enable conversion of rt space units

Teach the xfs_db convert function about realtime extents, blocks, and
realtime group numbers.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: support changing the label and uuid of rt superblocks
Darrick J. Wong [Wed, 7 Aug 2024 23:04:41 +0000 (16:04 -0700)]
xfs_db: support changing the label and uuid of rt superblocks

Update the label and uuid commands to change the rt superblocks along
with the filesystem superblocks.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: support dumping realtime superblocks
Darrick J. Wong [Wed, 7 Aug 2024 23:04:41 +0000 (16:04 -0700)]
xfs_db: support dumping realtime superblocks

Allow debugging of realtime superblocks, and add the relevant fields in
the fs superblock that point us at the existence and location of the rt
supers.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: listify the definition of enum typnm
Darrick J. Wong [Wed, 7 Aug 2024 23:04:41 +0000 (16:04 -0700)]
xfs_db: listify the definition of enum typnm

Convert the enum definition into a list so that future patches adding
things to enum typnm don't have to reflow the entire thing.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: support adding rtgroups to a filesystem
Darrick J. Wong [Wed, 7 Aug 2024 23:04:41 +0000 (16:04 -0700)]
xfs_repair: support adding rtgroups to a filesystem

Allow users to add the rtgroups feature to a filesystem if the
filesystem does not already have a realtime volume attached.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: repair rtbitmap and rtsummary block headers
Darrick J. Wong [Wed, 7 Aug 2024 23:04:40 +0000 (16:04 -0700)]
xfs_repair: repair rtbitmap and rtsummary block headers

Check and repair the new block headers attached to rtbitmap and
rtsummary blocks.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: support realtime superblocks
Darrick J. Wong [Wed, 7 Aug 2024 23:04:40 +0000 (16:04 -0700)]
xfs_repair: support realtime superblocks

Support the realtime superblock feature.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: find and clobber rtgroup bitmap and summary files
Darrick J. Wong [Tue, 13 Aug 2024 19:27:56 +0000 (12:27 -0700)]
xfs_repair: find and clobber rtgroup bitmap and summary files

On a rtgroups filesystem, if the rtgroups bitmap or summary files are
garbage, we need to clear the dinode and update the incore bitmap so
that we don't bother to check the old rt freespace metadata.

However, we regenerate the entire rt metadata directory tree during
phase 6.  If the bitmap and summary files are ok, we still want to clear
the dinode, but we can still use the incore inode to check the old
freespace contents.  Split the clear_dinode function into two pieces,
one that merely zeroes the inode, and the old clear_dinode now turns off
checking.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: support realtime groups
Christoph Hellwig [Wed, 7 Aug 2024 23:04:40 +0000 (16:04 -0700)]
xfs_repair: support realtime groups

Make repair aware of multiple rtgroups.  This now uses the same code as the
AG-based data device for block usage tracking instead of the less optimal
AVL trees and bitmaps used for the traditonal RT device.

Note this is still a bit hacky at the moment by just going beyond the AG
arrays and not fully supporting the unknown state for RT allocation yet.

All this should be fixable.

Large parts of the code are based on patches from Darrick J. Wong.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: improve rtbitmap discrepancy reporting
Darrick J. Wong [Wed, 7 Aug 2024 23:04:40 +0000 (16:04 -0700)]
xfs_repair: improve rtbitmap discrepancy reporting

Improve the reporting of discrepancies in the realtime bitmap and
summary files by creating a separate helper function that will pinpoint
the exact (word) locations of mismatches.  This will help developers to
diagnose problems with the rtgroups feature and users to figure out
exactly what's bad in a filesystem.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: refactor offsetof+sizeof to offsetofend
Darrick J. Wong [Wed, 7 Aug 2024 23:04:39 +0000 (16:04 -0700)]
xfs_repair: refactor offsetof+sizeof to offsetofend

Replace this open-coded logic with the kernel's offsetofend macro before
we start adding more in the next patch.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: refactor phase4
Christoph Hellwig [Wed, 7 Aug 2024 23:04:39 +0000 (16:04 -0700)]
xfs_repair: refactor phase4

Split out helpers to process all duplicate extents in an AG and the RT
duplicate extents.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agolibfrog: add bitmap_clear
Darrick J. Wong [Wed, 7 Aug 2024 23:04:39 +0000 (16:04 -0700)]
libfrog: add bitmap_clear

Uncomment and fix bitmap_clear so that xfs_repair can start using it.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
[hch: split from a larger patch]
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agolibfrog: report rt groups in output
Darrick J. Wong [Wed, 7 Aug 2024 23:04:39 +0000 (16:04 -0700)]
libfrog: report rt groups in output

Report realtime group geometry.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agolibfrog: support scrubbing rtgroup metadata paths
Darrick J. Wong [Wed, 7 Aug 2024 23:42:10 +0000 (16:42 -0700)]
libfrog: support scrubbing rtgroup metadata paths

Support scrubbing the metadata paths of rtgroup metadata inodes.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agolibxfs: implement some sanity checking for enormous rgcount
Darrick J. Wong [Wed, 7 Aug 2024 23:04:39 +0000 (16:04 -0700)]
libxfs: implement some sanity checking for enormous rgcount

Similar to what we do for suspiciously large sb_agcount values, if
someone tries to get libxfs to load a filesystem with a very large
realtime group count, let's do some basic checks of the rt device to
see if it's really that large.  If the read fails, only load the first
rtgroup and warn the user.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agolibxfs: dirty buffers should be marked uptodate too
Darrick J. Wong [Wed, 7 Aug 2024 22:54:37 +0000 (15:54 -0700)]
libxfs: dirty buffers should be marked uptodate too

I started fuzz-testing the realtime rmap feature with a very large
number of realtime allocation groups.  There were so many rt groups that
repair had to rebuild /realtime in the metadata directory tree, and that
directory was big enough to spur the creation of a block format
directory.

Unfortunately, repair then walks both directory trees to look for
unconnceted files.  This part of phase 6 emits CRC errors on the newly
created buffers for the /realtime directory, declares the directory to
be garbage, and moves all the rt rmap inodes to /lost+found, resulting
in a corrupt fs.

Poking around in gdb, I noticed that the buffer contents were indeed
zero, and that UPTODATE was not set.  This was very strange, until I
added a watch on bp->b_flags to watch for accesses.  It turns out that
xfs_repair's prefetch code will _get a buffer and zero the contents if
UPTODATE is not set.

The directory tree code in libxfs will also _get a buffer, initialize
it, and log it to the coordinating transaction, which in this case is
the transactions used to reconnect the rmap btree inodes to /realtime.
At no point does any of that code ever set UPTODATE on the buffer, which
is why prefetch zaps the contents.

Hence change both buffer dirtying functions to set UPTODATE, since a
dirty buffer is by definition at least as recent as whatever's on disk.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: scrub metadir paths for rtgroup metadata
Darrick J. Wong [Wed, 7 Aug 2024 23:33:25 +0000 (16:33 -0700)]
xfs: scrub metadir paths for rtgroup metadata

Add the code we need to scan the metadata directory paths of rt group
metadata files.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: scrub the realtime group superblock
Darrick J. Wong [Wed, 7 Aug 2024 23:04:38 +0000 (16:04 -0700)]
xfs: scrub the realtime group superblock

Enable scrubbing of realtime group superblocks.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_logprint: report realtime EFIs
Darrick J. Wong [Wed, 7 Aug 2024 23:04:38 +0000 (16:04 -0700)]
xfs_logprint: report realtime EFIs

Decode the EFI format just enough to report if an EFI targets the
realtime device or not.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: add wrapping rtgroup iterator
Darrick J. Wong [Thu, 8 Aug 2024 18:14:22 +0000 (11:14 -0700)]
xfs: add wrapping rtgroup iterator

Add a bunch of iterator macros to walk all rtgroups from a starting
rgno and wrapping around to rtgroup 0, then refactor the new allocator
to use it.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: make the RT allocator rtgroup aware
Christoph Hellwig [Wed, 7 Aug 2024 23:04:38 +0000 (16:04 -0700)]
xfs: make the RT allocator rtgroup aware

Source kernel commit: 48429f5e97276b8b99a938136db513851bf04ad3

Make the allocator rtgroup aware by either picking a specific group if
there is a hint, or loop over all groups otherwise.  A simple rotor is
provided to pick the placement for initial allocations.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Darrick J. Wong [Wed, 7 Aug 2024 23:04:38 +0000 (16:04 -0700)]
fixup

11 months agoxfs: use realtime EFI to free extents when rtgroups are enabled
Darrick J. Wong [Wed, 7 Aug 2024 23:04:37 +0000 (16:04 -0700)]
xfs: use realtime EFI to free extents when rtgroups are enabled

Source kernel commit: 97634bd4ecbe878557002eff401d688a9b56fc32

When rmap is enabled, XFS expects a certain order of operations, which
is: 1) remove the file mapping, 2) remove the reverse mapping, and then
3) free the blocks.  When reflink is enabled, XFS replaces (3) with a
deferred refcount decrement operation that can schedule freeing the
blocks if that was the last refcount.

For realtime files, xfs_bmap_del_extent_real tries to do 1 and 3 in the
same transaction, which will break both rmap and reflink unless we
switch it to use realtime EFIs.  Both rmap and reflink depend on the
rtgroups feature, so let's turn on EFIs for all rtgroups filesystems.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: support error injection when freeing rt extents
Darrick J. Wong [Wed, 7 Aug 2024 23:04:37 +0000 (16:04 -0700)]
xfs: support error injection when freeing rt extents

Source kernel commit: 33a3273ca66de4072947c5e3606400f22c64c296

A handful of fstests expect to be able to test what happens when extent
free intents fail to actually free the extent.  Now that we're
supporting EFIs for realtime extents, add to xfs_rtfree_extent the same
injection point that exists in the regular extent freeing code.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:37 +0000 (16:04 -0700)]
fixup

11 months agoxfs: support logging EFIs for realtime extents
Darrick J. Wong [Wed, 7 Aug 2024 23:04:37 +0000 (16:04 -0700)]
xfs: support logging EFIs for realtime extents

Source kernel commit: ba22ecc455d21c9f2c9936b9864e4cf55bba2724

Teach the EFI mechanism how to free realtime extents.  We're going to
need this to enforce proper ordering of operations when we enable
realtime rmap.

Declare a new log intent item type (XFS_LI_EFI_RT) and a separate defer
ops for rt extents.  This keeps the ondisk artifacts and processing code
completely separate between the rt and non-rt cases.  Hopefully this
will make it easier to debug filesystem problems.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:36 +0000 (16:04 -0700)]
fixup

11 months agoxfs: store rtgroup information with a bmap intent
Darrick J. Wong [Wed, 7 Aug 2024 23:04:36 +0000 (16:04 -0700)]
xfs: store rtgroup information with a bmap intent

Source kernel commit: b6e3ae52a23a48ba4b7872a228f01138ab65e812

Make the bmap intent items take an active reference to the rtgroup
containing the space that is being mapped or unmapped.  We will need
this functionality once we start enabling rmap and reflink on the rt
volume.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: grow the realtime section when realtime groups are enabled
Darrick J. Wong [Wed, 7 Aug 2024 23:04:36 +0000 (16:04 -0700)]
xfs: grow the realtime section when realtime groups are enabled

Source kernel commit: 44d787858eaf22fbc930e3cadc2228f4ffcf2c3a

Enable growing the rt section when realtime groups are enabled.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:36 +0000 (16:04 -0700)]
fixup

11 months agoxfs: encode the rtsummary in big endian format
Darrick J. Wong [Wed, 7 Aug 2024 23:04:35 +0000 (16:04 -0700)]
xfs: encode the rtsummary in big endian format

Source kernel commit: 59fc91c095f19228610035ac2e8a86cfed79c855

Currently, the ondisk realtime summary file counters are accessed in
units of 32-bit words.  There's no endian translation of the contents of
this file, which means that the Bad Things Happen(tm) if you go from
(say) x86 to powerpc.  Since we have a new feature flag, let's take the
opportunity to enforce an endianness on the file.  Encode the summary
information in big endian format, like most of the rest of the
filesystem.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:35 +0000 (16:04 -0700)]
fixup

11 months agoxfs: encode the rtbitmap in big endian format
Darrick J. Wong [Wed, 7 Aug 2024 23:04:35 +0000 (16:04 -0700)]
xfs: encode the rtbitmap in big endian format

Source kernel commit: d54d402cff2f2f565b630334aea4b95fa6a3f9e6

Currently, the ondisk realtime bitmap file is accessed in units of
32-bit words.  There's no endian translation of the contents of this
file, which means that the Bad Things Happen(tm) if you go from (say)
x86 to powerpc.  Since we have a new feature flag, let's take the
opportunity to enforce an endianness on the file.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:35 +0000 (16:04 -0700)]
fixup

11 months agoxfs: add block headers to realtime bitmap and summary blocks
Darrick J. Wong [Wed, 7 Aug 2024 23:04:20 +0000 (16:04 -0700)]
xfs: add block headers to realtime bitmap and summary blocks

Upgrade rtbitmap and rtsummary blocks to have self describing metadata
like most every other thing in XFS.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:34 +0000 (16:04 -0700)]
fixup

11 months agoxfs: export the geometry of realtime groups to userspace
Darrick J. Wong [Wed, 7 Aug 2024 23:04:34 +0000 (16:04 -0700)]
xfs: export the geometry of realtime groups to userspace

Source kernel commit: f5a734e12dae026218c45e5f46fe771492f2249d

Create an ioctl so that the kernel can report the status of realtime
groups to userspace.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:34 +0000 (16:04 -0700)]
fixup

11 months agoxfs: record rt group metadata errors in the health system
Darrick J. Wong [Wed, 7 Aug 2024 23:04:19 +0000 (16:04 -0700)]
xfs: record rt group metadata errors in the health system

Record the state of per-rtgroup metadata sickness in the rtgroup
structure for later reporting.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:33 +0000 (16:04 -0700)]
fixup

11 months agoxfs: add frextents to the lazysbcounters when rtgroups enabled
Darrick J. Wong [Wed, 7 Aug 2024 23:04:33 +0000 (16:04 -0700)]
xfs: add frextents to the lazysbcounters when rtgroups enabled

Source kernel commit: 89d25a402bd8d2ec7fa2904640dfae961f148bf9

Make the free rt extent count a part of the lazy sb counters when the
realtime groups feature is enabled.  This is possible because the patch
to recompute frextents from the rtbitmap during log recovery predates
the code adding rtgroup support, hence we know that the value will
always be correct during runtime.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: add a helper to prevent bmap merges across rtgroup boundaries
Christoph Hellwig [Wed, 7 Aug 2024 23:04:33 +0000 (16:04 -0700)]
xfs: add a helper to prevent bmap merges across rtgroup boundaries

Source kernel commit: a3ae21f825b8790c5cf231e366309dbec3677ddf

Except for the rt superblock, realtime groups do not store any metadata
at the start (or end) of the group.  There is nothing to prevent the
bmap code from merging allocations from multiple groups into a single
bmap record.  Add a helper to check for this case.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
[djwong: massage the commit message after pulling this into rtgroups]
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: check that rtblock extents do not break rtsupers or rtgroups
Darrick J. Wong [Wed, 7 Aug 2024 23:04:32 +0000 (16:04 -0700)]
xfs: check that rtblock extents do not break rtsupers or rtgroups

Source kernel commit: 503fca119d2c8df89256e555c00ecc732fc98e4b

Check that rt block pointers do not point to the realtime superblock and
that allocated rt space extents do not cross rtgroup boundaries.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: export realtime group geometry via XFS_FSOP_GEOM
Darrick J. Wong [Wed, 7 Aug 2024 23:04:32 +0000 (16:04 -0700)]
xfs: export realtime group geometry via XFS_FSOP_GEOM

Source kernel commit: a580e4a73a9eaa96153080f7c81a30333bcc4d81

Export the realtime geometry information so that userspace can query it.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:32 +0000 (16:04 -0700)]
fixup

11 months agoxfs: update realtime super every time we update the primary fs super
Darrick J. Wong [Wed, 7 Aug 2024 23:04:32 +0000 (16:04 -0700)]
xfs: update realtime super every time we update the primary fs super

Source kernel commit: 8bbee1210019f3cb32d8ddd39223ce65d9da4039

Every time we update parts of the primary filesystem superblock that are
echoed in the rt superblock, we must update the rt super.  Avoid
changing the log to support logging to the rt device by using ordered
buffers.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:32 +0000 (16:04 -0700)]
fixup

11 months agolibfrog: add memchr_inv
Darrick J. Wong [Wed, 7 Aug 2024 23:04:31 +0000 (16:04 -0700)]
libfrog: add memchr_inv

Add this kernel function so we can use it in userspace.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
[hch: split from a larger patch]
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: define the format of rt groups
Darrick J. Wong [Wed, 7 Aug 2024 23:04:31 +0000 (16:04 -0700)]
xfs: define the format of rt groups

Source kernel commit: f13677b8a76a61cbb1c937d250a3bd45019748e8

Define the ondisk format of realtime group metadata, and a superblock
for realtime volumes.  rt supers are protected by a separate rocompat
bit so that we can leave them off if the rt device is zoned.

Add a xfs_sb_version_hasrtgroups so that xfs_repair knows how to zero
the tail of superblocks.

For rt group enabled file systems there is a separate bitmap and summary
file for each group and thus the number of bitmap and summary blocks
needs to be calculated differently.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:31 +0000 (16:04 -0700)]
fixup

11 months agoxfs: make RT extent numbers relative to the rtgroup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:12 +0000 (16:04 -0700)]
xfs: make RT extent numbers relative to the rtgroup

To prepare for adding per-rtgroup bitmap files, make the xfs_rtxnum_t
type encode the RT extent number relative to the rtgroup.  The biggest
part of this to clearly distinguish between the relative extent number
that gets masked when converting from a global block number and length
values that just have a factor applied to them when converting from
file system blocks.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:30 +0000 (16:04 -0700)]
fixup

11 months agoxfs: refactor xfs_rtsummary_blockcount
Christoph Hellwig [Wed, 7 Aug 2024 23:04:11 +0000 (16:04 -0700)]
xfs: refactor xfs_rtsummary_blockcount

Make xfs_rtsummary_blockcount take all the required information from
the mount structure and return the number of summary levels from it
as well.  This cleans up many of the callers and prepares for making the
rtsummary files per-rtgroup where they need to look at different value.

This means we recalculate some values in some callers, but as all these
calculations are outside the fast path and cheap that seems like a price
worth paying.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: refactor xfs_rtbitmap_blockcount
Christoph Hellwig [Wed, 7 Aug 2024 23:04:10 +0000 (16:04 -0700)]
xfs: refactor xfs_rtbitmap_blockcount

Rename the existing xfs_rtbitmap_blockcount to
xfs_rtbitmap_blockcount_len and add a new xfs_rtbitmap_blockcount wrapper
around it that takes the number of extents from the mount structure.

This will simplify the move to per-rtgroup bitmaps as those will need to
pass in the number of extents per rtgroup instead.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: remove XFS_ILOCK_RT*
Darrick J. Wong [Wed, 7 Aug 2024 23:04:30 +0000 (16:04 -0700)]
xfs: remove XFS_ILOCK_RT*

Now that we've centralized the realtime metadata locking routines, get
rid of the ILOCK subclasses since we now use explicit lockdep classes.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:21 +0000 (16:04 -0700)]
fixup

11 months agoxfs: move RT bitmap and summary information to the rtgroup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:05 +0000 (16:04 -0700)]
xfs: move RT bitmap and summary information to the rtgroup

Move the pointers to the RT bitmap and summary inodes as well as the
summary cache to the rtgroups structure to prepare for having a
separate bitmap and summary inodes for each rtgroup.

Code using the inodes now needs to operate on a rtgroup.  Where easily
possible such code is converted to iterate over all rtgroups, else
rtgroup 0 (the only one that can currently exist) is hardcoded.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:04:12 +0000 (16:04 -0700)]
fixup

11 months agoxfs: support caching rtgroup metadata inodes
Darrick J. Wong [Wed, 7 Aug 2024 23:04:03 +0000 (16:04 -0700)]
xfs: support caching rtgroup metadata inodes

Create the necessary per-rtgroup infrastructure that we need to load
metadata inodes into memory.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: add a lockdep class key for rtgroup inodes
Darrick J. Wong [Wed, 7 Aug 2024 23:04:11 +0000 (16:04 -0700)]
xfs: add a lockdep class key for rtgroup inodes

Source kernel commit: 8c2ef9cdb92b08063962cbd2f95571042225c125

Add a dynamic lockdep class key for rtgroup inodes.  This will enable
lockdep to deduce inconsistencies in the rtgroup metadata ILOCK locking
order.  Each class can have 8 subclasses, and for now we will only have
2 inodes per group.  This enables rtgroup order and inode order checks
when nesting ILOCKs.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Christoph Hellwig <hch@lst.de>
11 months agoxfs: define locking primitives for realtime groups
Darrick J. Wong [Wed, 7 Aug 2024 23:04:01 +0000 (16:04 -0700)]
xfs: define locking primitives for realtime groups

Define helper functions to lock all metadata inodes related to a
realtime group.  There's not much to look at now, but this will become
important when we add per-rtgroup metadata files and online fsck code
for them.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixups
Christoph Hellwig [Wed, 7 Aug 2024 23:04:01 +0000 (16:04 -0700)]
fixups

11 months agoxfs: create incore realtime group structures
Darrick J. Wong [Wed, 7 Aug 2024 23:04:00 +0000 (16:04 -0700)]
xfs: create incore realtime group structures

Create an incore object that will contain information about a realtime
allocation group.  This will eventually enable us to shard the realtime
section in a similar manner to how we shard the data section, but for
now just a single object for the entire RT subvolume is created.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agofixup
Christoph Hellwig [Wed, 7 Aug 2024 23:02:00 +0000 (16:02 -0700)]
fixup

11 months agoxfs: replace m_rsumsize with m_rsumblocks
Christoph Hellwig [Wed, 7 Aug 2024 23:03:58 +0000 (16:03 -0700)]
xfs: replace m_rsumsize with m_rsumblocks

Track the RT summary file size in blocks, just like the RT bitmap
file.  While we have users of both units, blocks are used slightly
more often and this matches the bitmap file for consistency.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: remove xfs_{rtbitmap,rtsummary}_wordcount
Christoph Hellwig [Wed, 7 Aug 2024 23:03:57 +0000 (16:03 -0700)]
xfs: remove xfs_{rtbitmap,rtsummary}_wordcount

xfs_rtbitmap_wordcount and xfs_rtsummary_wordcount are currently unused,
so remove them to simplify refactoring other rtbitmap helpers.  They
can be added back or simply open coded when actually needed.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: clean up the ISVALID macro in xfs_bmap_adjacent
Christoph Hellwig [Wed, 7 Aug 2024 23:03:52 +0000 (16:03 -0700)]
xfs: clean up the ISVALID macro in xfs_bmap_adjacent

Turn the  ISVALID macro defined and used inside in xfs_bmap_adjacent
that relies on implict context into a proper inline function.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: simplify xfs_rtalloc_query_range
Christoph Hellwig [Wed, 7 Aug 2024 23:01:58 +0000 (16:01 -0700)]
xfs: simplify xfs_rtalloc_query_range

There isn't much of a good reason to pass the xfs_rtalloc_rec structures
that describe extents to xfs_rtalloc_query_range as we really just want
a lower and upper bound xfs_rtxnum_t.  Pass the rtxnum directly and
simply the interface.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: remove xfs_rtb_to_rtxrem
Christoph Hellwig [Wed, 7 Aug 2024 23:01:58 +0000 (16:01 -0700)]
xfs: remove xfs_rtb_to_rtxrem

Simplify the number of block number conversion helpers by removing
xfs_rtb_to_rtxrem.  Any recent compiler is smart enough to eliminate
the double divisions if using separate xfs_rtb_to_rtx and
xfs_rtb_to_rtxoff calls.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: ensure rtx mask/shift are correct after growfs
Christoph Hellwig [Wed, 7 Aug 2024 23:01:58 +0000 (16:01 -0700)]
xfs: ensure rtx mask/shift are correct after growfs

Source kernel commit: 101587dd747fd4fa4df9688892e3ad86dcebc63b

When growfs sets an extent size, it doesn't updated the m_rtxblklog and
m_rtxblkmask values, which could lead to incorrect usage of them if they
were set before and can't be used for the new extent size.

Add a xfs_mount_sb_set_rextsize helper that updates the two fields, and
also use it when calculating the new RT geometry instead of disabling
the optimization there.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: stop using libxfs_rtsummary_wordcount
Christoph Hellwig [Wed, 7 Aug 2024 23:01:59 +0000 (16:01 -0700)]
xfs_db: stop using libxfs_rtsummary_wordcount

Just use the pre-calculated mount value and shift it.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: stop using libxfs_rt{bitmap,summary}_wordcount
Christoph Hellwig [Wed, 7 Aug 2024 23:01:59 +0000 (16:01 -0700)]
xfs_repair: stop using libxfs_rt{bitmap,summary}_wordcount

We can easily calculate the values by just shifting the mount/sb based
sized without the need of these helpers.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: refactor generate_rtinfo
Christoph Hellwig [Wed, 7 Aug 2024 23:01:57 +0000 (16:01 -0700)]
xfs_repair: refactor generate_rtinfo

Move the allocation of the computed values into generate_rtinfo, and thus
make the variables holding them private in rt.c, and clean up a few
formatting nits.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
[djwong: move functions to fix build errors]
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: stop preallocating blocks in mk_rbmino and mk_rsumino
Christoph Hellwig [Wed, 7 Aug 2024 23:01:57 +0000 (16:01 -0700)]
xfs_repair: stop preallocating blocks in mk_rbmino and mk_rsumino

Now that repair is using libxfs_rtfile_initialize_blocks to write to the
rtbitmap and rtsummary inodes, space allocation is already taken care of
that helper and there is no need to preallocate it.  Remove the code to
do so.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: use libxfs_rtfile_initialize_blocks
Christoph Hellwig [Wed, 7 Aug 2024 23:01:57 +0000 (16:01 -0700)]
xfs_repair: use libxfs_rtfile_initialize_blocks

Use libxfs_rtfile_initialize_blocks to write the re-computed rtbitmap
and rtsummary contents.  This removes duplicate code and prepares for
even more sharing once the rtgroup features adds a metadata header to
the rtbitmap and rtsummary blocks.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agomkfs: use xfs_rtfile_initialize_blocks
Christoph Hellwig [Wed, 7 Aug 2024 23:01:57 +0000 (16:01 -0700)]
mkfs: use xfs_rtfile_initialize_blocks

Use the new libxfs helper for initializing the rtbitmap/summary files
for rtgroup-enabled file systems.  Also skip the zeroing of the blocks
for rtgroup file systems as we'll overwrite every block instantly.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agomkfs: remove a pointless rtfreesp_init forward declaration
Christoph Hellwig [Wed, 7 Aug 2024 23:01:56 +0000 (16:01 -0700)]
mkfs: remove a pointless rtfreesp_init forward declaration

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: use xfs_validate_rt_geometry
Christoph Hellwig [Wed, 7 Aug 2024 23:01:56 +0000 (16:01 -0700)]
xfs_repair: use xfs_validate_rt_geometry

Use shared libxfs code with the kernel instead of reimplementing it.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_repair: checking rt free space metadata must happen during phase 4
Darrick J. Wong [Wed, 14 Aug 2024 00:23:30 +0000 (17:23 -0700)]
xfs_repair: checking rt free space metadata must happen during phase 4

Back in the really old days, xfs_repair would generate the new free
space information for the realtime section during phase 5, and write the
contents to the rtbitmap and summary files during phase 6.  This was ok
because the incore information isn't used until phase 6.

Then I changed the behavior to check the generated information against
what was on disk and complain about the discrepancies.  Unfortunately,
there was a subtle flaw here -- for a non -n run, we'll have regenerated
the AG metadata before we actually check the rt free space information.
If the AG btree regeneration should clobber one of the old rtbitmap or
summary blocks, this will be reported as a corruption even though
nothing's wrong.

Move check_rtmetadata to the end of phase 4 so that this doesn't happen.

Fixes: f2e388616d74 ("xfs_repair: check free rt extent count")
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: push transaction join out of xfs_rtbitmap_lock and xfs_rtgroup_lock
Christoph Hellwig [Wed, 7 Aug 2024 23:01:56 +0000 (16:01 -0700)]
xfs: push transaction join out of xfs_rtbitmap_lock and xfs_rtgroup_lock

To prepare for being able to join an already locked rtbitmap inode to a
transaction split out separate helpers for joining the transaction from
the locking helpers.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: factor out rtbitmap/summary initialization helpers
Christoph Hellwig [Wed, 7 Aug 2024 23:01:56 +0000 (16:01 -0700)]
xfs: factor out rtbitmap/summary initialization helpers

Add helpers to libxfs that can be shared by growfs and mkfs for
initializing the rtbitmap and summary, and by passing the optional
data pointer also by repair for rebuilding them.  This will become
even more useful when the rtgroups feature adѕ a metadata header
to each block, which means even more shared code.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
[djwong: minor documentation and data advance tweaks]
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: add bounds checking to xfs_rt{bitmap,summary}_read_buf
Christoph Hellwig [Wed, 7 Aug 2024 23:01:55 +0000 (16:01 -0700)]
xfs: add bounds checking to xfs_rt{bitmap,summary}_read_buf

Add a corruption check for passing an invalid block number, which is a
lot easier to understand than the xfs_bmapi_read failure later on.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: assert a valid limit in xfs_rtfind_forw
Christoph Hellwig [Wed, 7 Aug 2024 23:01:55 +0000 (16:01 -0700)]
xfs: assert a valid limit in xfs_rtfind_forw

Protect against developers passing stupid limits when refactoring the
RT code once again.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: remove the limit argument to xfs_rtfind_back
Christoph Hellwig [Wed, 7 Aug 2024 23:01:55 +0000 (16:01 -0700)]
xfs: remove the limit argument to xfs_rtfind_back

All callers pass a 0 limit to xfs_rtfind_back, so remove the argument
and hard code it.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: factor out a xfs_validate_rt_geometry helper
Christoph Hellwig [Wed, 7 Aug 2024 23:01:55 +0000 (16:01 -0700)]
xfs: factor out a xfs_validate_rt_geometry helper

Split the RT geometry validation in the early mount code into a
helper than can be reused by repair (from which this code was
apparently originally stolen anyway).

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
[djwong: u64 return value for calc_rbmblocks]
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs: remove xfs_validate_rtextents
Christoph Hellwig [Wed, 7 Aug 2024 23:01:54 +0000 (16:01 -0700)]
xfs: remove xfs_validate_rtextents

Replace xfs_validate_rtextents with an open coded check for 0
rtextents.  The name for the function implies it does a lot more
than a zero check, which is more obvious when open coded.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: allow setting current address to log blocks
Darrick J. Wong [Wed, 7 Aug 2024 22:54:09 +0000 (15:54 -0700)]
xfs_db: allow setting current address to log blocks

Add commands so that users can target blocks on an external log device.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: convert rtsummary geometry
Darrick J. Wong [Wed, 7 Aug 2024 22:54:09 +0000 (15:54 -0700)]
xfs_db: convert rtsummary geometry

Teach the rtconvert command to be able to convert realtime blocks and
extents to locations within the rt summary.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: convert rtbitmap geometry
Darrick J. Wong [Wed, 7 Aug 2024 22:54:09 +0000 (15:54 -0700)]
xfs_db: convert rtbitmap geometry

Teach the rtconvert command to be able to convert realtime blocks and
extents to locations within the rt bitmap.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: enable conversion of rt space units
Darrick J. Wong [Wed, 7 Aug 2024 22:54:09 +0000 (15:54 -0700)]
xfs_db: enable conversion of rt space units

Teach the xfs_db convert function about rt extents, rt block numbers,
and how to compute offsets within the rt bitmap and summary files.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: access arbitrary realtime blocks and extents
Darrick J. Wong [Wed, 7 Aug 2024 22:54:08 +0000 (15:54 -0700)]
xfs_db: access arbitrary realtime blocks and extents

Add two commands to xfs_db so that we can point ourselves at any
arbitrary realtime block or extent.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: access realtime file blocks
Darrick J. Wong [Wed, 7 Aug 2024 22:54:08 +0000 (15:54 -0700)]
xfs_db: access realtime file blocks

Now that we have the ability to point the io cursor at the realtime
device, let's make it so that the "dblock" command can walk the contents
of realtime files.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
11 months agoxfs_db: make the daddr command target the realtime device
Darrick J. Wong [Wed, 7 Aug 2024 22:54:08 +0000 (15:54 -0700)]
xfs_db: make the daddr command target the realtime device

Make it so that users can issue the command "daddr -r XXX" to select
disk block XXX on the realtime device.

Signed-off-by: Darrick J. Wong <djwong@kernel.org>