HDDS-15105. Revise snapshot defrag design doc and add implementation details by smengcl · Pull Request #10119 · apache/ozone

smengcl · 2026-04-24T06:26:18Z

Generated-by: GPT-5.5

What changes were proposed in this pull request?

Revise snapshot defrag design doc. And add implementation details. GPT-5.5 generated most and I tweaked some.

Current design doc was merged in #8514

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15105

How was this patch tested?

n/a

…details Generated-by: GPT-5.5

smengcl · 2026-04-24T06:35:22Z

I eyeballed it and looks good. Inaccuraties are corrected in the process. Need to at least give it another pass before this can be merged.

jojochuang

This looks like a complete rewrite and hard to know what exactly was added.

smengcl · 2026-04-24T22:53:57Z

This looks like a complete rewrite and hard to know what exactly was added.

That's fair. I should convert it into a user doc

smengcl · 2026-04-24T23:25:05Z

Uh oh, I just realized that design doc should be under hadoop-hdds/docs/content/design/ . But the current design doc is already merged under hadoop-hdds/docs/content/feature/ . When I try to move the original design doc to the correct place while adding the user feature doc, it made the diff more confusing. Let me fix that by splitting the change into two PRs.

This reverts commit 1e89d36.

This reverts commit 935dda7.

Copilot

Pull request overview

This PR revises the Snapshot Defragmentation documentation to align the original design proposal with the current implementation details in Ozone (local YAML metadata, locking, on-disk layout, and service workflows).

Changes:

Document implemented snapshot-local YAML metadata (OmSnapshotLocalData) and how defrag state/versioning is tracked locally (not via Ratis SnapshotInfo).
Update the described locking, directory layout, diff computation paths, and purge/cleanup behavior to match current code.
Refresh the workflow and mermaid diagrams to reflect the implemented defrag flow.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-24T23:52:39Z

+structure and calls out where the implemented behavior differs from the initial
+proposal. For the current user-facing behavior, configuration, and on-disk
+layout, see
+[Snapshot Defragmentation]({{< ref "feature/Snapshot-Defragmentation.md" >}}).


The Hugo ref points to feature/Snapshot-Defragmentation.md, but there is no such doc under hadoop-hdds/docs/content/feature/ (only SnapshotDefragmentation.md). This will render as a broken link; update the reference to an existing page (or add the missing target doc).

Suggested change

[Snapshot Defragmentation]({{< ref "feature/Snapshot-Defragmentation.md" >}}).

[Snapshot Defragmentation]({{< ref "feature/SnapshotDefragmentation.md" >}}).

Copilot · 2026-04-24T23:52:40Z

+1. **Acquire the bootstrap read lock** and load `SnapshotInfo` plus local YAML.
+2. **Create a temporary checkpoint** in `tmp_defrag`.
+   * If this is the first snapshot in the bucket path chain, checkpoint the
+     current snapshot.
+   * Otherwise, checkpoint the current version of the path previous snapshot.
+3. **Drop non-incremental column families** from the temporary checkpoint. They
+   are reloaded from the current snapshot later.
+4. **Defrag tracked tables** (`keyTable`, `fileTable`, `directoryTable`).
+   * For the first snapshot in the path chain, perform full defrag by deleting
+     ranges outside the bucket prefix and compacting the tracked tables.
+   * For later snapshots, compute incremental delta SSTs, write changed records
+     or tombstones into temporary SST files, and ingest those files.
+5. **Acquire `SNAPSHOT_DB_CONTENT_LOCK`** for the current snapshot. Snapshot
+   reads and deep-clean writes follow the same DAG-based lock hierarchy.
+6. **Reload non-incremental tables** from the current snapshot into the
+   checkpoint.
+7. **Move the checkpoint directory** to
+   `db.snapshots/checkpointState/om.db-<snapshot_id>-<next_version>`.
+8. **Update local YAML**, incrementing `version`, adding the new
+   `versionSstFileInfos` entry, and clearing `needsDefrag`.
+9. **Delete older checkpoint directories** for the same snapshot under
+   `SNAPSHOT_DB_CONTENT_LOCK` and the snapshot DB cache write lock. Older YAML
+   version metadata is removed later by `OmSnapshotLocalDataManagerService`
+   only when no local version depends on it.
+10. **Release `SNAPSHOT_DB_CONTENT_LOCK`**.


Markdown list nesting looks inconsistent in the updated workflow section: the 10-step numbered list (starting with "1. Acquire the bootstrap read lock") is not indented under item "5. ### Snapshot Defragmentation Workflow". This will typically render as a separate top-level list (and can restart numbering unexpectedly). Consider indenting the sub-steps to be children of item 5, or convert item 5 into its own heading (not part of the outer numbered list) to avoid ambiguous rendering.

Suggested change

1. **Acquire the bootstrap read lock** and load `SnapshotInfo` plus local YAML.

2. **Create a temporary checkpoint** in `tmp_defrag`.

* If this is the first snapshot in the bucket path chain, checkpoint the

current snapshot.

* Otherwise, checkpoint the current version of the path previous snapshot.

3. **Drop non-incremental column families** from the temporary checkpoint. They

are reloaded from the current snapshot later.

4. **Defrag tracked tables** (`keyTable`, `fileTable`, `directoryTable`).

* For the first snapshot in the path chain, perform full defrag by deleting

ranges outside the bucket prefix and compacting the tracked tables.

* For later snapshots, compute incremental delta SSTs, write changed records

or tombstones into temporary SST files, and ingest those files.

5. **Acquire `SNAPSHOT_DB_CONTENT_LOCK`** for the current snapshot. Snapshot

reads and deep-clean writes follow the same DAG-based lock hierarchy.

6. **Reload non-incremental tables** from the current snapshot into the

checkpoint.

7. **Move the checkpoint directory** to

`db.snapshots/checkpointState/om.db-<snapshot_id>-<next_version>`.

8. **Update local YAML**, incrementing `version`, adding the new

`versionSstFileInfos` entry, and clearing `needsDefrag`.

9. **Delete older checkpoint directories** for the same snapshot under

`SNAPSHOT_DB_CONTENT_LOCK` and the snapshot DB cache write lock. Older YAML

version metadata is removed later by `OmSnapshotLocalDataManagerService`

only when no local version depends on it.

10. **Release `SNAPSHOT_DB_CONTENT_LOCK`**.

1. **Acquire the bootstrap read lock** and load `SnapshotInfo` plus local YAML.

2. **Create a temporary checkpoint** in `tmp_defrag`.

* If this is the first snapshot in the bucket path chain, checkpoint the

current snapshot.

* Otherwise, checkpoint the current version of the path previous snapshot.

3. **Drop non-incremental column families** from the temporary checkpoint. They

are reloaded from the current snapshot later.

4. **Defrag tracked tables** (`keyTable`, `fileTable`, `directoryTable`).

* For the first snapshot in the path chain, perform full defrag by deleting

ranges outside the bucket prefix and compacting the tracked tables.

* For later snapshots, compute incremental delta SSTs, write changed records

or tombstones into temporary SST files, and ingest those files.

5. **Acquire `SNAPSHOT_DB_CONTENT_LOCK`** for the current snapshot. Snapshot

reads and deep-clean writes follow the same DAG-based lock hierarchy.

6. **Reload non-incremental tables** from the current snapshot into the

checkpoint.

7. **Move the checkpoint directory** to

`db.snapshots/checkpointState/om.db-<snapshot_id>-<next_version>`.

8. **Update local YAML**, incrementing `version`, adding the new

`versionSstFileInfos` entry, and clearing `needsDefrag`.

9. **Delete older checkpoint directories** for the same snapshot under

`SNAPSHOT_DB_CONTENT_LOCK` and the snapshot DB cache write lock. Older YAML

version metadata is removed later by `OmSnapshotLocalDataManagerService`

only when no local version depends on it.

10. **Release `SNAPSHOT_DB_CONTENT_LOCK`**.

jojochuang

a few future sidenotes but the doc itself is good. Let's merge it.

jojochuang · 2026-04-27T20:19:47Z

-   Snapshots currently reside under `db.snapshots/checkpointState/` directory. The proposal introduces a `db.snapshots/checkpointStateDefragged/` directory for defragged snapshots. The directory format should be as follows:
+   Snapshots reside under the OM metadata directory in
+   `db.snapshots/checkpointState/`. The implementation does not use a separate
+   `checkpointStateDefragged` directory. Defragged versions are sibling


btw, this is no longer used checkpointStateDefragged / OM_SNAPSHOT_CHECKPOINT_DEFRAGGED_DIR
consider removing it in the future.

jojochuang · 2026-04-27T20:22:32Z

+   The important YAML fields are `snapshotId`, `previousSnapshotId`, `version`,
+   `needsDefrag`, `versionSstFileInfos`, `dbTxSequenceNumber`,
+   `transactionInfo`, `lastDefragTime`, `checksum`, and `isSSTFiltered`.
+   `lastDefragTime` is serialized, but current defrag decisions are based on


maybe somewhere in the code we want to mark lastDefragTime deprecated.

smengcl added 2 commits April 23, 2026 22:42

HDDS-15105. Revise snapshot defrag design doc and add implementation …

c5863ab

…details Generated-by: GPT-5.5

Clarification

8301932

smengcl added AI-gen documentation Improvements or additions to documentation labels Apr 24, 2026

jojochuang reviewed Apr 24, 2026

View reviewed changes

smengcl added 2 commits April 24, 2026 16:12

Convert new revision into user doc; restore design doc structure

82f5e93

Design doc should live under hadoop-hdds/docs/content/design/

1e89d36

smengcl added 4 commits April 24, 2026 16:39

Revert "Design doc should live under hadoop-hdds/docs/content/design/"

9eb8a15

This reverts commit 1e89d36.

Remove user doc (feature doc) to be added in a follow-up PR

d06b16e

Wording

a391afc

Clean up

b458e62

smengcl marked this pull request as ready for review April 24, 2026 23:49

Copilot AI review requested due to automatic review settings April 24, 2026 23:49

Copilot started reviewing on behalf of smengcl April 24, 2026 23:50 View session

smengcl added 2 commits April 24, 2026 16:50

Move it under hadoop-hdds/docs/content/design/

935dda7

Revert "Move it under hadoop-hdds/docs/content/design/"

00aecf2

This reverts commit 935dda7.

Copilot AI reviewed Apr 24, 2026

View reviewed changes

smengcl commented Apr 24, 2026

View reviewed changes

Comment thread hadoop-hdds/docs/content/feature/SnapshotDefragmentation.md Outdated

smengcl mentioned this pull request Apr 25, 2026

HDDS-15113. Add snapshot defrag feature doc #10131

Draft

1 task

jojochuang added the snapshot https://issues.apache.org/jira/browse/HDDS-6517 label Apr 27, 2026

Make CI happy

a975a32

jojochuang approved these changes Apr 27, 2026

View reviewed changes

jojochuang reviewed Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-15105. Revise snapshot defrag design doc and add implementation details#10119

HDDS-15105. Revise snapshot defrag design doc and add implementation details#10119
smengcl wants to merge 11 commits intoapache:masterfrom
smengcl:HDDS-15105-revise-defrag-doc

smengcl commented Apr 24, 2026

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

jojochuang left a comment

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 24, 2026

Uh oh!

Copilot AI Apr 24, 2026

Uh oh!

Uh oh!

jojochuang left a comment

Uh oh!

jojochuang Apr 27, 2026

Uh oh!

jojochuang Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	[Snapshot Defragmentation]({{< ref "feature/Snapshot-Defragmentation.md" >}}).
	[Snapshot Defragmentation]({{< ref "feature/SnapshotDefragmentation.md" >}}).

Conversation

smengcl commented Apr 24, 2026

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

smengcl commented Apr 24, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

jojochuang Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

jojochuang Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants