Skip to content

PHOENIX-7820 ConnectionQueryServicesImpl.createSnapshot bounded retry on transient exception#2438

Open
apurtell wants to merge 1 commit intoapache:masterfrom
apurtell:PHOENIX-7820
Open

PHOENIX-7820 ConnectionQueryServicesImpl.createSnapshot bounded retry on transient exception#2438
apurtell wants to merge 1 commit intoapache:masterfrom
apurtell:PHOENIX-7820

Conversation

@apurtell
Copy link
Copy Markdown
Contributor

@apurtell apurtell commented May 1, 2026

ConnectionQueryServicesImpl.createSnapshot() invokes admin.snapshot() during the Phoenix upgrade path. Transient HMaster issues surface as upgrade failures because of the lack of retry. The master's per-table lock can be briefly held by a concurrent admin operation, or RPC-level retries can resubmit an already-accepted snapshot request, causing the master to reject the duplicate. The fix is to wrap the snapshot call in a small bounded retry loop (5 attempts, 1 s backoff).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant