Skip to content

Add source archive#294

Merged
probably-jaden merged 1 commit into
mainfrom
source-archive-clean
Jun 25, 2026
Merged

Add source archive#294
probably-jaden merged 1 commit into
mainfrom
source-archive-clean

Conversation

@probably-jaden

@probably-jaden probably-jaden commented Jun 25, 2026

Copy link
Copy Markdown
Contributor

Adds the source_archive package: captures HTML + screenshot + markdown for the URLs a forecasting bot cited and stores them (S3 or local) with provenance, deduplicated by url + content-hash. Heavy backends (Playwright, Firecrawl, etc.) are an optional source-archive extra.

Capture HTML + screenshot + markdown for the URLs a forecasting bot cited and
store them (S3 or local) with provenance, deduplicated by url + content-hash.
Heavy backends are an optional `source-archive` extra.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@probably-jaden probably-jaden self-assigned this Jun 25, 2026
@probably-jaden probably-jaden marked this pull request as ready for review June 25, 2026 22:42
@probably-jaden probably-jaden merged commit 30fa3f1 into main Jun 25, 2026
2 checks passed
@probably-jaden probably-jaden deleted the source-archive-clean branch June 25, 2026 22:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant