Skip to content

Latest commit

 

History

History
110 lines (87 loc) · 4.87 KB

File metadata and controls

110 lines (87 loc) · 4.87 KB

Source Registry

This file records source families for the stable research release. A source listed here is not automatically trusted; every event still carries its own source_url and confidence.

Candidate Official Sources

Candidate Issuer Sources

  • Issuer investor-relations pages, SEC filings, and company press releases.
  • Use issuer_release and keep the direct stable URL in source_url.

Candidate Lead Sources

  • Reputable financial media can be used as low-confidence leads only. Leads should be upgraded only after matching official filings, official remarks, issuer releases, or other primary material.

Deferred Sources

The stable release intentionally excludes X / Twitter, Truth Social, and Longbridge community/profile/following-list ingestion. They can be revisited later only if a stable official interface and clear operating policy exist.

Current Configured RSS Feeds

config/free_rss_feeds.csv keeps the stable default set limited to official feeds that can be replayed without login:

  • White House presidential actions
  • SEC press releases
  • SEC speeches and statements
  • Federal Reserve press releases
  • Federal Reserve speeches and testimony
  • NIST general, cybersecurity, and energy news
  • CISA news

FTC RSS URLs are documented above as candidates, but are not in the default CSV because the feed returned 403 to the current Python fetcher during verification.

data/live/source_manifest.json records source row counts, feed health, covered symbols, confidence counts, and event-type counts for every published live run.

Alias Coverage Notes

config/core_us_equity_aliases.csv includes targeted aliases for the current AI and data-center names that often appear through products or infrastructure terms, including MU, INTC, AMD, DELL, and VRT. Alias updates should prefer company names, product names, or clear infrastructure phrases over very broad hype terms.

Confidence Guide

  • high: official government source or direct filing with stable URL.
  • medium: issuer release or other primary issuer material.
  • low: financial-media lead pending primary source review.

Cross-Sector Coverage Boundary

Stable source coverage should remain cross-sector and should not be limited to AI or the current market hot list. The source layer can collect official evidence for semiconductors, data-center power, cybersecurity, defense, energy, financials, healthcare, consumer platforms, industrials, crypto infrastructure, and EV/auto themes when a durable primary source exists.

Theme membership and long-horizon semantic bias are owned by ResearchSignalContextPipelines. This repository only emits point-in-time source items and events with URLs, dates, source type, and confidence. A source item must not be promoted just because a theme or symbol is popular in the current market.

Long-Horizon Evidence Boundary

Long-horizon recommendations in QuantAdvisorResearch should not depend on this repository alone. This repository can improve confidence when durable primary evidence exists, such as SEC filings, issuer releases, official procurement, Federal Register notices, agency grants, or official policy documents. If those events are absent, the Advisor may still rank a symbol through theme context and momentum; that should be interpreted as low event/news support rather than an ingestion failure in this repository.