Surface untested source files at 0% coverage in reports by olivembo · Pull Request #9 · eclipse-score/score_cpp_policies

olivembo · 2026-06-23T09:46:38Z

Summary

llvm-cov only reports files whose object files are linked into at least one test.

Source files that exist in the workspace but no cc_test pulls in silently disappear from the coverage report.

This PR adds a Bazel aspect that walks the dependency graph to collect all C/C++ sources, compares them against what llvm-cov actually reported, and augments the LCOV, HTML, and text outputs with synthetic 0%-coverage entries for the missing files.

The coverage pipeline (merger, reporter, justify, effective_coverage) was originally developed by Jan Schlosser in eclipse-score/communication#549.
This PR centralizes it in score_cpp_policies, adds the baseline-coverage aspect, and improves the reporter.

Approach

Why an aspect + manifest instead of patching `--instrumentation_filter`?

instrumentation_filter controls which targets get built with coverage instrumentation, but llvm-cov still only reports files whose .o is linked into a test binary. There is no Bazel-native way to surface the gap. The aspect walks deps/srcs/implementation_deps transitively and writes a manifest of all reachable .cpp/.cc/.cxx/.c files. The reporter then diffs manifest vs. LCOV output.

Why heuristic line counts instead of exact numbers?

Without running llvm-cov against actual instrumented object files (which don't exist for untested sources), we can't get exact instrumentable line counts. The heuristic (_count_instrumentable_lines) filters blank lines, comments, preprocessor directives, lone braces, and namespace declarations. All outputs explicitly label these as estimates
(~N, "estimated via heuristic") to avoid false precision. The TOTALS line in summary.txt is intentionally left untouched. Only a WARNING banner is appended.

Known limitations (tracked for follow-up)

Baseline coverage / rules_cc gap: Proper baseline generation requires upstream support in rules_cc first. Current heuristic line counts are estimates.
QNX: Coverage on QNX may pull in Rust coverage data; how to separate these is still an open question.
constexpr functions: These appear uncovered due to how llvm-cov handles them.
toolchains_llvm: Once the above are resolved, contributing upstream there is the intended next step.

What changed

coverage/defs.bzl: _collect_sources_aspect, score_instrumented_sources_manifest rule, instrumented_sources_manifest parameter on score_coverage_reporter macro
coverage/reporter.py: LCOV augmentation, HTML augmentation (top-banner + per-file pages + detail table), text summary banner, workspace-bounds check, HTML escaping including quotes
coverage/BUILD.bazel: reporter_lib py_library for testability
tests/coverage/: uncovered.cpp/.h fixture (library with no test), reporter_test.py with unit tests for all augmentation helpers including path-traversal rejection

Consumer usage

load("@score_cpp_policies//coverage:defs.bzl",
     "score_coverage_reporter", "score_instrumented_sources_manifest")

score_instrumented_sources_manifest(
    name = "instrumented_sources",
    targets = ["//src:mylib"],
)

score_coverage_reporter(
    name = "reporter_wrapper",
    llvm_cov = "@llvm_toolchain//:llvm-cov",
    llvm_profdata = "@llvm_toolchain//:llvm-profdata",
    instrumented_sources_manifest = ":instrumented_sources",
)

Introduces a reusable coverage toolchain based on llvm-cov: - coverage/merger.py: per-test profraw -> profdata + object file packaging - coverage/reporter.py: cross-test aggregation, HTML/LCOV/text reports - coverage/effective_coverage.py: justification overlay + effective metrics - coverage/justify.py: justification manifest resolution - coverage/defs.bzl: score_coverage_reporter macro for consumer wiring - coverage/coverage.bazelrc: shared coverage flags - coverage/filter_regexes.txt: baseline source exclusions - coverage/generate_coverage_html.sh: convenience entry point Adds an end-to-end example under tests/coverage exercising the pipeline with a small instrumented library, test, justification file, and consumer filter regexes. Known limitation: source files not linked into any cc_test are not yet included in the report (no instrumented object file -> invisible to llvm-cov).

llvm-cov only reports files linked into at least one test binary. Sources that exist in the workspace but no test pulls in silently disappear from the report, causing coverage to appear higher than it actually is. Three mechanisms work together to fix this: Bazel aspect + manifest rule (coverage/defs.bzl): _collect_sources_aspect walks the dependency graph of all configured targets and collects C/C++ source files. score_instrumented_sources_manifest writes a workspace-relative path-per-line manifest. score_coverage_reporter gains an optional instrumented_sources_manifest parameter that passes the manifest to the reporter via --instrumented_sources_manifest. Reporter augmentation (coverage/reporter.py): After llvm-cov export, the reporter compares the manifest against covered sources from the LCOV output. For each missing file a synthetic 0%-coverage LCOV record is appended (SF + DA per non-blank line + LF/LH). The llvm-cov text summary TOTALS line is updated in-place using re.finditer to preserve fixed-width column alignment. Per-file HTML pages and a "Not Linked Into Tests" index section are generated for visibility. Correctness and security fixes applied during review: - Use str.replace() instead of str.format() for HTML template rendering so that { and } in C++ source bodies do not crash the reporter with KeyError/ValueError. - Separate stderr from stdout for run_llvm_cov_export and run_llvm_cov_report (separate_stderr=True) so that llvm-cov warning messages are not mixed into LCOV/summary output. - Validate that resolved manifest paths stay within workspace_root via Path.is_relative_to() before reading files. - Extend _escape_html to cover ' (') and " (") so that file paths with apostrophes do not break HTML attributes. - Count only non-blank lines for LF in synthetic LCOV records to avoid inflating the denominator in aggregate metrics. Test fixture (tests/coverage/uncovered.cpp, uncovered.h, BUILD.bazel): cc_library intentionally not linked into any cc_test. Verifies that the reporter surfaces the file at 0% coverage rather than omitting it. Docs (coverage/README.md): new section 5a with usage example.

- Use heuristic to identify instrumentable lines instead of counting all non-blank lines. Filters comments, preprocessor directives, lone braces, namespace declarations, and access specifiers to avoid inflating LF values in synthetic LCOV records. - Augment summary.txt and console output with untested file line counts so the visible TOTALS reflect the true coverage including 0%-files. - Parse the llvm-cov column header to determine the Lines group index dynamically instead of hardcoding position 1. - Add workspace-bounds check after resolve() in _find_untested_sources to prevent path traversal via symlinks. - Escape single and double quotes in _escape_html to prevent attribute breakout in generated HTML pages.

- Narrow _NON_EXECUTABLE_RE block-comment pattern from `|\*.*` to `|\*(?:[/\s].*)?` so that pointer dereferences (`*ptr = value;`) are correctly classified as executable. - Add py_test with unit tests for all reporter augmentation helpers: _is_likely_executable, _count_instrumentable_lines, _covered_sources_from_lcov, _find_untested_sources, _append_zero_coverage_lcov, _augment_text_summary, _escape_html. Includes a path-traversal rejection test for _find_untested_sources.

- Add py_library target for reporter so unit tests can import it. - Remove coverable_test from instrumented_sources manifest targets to avoid testonly dependency violation (test sources don't need to appear in the manifest — they're tested by definition).

The heuristic line count (_count_instrumentable_lines) cannot replicate what llvm-cov would report for actually-instrumented objects. Rewriting TOTALS with approximate numbers gives false precision. - _augment_text_summary: no longer rewrites the TOTALS line; appends a clearly-labelled WARNING banner with ~N estimated lines instead. - _inject_untested_section_into_index: injects a prominent banner right after <body> so it is the first thing reviewers see. Detail table uses ~N notation and includes a disclaimer about the heuristic. - _append_zero_coverage_lcov docstring documents the approximation. - Tests updated to match banner-only behavior.

nradakovic

To be honest. I will need a lot of time (and coffee) to do proper review. Hopefully by Monday I will have feedback.

olivembo · 2026-06-25T15:50:29Z

To be honest. I will need a lot of time (and coffee) to do proper review. Hopefully by Monday I will have feedback.

Thanks for the quick feedback Nikola.

In fact, I took most of the code from this PR in the communication repository. Additionally, I fixed that files without unittests are shown in the report so not a higher coverage is show that is actually the case.

Maybe for the review you seek help in the communication team.

Happy review and hopefully good ☕😉

nradakovic · 2026-06-25T16:05:16Z

To be honest. I will need a lot of time (and coffee) to do proper review. Hopefully by Monday I will have feedback.

Thanks for the quick feedback Nikola.

In fact, I took most of the code from this PR in the communication repository. Additionally, I fixed that files without unittests are shown in the report so not a higher coverage is show that is actually the case.

Maybe for the review you seek help in the communication team.

Happy review and hopefully good ☕😉

And this is also another problem. I'll for sure have some questions why certain implementation blocks are done in such way, but since communication PR is already merged (and in use?) not sure if I want to dive into large discussion and diverge from original PR. Anyway, let's see once I have concrete feedback. If something is on critical path, I will request that communication change their implementation.

nradakovic · 2026-06-25T16:27:41Z

So something that I see by just looking at. This implementation relies on a generated Bash wrapper to reconstruct Bazel runfiles and workspace paths before invoking the actual reporter. This feels a bit non-Bazel-native. Can we instead model this as an executable Starlark rule, or move runfiles/path resolution into the reporter binary using Bazel runfiles libraries?

nradakovic

For start I left 2 comments. It's enough to discuss Bazel implementation blocks (I still haven't check Python scripts) but I think they look OK. Have to do better look.

I see two paths for this PR to get merged:

I will approve as is but only if we have plan to improve current implementation (tickets to track it with clear plan how to improve it). What communication guys did is on them to support it. This will be untilized by every single module in S-CORE so we do not want to find nasty surprises 2 days before release of S-CORE.
We start now changing blocks but this will require some time and probably we will need to include more people who have more exp. with Bazel and code coverage.

@RSingh1511 Please provide your thoughts on this as well.

nradakovic · 2026-06-25T19:48:34Z

+reporter can add 0%-coverage entries for files that no test linked against.""",
+)
+
+def score_coverage_reporter(


As I already said in my previous comment, this simply looks wrong from Bazel native perspective. This is one of the biggest anti-patterns in Bazel and people very often fall into this trap which later brings nothing more than pain in the head. Genrules are for simple content transformation (cat, grep, sed), not what we have here.

The solution here would be to encapsulate all this into custom rules with executable python binaries + Python's Runfiles API as dependency.

Then we would benefint in multiple fronts:

Escaping horrible complexity with genrule (\\$$, $$, $)

Increasing reproducibility since shell has differences per OS.

Increasing debug capabilites.

Thanks for the feedback. Bazel is a b...east 🤪 ... I still try to tame it 😉

Addressed with d762cc5

nradakovic · 2026-06-25T19:51:17Z

+# file:
+#
+#   coverage --instrumentation_filter="^//<your_top_level_package>[/:]"
+#   coverage --coverage_report_generator=//<your_dir>:reporter_wrapper


And here is where things become fun. What if reference integration wants to execute code coverage which is in the end overall plan. How reference integration will set reporter for different modules? For start, everything that needs to be set in module .bazelrc file needs to be implemented over module extention (if possible).

RSingh1511 · 2026-06-26T04:23:18Z

Hello @olivembo , Thanks for this. Since you referenced communication, could you create an adoption PR in communication or logging showing how we can consume it? A real data example would help with verification and allow everyone to understand the use case.

Copilot

Pull request overview

This PR extends the repo’s Bazel/llvm-cov coverage tooling so that C/C++ source files reachable from configured coverage roots—but not linked into any executed test binary—are still surfaced in coverage outputs as synthetic 0%-coverage entries (LCOV, HTML, and text summary). It does so by generating a manifest of reachable sources via a Bazel aspect and then augmenting the reporter outputs based on a manifest-vs-LCOV diff.

Changes:

Adds a Bazel aspect + score_instrumented_sources_manifest rule to enumerate transitive C/C++ sources and plumbs an optional instrumented_sources_manifest into score_coverage_reporter.
Implements report augmentation in coverage/reporter.py (LCOV + HTML index/pages + text summary banner) and adds unit tests for the augmentation helpers.
Adds coverage adoption docs/config (coverage/README.md, coverage.bazelrc, filter regex baseline) and a consumer smoke-test workspace under tests/coverage.

Reviewed changes

Copilot reviewed 5 out of 27 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/MODULE.bazel	Adds `rules_python` + Python toolchain setup for the test workspace.
tests/coverage/uncovered.h	Adds an intentionally untested header fixture for smoke-testing “0% untested file” surfacing.
tests/coverage/uncovered.cpp	Adds an intentionally untested source fixture for smoke-testing “0% untested file” surfacing.
tests/coverage/reporter_test.py	Adds unit tests for coverage reporter augmentation helpers (LCOV/HTML/text/escaping/path checks).
tests/coverage/coverage_justifications.yaml	Adds a sample justification DB used by the coverage smoke test.
tests/coverage/coverage_filter_regexes.txt	Adds smoke-test consumer-side extra ignore regexes.
tests/coverage/coverable.h	Adds a small API exercised by the coverage smoke test (with an intentionally uncovered branch).
tests/coverage/coverable.cpp	Adds the implementation for the smoke-test library.
tests/coverage/coverable_test.cpp	Adds a test that intentionally leaves one branch uncovered for report validation.
tests/coverage/BUILD.bazel	Wires the smoke test targets, manifest generation, and reporter wrapper.
tests/BUILD.bazel	Exports `MODULE.bazel` so the reporter wrapper can resolve workspace root at runtime.
tests/.bazelrc	Imports coverage bazelrc and adds coverage smoke-test settings for the tests workspace.
README.md	Updates top-level README to document coverage tooling availability and link to coverage guide.
MODULE.bazel	Adds `rules_python`, `rules_shell`, Python toolchain, and pip hub wiring for coverage tooling.
coverage/requirements.in	Adds `pyyaml` as the direct dependency for justification processing.
coverage/requirements_lock.txt	Adds a locked `pyyaml` pin with hashes for Bazel pip integration.
coverage/reporter.py	Implements final report generation + augmentation with synthetic 0%-coverage entries for untested sources.
coverage/README.md	Adds an adoption guide for consumers (setup, configs, optional untested-files manifest).
coverage/merger.py	Adds per-test merger that packages profdata + object file metadata for final reporting.
coverage/justify.py	Adds YAML + in-source marker processing to generate a justification manifest.
coverage/generate_coverage_html.sh	Adds a consumer-facing driver to extract reports, run justification/effective-coverage, and optionally archive artifacts.
coverage/filter_regexes.txt	Adds baseline ignore regexes (tests/mocks/fakes/external/benchmarks).
coverage/effective_coverage.py	Adds HTML post-processing + effective coverage computation incorporating justifications.
coverage/defs.bzl	Adds the manifest aspect/rule and updates `score_coverage_reporter` to pass the manifest into the reporter.
coverage/coverage.bazelrc	Adds generic Bazel coverage flags/config to be imported by consumers.
coverage/BUILD.bazel	Adds Bazel targets for merger/reporter/justify/effective_coverage and the driver script.
.gitignore	Ignores generated `cpp_coverage/` output directories.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

 - **`clang_tidy/.clang-tidy`** — centralized default check set (conservative baseline, tailorable per module)
 - **`clang_tidy/clang_tidy.bazelrc`** — `--config=clang-tidy` bazelrc config consumers can import
+- **`//coverage:reporter` + `score_coverage_reporter` macro** — llvm-cov source-based
+coverage with a shared baseline of ignore regexes (test/mock/fake/external), pluggable 


…e_reporter The previous implementation used a genrule + heredoc to generate the reporter wrapper shell script. This required three-level escaping (\\$$) to handle Bazel make-variable and bash variable expansion in the same string which makes the code fragile and hard to debug. Replace with a custom Starlark rule (`_score_coverage_reporter_rule`) that uses `ctx.actions.write()` to produce the wrapper script directly. A new `_rlocation_path()` helper computes `Runfiles.Rlocation()`-compatible paths from File objects, eliminating all genrule make-variable gymnastics. The `--workspace_root` argument (previously computed via `readlink -f` in bash) is replaced by `--module_bazel` (a plain rlocation string). The reporter resolves it with `r.Rlocation()` and derives the workspace root in Python. The generated wrapper is now a readable 7-line script with no escaping. `rules_shell` is no longer a dependency of defs.bzl. Addresses review comment: eclipse-score#9 (comment)

olivembo · 2026-06-26T12:41:10Z

Hello @olivembo , Thanks for this. Since you referenced communication, could you create an adoption PR in communication or logging showing how we can consume it? A real data example would help with verification and allow everyone to understand the use case.

--> eclipse-score/communication#604

FScholPer · 2026-06-26T13:18:55Z

Gentleman still wait or proceed with merge?

castler · 2026-06-27T07:07:33Z

Two cents from the guy where this was copied over from.

As I wrote already in the Slack, the proper generation of baseline coverage requires a deeper look. Especially, as rules_cc is missing this support today - and I believe for a proper generation of baseline coverage, we need to look there first. But, this will require quite some time.

Second, I am still in the workings to make the coverage also correct for QNX. First experiments also showed that the coverage for QNX will bring coverage results for rust included. Thus, I am also not sure yet how to best handle this and if this will be at the end an "cpp_policies" only thing, or might not include Rust also.

Third, we still have a problem with constexpr functions, as they appear not covered.

Fourth, my original plan would have been, once the three mentioned points are solved, to contribute this solution to toolchains_llvm, as obviously this can be reused by a way broader community.

Anyhow, it can be decided that this is "good enough" for now, and these problems are tackled in an incremental way.
Also just raising awareness for them.

On a personal note - while the licenses and copyrights that we utilize in this project fully allow a copying of the code to different places without touching the license + copyright, I still think it follows best practices to give at least some credit / blame for the original author. Feels quite bad if a topic where quite some time was invested is just copied over. I would have loved to see at least a clean commit with the original state and author, and then the improvements, refactorings on top of it. (I do not require to change this, but I would like to raise awareness).

olivembo · 2026-06-29T06:08:50Z

Gentleman still wait or proceed with merge?

It's not approved yet

olivembo · 2026-06-29T11:43:24Z

Two cents from the guy where this was copied over from.

As I wrote already in the Slack, the proper generation of baseline coverage requires a deeper look. Especially, as rules_cc is missing this support today - and I believe for a proper generation of baseline coverage, we need to look there first. But, this will require quite some time.

Second, I am still in the workings to make the coverage also correct for QNX. First experiments also showed that the coverage for QNX will bring coverage results for rust included. Thus, I am also not sure yet how to best handle this and if this will be at the end an "cpp_policies" only thing, or might not include Rust also.

Third, we still have a problem with constexpr functions, as they appear not covered.

Fourth, my original plan would have been, once the three mentioned points are solved, to contribute this solution to toolchains_llvm, as obviously this can be reused by a way broader community.

Anyhow, it can be decided that this is "good enough" for now, and these problems are tackled in an incremental way. Also just raising awareness for them.

On a personal note - while the licenses and copyrights that we utilize in this project fully allow a copying of the code to different places without touching the license + copyright, I still think it follows best practices to give at least some credit / blame for the original author. Feels quite bad if a topic where quite some time was invested is just copied over. I would have loved to see at least a clean commit with the original state and author, and then the improvements, refactorings on top of it. (I do not require to change this, but I would like to raise awareness).

Hi Jan, thanks for the detailed write-up. This is genuinely useful context.

On the technical points:

Baseline coverage / rules_cc: Agreed. The heuristic line counts in this PR are explicitly labeled as estimates precisely because we know proper baseline generation needs upstream work in rules_cc first. That's not something we tried to paper over.
QNX + Rust interleaving: That's not on our radar yet. Good to know it's an active problem. If the solution ends up language-agnostic, landing it there rather than in cpp_policies alone makes sense.
constexpr: Known and annoying. Nothing to add here.

On attribution: you're right to call this out, and I'll be direct: It should have been handled better from the start. A commit that establishes the original state with proper authorship, followed by the changes on top, would have been the clean way to do it. That ship has sailed for the commit history, but I've updated the PR description to explicitly credit your PR #549 as the origin of the pipeline. Not a substitute for getting the history right, but at least the record is there.

The plan: Merge this as "good enough for now", and track the three open items as follow-ups. Does that work for you?

olivembo added 6 commits June 23, 2026 08:25

olivembo requested a review from RSingh1511 June 23, 2026 09:52

olivembo mentioned this pull request Jun 23, 2026

use llvm cov for coverage and justification eclipse-score/communication#549

Merged

AlexanderLanin marked this pull request as ready for review June 25, 2026 15:13

AlexanderLanin requested a review from nradakovic June 25, 2026 15:13

AlexanderLanin added this to INF - Infrastructure Community Jun 25, 2026

olivembo closed this Jun 25, 2026

github-project-automation Bot moved this to Done in INF - Infrastructure Community Jun 25, 2026

olivembo reopened this Jun 25, 2026

github-project-automation Bot moved this from Done to In Progress in INF - Infrastructure Community Jun 25, 2026

nradakovic reviewed Jun 25, 2026

View reviewed changes

nradakovic requested a review from Copilot June 26, 2026 08:26

Copilot started reviewing on behalf of nradakovic June 26, 2026 08:27 View session

Copilot AI reviewed Jun 26, 2026

View reviewed changes

nradakovic added enhancement New feature or request wip Work in progress p3 Medium/Low - handle it within normal process code coverage Issue or pull request for code coverage labels Jun 26, 2026

olivembo mentioned this pull request Jun 26, 2026

chore: adopt @score_cpp_policies//coverage in communication eclipse-score/communication#604

Open

Uh oh!

Conversation

olivembo commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Approach

Why an aspect + manifest instead of patching --instrumentation_filter?

Why heuristic line counts instead of exact numbers?

Known limitations (tracked for follow-up)

What changed

Consumer usage

Uh oh!

nradakovic left a comment

Choose a reason for hiding this comment

Uh oh!

olivembo commented Jun 25, 2026

Uh oh!

nradakovic commented Jun 25, 2026

Uh oh!

nradakovic commented Jun 25, 2026

Uh oh!

nradakovic left a comment

Choose a reason for hiding this comment

Uh oh!

nradakovic Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

olivembo Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nradakovic Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RSingh1511 commented Jun 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

olivembo commented Jun 26, 2026

Uh oh!

FScholPer commented Jun 26, 2026

Uh oh!

castler commented Jun 27, 2026

Uh oh!

olivembo commented Jun 29, 2026

Uh oh!

olivembo commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

olivembo commented Jun 23, 2026 •

edited

Loading

Why an aspect + manifest instead of patching `--instrumentation_filter`?

nradakovic Jun 25, 2026 •

edited

Loading

olivembo Jun 26, 2026 •

edited

Loading

nradakovic Jun 25, 2026 •

edited

Loading