topmark check¶
Purpose: Verify TopMark headers and optionally insert or update them with --apply.
The check command verifies the presence and correctness of TopMark headers in targeted files. It
does not modify files (dry-run) but reports which files would need updates. In this mode summaries
end with - previewed. When run with --apply, files are actually modified and summaries end with
- inserted, - replaced, or other terminal statuses.
Note
The canonical vocabulary used throughout the documentation is defined in Terminology and Canonical Vocabulary.
Quick start¶
# Dry-run: show which files would get a TopMark header or be updated
topmark check src/
# Apply in place
topmark check --apply src/
# Show unified diffs (human output)
topmark check --diff src/
# Summary-only view (CI-friendly)
topmark check --summary src/
# Suppress TEXT rendering and rely on the exit code
topmark check --quiet src/
# Render document-oriented Markdown output
topmark check --output-format markdown src/
# Treat staged configuration-loading validation warnings as errors for this run
topmark check --strict src/
# Read targets from stdin (one path per line) and generate unified diff output
git ls-files | topmark check --files-from - --diff
Input applicability¶
- Dry-run by default; exit code
WOULD_CHANGE (2)when changes would occur. - Preserves the file's original newline style (LF/CRLF/CR).
- Preserves a leading UTF-8 BOM if present.
- Places headers according to file-type policy (shebang and PEP 263 in Python; XML declaration/DOCTYPE in XML/HTML; no insertion inside Markdown fenced code).
- Idempotent: re-running on already-correct files results in no changes.
STDIN modes¶
check supports both list STDIN mode (--files-from -, --include-from -, or --exclude-from -)
and content STDIN mode (- plus --stdin-filename NAME). These modes are mutually exclusive.
With --apply in content mode, transformed content is written to STDOUT and diagnostics are routed
to STDERR.
See shared input modes for the full STDIN contract,
including why TopMark does not provide a --stdin option flag.
Configuration and validation¶
check supports --strict / --no-strict to override the effective strict value for the run.
Before any file processing begins, TopMark performs whole-source TOML schema validation during configuration loading. TOML-source diagnostics (including missing-section INFO diagnostics) are evaluated together with merged-config and runtime applicability diagnostics during staged configuration-loading validation for the run.
Note
[config].strict is a TOML-source-local strictness preference controlling staged
configuration-loading validation for the current TOML source.
Effective strictness is evaluated across:
- TOML-source diagnostics;
- merged-config diagnostics;
- runtime applicability diagnostics.
strict is resolved during TOML loading and does not become a layered configuration field.
TopMark resolves configuration from defaults, user config, the project chain, explicit --config
files, and CLI overrides before staged validation produces the effective runtime configuration. See
Configuration discovery, precedence, and policy for the full
configuration-loading and validation contract.
Filtering and file discovery¶
TopMark determines which files to process using a combination of path-based filters and file-type filters.
Path arguments, include/exclude patterns, --files-from, and file-type filters follow the shared
TopMark filtering pipeline. Positional paths and relative patterns are resolved from the current
working directory; path-based filters run before file-type filters, and exclude rules take
precedence. See Filtering for the full path discovery
contract.
File type filters¶
--include-file-types / -tRestrict processing to the given file type identifiers. May be repeated and/or provided as a comma-separated list.--exclude-file-types / -TExclude the given file type identifiers. May be repeated and/or provided as a comma-separated list.
Exclude rules take precedence over include rules.
TopMark accepts file type identifiers in local form, such as python, or qualified form, such as
topmark:python.
Local identifiers are accepted only when unambiguous. Internally, TopMark normalizes identifiers to canonical qualified file type identities before filtering, runtime resolution, policy evaluation, diagnostics, and registry lookup.
See file-type filtering for the full identifier contract.
Examples:
topmark check --include-file-types python src/
topmark check --include-file-types topmark:python src/
topmark check --exclude-file-types topmark:markdown docs/
Path-based filters¶
--include,--excludeInclude or exclude glob patterns.--include-from,--exclude-fromLoad patterns from files (one per line).--files-fromProvide an explicit list of files to process.
Notes:
- Positional arguments are resolved relative to the current working directory (CWD), Black-style.
- Patterns in
--include,--exclude, and the files passed to--include-from/--exclude-fromare also resolved relative to CWD. Absolute patterns are not supported. - Path-based filters are evaluated before file-type filters.
- Exclude rules win over include rules when both match a path.
- File-type filters are applied after path-based include/exclude filtering.
- File-type filters are normalized to canonical qualified file type identities before filtering, runtime resolution, policy evaluation, diagnostics, and registry lookup.
- Explicit missing literal paths (for example
fubar.py) are reported asFILE_NOT_FOUND (66). - Unmatched glob patterns (for example
missing/**/*.py) are treated as soft discovery diagnostics and do not failcheck.
--reportcontrols the scope of human per-file TEXT rendering only. It does not affect processing, mutation behavior, summaries, machine-readable output, or exit-code selection.
Values:
actionable: show files that would change, changed, failed, or otherwise require attention; hide unsupported entries from the per-file listing while summaries may still count them.noncompliant: show actionable entries plus unsupported entries.all: show every processed result, including unchanged/compliant entries.
Example¶
# Use include/exclude files with relative patterns
printf "*.py\n" > inc.txt
printf "tests/*\n# ignored\n" > exc.txt
topmark check --include-from inc.txt --exclude-from exc.txt --diff
Command-specific policy options¶
The check command supports policy overrides that control how headers are inserted or updated.
See also: Policy guide.
Policy overrides passed to check follow the same resolution semantics as TOML configuration and
API overlays.
Use --header-mutation-mode to control mutation behavior for check:
all(default): insert missing headers and update existing headersadd-only: insert missing headers only; existing headers are not updatedupdate-only: update existing headers only; missing headers are not inserted
This policy affects dry-run reporting, --apply behavior, API result views, and semantic runtime
outcome bucketing. It is a check-only policy; strip removes existing headers and
probe is read-only.
Safety gates still take precedence. Malformed headers, unreadable files, unsupported files, blocked
filesystem states, and other non-mutable conditions are not made mutable by
--header-mutation-mode.
Empty file behavior¶
--allow-header-in-empty-files / --no-allow-header-in-empty-files--empty-insert-mode
These options control how check classifies empty files and whether headers may be inserted.
--empty-insert-mode defines which empty or empty-like files are eligible for insertion:
bytes_empty: only true 0-byte fileslogical_empty: true 0-byte files plus logically empty placeholderswhitespace_empty: any decoded content containing only whitespace or newlines
This policy affects dry-run reporting, --apply behavior, API result views, and semantic runtime
outcome bucketing.
This classification is evaluated together with --allow-header-in-empty-files:
- when disabled (default), empty-like files are treated as unchanged and compliant
- when enabled, eligible empty-like files may receive headers, subject to safety gates
--render-empty-header-when-no-fields is separate and controls whether an otherwise empty header
may be rendered when no fields are configured.
Safety gates still take precedence. Unreadable files, unsupported files, malformed headers, blocked filesystem states, and other non-mutable conditions are not made mutable by these options.
Formatting and safety¶
--allow-reflow / --no-allow-reflow--render-empty-header-when-no-fields / --no-render-empty-header-when-no-fields
These options influence rendering behavior and idempotence.
Shared policy¶
--allow-content-probe / --no-allow-content-probe
Controls whether file-type detection may inspect file contents when needed.
Behavior details¶
- Placement rules (processor-aware):
- Pound-style processors (for example Python, Shell, Ruby, Makefile): after shebang (and optional encoding line), else at top; keep exactly one blank around the block as per policy.
- Slash-style processors (for example C, C++, TypeScript): at top with consistent spacing.
- XML/HTML processors: after XML declaration and DOCTYPE; maintain a single intentional blank; never break the declaration.
- Markdown processor: uses HTML comments for the header; fenced code blocks are ignored for detection.
- Newline/BOM preservation: preserved across all paths (insert/replace). Reader normalizes in memory; updater reattaches BOM and keeps line endings.
- Idempotency: running
topmark checkagain on a file that already has a correct header produces no diff and exit code 0 (unless other files would change).
Output behavior¶
Output format, TEXT verbosity, quiet mode, color output, and shared exit-code behavior are documented in shared options and exit codes.
Shared output controls¶
TEXT verbosity is separate from internal logging:
-v,--verboseincreases TEXT output detail forcheck, such as per-line diagnostics and additional hints.-q,--quietsuppresses TEXT rendering while preserving the command's exit status.- Markdown output is document-oriented and renders diagnostics and hints when present without
requiring
-v. - Machine-readable JSON and NDJSON output ignore TEXT-oriented verbosity and quiet controls.
Notes:
- Summary mode aggregates outcomes and suppresses per-file guidance lines.
- In TEXT rendering, per-line diagnostics are shown with
-vand above. - Primary/headline hint selection is presentation-level guidance and is not part of the stable CLI contract; rely on exit codes and machine-readable output for automation.
- Diffs (
--diff) are always human-readable only and never included in JSON or NDJSON output.
Machine-readable output¶
Use --output-format json or --output-format ndjson to emit output suitable for tooling:
- JSON: a single machine-readable JSON document containing
meta, the effective runtime configuration snapshot,config_diagnostics, and then eitherresults(detail mode) orsummary(summary mode). - NDJSON: one machine-readable NDJSON record per line. Every record includes
kindandmeta, and the payload is stored under a container key that matcheskind.
For the canonical schema, stable kind values, and shared conventions, see:
Note
- Verbosity (
-v/--verbose) affects only TEXT rendering. - Quiet mode (
-q/--quiet) suppresses TEXT rendering for commands that support it. - Markdown and machine-readable output are not affected by TEXT verbosity controls.
Machine-readable output emits resolved file type identities using canonical qualified identity
strings when available. Configuration payloads also emit normalized file type filters and
policy_by_type keys.
Notes:
- Summary mode aggregates outcomes and suppresses per-file guidance lines.
- The
configpayload in JSON and NDJSON is the resolved runtime configuration snapshot after per-source TOML validation, layered configuration merge, staged configuration-loading validation, and CLI override application.
JSON schema (detail mode)¶
When --summary is not set, topmark check emits a single JSON object:
{
"meta": { /* MetaPayload */ },
"config": { /* RuntimeConfigPayload */ },
"config_diagnostics": { /* ConfigDiagnosticsPayload */ },
"results": [
{ /* per-file result payload */ }
]
}
The per-file result payload mirrors strip but reflects the check intent (e.g.
outcome.check.* fields instead of outcome.strip.*).
JSON schema (summary mode)¶
In summary mode (--summary), results is omitted and replaced by a flat summary list of rows:
{
"meta": { /* MetaPayload */ },
"config": { /* RuntimeConfigPayload */ },
"config_diagnostics": { /* ConfigDiagnosticsPayload */ },
"summary": [
{ "outcome": "unchanged", "reason": "up-to-date", "count": 30 },
{ "outcome": "would insert", "reason": "header missing, changes found", "count": 1 }
]
}
NDJSON schema (detail vs summary)¶
NDJSON is a stream with a stable prefix followed by either per-file result records (detail mode)
or per-bucket summary records (summary mode):
- Prefix records:
kind="config"(effective runtime configuration snapshot)kind="config_diagnostics"(counts-only)- zero or more
kind="diagnostic"records (each withdomain="config"; these may originate from TOML-source, merged-config, or runtime applicability diagnostics) - Then:
- detail mode (no
--summary): onekind="result"record per file - summary mode (
--summary): onekind="summary"record per(outcome, reason)bucket
Example (summary mode):
{"kind":"config","meta":{...},"config":{...}}
{"kind":"config_diagnostics","meta":{...},"config_diagnostics":{"diagnostic_counts":{"info":0,"warning":0,"error":0}}}
{"kind":"summary","meta":{...},"summary":{"outcome":"would insert","reason":"header missing, changes found","count":1}}
Command-specific options¶
| Option | Description |
|---|---|
--apply |
Write changes to files (off by default). |
--diff |
Show unified diffs (human output only). |
--summary |
Show outcome counts instead of per-file details. |
-q, --quiet |
Suppress TEXT rendering while preserving the command's exit status. |
--files-from |
Read newline-delimited paths from file (use '-' for STDIN). |
- (PATH) |
Read one virtual file from STDIN content (requires --stdin-filename). |
--include |
Add paths by glob (can be used multiple times). |
--include-from |
File of patterns to include (one per line, # comments allowed). |
--exclude |
Exclude paths by glob (can be used multiple times). |
--exclude-from |
File of patterns to exclude. |
--include-file-types / -t |
Restrict to local or qualified TopMark file type identifiers. |
--exclude-file-types / -T |
Exclude local or qualified TopMark file type identifiers. |
--report |
Control reporting scope: actionable, noncompliant, or all. |
--header-mutation-mode |
Check-only policy override: all, add-only, or update-only. |
--empty-insert-mode |
Check-only policy override controlling empty-file classification. |
--strict / --no-strict |
Override effective configuration-loading validation strictness for this run. |
--stdin-filename |
Assumed filename when PATH is '-' (content from STDIN). |
Run
topmark check -hfor the full list of options and help text.
Exit codes¶
topmark check uses exit code WOULD_CHANGE (2) as a stable dry-run signal when changes would be
needed. Successful clean runs and successful --apply runs exit with SUCCESS (0).
Common check exit codes:
| Scenario | Exit code |
|---|---|
| Clean run / successful apply | SUCCESS (0) |
| Dry-run would add or update | WOULD_CHANGE (2) |
| Missing explicit input path | FILE_NOT_FOUND (66) |
| Write/apply failure | IO_ERROR (74) |
| Permission failure | PERMISSION_DENIED (77) |
| Configuration error | CONFIG_ERROR (78) |
| Invalid CLI usage | USAGE_ERROR (64) |
Notes:
- Explicit missing literal paths are hard input errors and produce
FILE_NOT_FOUND (66). - Unmatched glob patterns are soft discovery diagnostics and do not fail
check. - In mixed-result runs, hard input and filesystem errors take precedence over
WOULD_CHANGE (2).
See Exit codes for the complete CLI-wide exit-code contract.
Typical workflows¶
1) Add headers to a project¶
2) Review a change set¶
3) CI: summarize and fail when changes are needed¶
4) Run with strict config checking¶
# Fail when staged configuration-loading validation warnings are present
# (for example TOML-source, merged-config, or runtime applicability warnings)
topmark check --strict src/
Pre-commit integration¶
topmark check is the command used by the non-destructive topmark-check pre-commit hook.
The hook runs topmark check against files selected by pre-commit and follows the same resolution,
filtering, policy, configuration, output, and exit-code behavior documented on this page.
For general pre-commit integration guidance, CI workflows, and repository hook configuration, see Pre-commit integration.
Related commands¶
topmark strip- remove detected TopMark headers instead of inserting or updating them.topmark probe- explain file-type and processor resolution.topmark config check- validate the effective runtime configuration and report diagnostics.topmark config dump- inspect the effective runtime configuration, including normalized file type identifiers.
Related docs¶
- Command overview
- Configuration
- Filtering
- Policies
- Shared options
- Exit codes
- Header placement rules
- Machine-readable output
- Machine-readable format conventions
- Pre-commit integration
- Terminology and Canonical Vocabulary
Troubleshooting¶
- No files to process: Ensure you passed positional paths, or selected the correct STDIN mode
(
--files-from -for list mode, or-with--stdin-filenamefor content mode). Use-vvfor detailed TEXT rendering; use logging options for internal debug logs. - Patterns do not match: Remember that include/exclude patterns are relative to CWD.
cdinto the project root before running. - File type filter does not match: use
topmark probeto inspect resolution decisions, and prefer qualified identifiers such astopmark:pythonwhen local identifiers may be ambiguous. - Missing file error: A literal path such as
fubar.pyis treated as an explicit input and fails withFILE_NOT_FOUND (66)when it does not exist. Use a glob pattern when an empty match set should be non-fatal. - Unexpected placement: For pound/slash formats, check for leading banners or shebang/encoding lines. For XML/HTML, verify declaration/doctype positions.