A document is the source of content that Doc Reviewer analyzes. You load a document by uploading a file or providing a URL, and Doc Reviewer automatically parses it into sections, classifies each section, and presents the result as a tree you can browse and evaluate. Every document belongs to a project, and multiple documents in the same project share the project’s product context during evaluation.Documentation Index
Fetch the complete documentation index at: https://www.doc-reviewer.site/llms.txt
Use this file to discover all available pages before exploring further.
Supported formats
Bold text is preserved so the LLM can recognize UI element names and distinguish them from surrounding prose.
DOCX
Bold formatting is preserved the same way as in PDF, keeping UI element names readable for the LLM.
Markdown (.md)
Parsed natively. Heading structure is used directly to build the section tree.
Plain text (.txt)
Parsed as plain text. Sections are split based on heading-like patterns.
Web pages
In addition to files, you can load content directly from a URL. Doc Reviewer uses a headless Chromium browser (Playwright) to fetch the page, which means JavaScript-rendered sites and single-page applications work correctly. To add a web page, select the By URL tab on the evaluation screen, paste the URL, and click Load. After the first page loads, you can add more pages from the same documentation site using the + Add page button — all pages are then treated as a single document for evaluation.The web parser is optimized for Positive Technologies web help, which uses custom
<instruction>, <action>, and <task> tags. Each <instruction> block becomes a separate section in the document tree. For other sites, Doc Reviewer falls back to a generic HTML-to-Markdown conversion that works for most pages but may produce lower-quality results on complex layouts.What happens when a document is uploaded
When you upload a file or load a URL, Doc Reviewer processes it in three steps:Parsing
The file is read and split into sections based on headings. Each section gets a title, body content, heading level, and a path that reflects its position in the document hierarchy.
Instruction detection
Each section is analyzed by the instruction detector, which checks three signals: whether the title is phrased as a deverbal noun naming a task (such as “Connection setup” or “Adding a user”), whether the body contains a purpose phrase starting with “To [verb]:”, and whether the body contains a numbered list of steps. Sections that match all three signals are classified as
instruction; those that match one or two signals are classified as possible; the rest are classified as non-instruction.Section classifications
Every section in a document receives one of three classifications:instruction
instruction
The section matched all three detection signals: deverbal noun in the title, a purpose phrase in the body, and a numbered list of steps. Doc Reviewer treats these sections as confirmed instructions and includes them in evaluation by default.
possible
possible
The section matched one or two detection signals but not all three. It may be an instruction that is missing a standard element, or it may be a different type of content. Doc Reviewer includes
possible sections in evaluation alongside confirmed instructions, so you can review their results and decide whether to keep them.non-instruction
non-instruction
The section matched no detection signals. These are typically introductory text, overview pages, glossaries, reference tables, and similar non-procedural content.
non-instruction sections are never sent to the LLM for evaluation. They are used instead as source material for generating the project’s product context.The document tree
When you open a document, Doc Reviewer shows its full section structure as a tree. The tree reflects the heading hierarchy of the original file — top-level headings at the root, subsections nested underneath. Each node in the tree shows:- The section title
- Its classification (
instruction,possible, ornon-instruction) - Whether it is included or excluded from evaluation
- Its evaluation result color (once evaluated)