ParsersParser Settings

Parsing Configuration

Configure which pages and file types to parse

Parsing Configuration

In Parser Settings → Parsing Configuration, you can control which pages and which file types DigiParser processes from your documents.

Where to find it

  1. Open your parser and go to Settings → Parser Settings.
  2. Scroll to the Parsing Configuration section.

Pages to be parsed

By default, DigiParser processes all pages in a document. You can limit which pages are extracted (e.g. only odd or even pages, or specific page ranges). This applies to PDFs only.

To set: In the Pages to be parsed section, turn the toggle on and choose Only odd pages, Only even pages, or Page ranges and enter a custom range (e.g. 1,2,8 or 4,7,12-16 or 2n-1). Then click Save Parser.

See Pages to be parsed for full details and examples.

Document Types

By default, DigiParser processes all supported file types. You can restrict it to specific types.

Supported types:

  • PDF
  • Images: PNG, JPEG/JPG
  • Office: Word (.docx), Excel (.xlsx), PowerPoint (.pptx)
  • Text: Markdown, plain text, CSV, HTML, JSON

When to use:

  • PDFs only: If you only process PDFs and want to ignore images or Office files
  • Specific formats: If you only want to process certain file types (e.g. only PDFs and images)

To set: Use the Document Types dropdown to select which file types to process, then click Save Parser.

Calculate confidence scores

If enabled, DigiParser shows how confident it is in each extracted value (e.g. as a percentage or color). This helps you spot values that might be wrong.

Cost: No additional credits beyond standard processing. Confidence scores may increase compute time slightly but do not change credit usage.

When to use:

  • When you want to prioritize review on low-confidence documents
  • When you need to spot potential errors before export

To enable: Turn on Calculate confidence scores for extracted fields and click Save Parser.

Enable advanced table data extraction

This setting helps DigiParser read larger or more complex tables more accurately.

When to use:

  • Your documents include long line-item tables (for example 100+ rows)
  • Table rows or columns are sometimes missed or misaligned
  • You process documents with mixed table layouts from different vendors

Options:

  • Auto: DigiParser decides when advanced table extraction is needed
  • Enabled: Always use advanced table extraction
  • Disabled: Turn advanced table extraction off

To set: In Enable advanced table data extraction, choose Auto, Enabled, or Disabled, then click Save Parser.

See Advanced table extraction for detailed guidance and best practices.

Data extraction mode

This controls the speed vs quality balance when extracting data.

Options:

  • Fast: Prioritizes speed
  • Accurate: High accuracy with balanced speed
  • Critical: Maximum accuracy for important workflows

Credit notes:

  • Fast and Accurate use standard credits
  • Critical uses more credits per page

To set: In Data extraction mode, choose Fast, Accurate, or Critical, then click Save Parser.

See Data extraction modes for mode selection guidance and detailed examples.

Enable markdown parsing

When enabled, documents are converted to markdown so you can view and download them from the Markdown tab in the document viewer.

Cost: Uses 1 credit per page (increases processing costs).

When to use:

  • When you want to view documents as markdown text
  • When you need to download markdown versions of documents

To enable: Turn on Enable markdown parsing and click Save Parser.

Tips

  • Start with defaults: Use default settings (all pages, all file types) unless you have a specific need
  • Test page ranges: If limiting pages, test on a few documents first to make sure you're not missing data
  • Advanced table extraction: Start with Auto and switch to Enabled only if complex tables need extra help
  • Data extraction mode: Start with Accurate, then adjust to Fast or Critical by business need
  • Confidence scores: Enable when you need better visibility into extraction quality (no extra credits)
  • Markdown parsing: Only enable if you need markdown view/download—it increases costs

Next steps

How is this guide?

On this page