Parsing Configuration

In Parser Settings → Parsing Configuration, you can control which pages and which file types DigiParser processes from your documents.

Where to find it

Open your parser and go to Settings → Parser Settings.
Scroll to the Parsing Configuration section.

Pages to be parsed

By default, DigiParser processes all pages in a document. You can limit which pages are extracted (e.g. only odd or even pages, or specific page ranges). This applies to PDFs only.

To set: In the Pages to be parsed section, turn the toggle on and choose Only odd pages, Only even pages, or Page ranges and enter a custom range (e.g. 1,2,8 or 4,7,12-16 or 2n-1). Then click Save Parser.

See Pages to be parsed for full details and examples.

Document Types

By default, DigiParser processes all supported file types. You can restrict it to specific types.

Supported types:

PDF
Images: PNG, JPEG/JPG
Office: Word (.docx), Excel (.xlsx), PowerPoint (.pptx)
Text: Markdown, plain text, CSV, HTML, JSON

When to use:

PDFs only: If you only process PDFs and want to ignore images or Office files
Specific formats: If you only want to process certain file types (e.g. only PDFs and images)

To set: Use the Document Types dropdown to select which file types to process, then click Save Parser.

Calculate confidence scores

If enabled, DigiParser shows how confident it is in each extracted value (e.g. as a percentage or color). This helps you spot values that might be wrong.

Cost: No additional credits beyond standard processing. Confidence scores may increase compute time slightly but do not change credit usage.

When to use:

When you want to prioritize review on low-confidence documents
When you need to spot potential errors before export

To enable: Turn on Calculate confidence scores for extracted fields and click Save Parser.

Enable advanced table data extraction

This setting helps DigiParser read larger or more complex tables more accurately.

When to use:

Your documents include long line-item tables (for example 100+ rows)
Table rows or columns are sometimes missed or misaligned
You process documents with mixed table layouts from different vendors

Options:

Auto: DigiParser decides when advanced table extraction is needed
Enabled: Always use advanced table extraction
Disabled: Turn advanced table extraction off

To set: In Enable advanced table data extraction, choose Auto, Enabled, or Disabled, then click Save Parser.

See Advanced table extraction for detailed guidance and best practices.

Data extraction mode

This controls the speed vs quality balance when extracting data.

Options:

Fast: Prioritizes speed
Accurate: High accuracy with balanced speed
Critical: Maximum accuracy for important workflows

Credit notes:

Fast and Accurate use standard credits
Critical uses more credits per page

To set: In Data extraction mode, choose Fast, Accurate, or Critical, then click Save Parser.

See Data extraction modes for mode selection guidance and detailed examples.

Enable markdown parsing

When enabled, documents are converted to markdown so you can view and download them from the Markdown tab in the document viewer.

Cost: Uses 1 credit per page (increases processing costs).

When to use:

When you want to view documents as markdown text
When you need to download markdown versions of documents

To enable: Turn on Enable markdown parsing and click Save Parser.

Tips

Start with defaults: Use default settings (all pages, all file types) unless you have a specific need
Test page ranges: If limiting pages, test on a few documents first to make sure you're not missing data
Advanced table extraction: Start with Auto and switch to Enabled only if complex tables need extra help
Data extraction mode: Start with Accurate, then adjust to Fast or Critical by business need
Confidence scores: Enable when you need better visibility into extraction quality (no extra credits)
Markdown parsing: Only enable if you need markdown view/download—it increases costs

Next steps

General Settings – Parser name and description
Advanced table extraction – Improve extraction for complex tables
Data extraction modes – Choose speed vs precision
Pages to be parsed – Limit which PDF pages are extracted
Email Processing – Email processing options
Split Documents – PDF splitting options

Parsing Configuration

On this page