Scan Documents

Overview

Task Operations represent asynchronous jobs that you can initiate through the Scan Documents API to process or transform a File.

When you request an operation (like extracting text from an image or merging PDF documents), the API creates a Task object to track its progress. You can then query the status of this task using its unique ID.

Example Task Object

Here's what a typical File object representing a PNG image might look like:

{
  "id": "task_euyrvozb9302uwhq",
  "operation": "extract-text",
  "status": "completed",
  "parameters": {
    "input": "file_abc123xyz",
    "format": "markdown"
  },
  "result": {
    "format": "markdown",
    "content": "**This** is the *extracted* text content"
  },
  "callback_url": "https://example.com/webhook",
  "created_at": "2021-05-03T10:00:00Z",
  "updated_at": "2021-05-03T10:05:00Z"
}

Now, let's break down the properties of this Task object.

Properties

Every Task object shares a common structure, regardless of the specific operation being performed:

string

A unique identifier for the task (e.g., task_euyrvozb9302uwhq). You use this ID to check the task's status.

operation

string

A string indicating the type of operation requested (e.g., extract-text, convert, merge).

status

string

The current state of the task. See Task Statuses below.

parameters

object

An object containing the specific inputs you provided when creating the task (e.g., the input file ID, target format, quality settings). The structure varies depending on the operation.

result

object

An object containing the outcome of the task. Its structure depends on the status and operation.

If status is pending or processing, this object is usually empty.
If status is completed, this object contains the successful output (e.g., extracted text content, list of generated file IDs).
If status is failed, this object contains error details (error message and details).

callback_urloptional

string

An optional URL where the API will send a webhook notification when the task is completed. If not provided, you must manually check the task status OR use the webhooks to listen for task completion events.

This is useful for integrating task completion notifications into your application without polling the API. However, for API users is recommended to use the webhooks for better performance and reliability.

created_at

string

The date and time when the task was created, in ISO format (e.g., 2021-05-03T10:00:00Z).

updated_at

string

The date and time when the task's status was last updated, in ISO format (e.g., 2021-05-03T10:05:00Z).

Task Statuses

A task can be in one of the following states:

pending: The task has been accepted but has not yet started processing.
processing: The task is currently being executed.
completed: The task finished successfully. The result object contains the output.
failed: The task could not be completed due to an error. The result object contains details about the failure.

Available Operations

Tasks are initiated by making POST requests to specific endpoints under /v1/image-operations/ or /v1/pdf-operations/.

Image Operations

These operations work on image files (image/png, image/jpeg, image/webp).

Convert

Converts an image file to a different format (PNG, JPEG, WebP).

Detect Documents

Detects the boundaries of documents within an image.

Extract Text

Extracts text content from an image using OCR.

Warp

Applies perspective correction to an image based on four corner points.

Apply Effect

Applies a predefined visual effect to an image.

PDF Operations

These operations work on PDF files (application/pdf).

Render

Converts specific pages of a PDF document into image files (PNG).

Split

Splits a multi-page PDF into multiple single-page PDF files.

Merge

Combines multiple PDF files into a single PDF document.

Extract Pages

Creates a new PDF file containing only specified pages from a source PDF.

Error Handling

If a task encounters an issue, its status will change to failed. The result object will then contain:

error

string

A string describing the error.

details

object

An object containing additional context or specifics about the error, if available.

Common reasons for failure include providing an invalid file ID, using incorrect parameters (e.g., invalid page range, unsupported format), or internal processing errors.

Example

Here is an example of a failed task object:

{
  "id": "task_euyrvozb9302uwhq",
  "operation": "extract-text",
  "status": "failed",
  "parameters": {
    "input": "file_abc123xyz",
    "format": "markdown"
  },
  "result": {
    "error": "Source file not found.",
    "details": {
        "file_id": "file_abc123xyz",
        "reason": "The file might have been deleted."
    }
  },
  "created_at": "2021-05-03T10:00:00Z",
  "updated_at": "2021-05-03T10:05:00Z"
}

Waiting for Task Completion

Operations are asynchronous, meaning they may take time to complete.

You can check the status of a task by making a GET request to the task's endpoint, wait for the callback URL to be invoked if you provided one, or listen for a webhook notification for the event task.completed to be triggered when the task is completed.

Overview

Example Task Object

Properties

Task Statuses

Available Operations

Image Operations

Convert

Detect Documents

Extract Text

Warp

Apply Effect

PDF Operations

Render

Split

Merge

Extract Pages

Error Handling

Example

Waiting for Task Completion

On this page