Skip to main content

Tools and Commands

Tools enable agents to interact with web browsers, external services, and orchestrate complex workflows.

Written by Sam B.

Category

Command

Purpose

Actions

click

Click a visible page element such as a button, link, field, or icon

Actions

clickFill

Click into a field and replace its current value in one step

Actions

copy

Copy the provided text to the system clipboard

Actions

drag

Drag one visible element and drop it onto another area

Actions

fillCredentialsAndSubmit

Fill a login form using stored credentials

Actions

getSelectedText

Capture text that is already highlighted on the page for use in a future step

Actions

move

Move the mouse pointer to hover over an element without clicking

Actions

paste

Paste the current clipboard contents into the focused field

Actions

pressKeys

Press one or more keyboard keys or keyboard shortcuts

Actions

reportStop

End the current agent and return its final status and output

Actions

scroll

Scroll the page or a specific container by a set amount

Actions

scrollToPattern

Scroll until text matching a regex pattern is visible

Actions

scrollToText

Scroll until specific text is visible

Actions

type

Type into the currently focused field character by character

Files

downloadFile

Download a file from the current page and save it to Magical storage

Files

downloadNextDocumentRequest

Capture the next document-style file response triggered from the page or a URL and save it to Magical storage

Files

downloadNextNetworkRequest

Capture the next matching network response triggered from the page and save it to Magical storage as a file

Files

listRemoteFiles

List files and folders on the configured remote SFTP server

Files

printFocusedFrameToPDF

Turn the currently focused frame into a PDF and save that PDF to Magical storage

Files

printFrameToPDF

Turn a specific frame into a PDF and save that PDF to Magical storage

Files

printToPDF

Turn the current page into a PDF and save that PDF to Magical storage

Files

readDownloadedFile

Read or parse a file that already exists in Magical storage

Files

readRemoteFile

Read a text file from the configured remote SFTP server

Files

uploadFile

Take a file from Magical storage and upload it into a file input on the current page

Files

uploadScreenshot

Capture a screenshot of the current page and save it to Magical storage

Files

writeRemoteFile

Write text to a file on the configured remote SFTP server

Files

writeRemoteFileFromStorage

Copy a file from Magical storage to a configured remote SFTP server

Forms and dropdowns

getDropdownOptions

List the available choices in a dropdown

Forms and dropdowns

selectDropdownOptions

Select one or more dropdown options

Logic

ifElse

Run one branch or another based on a condition

Navigation

listTabs

List open browser tabs and show which one is active

Navigation

closeTabs

Close one or more open browser tabs (must follow listTabs)

Navigation

switchToTab

Switch to an existing browser tab (must follow listTabs)

Navigation

getCurrentUrl

Get the URL of the current page

Navigation

navigateBack

Go back to the previous page in browser history

Navigation

navigateTo

Open a specific URL in the current tab

Navigation

navigateToLinkFromNewTab

Open a link that normally opens in a new tab in the current tab instead

Navigation

openNewTab

Open a new browser tab, optionally at a specific URL

Navigation

reloadPage

Reload the current page

Notes and control

callSpecializedAgent

Run another agent and wait for its output

Notes and control

notepad

Save notes or extracted data for later steps in the same agent

Notes and control

notify

Send a notification through the configured notification channel

Notes and control

queueAgents

Start another agent in the background without waiting for its output

Notes and control

requestHumanIntervention

Pause the run and ask a human to step in

Notes and control

wait

Pause for a set number of seconds before continuing

Special commands

setCheckpointId

Set a unique ID used to group checkpoints

Special commands

checkpoint

Record evidence for an important milestone

Special commands

getNDCFromMedication

Look up the best matching NDC for a medication using its details

Special commands

somFilter

Filter the SOM overlay so only relevant page elements are shown

Available tools and commands by agent type

Trigger agents

  • All commands available, based on environment selections

Orchestrate agents

  • Universal commands:

    • reportStop

    • ifElse

    • custom

    • callSpecializedAgent

    • queueAgents

    • notepad

    • requestHumanIntervention

    • wait

    • notify

    • callAPI

    • checkpoint

    • getNDCFromMedication

    • setCheckpointId

    • listRemoteFiles

    • readDownloadedFile

    • readRemoteFile

    • writeRemoteFileFromStorage

    • writeRemoteFile

Browser agents

  • Universal commands:

    • reportStop

    • ifElse

    • custom

    • callSpecializedAgent

    • queueAgents

    • notepad

    • requestHumanIntervention

    • wait

    • notify

    • callAPI

    • checkpoint

    • getNDCFromMedication

    • setCheckpointId

    • listRemoteFiles

    • readDownloadedFile

    • readRemoteFile

    • writeRemoteFileFromStorage

    • writeRemoteFile

  • Browser-specific commands:

    • click

    • clickFill

    • copy

    • drag

    • getSelectedText

    • move

    • paste

    • pressKeys

    • scroll

    • scrollToText

    • scrollToPattern

    • type

    • fillCredentialsAndSubmit

    • downloadFile

    • downloadNextDocumentRequest

    • downloadNextNetworkRequest

    • printFrameToPDF

    • printFocusedFrameToPDF

    • printToPDF

    • capturePrintOutput

    • uploadFile

    • uploadScreenshot

    • getDropdownOptions

    • selectDropdownOptions

    • getCurrentUrl

    • navigateBack

    • navigateTo

    • navigateToLinkFromNewTab

    • listTabs

    • openNewTab

    • switchToTab

    • closeTabs

    • reloadPage

    • somFilter

Desktop agents

  • Universal commands:

    • reportStop

    • ifElse

    • custom

    • callSpecializedAgent

    • queueAgents

    • notepad

    • requestHumanIntervention

    • wait

    • notify

    • callAPI

    • checkpoint

    • getNDCFromMedication

    • setCheckpointId

    • listRemoteFiles

    • readDownloadedFile

    • readRemoteFile

    • writeRemoteFileFromStorage

    • writeRemoteFile

  • Desktop-specific commands

    • click

    • clickFill

    • drag

    • move

    • scroll

    • uploadScreenshot

Code agents
Uses code instructions, not the normal command editor flow.

Branch agents
Does not use the normal command catalog. Its condition outcomes are limited to reportStop, callSpecializedAgent, and queueAgents.

API agents
No command-style instructions. You will set up a GET, POST, PUSH, PATCH, or DELETE API call.

File agents
No command-style instructions. This agent requires only file(s) as inputs and outputs to represent the information you'd like to extract from those files.

Did this answer your question?