Instruct your AI Agent to perform a precise sequence of actions or to achieve a high-level goal on a webpage.
Interact with Page
block is a highly versatile tool for controlling the AI agent’s actions on a webpage. You can use it in two ways: by providing a precise, sequential list of commands, or by defining a single, high-level goal that the agent will attempt to achieve by performing multiple sub-steps automatically.
This makes it essential for everything from simple button clicks to complex tasks like filling out forms or performing a search.
Interact with Page
block when you need your AI Agent to:
type '{{username}}' into 'Username or Email field'
type '{{password}}' into 'Password input'
click on button 'Sign In'
Find car part by part number in the search bar. Part number is {{part_number}}.
click
: Clicks on a target element (click on button 'Log In'
).type
: Types text into a field (type '{{text}}' into 'Search bar'
).goto
: Opens a specific URL (goto 'https://example.com/contact'
).press
: Presses a keyboard key (press 'Enter'
).back
: Goes to the previous page.idle
: Waits for 30 seconds.stop
: Immediately stops the workflow run (stop 'Item not found'
).{{part_number}}
) are crucial for both modes, allowing you to make your interactions dynamic.
1
. For a goal like finding a car part, you must increase this value (e.g., to 5
or higher) to give the agent enough “action credits” to perform all the necessary sub-steps (close pop-ups, click, type, click again).Find car part by part number in the search bar. Part number is {{part_number}}.
5
type '{{username}}' into 'Username or Email field'
type '{{password}}' into 'Password input'
press 'Enter'
idle
goto 'https://app.example.com/settings/profile'
Interact with Page
block gives you a flexible spectrum of control, from micromanaging every action to defining a high-level goal and letting the AI handle the details.