Docs/Tools/Built-in Tools
Reference

Tools

Your agent comes with 9 built-in tools for terminal access, file operations, web browsing, and HTTP requests. Skills add 100+ more integration tools.

💡

Tools are real actions

When your agent uses a tool, it executes a real action on its dedicated server — running a shell command, writing a file, or making an HTTP request. These are not simulated responses.

Built-in Tools#

These 9 tools are always available on every agent, regardless of plan or skill configuration. They run directly on your agent's dedicated VM.

Terminal#

Execute shell commands on your agent's Linux server. Full root access with support for any CLI tool installed on the VM.

ParameterTypeDescription
commandstring (required)The shell command to execute
timeoutintegerMax execution time in seconds (default: 30, max: 120)
working_directorystringDirectory to run the command in (default: /root)

File Read#

Read the contents of any file on the server. Supports text files with a default limit of 50KB.

ParameterTypeDescription
file_pathstring (required)Absolute path to the file
max_bytesintegerMaximum bytes to read (default: 50000)

File Write#

Write or append content to a file. Automatically creates intermediate directories if they don't exist.

ParameterTypeDescription
file_pathstring (required)Absolute path to the file
contentstring (required)Content to write
appendbooleanAppend to file instead of overwriting (default: false)

File List#

List the contents of a directory. Supports recursive listing with a default limit of 200 entries.

ParameterTypeDescription
directory_pathstring (required)Absolute path to the directory
recursivebooleanList recursively (default: false)
max_entriesintegerMaximum entries to return (default: 200)

Web Browse#

Browse a webpage using a headless Chromium browser with full JavaScript rendering. Extracts text content and links from the page.

ParameterTypeDescription
urlstring (required)The URL to browse
extract_linksbooleanAlso extract all links from the page (default: false)

Web Scrape#

Extract structured data from a webpage using CSS selectors. Useful for pulling specific elements like prices, titles, or tables.

ParameterTypeDescription
urlstring (required)The URL to scrape
selectorstring (required)CSS selector to target elements
attributestringElement attribute to extract (default: text content)

Web Screenshot#

Take a screenshot of a webpage. Captures the full page or just the visible viewport and saves it to the workspace.

ParameterTypeDescription
urlstring (required)The URL to screenshot
full_pagebooleanCapture full page or just viewport (default: true)
output_pathstringWhere to save the screenshot

Web Click & Extract#

Perform multi-step interactions on a webpage — click buttons, type into fields, select options, scroll, and wait for elements. Useful for interacting with dynamic web apps.

ParameterTypeDescription
urlstring (required)The URL to interact with
actionsarray (required)List of actions: click, type, select, wait, scroll
extract_afterstringCSS selector to extract content after actions complete

HTTP Request#

Make arbitrary HTTP requests to external APIs. Supports all methods, custom headers, and request bodies.

ParameterTypeDescription
urlstring (required)The URL to request (must be external, not localhost)
methodstringHTTP method: GET, POST, PUT, DELETE, PATCH (default: GET)
headersobjectCustom request headers
bodystringRequest body (for POST/PUT/PATCH)

Skill-Integrated Tools#

When you enable a skill and provide API credentials, your agent gets additional tools specific to that integration. These are real API-backed tools — your agent makes actual calls to Discord, GitHub, Gmail, and other services.

SkillTools Provided
Discordsend_message, read_messages, list_channels, list_guilds
Slacksend_message, read_messages, list_channels
Telegramsend_message, read_updates, set_webhook
Twitter/Xread_timeline, post_tweet, search, get_mentions
WhatsAppsend_message, read_messages

100+ integration tools

The full list includes 100+ tools across social media, developer platforms, databases, CRMs, email providers, search engines, AI models, analytics, finance, and more. Enable any skill from the Skills panel to unlock its tools.

Tool Execution#

When your agent decides to use a tool, here's what happens:

  1. The AI model returns a tool call with the tool name and parameters
  2. The bridge executes the tool on the agent's VM
  3. The result is sent back to the AI model as tool output
  4. The model uses the result to formulate its response
  5. This loop can repeat up to 25 times per message for complex multi-step tasks

Execution Constraints#

ConstraintValue
Max tool rounds per message25
Terminal command timeout120 seconds
File read limit50 KB (default)
Directory listing limit200 entries (default)
Tool result truncation20 KB max per result

TOOLS.md Configuration#

The TOOLS.md config file lets you customize how your agent uses its tools. Use it to set tool priorities, define error handling behavior, and document interaction patterns.

TOOLS.md (example)
# TOOLS — Tool Usage Patterns

## Usage Priority
1. Check memory for past context and solutions
2. Search available knowledge bases and documentation
3. Use terminal for system operations and diagnostics
4. Use HTTP requests for external API calls
5. Use web browsing for research and data gathering

## Error Handling
1. Tool fails -> try alternative approach
2. Data not found -> ask for clarification
3. Permission denied -> explain what's needed
4. System down -> set expectations and follow up

## Best Practices
- Prefer file operations over terminal for reading/writing files
- Use web_browse for dynamic pages, http_request for APIs
- Always validate terminal command output before acting on it
- Save important results to workspace files for later reference

Multi-Provider Support#

Tools work identically across all AI providers. Whether your agent runs on Claude, GPT-4o, Gemini, Grok, or any other supported model, the same tools are available with the same capabilities. The bridge automatically converts tool schemas to each provider's native format.

ProviderTool Format
Anthropic (Claude)Native tool_use format with multi-turn support
OpenAI (GPT)Function-calling format with tool_choice
Google (Gemini)OpenAI-compatible function calling
xAI (Grok)OpenAI-compatible function calling
💡

Switch models anytime

Changing your AI model doesn't affect tool availability. All tools work the same regardless of which provider powers your agent. See Supported Models.