Tool Integration Guide
Learn how to integrate external tools and services with your Agent5ive agents.
Composio tools can now auto-authenticate directly through Agent5ive! When you add a Composio MCP server, a popup will appear on your screen with the needed API keys and OAuth flows.
How it works:
- Add a Composio MCP server during agent creation or editing
- A popup window will automatically appear for authentication
- Complete the OAuth flow or provide API keys as prompted
- Close the authentication window when finished
- The advanced MCP server will be added to your agent automatically
If auto-authentication fails: You'll receive a guide to manually set up the integration using the traditional Composio dashboard method described below.
Understanding Tool Integration
Tool integration allows your agents to perform actions or retrieve information beyond their base language model capabilities. Agent5ive uses Composio to integrate external tools with your agents, enabling them to interact with APIs, search the web, process data, and more.
How Tool Integration Works
- Tool Selection - During agent creation, you select tools that match your agent's purpose
- Configuration - The platform configures the agent to use these tools appropriately
- Runtime Integration - When interacting with users, the agent determines when to use tools
- Tool Execution - The agent calls the appropriate tool with the necessary parameters
- Result Processing - The agent processes the tool's output and incorporates it into its response
Tool Approval Policy System
The Tool Approval Policy System gives you fine-grained control over how tools are executed in your agent deployments. You can configure whether tools require user confirmation before execution, are automatically approved, or are disabled entirely.
How Tool Approval Works
When configuring your agent's tools, you can set approval policies at two levels: a default rule for all tools, and custom rules for individual tools. This gives you flexibility to balance automation with user control.
All tools execute immediately without user confirmation. Best for trusted environments and streamlined user experiences.
Users must confirm each tool execution. Provides maximum transparency and control over agent actions.
Set individual approval policies for each tool. Allows you to require approval for sensitive operations while auto-approving routine tasks.
Configuring Tool Approval Policies

Example showing custom approval rules for standard tools: Google Maps Search set to auto-approve, Image Search requiring approval, and Web Browsing disabled.
Configuration Steps:
- Select Tool Approval Mode - Choose from "Automatically approve every tool", "Require approval for every tool", or "Custom rules"
- Set Default Rule - When using custom rules, set a default behavior for tools without explicit overrides
- Configure Individual Tools - For each tool, choose:
- Auto (use without confirmation) - Tool executes immediately
- Require approval - User must confirm before tool executes
- Disable - Tool is not available for this deployment

Example showing custom approval rules for MCP tools: TODOIST_CREATE_TASK and TODOIST_GET_ALL_COMMENTS require approval, TODOIST_DELETE_TASK is disabled, while TODOIST_GET_ALL_LABELS and TODOIST_GET_ALL_PROJECTS auto-approve.
Best Practice for MCP Tools:
Consider requiring approval for destructive operations (like DELETE or UPDATE) while auto-approving read-only operations (like GET or SEARCH). This balances user control with convenience.
Use Cases for Tool Approval Policies
- Read-only operations - Web searches, data retrieval, information lookups
- Trusted environments - Internal company tools with limited risk
- Streamlined UX - Consumer-facing agents where speed matters
- Low-risk tools - Maps searches, weather lookups, general information
- Destructive operations - Deleting data, modifying records, sending emails
- Financial transactions - Payment processing, fund transfers, purchases
- Sensitive data access - Personal information, confidential documents
- External communications - Sending messages, posting content, API calls
- Tool approval policies apply to all interactive deployment types: SDK/IFRAME/REACT, embedded widgets, MCP integrations, voice conversations, WhatsApp, and Telegram
- On WhatsApp and Telegram, approval-required tools prompt users via interactive messages or verbal consent
- Connecting an agent via MCP, Scheduled tasks and event-driven triggers always auto-approve tools - these run autonomously without user interaction
- Disabled tools are completely unavailable to the agent in that deployment
- Default rules apply to any tools without explicit custom rules
- Consider the user experience when requiring approval for frequently-used tools
Available Tools
Agent5ive provides a comprehensive set of search tools that allow your agents to find and retrieve information from various sources. These tools enhance your agent's ability to provide accurate and up-to-date information in both text and voice conversations.
Available Search Tools:
- Web Browsing - Search the web for information
Enables your agent to find general information across the internet.
- Google Maps Search - Search for locations on Google Maps
Helps your agent find specific locations, businesses, and points of interest.
- Google Maps Directions - Get directions and location information
Allows your agent to provide navigation guidance between locations.
- Image Search - Search for images on the web
Enables your agent to find relevant images for visual references.
- News Search - Search for news articles
Helps your agent stay current with the latest news and developments.
- Finance Search - Search for financial information
Provides access to financial data, stock information, and market trends.
- Scholar Search - Search for scholarly articles
Enables your agent to access academic and research publications.
- Trends Search - Search for trends and insights
Helps your agent identify popular topics and trending subjects.
- Shopping Search - Search for shopping deals
Allows your agent to find products, compare prices, and identify deals.
- Event Search - Search for events and activities
Enables your agent to find upcoming events, concerts, conferences, and activities.
- Amazon Search - Search for products on Amazon
Find and compare products available on Amazon marketplace.
- DuckDuckGo Search - Search the web with DuckDuckGo
Privacy-focused web search for general information retrieval.
- Exa Answer - Get answers with citations using the Exa API
Provides well-researched answers with proper source citations.
- Exa Similar Link - Find similar links with the Exa API
Discover related content and resources based on similarity.
- Flights Search - Search for flights
Find and compare flight options for travel planning.
- Hotels Search - Search for hotels and vacation rentals
Locate accommodation options for travel destinations.
- Tavily Search - Perform a search with the Tavily API
Advanced search capabilities using Tavily's specialized search engine.
- TripAdvisor Search - Search for travel recommendations on TripAdvisor
Find travel recommendations, reviews, and destination information.
- Walmart Search - Search for products on Walmart
Find and compare products available at Walmart.
- HackerNews Search Posts - Search for posts on HackerNews
Find and retrieve discussions, articles, and posts from the HackerNews community.
- Yelp AI Assistant - Search for places and chat on Yelp AI Assistant
Search for local businesses, restaurants, and services with AI-powered recommendations from Yelp.
- Mail Service - Send emails on behalf of the agent
Enables your agent to send emails to users or other recipients. Best practice: Include email template instructions in your system prompt or knowledge base notes to ensure consistent and appropriate email formatting.
Best Practice for Mail Service:
Define email templates and formatting guidelines in your agent's system prompt or knowledge base. For example: "When sending confirmation emails, use a professional tone and include: greeting, confirmation details, next steps, and signature." This ensures your agent sends consistent, well-formatted emails that align with your brand voice.
Example Usage:
A user asks your travel assistant about attractions in Paris. The agent uses the Google Maps Search tool to find popular tourist destinations, then uses the Web Browsing tool to gather information about opening hours and ticket prices.
Agent Persistent Memory enables your agents to remember information across different conversations and threads. This powerful feature is available only for Builder, Pro, and Business tier subscribers, allowing your agents to build a knowledge base over time.
How Agent Persistent Memory Works:
- Write Operation - Store information under specific keys
Requires 'key' and 'value' parameters. Related data is concatenated under a single key.
- Read Operation - Retrieve information stored under a specific key
Requires 'key' parameter to fetch the stored information.
- Read All Operation - Retrieve all stored information (up to 100 entries)
Returns up to 100 entries with no parameters needed.
- Search Operation - Find information based on a query
Searches both keys and values using the 'query' parameter. Optionally takes 'limit' parameter (default: 10, max: 50).
- Memory Scoping - Control the scope of stored information
All operations can optionally take 'threadId' to scope memory to a specific conversation; otherwise, memory is global to the agent deployment.
Example Usage:
A customer support agent can remember user preferences across multiple conversations. When a user mentions they prefer email communication, the agent stores this preference and applies it in future interactions without asking again.
Best Practices:
- Use descriptive key names that clearly indicate the type of information stored
- Structure your data consistently to make retrieval and searching more effective
- Include validation in your system prompt to prevent storing sensitive or inappropriate information
- Consider using thread-specific memory for personal preferences and global memory for general knowledge
- Regularly review stored information to ensure accuracy and relevance
Important Considerations:
- Privacy Impact - Information stored in agent memory persists across conversations and may be accessed in future interactions
- System Prompt Guidance - Your system prompt should clearly define what information the agent should store and how it should use that information
- User Consent - Consider informing users that the agent may remember information from previous conversations
- Data Accuracy - Stored information may become outdated; consider implementing a verification mechanism
- Memory Limitations - There are limits to how much information can be stored and retrieved
Schedule Task allows you to automate your agents to execute specific tasks at regular intervals without any manual intervention. Results can be automatically sent to your email or Telegram, making it perfect for daily reports, monitoring, reminders, and automated data processing.
How Schedule Task Works:
- Talk to Your Agent - Simply ask your agent to schedule a task for you
You don't need to configure anything manually. Just tell the agent what you want and when.
- Specify the Task - Clearly describe what your agent should do
Be specific and detailed in your instructions for best results.
- Set the Schedule - Tell the agent when and how often to run the task
Use natural language like "every morning at 8 AM" or "every Monday at 5 PM."
- Choose Output Destination - Results are sent to email or Telegram
Email is always available. Telegram output requires a configured Telegram bot on the deployment and scheduling through Telegram chat.
- Manage Tasks - View, pause, and delete tasks from the chat interface
Expand "Scheduled Tasks" in the chat to manage all your scheduled tasks.
Example Usage:
A market research agent is scheduled to run every morning at 9 AM to gather the latest news about competitors, analyze trends, and summarize key insights. The results are automatically sent to your email, providing you with a daily briefing without any manual effort.
Common Use Cases:
- Daily reports and summaries (news digests, market updates, analytics reports)
- Regular monitoring and alerts (system checks, price tracking, status updates)
- Automated reminders (task follow-ups, appointment notifications, deadline alerts)
- Data collection and processing (gathering information, analyzing trends, updating records)
- Periodic content generation (social media posts, newsletters, blog updates)
Important Considerations:
- Minimum Interval: Tasks must be scheduled at least 10 minutes apart
- Clear Prompts: Write specific, actionable prompts for consistent results
- Telegram Output: Only available for deployments with a Telegram bot when scheduling through Telegram chat
- Message Limits: Scheduled task executions count toward your daily message limits
Configuration Note:
Image Upload & Vision can be enabled or disabled after agent creation using the "Edit Details" button. Simply toggle the setting and redeploy your agent to apply the changes.
Image vision capabilities now extend beyond the web client to messaging platforms including WhatsApp and Telegram.
When users send images through these platforms, the images are automatically fetched, converted to data URLs, and processed by your vision-enabled agent. Users on non-vision agents will receive a notification that image processing is not supported.
Agent5ive allows you to enable image upload and vision capabilities for your agents. This powerful feature lets users share images with your agent, which can then analyze and respond to the visual content.
How Image Vision Works:
- Enable the Feature - Toggle on "Enable Image Upload & Vision" during creation or in the Edit Details section
This requires a vision-capable model (like Gemini Flash).
- User Uploads - Users can upload images directly in the chat interface
The upload button appears in the chat input area when this feature is enabled.
- Messaging Platform Support - Images sent via WhatsApp and Telegram are also processed
Images are fetched from the messaging platform, converted to data URLs, and sent to your agent for analysis.
- Image Analysis - Your agent can recognize content in images
For example, a travel agent can identify landmarks or destinations from photos.
- Contextual Responses - Your agent responds based on the image content
The agent can provide information, recommendations, or ask follow-up questions related to what it sees.
Example Usage:
A user shares a photo of the Eiffel Tower with your travel agent. The agent recognizes Paris, and asks specific questions about when they're planning to visit, their budget, and interests to help create a personalized Paris itinerary.
Agent5ive allows you to enable document file upload capabilities for your agents. This feature lets users share documents with your agent, which can then analyze and respond to the content within those files.
Configuration Note:
File Upload capabilities can be enabled or disabled after agent creation using the "Edit Details" button. Simply toggle the setting and redeploy your agent to apply the changes.
How File Upload Works:
- Enable the Feature - Toggle on "Enable File Upload (PDF, DOCX, TXT)" during creation or in the Edit Details section
This allows your agent to process various document formats.
- User Uploads - Users can upload document files directly in the chat interface
The upload button appears in the chat input area when this feature is enabled.
- Document Processing - Your agent can extract and analyze content from uploaded files
For example, a research assistant can analyze data from uploaded PDFs or documents.
- Contextual Responses - Your agent responds based on the document content
The agent can provide summaries, answer questions, or perform tasks based on the information in the uploaded files.
Example Usage:
A user shares a research paper PDF with your academic assistant agent. The agent analyzes the paper, extracts key findings, and can answer specific questions about the methodology or conclusions presented in the document.
Configuration Note:
MCP integration can be added or removed after agent creation using the "Edit Details" button. You can add new MCP servers, remove existing ones, or modify MCP configurations and redeploy your agent to apply the changes.
The Advanced Tools (MCP) feature allows you to connect your agents to external Server-Sent Events (SSE/MCP [Http Streamable]) hosted servers, such as those provided by Composio, using the Model Context Protocol (MCP). This enables your agents to access a wide range of third-party services and APIs that aren't available through standard tools.
Available Tutorials:
Follow these step-by-step tutorials to set up specific MCP integrations with your agents.
Getting Started with MCP Servers:
Composio tools now support automatic authentication! When you add a Composio MCP server during agent creation or editing, a popup will automatically handle authentication for you.
If auto-authentication doesn't work, you can still use the manual setup methods below. Alternatively, you can find equivalent tools on Smithery or create custom workflows with n8n.
There are multiple platforms for accessing MCP servers. If the auto-authentication feature doesn't work for Composio, you can use these manual methods:
Composio MCP Dashboard (Manual Setup)
Official platform for enterprise-grade MCP integrations with popular services like Gmail, Google Calendar, and GitHub.
- Visit the Composio MCP Dashboard
- Create an account or sign in
- Browse available integrations
- Create a server for your desired integration
- Configure authentication and permissions
- Copy the MCP Server URL
Smithery.ai
Community platform with hundreds of MCP servers including Exa search, Context7 documentation, Browserbase automation, and more.
- Visit Smithery.ai
- Browse or search for MCP servers
- Select a server that matches your needs
- Click "Get URL with keys instead" (required for most servers)
- Copy the generated URL with embedded keys
- Paste directly into Agent5ive MCP configuration
See our Smithery.ai tutorial for detailed instructions.
For detailed documentation on creating and configuring MCP servers, visit the tutorials page.
Example Usage:
You create a personal assistant agent and connect it to a Google Calendar MCP server from Composio. Users can now ask your agent to "Schedule a meeting with John on Friday at 2 PM" or "What meetings do I have tomorrow?", and the agent will use the Google Calendar API to create events or retrieve schedule information.
Important Considerations:
- MCP servers require proper authentication and authorization to access third-party services
- Users may need to grant permissions for your agent to access their data through the connected services
- Some integrations may have rate limits or usage restrictions imposed by the service provider
- Always test thoroughly before deploying agents with MCP tools in production environments
Configuration Note:
Voice capabilities can be enabled or modified at any time using the "Edit Details" button. After making changes, redeploy your agent for the updates to take effect. Note that voice usage has associated costs based on your subscription tier.
The Real-time Voice Chat feature allows your agents to listen and respond using natural speech, creating a more intuitive and accessible interaction experience. This feature leverages OpenAI's advanced GPT-realtime model to provide high-quality voice interactions with support for MCP tools and agent communication.
How Voice Chat Works:
- Enable the Feature - Expand the "Real-time Voice Chat" section and toggle it on during creation or in the Edit Details section. Voice settings can be modified at any time.
- Select a Voice - Choose from available voice options to personalize your agent's speaking style
- Provide API Key - Enter your OpenAI API key to enable the voice processing capabilities
- Voice Interaction - Users can speak to your agent using their microphone and hear responses in the selected voice
- MCP Tool Support - Voice conversations can use MCP tools for enhanced functionality
- Agent Communication - Voice agents can communicate with other agents as tools
Example Usage:
A user can have a hands-free conversation with your customer support agent while multitasking. They can ask questions verbally and receive spoken responses, making the interaction more natural and accessible.
Important Note:
Agents as Tools configuration can be modified at any time using the "Edit Details" button. After making changes, redeploy your agent for the updates to take effect. Use "Clone this Agent" if you want to clone an agent as a starting point for a new one.
Agent5ive now allows you to use your publicly deployed agents as tools for other agents. This powerful feature enables you to create specialized agents that can work together, with one agent calling on the expertise of others when needed.
How Multi-Agent Integration Works:
- Deploy and Publish Specialized Agents - First, create, deploy, and publish agents with specific expertise
Agents must be published publicly to be available as tools for other agents.
- Select Agents as Tools - When creating a new agent, you'll see a "Select Your Agents as Tools" section
This section allows you to select from your publicly deployed agents to use as tools.
- Multi-Agent Collaboration - Your primary agent can now call on these specialized agents when needed
For example, a fitness blog writer agent might consult a nutrition expert agent for diet recommendations.
- Seamless Integration - The process is transparent to end users, who simply interact with the primary agent
Behind the scenes, the primary agent is delegating specific questions to specialized agents.
Example Usage:
A user asks your fitness coach agent for a complete wellness plan. The fitness coach agent uses a nutrition specialist agent to create the diet portion, a workout planner agent to design the exercise routine, and a meditation guide agent to suggest mindfulness practices, then combines all this information into a comprehensive response.
Multi-Agent in Voice Conversations:
- Multi-agent functionality now works in voice conversations
- Agent calls are automatically approved in voice mode for a seamless experience
- Users won't be prompted to confirm when the primary agent calls other agents during voice interactions
- This ensures smooth, natural conversations without interruptions for approval
Important Notes:
- Only publicly published agents can be used as tools for other agents
- Agents with the "Multi Agent" badge are using at least one other agent as a tool
- In voice conversations, approval-required tools prompt for verbal confirmation
- In text conversations, agent approval behavior follows your configured tool approval policies
Custom tool integration via Composio is planned for a future update of Agent5ive. This feature will allow you to create your own tools that connect to your specific systems, APIs, or data sources.
Important Note:
Custom tool integration is not currently available. The platform currently supports only the pre-built tools listed above. We'll update this documentation when custom tool integration becomes available.
Current Limitations:
- No Local File Access - Agents cannot access files on your computer or local devices
- No Direct Database Connections - Agents cannot directly connect to your databases
- No Custom API Integration - Custom API connections are not currently supported
If you need your agent to have access to specific information, consider using the Knowledge Base feature to upload documents or provide URLs that contain the necessary information.
Agent5ive provides powerful content generation tools that allow your agents to create visual and document content on demand. These tools enable your agents to generate images, videos, and PDF documents based on text descriptions.

Available Content Generation Tools:
- Generate Image - Generate images based on text descriptions
Creates images from text prompts and provides links to the generated images. Perfect for creating visual content, illustrations, or concept art based on user descriptions.
- Generate Video - Generate VEO3 videos based on text descriptions
Creates short videos using Google's VEO3 model based on text prompts. Ideal for creating video content, animations, or visual demonstrations.
- Wait for Video - Wait for VEO3 video generation to complete
Monitors the video generation process and retrieves the completed video. This tool works in conjunction with Generate Video to handle the asynchronous nature of video creation.
- Convert Text to PDF - Convert text content into a PDF document
Transforms text content into downloadable PDF documents. Useful for creating reports, summaries, or formatted documents from conversation content.
When using video generation, you MUST select both tools:
- Generate Video - Initiates the video creation process
- Wait for Video - Monitors and retrieves the completed video
Video generation is an asynchronous process that takes time to complete. The "Wait for Video" tool is essential for your agent to know when the video is ready and to retrieve the final result. Without both tools, video generation will not work properly.
Example Usage:
A user asks your marketing agent to create a promotional video for a new product. The agent uses "Generate Video" to start creating the video based on the product description, then uses "Wait for Video" to monitor the generation process. Once complete, the agent provides the user with a link to the finished video.
Best Practices for Content Generation:
- Image Generation: Provide detailed, descriptive prompts for better results. Include style, mood, colors, and composition details.
- Video Generation: Always enable both "Generate Video" and "Wait for Video" tools together. Be patient as video generation can take several minutes.
- PDF Conversion: Structure your text content with clear headings and formatting before conversion for better-looking documents.
- Set clear expectations with users about generation times, especially for video content which may take longer to produce.
Event-Driven Triggers allow your agents to automatically execute tasks when specific events occur in external platforms. This transforms your agent from a passive responder into an active automation tool.
Automatic Syncing:
Agent5ive automatically fetches and syncs trigger types from Composio, ensuring you always have access to the latest triggers for platforms like GitHub, Slack, Gmail, and more.
Key Features:
- Wide Platform Support - Connect to over 30 platforms including GitHub, Slack, Gmail, and more.
- Real-time Execution - Agents react immediately when the event occurs.
- Event Tracking - View detailed logs of every trigger execution, including the payload and execution results.
- Status Monitoring - See exactly when a trigger was last fired with the "Last triggered" timestamp.
Example Usage:
Configure a "Pull Request Event" trigger for GitHub. When a new PR is opened, your agent automatically analyzes the code changes, checks for best practices, and posts a review comment—all without manual intervention.
For a complete guide on setting up and managing triggers, visit the Trigger Management Guide.