Skip to main content
Enhance your agents with additional capabilities beyond standard conversational AI. These features enable more immersive and context-aware interactions.

Available Capabilities

Webcam Vision

Enable your agent to see the user’s camera feed for more authentic, context-aware conversations. The agent can respond to visual cues, gestures, and what the user shows on camera. Use cases:
  • Emotional connection: Read facial expressions for more empathetic interactions
  • Fitness coaching: Observe form and technique during exercises
  • Educational tutors: Provide feedback on physical work and materials
  • Virtual shopping assistants: Recommend products based on what customers show
Webcam vision is currently in preview and available to select customers. Contact support@beyondpresence.ai to discuss access for your use case.

Wakeup Mode

Configure your agent to respond only to specific trigger keywords or phrases (e.g., “jarvis”, “hey agent”, “computer”). Ideal for assistant-style interactions where the agent should remain passive until explicitly activated. You can specify a list of case-insensitive trigger words or space-separated phrases that will activate the agent. Use cases:
  • Always-on assistants: Listen continuously, respond only when called
  • Hands-free environments: Enable voice activation when clicking isn’t possible
  • Ambient experiences: Remain present but non-intrusive in the background

Configuration

You can configure capabilities through our API. Dashboard support is planned for a future release. Specify capabilities when creating an agent using the Create Agent endpoint. Capabilities are additive and agent-specific, allowing you to enable multiple capabilities per agent and configure different capabilities for different agents.

Next Steps