A living map of the assistant’s anatomy. Drag the head to look around, and tap any glowing point to see how that part works and what powers it.
Every assistant belongs to one organization — an isolated tenant with its own data, members and policies.
Nothing is shared between organizations. Each one keeps its own knowledge, conversations, integrations and audit trail, so your data never mixes with anyone else’s.
Admins decide which capabilities are switched on and set sensible usage limits, keeping the assistant predictable and costs in check across the whole team.
The body that every other ability lives inside — it shapes what the Brain and Connection are allowed to do.
Learn moreOne or more AI models do the thinking — reading what you typed or said, and writing the reply.
You can wire up several models side by side — OpenAI, Google Gemini, local Ollama and more — and switch which one is active in the moment. The right model for the job, never a single lock-in.
For natural spoken conversation, a realtime model can listen and speak in one continuous flow instead of passing audio between separate steps.
Takes everything the senses gather, reasons over it, and hands its answer to the Mouth to be spoken.
Learn moreShare your screen and microphone so the assistant can see what you’re doing and help in context.
Perfect for walkthroughs, demos and getting unstuck while you work — the assistant follows along live rather than guessing from a description.
It reasons with the same AI model as the rest of chat, and can talk you through what it sees when spoken replies are available.
Feeds what it sees straight to the Brain so the reasoning has real visual context.
Learn moreYour microphone audio is turned into text the assistant can read.
On-device transcription runs privately, works offline and needs no third-party service — ideal when speech should never leave the device.
For natural, flowing conversation, a streaming cloud service transcribes speech the instant you say it.
Sends the words it heard to the Brain to be understood.
Learn moreThe assistant turns its written answer into a natural spoken voice.
A neural text-to-speech voice reads the reply aloud, so a voice session feels like a real conversation rather than reading from a screen.
You pick the voice provider that best fits your needs for quality, latency and language coverage.
Rides the Connection out to your speakers — the final step of a spoken reply.
Learn moreEvery voice or video stream flows through a LiveKit media server.
LiveKit moves audio and video between you and the assistant — and between teammates — with the low latency that real conversation needs.
Run it as a managed cloud service with nothing to maintain, or self-host it for full control and data residency. Either way it carries the ears, mouth and eyes for everyone in the organization.
Carries all voice and video between you and the assistant — Listening, Speaking and Seeing all ride on it.
Learn more