Meeting Agents
The mascot's flagship integration is the Meeting Agent: the same character you talk to on your desktop can join a Google Meet on your behalf, sit in the participant grid as an animated face, hear everyone in the room, talk back into the call with its own voice, and reach for tools while the meeting is happening.
It is not a notetaker. A notetaker sits silently and produces a transcript. A meeting agent participates - it answers questions, looks things up live, remembers prior meetings with the same people, and contributes when you (or it) decide it has something useful to add.
What it actually does in a call
1. It joins as a real participant
The mascot joins the meeting through an embedded webview, the same way a person joins from their browser. There is a name, a face, and a tile in the grid. Other participants see and hear it the way they'd see and hear any other attendee.
2. It listens to everyone in the room
Inbound audio from the meeting is captured and pushed through streaming speech-to-text in real time. The transcript is diarized per speaker, cleaned up, and folded into the Memory Tree as the meeting unfolds.
3. It interacts
When you address it directly ("Ghosty, can you pull up the numbers from last quarter?"), or when it decides it has something useful to add, it generates a reply on the fly and speaks it into the meeting.
4. It speaks - its own TTS audio plays back into the call
Replies are generated by the project's TTS stack and streamed straight into the meeting as an outbound microphone feed. It is injected directly as the agent's audio, so it lands clean for everyone else and doesn't echo.
5. It animates - the mascot's face IS the camera feed
The mascot's canvas is piped into the Meet call as the outbound camera stream. When the agent is talking, the mascot is talking on the camera tile - mouth shapes lip-sync to the same TTS audio everyone else is hearing.
6. It uses tools mid-meeting
While the call is happening, the mascot has access to the same tool surface it has on your desktop:
- Memory Tree - recall prior meetings, decisions, open threads
- Auto-fetch - pull a thread from Slack, an email, a Linear ticket
- Native tools - search the web, scrape a page, run a quick code lookup
- Subconscious Loop outputs - anything it has been working on in the background
Setup, controls, privacy
- Joining a call. You can hand the mascot a Google Meet link from the desktop app; it will join with the configured display name.
- Mic and camera control. The agent's mic is the TTS injection stream. The agent's camera is the mascot frame producer. You can mute the agent's mic from the app at any time.
- Transcripts and memory. Live transcripts land in the Memory Tree the same way any other source does.
- No covert recording. The agent appears as a normal participant in the grid; everyone in the call can see it and see when it's speaking.
See also
- The Mascot - The on-screen character itself, outside of meetings.
- Native Voice - STT / TTS that the meeting agent rides on.
- Memory Tree - Where transcripts and decisions land.