跳到主要内容

Realtime Mascot

OpenHuman has a face. The mascot is an animated character that lives on your desktop and acts as the visible surface of the agent, what it's saying, what it's thinking about, when it's idle, when it's busy, when it has something to tell you.

It is not a chrome ornament. The mascot is wired into the same pieces as the rest of the agent: voice, memory, the subconscious loop, and the Google Meet integration.

What it does

It speaks, and lip-syncs to its own voice

When the agent replies, the audio is generated through a hosted TTS model and streamed to your speakers. At the same time, the mascot drives a viseme map against the audio so its mouth shapes match the words coming out.

It joins your meetings, as a real participant

The mascot can join a Google Meet call as a real participant: it hears everyone, takes notes into your Memory Tree, speaks back into the call when it has something to say, and pipes its own animated face into the meeting as the camera feed.

It moves and reacts to its surroundings

The mascot has mood states (idle, thinking, listening, talking, surprised, dreaming) and it transitions between them based on what the agent is doing. When you start typing it shifts into a listening pose. When the model is reasoning, it shows that.

It remembers you

The mascot is the visible part of an agent that has the Memory Tree underneath it. It remembers what you've talked about, who the people in your life are, what's open on your plate, what's been decided, and what's outstanding.

It thinks in the background

Even when you've stopped typing, the mascot keeps thinking. The Subconscious Loop is a background tick that loads your standing tasks, reads the current state of your workspace, and decides what to do about each one.

It dreams

When you're away long enough, the mascot enters a dreaming state. Dreaming is the agent's offline consolidation pass, distilling the day's chunks into longer-horizon summaries.

Why have a mascot at all?

Most assistants are a blinking text input. That's fine for a tool. It's not fine for something that's meant to be alongside you all day, with persistent memory of your life, taking actions on your behalf.

The mascot exists because:

  • Presence beats panels. A face you can glance at tells you, in one frame, whether the agent is busy, idle, dreaming, or trying to get your attention.
  • It makes voice calls feel like a conversation. A camera feed of an animated character lip-syncing to its own speech is a different experience than a robotic voice with a black tile.
  • Personality is a UX surface. A consistent character on screen is easier to trust, talk to, and forgive when it makes a mistake than a faceless API.

See also