https://github.com/pattern-ai-labs/agentcall
AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)
https://github.com/pattern-ai-labs/agentcall
agentcall agentic-ai ai-agent ai-meeting-assistant claude-code claude-plugin claude-skill google-meet llm-agent meeting-bot microsoft-teams screen-sharing speech-to-text text-to-speech transcription tts voice-agent voice-assistant websocket zoom
Last synced: about 3 hours ago
JSON representation
AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)
- Host: GitHub
- URL: https://github.com/pattern-ai-labs/agentcall
- Owner: pattern-ai-labs
- License: mit
- Created: 2026-04-12T13:50:18.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-05-19T12:21:37.000Z (about 1 month ago)
- Last Synced: 2026-05-19T15:30:59.383Z (about 1 month ago)
- Topics: agentcall, agentic-ai, ai-agent, ai-meeting-assistant, claude-code, claude-plugin, claude-skill, google-meet, llm-agent, meeting-bot, microsoft-teams, screen-sharing, speech-to-text, text-to-speech, transcription, tts, voice-agent, voice-assistant, websocket, zoom
- Language: Python
- Homepage: https://agentcall.dev
- Size: 622 KB
- Stars: 69
- Watchers: 5
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# join-meeting
**Let any AI agent join and participate in video meetings via voice.**
An [AgentCall](https://agentcall.dev) skill that enables AI coding agents (Claude Code, Cursor, Codex, Gemini CLI, OpenClaw, Windsurf, Junie, and 30+ more) to join Google Meet, Zoom, and Microsoft Teams as a bot with voice conversation, visual avatar, screenshare, and real-time transcription.
## What It Does
Your AI agent joins a meeting as a participant and can:
- **Talk** — respond via text-to-speech (54 voices, 9 languages, <1s latency)
- **Listen** — receive real-time transcripts of what participants say
- **See** — take screenshots of shared screens, slides, presentations
- **Show** — display an animated avatar as the bot's camera feed
- **Present** — screenshare URLs, dashboards, slides dynamically during the call
- **Chat** — send and receive meeting chat messages
- **Collaborate** — use GetSun voice intelligence for natural multi-party conversations
The agent keeps its full session context — it can search code, edit files, run commands, and commit changes while talking in the meeting. The meeting is just another I/O channel.
## Quick Start
### Prerequisites
This is a **skill, not a standalone app** — it runs on top of a coding agent. You'll need:
1. **A coding agent** — one of:
- Claude Code (Anthropic)
- OpenClaw
- Cursor 2.5+
- OpenAI Codex CLI
- Gemini CLI
- Windsurf (Cascade)
- GitHub Copilot (JetBrains / VS Code)
- Junie
2. **Python 3.10+** or **Node.js 18+**
3. **Dependencies**: `pip install aiohttp websockets` (Python) or `npm install ws` (Node.js)
4. **AgentCall API key** — free at https://app.agentcall.dev/api-keys
### Install
Pick your agent. For anything not listed, see *From GitHub (any agent)*.
#### Claude Code
```
/plugin marketplace add pattern-ai-labs/agentcall
/plugin install join-meeting@agentcall
```
[](https://www.youtube.com/watch?v=mrwzHWnJMCw)
#### OpenClaw
```
openclaw skills install join-meeting
```
#### Cursor 2.5+
Use `/add-plugin` in the editor, or browse cursor.com/marketplace.
#### OpenAI Codex CLI
Drop the skill into `~/.codex/skills/`, or run `$skill-installer` from your Codex session. (`SKILL.md` is read directly.)
#### Gemini CLI
```
gemini extensions install https://github.com/pattern-ai-labs/agentcall
```
(Run from your terminal, not from inside the interactive CLI.)
#### From GitHub (any agent)
```
git clone https://github.com/pattern-ai-labs/agentcall.git
```
Then point your agent at `agentcall/SKILL.md` (the open Agent Skills standard, supported by 30+ frameworks):
- **Windsurf** — drop folder into `.windsurf/skills/`
- **GitHub Copilot (JetBrains)** — reference from `~/.copilot/agents/*.agent.md`
- **Junie** — `npx skills add pattern-ai-labs/agentcall --agent junie`
- **Claude Code (single-session, no install)** — drop folder into `.claude/skills/`
[](https://www.youtube.com/watch?v=qpgjhRvzP_Q)
### See it in action
[](https://www.youtube.com/watch?v=d4DD8HP65pU)
### Join a Meeting
Tell your agent: *"Join this meeting: https://meet.google.com/abc-def-ghi"*
## Modes
| Mode | What the bot has | Use case |
|------|-----------------|----------|
| `audio` | Voice only | Simplest — voice conversations, note-taking |
| `webpage-av` | Voice + animated avatar | Visual presence in meetings |
| `webpage-av-screenshare` | Voice + avatar + screenshare | Presentations, sharing content |
## Voice Strategies
| Strategy | How it works | Best for |
|----------|-------------|----------|
| `direct` | Agent controls TTS directly | 1-on-1 conversations, customer support |
| `collaborative` | GetSun voice intelligence handles timing | Group meetings, natural conversation flow |
## Features
- **VAD gap buffering** — combines fragmented transcripts into complete utterances
- **Barge-in prevention** — waits for silence before speaking
- **Auto-interruption** — detects when someone talks over the bot (webpage modes)
- **Sentence tracking** — knows which sentence was interrupted and when
- **WebSocket reconnection** — auto-reconnects on network blips with call status check
- **Crash recovery** — reconnects to active calls after agent restart
- **7 built-in templates** — pattern, orb, ring, avatar, dashboard, blank, voice-agent
- **Screenshots** — capture what's on screen at any time
- **Chat I/O** — send URLs, code, text that's hard to speak
- **API key persistence** — saved to `~/.agentcall/config.json`, asked once
## Agent Frameworks
Works with any agent that can spawn a subprocess and read stdout:
| Framework | Status |
|-----------|--------|
| Claude Code / Agent SDK | Full support (+ Stop Hook for persistent calls) |
| OpenAI Codex | Full support |
| Cursor | Full support (tmux for PTY) |
| Windsurf (Cascade) | Full support |
| Gemini CLI | Full support |
| JetBrains Junie | Full support |
| OpenClaw | Full support |
| GitHub Copilot | Full support |
| Aider | Partial (no subprocess) |
## Documentation
- **[SKILL.md](SKILL.md)** — Complete reference (modes, events, commands, patterns, Claude Code integration)
- **[examples/](examples/)** — 7 working examples (notetakers, support agent, meeting assistants, coding companion)
- **[references/](references/)** — API reference + guides (collaborative mode, interruption handling, crash recovery, screenshare)
## Pricing
Base plan: 6 hours of meeting time, 1 concurrent call. All features included.
Paid: per-minute billing. See [agentcall.dev](https://agentcall.dev) for details.
## License
MIT — use, modify, redistribute freely.
---
Built by [AgentCall](https://agentcall.dev) (Pattern AI Labs)