An open API service indexing awesome lists of open source software.

https://github.com/pattern-ai-labs/agentcall

AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)
https://github.com/pattern-ai-labs/agentcall

agentcall agentic-ai ai-agent ai-meeting-assistant claude-code claude-plugin claude-skill google-meet llm-agent meeting-bot microsoft-teams screen-sharing speech-to-text text-to-speech transcription tts voice-agent voice-assistant websocket zoom

Last synced: about 3 hours ago
JSON representation

AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google Meet, Teams, Zoom (Beta)

Awesome Lists containing this project

README

          

# join-meeting

**Let any AI agent join and participate in video meetings via voice.**

An [AgentCall](https://agentcall.dev) skill that enables AI coding agents (Claude Code, Cursor, Codex, Gemini CLI, OpenClaw, Windsurf, Junie, and 30+ more) to join Google Meet, Zoom, and Microsoft Teams as a bot with voice conversation, visual avatar, screenshare, and real-time transcription.

## What It Does

Your AI agent joins a meeting as a participant and can:

- **Talk** — respond via text-to-speech (54 voices, 9 languages, <1s latency)
- **Listen** — receive real-time transcripts of what participants say
- **See** — take screenshots of shared screens, slides, presentations
- **Show** — display an animated avatar as the bot's camera feed
- **Present** — screenshare URLs, dashboards, slides dynamically during the call
- **Chat** — send and receive meeting chat messages
- **Collaborate** — use GetSun voice intelligence for natural multi-party conversations

The agent keeps its full session context — it can search code, edit files, run commands, and commit changes while talking in the meeting. The meeting is just another I/O channel.

## Quick Start

### Prerequisites

This is a **skill, not a standalone app** — it runs on top of a coding agent. You'll need:

1. **A coding agent** — one of:
- Claude Code (Anthropic)
- OpenClaw
- Cursor 2.5+
- OpenAI Codex CLI
- Gemini CLI
- Windsurf (Cascade)
- GitHub Copilot (JetBrains / VS Code)
- Junie
2. **Python 3.10+** or **Node.js 18+**
3. **Dependencies**: `pip install aiohttp websockets` (Python) or `npm install ws` (Node.js)
4. **AgentCall API key** — free at https://app.agentcall.dev/api-keys

### Install

Pick your agent. For anything not listed, see *From GitHub (any agent)*.

#### Claude Code

```
/plugin marketplace add pattern-ai-labs/agentcall
/plugin install join-meeting@agentcall
```

[![Plugin setup in VS Code](https://img.youtube.com/vi/mrwzHWnJMCw/hqdefault.jpg)](https://www.youtube.com/watch?v=mrwzHWnJMCw)

#### OpenClaw

```
openclaw skills install join-meeting
```

#### Cursor 2.5+

Use `/add-plugin` in the editor, or browse cursor.com/marketplace.

#### OpenAI Codex CLI

Drop the skill into `~/.codex/skills/`, or run `$skill-installer` from your Codex session. (`SKILL.md` is read directly.)

#### Gemini CLI

```
gemini extensions install https://github.com/pattern-ai-labs/agentcall
```

(Run from your terminal, not from inside the interactive CLI.)

#### From GitHub (any agent)

```
git clone https://github.com/pattern-ai-labs/agentcall.git
```

Then point your agent at `agentcall/SKILL.md` (the open Agent Skills standard, supported by 30+ frameworks):

- **Windsurf** — drop folder into `.windsurf/skills/`
- **GitHub Copilot (JetBrains)** — reference from `~/.copilot/agents/*.agent.md`
- **Junie** — `npx skills add pattern-ai-labs/agentcall --agent junie`
- **Claude Code (single-session, no install)** — drop folder into `.claude/skills/`

[![Generic install with GitHub link](https://img.youtube.com/vi/qpgjhRvzP_Q/hqdefault.jpg)](https://www.youtube.com/watch?v=qpgjhRvzP_Q)

### See it in action

[![Demo](https://img.youtube.com/vi/d4DD8HP65pU/hqdefault.jpg)](https://www.youtube.com/watch?v=d4DD8HP65pU)

### Join a Meeting

Tell your agent: *"Join this meeting: https://meet.google.com/abc-def-ghi"*

## Modes

| Mode | What the bot has | Use case |
|------|-----------------|----------|
| `audio` | Voice only | Simplest — voice conversations, note-taking |
| `webpage-av` | Voice + animated avatar | Visual presence in meetings |
| `webpage-av-screenshare` | Voice + avatar + screenshare | Presentations, sharing content |

## Voice Strategies

| Strategy | How it works | Best for |
|----------|-------------|----------|
| `direct` | Agent controls TTS directly | 1-on-1 conversations, customer support |
| `collaborative` | GetSun voice intelligence handles timing | Group meetings, natural conversation flow |

## Features

- **VAD gap buffering** — combines fragmented transcripts into complete utterances
- **Barge-in prevention** — waits for silence before speaking
- **Auto-interruption** — detects when someone talks over the bot (webpage modes)
- **Sentence tracking** — knows which sentence was interrupted and when
- **WebSocket reconnection** — auto-reconnects on network blips with call status check
- **Crash recovery** — reconnects to active calls after agent restart
- **7 built-in templates** — pattern, orb, ring, avatar, dashboard, blank, voice-agent
- **Screenshots** — capture what's on screen at any time
- **Chat I/O** — send URLs, code, text that's hard to speak
- **API key persistence** — saved to `~/.agentcall/config.json`, asked once

## Agent Frameworks

Works with any agent that can spawn a subprocess and read stdout:

| Framework | Status |
|-----------|--------|
| Claude Code / Agent SDK | Full support (+ Stop Hook for persistent calls) |
| OpenAI Codex | Full support |
| Cursor | Full support (tmux for PTY) |
| Windsurf (Cascade) | Full support |
| Gemini CLI | Full support |
| JetBrains Junie | Full support |
| OpenClaw | Full support |
| GitHub Copilot | Full support |
| Aider | Partial (no subprocess) |

## Documentation

- **[SKILL.md](SKILL.md)** — Complete reference (modes, events, commands, patterns, Claude Code integration)
- **[examples/](examples/)** — 7 working examples (notetakers, support agent, meeting assistants, coding companion)
- **[references/](references/)** — API reference + guides (collaborative mode, interruption handling, crash recovery, screenshare)

## Pricing

Base plan: 6 hours of meeting time, 1 concurrent call. All features included.
Paid: per-minute billing. See [agentcall.dev](https://agentcall.dev) for details.

## License

MIT — use, modify, redistribute freely.

---

Built by [AgentCall](https://agentcall.dev) (Pattern AI Labs)