Feat/linux pulseaudio audio by xingjianll · Pull Request #152 · Project-NEURIA/OpenNeuro

xingjianll · 2026-04-06T06:20:06Z

No description provided.

On Linux, use parec/paplay subprocess to access PulseAudio/PipeWire virtual devices (e.g. cable_a.monitor, cable_b) that sounddevice can't see through ALSA. Device dropdown lists PulseAudio sources/sinks via pactl. Falls back to sounddevice on macOS/Windows. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Single component replaces separate VAD + ASR pipeline. Streams audio to Deepgram's WebSocket API (nova-3), returns final transcriptions with speaker labels (e.g. "[Speaker 0] hello"). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Removes messages output, adds token/text/tool_calls/eos outputs (same as LLM) - Calls OpenAI Responses API directly with streaming - Recursive _call_llm for tool call → tool result → LLM loop - Accepts ToolDef inputs for function calling - Drains speech/feedback/vision/objects/pose/memory each iteration - Only calls LLM when new user input arrives Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…tchpad tool - AgentLoop calls OpenAI Responses API directly (no external LLM component) - Accepts VideoFrame input, encodes as base64 JPEG for vision - Built-in diary tool for tracking mental state (handled internally) - ScratchpadTool component with read/update tools - Continuous loop with no speech gating — agent is always active - No recursion — flat loop with fresh transient context each iteration - System prompt emphasizes embodiment and vision grounding Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Use ToolResult frame instead of TextFrame for tool_result input - Use result.call_id instead of local tc dict for correct pairing - Add head look heading alongside body heading in pose context - Add debug prints for pose data Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

xingjianll and others added 8 commits April 6, 2026 00:52

docs: add Linux PulseAudio virtual cable setup for VRChat audio routing

8e19fc1

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: add AgentLoop component (copy of AgentState for future iteration)

ac872e3

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor: AgentLoop uses while loop instead of iterating on request

83252b9

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/linux pulseaudio audio#152

Feat/linux pulseaudio audio#152
xingjianll wants to merge 8 commits into
mainfrom
feat/linux-pulseaudio-audio

xingjianll commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xingjianll commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant