nanoclaw

Author	SHA1	Message	Date
gavrielc	2e6dc21748	refactor(v2): per-group filesystem init, persistent across spawns Each group's on-disk state (CLAUDE.md, .claude-shared/, agent-runner-src/) is now initialized exactly once at group creation and owned by the group forever after. Spawn does only mounts — no copies, no settings.json overwrites, no skill clobbers, no source resyncs. Global memory composition switches from "host reads /workspace/global/CLAUDE.md at bootstrap and stuffs it into systemPrompt.append" to "group CLAUDE.md imports it via @/workspace/global/CLAUDE.md at the top." Edits to global propagate instantly through the existing read-only mount; no copy, no restart. - src/group-init.ts: new initGroupFilesystem(group, opts?) — idempotent, populates groups/<folder>/, .claude-shared/, agent-runner-src/ only when paths don't already exist. - src/container-runner.ts: buildMounts() calls init defensively at the top (catches existing groups on first spawn after this change), drops the inline settings.json write, skills cpSync loop, and agent-runner-src rm-then-copy. Just mounts now. - src/delivery.ts: create_agent flow uses initGroupFilesystem with optional instructions, replacing the inline mkdirSync + writeFileSync. - container/agent-runner/src/index.ts: drops GLOBAL_CLAUDE_MD reading. systemContext.instructions is now only the runtime-generated destinations addendum. - scripts/migrate-group-claude-md.ts: one-shot migration that prepends the @-import to existing groups' CLAUDE.md. Skips if global doesn't exist or if the @-import is already present (regex match on the @ form to avoid false positives from prose mentions of the path). - groups/main/CLAUDE.md: prepended by the migration. Existing groups need a one-time wipe of their agent-runner-src/ dir so init re-populates from current host source — done locally before this commit. Future host-side updates to container/skills/ or container/agent-runner/src/ won't auto-propagate; that's the trade-off for unconditional persistence and will be covered by host-mediated refresh tools in a follow-up. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 14:17:50 +03:00
gavrielc	b63dd186df	refactor(agent-runner): decouple provider interface from Claude specifics Reshape AgentProvider so provider-specific assumptions stop leaking into the generic layer. No change to what reaches sdkQuery() — same values, different plumbing. - QueryInput: opaque `continuation` replaces `sessionId` + `resumeAt`; `systemContext.instructions` replaces ambiguous `systemPrompt`; `mcpServers`, `env`, `additionalDirectories` move to `ProviderOptions` at construction time. - AgentProvider gains `isSessionInvalid(err)` and `supportsNativeSlashCommands` so the poll-loop stops regex-matching Claude error strings and gates passthrough slash commands per provider. - ClaudeProvider owns `CLAUDE_CODE_AUTO_COMPACT_WINDOW` and the stale-session regex internally. - ProviderEvent.activity kept and documented as the liveness signal (fires on every SDK message so the idle timer stays honest during long tool runs); init carries `continuation` instead of `sessionId`. - poll-loop drops mcpServers/env/systemPrompt from its config; admin user id now passed explicitly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 10:25:29 +03:00
gavrielc	e07158e194	fix(agent-runner): preserve thread_id when sending to current channel send_file and send_message with an explicit `to` parameter were always setting thread_id to null, causing files and messages to land in the Discord channel root instead of the thread the session is bound to. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 18:13:42 +03:00
Gabi Simons	b140b3655b	fix(agent-runner): reply to originating channel in single-destination shortcut When an agent has one configured destination (e.g. Discord) but receives a message from a different channel (e.g. Slack), the single-destination shortcut was routing replies to the destination instead of the originating channel. Now uses the inbound message's routing context (channel_type, platform_id) when available, falling back to the destination table only when routing context is absent. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 12:34:21 +00:00
gavrielc	9dda75bb21	docs(v2): cross-mount invariants + diagrams; inline a2a routing - session-manager.ts: shrink the cross-mount invariant header from 31 lines to 12, keeping each invariant's cause and consequence inline. - agent-runner/db/connection.ts: parallel cross-mount comment for the container-side reader (inbound.db must be journal_mode=DELETE). - agent-runner/db/messages-out.ts: document that even/odd seq parity is load-bearing — seq is the agent-facing message ID returned by send_message and consumed by edit_message / add_reaction, looked up across both tables. - v2-checklist.md: record the cross-mount invariants and seq parity under Core Architecture so future "simplifications" don't regress them. - scripts/sanity-live-poll.ts: empirical validation harness for the three cross-mount invariants — flips each one and observes silent message loss / corruption. - delivery.ts: inline routeAgentMessage at its single callsite (-17 net lines). The wrapper added more boilerplate than it factored. - docs/v2-architecture-diagram.{md,html}: rendered Mermaid diagrams of the v2 system, message flow, named destinations, entity model, and the two-DB split. - channels/adapter.ts, chat-sdk-bridge.ts, credentials.ts, db/sessions.ts, db/db-v2.test.ts: prettier format pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 00:21:12 +03:00
gavrielc	062b0cb6bf	fix(agent-runner): add updated_at column to session_state on older DBs session_state was added after the initial v2 schema with a lazy `CREATE TABLE IF NOT EXISTS` in getOutboundDb(), so older session outbound.db files have a session_state table from before updated_at existed. The lazy create is a no-op when the table already exists, leaving the column missing and causing: Error: table session_state has no column named updated_at on every `INSERT OR REPLACE INTO session_state` call. Follow up the CREATE IF NOT EXISTS with a PRAGMA table_info check and ALTER TABLE ADD COLUMN when updated_at is missing. Cheap on every open, only runs DDL once per DB. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 17:18:34 +03:00
gavrielc	e92b245399	feat(v2): OneCLI 0.3.1 — approvals, credential collection, threaded routing Three features built on top of @onecli-sh/sdk 0.3.1, landed together because they share wiring surfaces (session DB schema, delivery dispatcher, Chat SDK bridge, channel adapter contract). ## OneCLI manual-approval handler * `src/onecli-approvals.ts` — long-polls OneCLI via the SDK's `configureManualApproval`; on each request, delivers an `ask_question` card to the admin agent group's first messaging group, persists a `pending_approvals` row, and waits on an in-memory Promise resolved by the admin's button click or an expiry timer. Expired cards are edited to "Expired (...)" and a startup sweep flushes any rows left over from a previous process. * Short 11-byte approval id (`oa-<8 base36>`) instead of the SDK's UUID so the Telegram 64-byte `callback_data` limit is respected; the OneCLI UUID stays in the persisted payload for audit. * Migration 003 consolidated: `pending_approvals` now has the OneCLI-aware columns from the start (`agent_group_id`, `channel_type`, `platform_id`, `platform_message_id`, `expires_at`, `status`), `session_id` relaxed to nullable so cross-session approvals fit. * `handleQuestionResponse` in `src/index.ts` now routes OneCLI approvals through `resolveOneCLIApproval` before falling back to the session-bound approval path. ## Credential collection from chat New `trigger_credential_collection` MCP tool — the agent researches a third-party API, calls the tool with `{name, hostPattern, headerName, valueFormat, description}`, and blocks until the host reports saved, rejected, or failed. The credential value never enters the agent's context: the user submits it into a Chat SDK Modal on the host side, the host writes it to OneCLI via a thin facade (`src/onecli-secrets.ts` — shells out to `onecli secrets create`, shape mirrors the SDK we expect upstream), and only the status string flows back to the container via a system message. * `src/credentials.ts` — host-side handler: delivers the card to the conversation's own channel (not the admin channel — credential collection is a user-facing flow, distinct from admin approval), persists a `pending_credentials` row, drives the submit → `createSecret` → notify pipeline. Falls back gracefully when the channel doesn't support modals. * `src/db/credentials.ts` + migration 005: `pending_credentials` table. * `src/channels/chat-sdk-bridge.ts`: renders a `credential_request` card, handles the `nccr:` action prefix by opening a Modal with a TextInput, registers an `onModalSubmit` handler for the `nccm:` callback prefix. * `container/agent-runner/src/mcp-tools/credentials.ts`: the blocking MCP tool, mirroring the `ask_user_question` polling pattern. * `container/agent-runner/src/db/messages-in.ts`: `findCredentialResponse` helper to pick up the system message the host writes back. ## Threaded adapter routing The destination layer previously didn't carry thread context, so agent replies to Discord always landed in the root channel regardless of which thread the inbound came from. * `ChannelAdapter.supportsThreads: boolean` — declared by every channel skill at `createChatSdkBridge`. Threaded: Discord, Slack, Teams, Google Chat, Linear, GitHub, Webex. Non-threaded: Telegram, WhatsApp Cloud, Matrix, Resend, iMessage. * `src/router.ts`: non-threaded adapters strip `threadId` at ingest (threads collapse to channel-level sessions). Threaded adapters override the wiring's `session_mode` to `'per-thread'` so each thread = a session (except `agent-shared`, which is preserved as a cross-channel intent the adapter can't know about). * `session_routing` table in `inbound.db` — single-row default reply routing written by the host on every container wake from `session.messaging_group_id` + `session.thread_id`. Forward-compat `CREATE TABLE IF NOT EXISTS` handles older session DBs lazily. * `container/agent-runner/src/db/session-routing.ts` — container-side reader. * `send_message` / `send_file` / `ask_user_question` / `send_card` / scheduling tools all default their routing (channel, platform, and thread) from the session when no explicit `to` is given. Explicit `to` uses the destination's channel with `thread_id = null` (cross-destination sends start a new conversation elsewhere). * `poll-loop.ts::sendToDestination` (the final-text single-destination shortcut) now inherits `thread_id` from `RoutingContext` too — this was the root cause of Discord replies landing in the root channel even after `send_message` was wired correctly. ## Related cleanups * `src/container-runner.ts`: OneCLI agent identifier switched from the lossy folder-derived string to `agent_group.id`, making `getAgentGroup(externalId)` a trivial reverse lookup for per-agent scoping. * `wakeContainer` race fix via an in-flight promise map — concurrent wakes during the async buildContainerArgs / OneCLI `applyContainerConfig` window no longer double-spawn containers against the same session directory. * `src/db/db-v2.test.ts`: dropped the brittle `expect(row.v).toBe(N)` schema version assertion — it had to be bumped on every migration addition. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 17:18:21 +03:00
gavrielc	b59216c299	fix(v2): persist SDK session ID across container restarts The v2 poll loop held the session ID in a local variable, so every container restart started a fresh SDK session even though the .jsonl transcript was still sitting in the shared .claude mount. Store it in outbound.db (container-owned, already per channel/thread), seed the loop on startup, clear on /clear, and recover from stale-session errors the same way v1 did. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 01:17:42 +03:00
gavrielc	b591d7ce96	refactor: move destinations from JSON file into inbound.db The per-session destination map was being written as a sidecar JSON file (/workspace/.nanoclaw-destinations.json) — inconsistent with the rest of v2, where all host↔container IO goes through inbound.db / outbound.db. Move it into a `destinations` table in INBOUND_SCHEMA. The host writes it before every container wake AND on demand (e.g. after create_agent) so the creator sees the new child destination mid-session without a restart. The container queries the table live on every lookup — no cache, no staleness window. - src/db/schema.ts: add `destinations` table to INBOUND_SCHEMA. - src/session-manager.ts: writeDestinationsFile → writeDestinations, writes via DELETE + INSERT inside a transaction. - src/delivery.ts: create_agent handler calls writeDestinations on the creator's session after inserting the new destination rows. - container/agent-runner/src/destinations.ts: queries inbound.db directly in every findByName/getAllDestinations/findByRouting call. No more cache. No setDestinationsForTest (obsolete). No fs import. - container/agent-runner/src/index.ts and mcp-tools/index.ts: remove loadDestinations() calls — no longer needed. - Test helper initTestSessionDb creates the destinations table. Integration test inserts a row directly instead of mocking the cache. No backwards compatibility: sessions predating the schema update must be recreated. This is fine on the v2 branch. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 16:45:53 +03:00
gavrielc	09e1861a22	feat: single-destination shortcut — no wrapping needed when there's only one When an agent has exactly one configured destination, wrapping output in <message to="..."> blocks is unnecessary. Plain text goes to the sole destination automatically. This preserves the simple "just reply" flow for the common case of one user on one channel. Applies in three places: - System prompt addendum: single-destination case gets a simplified explanation ("your messages are delivered to X, just write directly"). Multi-destination case keeps the <message to="..."> syntax docs. - Main output parser: if zero <message> blocks are found and there is exactly one destination, the entire cleaned text (with <internal> stripped) is sent to that destination. - send_message / send_file MCP tools: `to` parameter is now optional. With one destination, omitted defaults to it. With multiple, omitting returns an error listing the options. Multi-destination behavior is unchanged — explicit <message to="..."> is still required, and untagged text is still scratchpad. groups/global/CLAUDE.md updated to describe both cases. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 16:36:09 +03:00
gavrielc	e83ffbc103	feat: named destinations + permission enforcement + fire-and-forget self-mod Replaces implicit routing context (NANOCLAW_PLATFORM_ID env vars) with per-agent named destination maps. Agents reference channels and peer agents by local names; the host re-validates every outbound route against a new agent_destinations table that is both the routing map and the ACL. Model changes: - New migration 004 adds agent_destinations (agent_group_id, local_name, target_type, target_id). Backfills from existing messaging_group_agents. - Host writes /workspace/.nanoclaw-destinations.json before every container wake so admin changes take effect on next start. - Container loads map at startup, appends system-prompt addendum listing available destinations and the <message to="name">…</message> syntax. - Agent main output is parsed for <message to="..."> blocks; each block becomes a messages_out row with routing resolved via the local map. Untagged text and <internal>…</internal> are scratchpad (logged only). - send_message MCP tool now takes `to` (destination name) instead of raw routing fields. send_to_agent deleted (redundant — agents are just destinations). send_file/edit_message/add_reaction route via map too. - Inbound formatter adds from="name" attribute via reverse-lookup so the agent sees a consistent namespace in both directions. Permission enforcement: - Host checks hasDestination() before every channel delivery AND every agent-to-agent route. Unauthorized messages dropped and logged. - routeAgentMessage simplified: ~15 lines, no JSON parse, content copied verbatim (target formatter resolves the sender via its own local map). - create_agent is admin-only, checked at both the container (tool not registered for non-admins) and the host (re-check on receive). Inserts bidirectional destination rows so parent↔child comms work immediately. Includes path-traversal guard on folder name. Self-modification cleanup: - add_mcp_server now requires admin approval (previously had none). - install_packages validates package names on BOTH sides (container tool + host receiver) with strict regex. Max 20 packages per request. - All three self-mod tools are fire-and-forget: write request, return immediately with "submitted" message. Admin approval triggers a chat notification to the requesting agent — no tool-call polling, no 5-min holds. On rebuild/mcp_server approval, the container is killed so the next wake picks up new config/image. - Approval delivery extracted into requestApproval() helper (the one place where three call sites were literally identical). Also folded in the phase-1 dynamic import cleanup (create_agent no longer does `await import('./db/agent-groups.js')`) and removes NANOCLAW_PLATFORM_ID / CHANNEL_TYPE / THREAD_ID env-var routing entirely. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 16:31:37 +03:00
gavrielc	d8fbd3b239	feat: agent-to-agent communication, dynamic agent creation, self-modification tools Agent-to-agent: host routes messages with channel_type='agent' to target agent's inbound.db, enriches with sender info, wakes target container. Bidirectional routing works via inherited routing context. Dynamic agents: create_agent MCP tool + system action handler creates agent groups, folders, and optional CLAUDE.md on the fly. Self-modification: install_packages (apt/npm, requires admin approval), add_mcp_server (no approval), request_rebuild (builds per-agent-group Docker image with approved packages). Approval flow reuses interactive card infrastructure with pending_approvals table. Also includes fixes from prior session: attachment download, reply context extraction, message editing (platform message ID tracking), delivery retry limits, and card update on button click. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 01:11:06 +03:00
gavrielc	82cb363f84	v2: split session DB into inbound/outbound for write isolation Eliminates SQLite write contention across the host-container mount boundary by splitting the single session.db into two files, each with exactly one writer: inbound.db — host writes (messages_in, delivered tracking) outbound.db — container writes (messages_out, processing_ack) Key changes: - Host uses even seq numbers, container uses odd (collision-free) - Container heartbeat via file touch instead of DB UPDATE - Scheduling MCP tools now emit system actions via messages_out (host applies them to inbound.db during delivery) - Host sweep reads processing_ack + heartbeat file for stale detection - OneCLI ensureAgent() call added (was missing from v2, caused applyContainerConfig to reject unknown agent identifiers) Verified: tsc clean, 327 tests pass, real e2e through Docker works. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 12:17:31 +03:00
gavrielc	9486d56b01	v2: make v2 the main entry point, move v1 to src/v1/ - Move all v1 files (index, router, container-runner, db, ipc, types, logger, channels/registry, and all utilities) to src/v1/ as a fully self-contained archive with no shared dependencies - Rename v2 files to remove -v2 suffix (index-v2.ts → index.ts, etc.) - Update all imports across v2 source, tests, and setup files - Migrate shared utilities (config, env, container-runtime, mount-security, timezone, group-folder) from pino logger to v2 log module - Migrate setup/ files from logger to log with argument order swap - Container agent-runner: move v1 entry to v1/, rename v2 to index.ts - Update setup skill to offer all 13 v2 channels - Install all Chat SDK adapter packages - dist/index.js now runs v2; dist/v1/index.js runs v1 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 11:40:36 +03:00
gavrielc	8a06b01646	v2: SQLite state adapter, admin commands, compact feedback - Replace in-memory Chat SDK state with SqliteStateAdapter — thread subscriptions now persist across restarts - Add migration 002 for chat_sdk_kv, subscriptions, locks, lists tables - Handle /clear in agent-runner (reset sessionId) — SDK has supportsNonInteractive:false for this command - Pass /compact, /context, /cost, /files through to SDK as admin commands - Skip admin commands in follow-up poll so they start fresh queries - Emit compact_boundary events as user-visible feedback messages - Pass NANOCLAW_ADMIN_USER_ID and NANOCLAW_ASSISTANT_NAME to containers Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 03:58:35 +03:00
gavrielc	c31bb02c06	v2 phase 5: pending questions with interactive cards End-to-end ask_user_question flow: - Agent MCP tool writes question card to messages_out - Host delivery creates pending_questions row, delivers as Discord Card with buttons - Local webhook server receives Gateway INTERACTION_CREATE events - Acknowledges interaction + updates card to show selected answer - Routes response back to session DB as system message - MCP tool poll picks up response and returns to agent Key fixes: - Poll loop now skips system messages (reserved for MCP tool responses) - Gateway listener uses webhookUrl forwarding mode for interaction support - Button custom_id encodes questionId + option text for self-contained routing Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 03:26:16 +03:00
gavrielc	c348fabf22	v2 phase 5: scheduling fixes, media handling, command processing - Host sweep: fix DELETE journal mode, busy_timeout, seq in recurrence INSERT - Outbound files: delivery reads from outbox dir, passes buffers to adapter, cleans up after delivery. Chat SDK bridge sends files via postMessage. - Inbound attachments: formatter includes attachment info in prompts - Commands: categorize /commands as admin, filtered, or passthrough. Admin commands check sender against NANOCLAW_ADMIN_USER_ID. Filtered commands silently dropped. Passthrough sent raw to agent. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 02:59:33 +03:00
gavrielc	afbc20a6c4	v2 phase 4+5: Discord via Chat SDK, expanded MCP tools, message seq IDs - Chat SDK bridge + Discord adapter (gateway listener, message routing) - MCP tools refactored into modular structure: core (send_message, send_file, edit_message, add_reaction), scheduling (schedule/list/cancel/pause/resume tasks), interactive (ask_user_question, send_card), agents (send_to_agent) - Message seq IDs: shared integer sequence across messages_in/out so agents see small numeric IDs instead of platform snowflakes - busy_timeout=5000 for session DB (poll loop + MCP server concurrent access) - Always copy agent-runner source to fix stale cache when non-index files change - Seed script for Discord testing, e2e test script Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 02:53:39 +03:00
gavrielc	6f2a7314d0	v2: fix agent-runner lifecycle and session DB reliability - Use DELETE journal mode for session DBs instead of WAL. WAL doesn't sync reliably across Docker volume mounts (VirtioFS), causing dropped writes and duplicate deliveries. - Add 20s idle detection to end the query stream. The concurrent poll tracks SDK activity via a new 'activity' provider event. When no SDK events arrive for 20s and no messages are pending, the stream ends and the poll loop continues. - Add touchProcessing heartbeat so the host can distinguish active agents from idle ones by checking status_changed recency. - Catch query errors in the poll loop and write error responses to messages_out instead of crashing the process. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 01:34:59 +03:00
gavrielc	03c4e3b672	v2: fix container launch for v2 agent-runner - Override entrypoint to compile and run index-v2.js (no stdin) - Add better-sqlite3 + @types to agent-runner dependencies - Exclude test files from agent-runner tsconfig (Docker build) - Add real e2e test script (host → container → Claude → session DB) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:49:30 +03:00
gavrielc	18d0b6e53f	v2: add agent-runner integration tests Poll loop end-to-end with mock provider: message pickup, batch processing, concurrent polling for late arrivals. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:40:00 +03:00
gavrielc	5a0098edc9	v2 phase 2: agent-runner — provider interface, poll loop, formatter AgentProvider abstraction with Claude and Mock implementations. Poll loop reads messages_in, formats by kind, queries provider, writes results to messages_out. Concurrent polling pushes follow-up messages into active queries. - providers/types.ts: AgentProvider, AgentQuery, ProviderEvent - providers/claude.ts: wraps Agent SDK with MessageStream, hooks, transcript archiving - providers/mock.ts: canned responses with push() support - providers/factory.ts: createProvider() - formatter.ts: format by kind (chat/task/webhook/system), XML escaping, routing extraction - poll-loop.ts: poll → format → query → write, concurrent polling - mcp-tools.ts: MCP server with send_message tool - index-v2.ts: new entry point (config from env, enters poll loop) - 11 new tests, all 288 tests pass Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:36:55 +03:00
gavrielc	3f0451b7b0	v2 phase 1: foundation — types, DB layer, logging Add the v2 data layer: typed interfaces, central DB with migration runner, per-entity CRUD, and agent-runner session DB operations. - src/log.ts: concise message-first logging API - src/types-v2.ts: AgentGroup, MessagingGroup, Session, MessageIn/Out - src/db/: connection (WAL), migration runner, 001-initial schema, CRUD for agent_groups, messaging_groups, sessions, pending_questions - container/agent-runner/src/db/: session DB connection, messages_in reads + status transitions, messages_out writes - 31 new tests, all 277 tests pass Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:34:09 +03:00
gavrielc	f77f9ce2c4	feat: set auto-compact threshold to 165k tokens Compact earlier to preserve more context fidelity before the window fills. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-05 16:15:56 +03:00
gavrielc	db3440f662	feat: upgrade agent SDK to 0.2.92 with 1M context and 200k auto-compact Use sonnet[1m] for full 1M context window and set auto-compact at 200k tokens to keep costs down while preserving access to extended context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 23:47:17 +03:00
gavrielc	032ba77a7f	feat: mount store rw for main agent and add requiresTrigger to register_group - Mount store/ separately as read-write so the main agent can access the SQLite database directly. - Add requiresTrigger parameter to the register_group MCP tool (host IPC already supported it, but the tool never exposed it). Defaults to false (no trigger). - Update group registration instructions to ask user about trigger. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 16:17:57 +03:00
gavrielc	87e89147c9	style: run prettier on container/agent-runner/src/ Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 21:53:02 +03:00
Koshkoshinsk	d622a79fe2	fix: suppress spurious chat message on script skip When a script returns wakeAgent=false, set result to null so the host doesn't forward an internal status string to the user's chat. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 11:41:25 +00:00
Gabi Simons	9f5aff99b6	feat: add script execution phase to agent-runner Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 14:04:11 +02:00
Gabi Simons	eb65121938	feat: add script to ContainerInput and task snapshot Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:04:11 +02:00
Gabi Simons	a516cc5cfe	feat: add script parameter to MCP task tools Add optional `script` field to schedule_task and update_task MCP tools, allowing agents to attach a pre-flight bash script that controls whether the task agent is woken up. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:04:11 +02:00
gavrielc	e7318be0a2	chore: bump claude-agent-sdk to ^0.2.76 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-14 15:24:15 +02:00
Gabi Simons	13ce4aaf67	feat: enhance container environment isolation via credential proxy (#798 ) * feat: implement credential proxy for enhanced container environment isolation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: address PR review — bind proxy to loopback, scope OAuth injection, add tests - Bind credential proxy to 127.0.0.1 instead of 0.0.0.0 (security) - OAuth mode: only inject Authorization on token exchange endpoint - Add 5 integration tests for credential-proxy.ts - Remove dangling comment - Extract host gateway into container-runtime.ts abstraction - Update Apple Container skill for credential proxy compatibility Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: scope OAuth token injection by header presence instead of path Path-based matching missed auth probe requests the CLI sends before the token exchange. Now the proxy replaces Authorization only when the container actually sends one, leaving x-api-key-only requests (post-exchange) untouched. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: bind credential proxy to docker0 bridge IP on Linux On bare-metal Linux Docker, containers reach the host via the bridge IP (e.g. 172.17.0.1), not loopback. Detect the docker0 interface address via os.networkInterfaces() and bind there instead of 0.0.0.0, so the proxy is reachable by containers but not exposed to the LAN. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: bind credential proxy to loopback on WSL WSL uses Docker Desktop with the same VM routing as macOS, so 127.0.0.1 is correct and secure. Without this, the fallback to 0.0.0.0 was triggered because WSL has no docker0 interface. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: detect WSL via /proc instead of env var WSL_DISTRO_NAME isn't set under systemd. Use /proc/sys/fs/binfmt_misc/WSLInterop which is always present on WSL. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-09 00:27:13 +02:00
Minwoo Kim	ec0e42b034	fix: correct misleading send_message tool description for scheduled tasks (#729 ) The send_message tool description incorrectly stated that a scheduled task's final output is not delivered to the user, instructing agents to use the MCP tool for any communication. In reality, task-scheduler.ts unconditionally forwards the agent's result to the user via a streaming output callback (deps.sendMessage), which is a direct call to the channel layer — entirely separate from the MCP tool path. This caused agents following the description to call send_message explicitly, resulting in duplicate messages: once via MCP and once via the native streaming callback. - Remove the incorrect note from the send_message tool description - Fix the misleading comment at task-scheduler.ts which attributed result delivery to the MCP tool rather than the streaming callback	2026-03-06 12:23:09 +02:00
Gavriel Cohen	68123fdd81	feat: add update_task tool and return task ID from schedule_task schedule_task was creating duplicate tasks when users asked to modify a schedule, because the agent had no way to update an existing task and didn't know the ID of the task it created. Now schedule_task generates and returns the task ID, and a new update_task tool allows modifying prompt, schedule_type, and schedule_value in place. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 12:16:42 +02:00
gavrielc	5955cd6ee5	chore: update claude-agent-sdk to 0.2.68 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-04 22:02:11 +02:00
Gabi Simons	0210aa9ef1	refactor: implement multi-channel architecture (#500 ) * refactor: implement channel architecture and dynamic setup - Introduced ChannelRegistry for dynamic channel loading - Decoupled WhatsApp from core index.ts and config.ts - Updated setup wizard to support ENABLED_CHANNELS selection - Refactored IPC and group registration to be channel-aware - Verified with 359 passing tests and clean typecheck * style: fix formatting in config.ts to pass CI * refactor(setup): full platform-agnostic transformation - Harmonized all instructional text and help prompts - Implemented conditional guards for WhatsApp-specific steps - Normalized CLI terminology across all 4 initial channels - Unified troubleshooting and verification logic - Verified 369 tests pass with clean typecheck * feat(skills): transform WhatsApp into a pluggable skill - Created .claude/skills/add-whatsapp with full 5-phase interactive setup - Fixed TS7006 'implicit any' error in IpcDeps - Added auto-creation of STORE_DIR to prevent crashes on fresh installs - Verified with 369 passing tests and clean typecheck * refactor(skills): move WhatsApp from core to pluggable skill - Move src/channels/whatsapp.ts to add-whatsapp skill add/ folder - Move src/channels/whatsapp.test.ts to skill add/ folder - Move src/whatsapp-auth.ts to skill add/ folder - Create modify/ for barrel file (src/channels/index.ts) - Create tests/ with skill package validation test - Update manifest with adds/modifies lists - Remove WhatsApp deps from core package.json (now skill-managed) - Remove WhatsApp-specific ghost language from types.ts - Update SKILL.md to reflect skill-apply workflow Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor(skills): move setup/whatsapp-auth.ts into WhatsApp skill The WhatsApp auth setup step is channel-specific — move it from core to the add-whatsapp skill so core stays minimal. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor(skills): convert Telegram skill to pluggable channel pattern Replace the old direct-integration approach (modifying src/index.ts, src/config.ts, src/routing.test.ts) with self-registration via the channel registry, matching the WhatsApp skill pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(skills): fix add-whatsapp build failure and improve auth flow - Add missing @types/qrcode-terminal to manifest npm_dependencies (build failed after skill apply without it) - Make QR-browser the recommended auth method (terminal QR too small, pairing codes expire too fast) - Remove "replace vs alongside" question — channels are additive - Add pairing code retry guidance and QR-browser fallback Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove hardcoded WhatsApp default and stale Baileys comment - ENABLED_CHANNELS now defaults to empty (fresh installs must configure channels explicitly via /setup; existing installs already have .env) - Remove Baileys-specific comment from storeMessageDirect() in db.ts Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor(skills): convert Discord, Slack, Gmail skills to pluggable channel pattern All channel skills now use the same self-registration pattern: - registerChannel() factory at module load time - Barrel file append (src/channels/index.ts) instead of orchestrator modifications - No more _ONLY flags (DISCORD_ONLY, SLACK_ONLY) — use ENABLED_CHANNELS instead - Removed ~2500 lines of old modify/ files (src/index.ts, src/config.ts, src/routing.test.ts) Gmail retains its container-runner.ts and agent-runner modifications (MCP mount + server config) since those are independent of channel wiring. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> refactor: use getRegisteredChannels instead of ENABLED_CHANNELS Remove the ENABLED_CHANNELS env var entirely. The orchestrator now iterates getRegisteredChannelNames() from the channel registry — channels self-register via barrel imports and their factories return null when credentials are missing, so unconfigured channels are skipped automatically. Deleted setup/channels.ts (and its tests) since its sole purpose was writing ENABLED_CHANNELS to .env. Refactored verify, groups, and environment setup steps to detect channels by credential presence instead of reading ENABLED_CHANNELS. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add breaking change notice and whatsapp migration instructions CHANGELOG.md documents the pluggable channel architecture shift and provides migration steps for existing WhatsApp users. CLAUDE.md updated: Quick Context reflects multi-channel architecture, Key Files lists registry.ts instead of whatsapp.ts, and a new Troubleshooting section directs users to /add-whatsapp if WhatsApp stops connecting after upgrade. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: rewrite READMEs for pluggable multi-channel architecture Reflects the architectural shift from a hardcoded WhatsApp bot to a pluggable channel platform. Adds upgrading notice, Mermaid architecture diagram, CI/License/TypeScript/PRs badges, and clarifies that slash commands run inside the Claude Code CLI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: move pluggable channel architecture details to SPEC.md Revert READMEs to original tone with only two targeted changes: - Add upgrading notice for WhatsApp breaking change - Mention pluggable channels in "What It Supports" Move Mermaid diagram, channel registry internals, factory pattern explanation, and self-registration walkthrough into docs/SPEC.md. Update stale WhatsApp-specific references in SPEC.md to be channel-agnostic. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: move upgrading notice to CHANGELOG, add changelog link Remove the "Upgrading from Pre-Pluggable Versions" section from README.md — breaking change details belong in the CHANGELOG. Add a Changelog section linking to CHANGELOG.md. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: expand CHANGELOG with full PR #500 changes Cover all changes: channel registry, WhatsApp moved to skill, removed core dependencies, all 5 skills simplified, orchestrator refactored, setup decoupled. Use Claude Code CLI instructions for migration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: bump version to 1.2.0 for pluggable channel architecture Minor version bump — new functionality (pluggable channels) with a managed migration path for existing WhatsApp users. Update version references in CHANGELOG and update skill. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix skill application * fix: use slotted barrel file to prevent channel merge conflicts Pre-allocate a named comment slot for each channel in src/channels/index.ts, separated by blank lines. Each skill's modify file only touches its own slot, so three-way merges never conflict when applying multiple channels. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve real chat ID during setup for token-based channels Instead of registering with `pending@telegram` (which never matches incoming messages), the setup skill now runs an inline bot that waits for the user to send /chatid, capturing the real chat ID before registration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: setup delegates to channel skills, fix group sync and Discord metadata - Restructure setup SKILL.md to delegate channel setup to individual channel skills (/add-whatsapp, /add-telegram, etc.) instead of reimplementing auth/registration inline with broken placeholder JIDs - Move channel selection to step 5 where it's immediately acted on - Fix setup/groups.ts: write sync script to temp file instead of passing via node -e which broke on shell escaping of newlines - Fix Discord onChatMetadata missing channel and isGroup parameters - Add .tmp-* to .gitignore for temp sync script cleanup Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: align add-whatsapp skill with main setup patterns Add headless detection for auth method selection, structured inline error handling, dedicated number DM flow, and reorder questions to match main's trigger-first flow. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: add missing auth script to package.json The add-whatsapp skill adds src/whatsapp-auth.ts but doesn't add the corresponding npm script. Setup and SKILL.md reference `npm run auth` for WhatsApp QR terminal authentication. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: update Discord skill tests to match onChatMetadata signature The onChatMetadata callback now takes 5 arguments (jid, timestamp, name, channel, isGroup) but the Discord skill tests only expected 3. This caused skill application to roll back on test failure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: replace 'pluggable' jargon with clearer language User-facing text now says "multi-channel" or describes what it does. Developer-facing text uses "self-registering" or "channel registry". Also removes extra badge row from README. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: align Chinese README with English version Remove extra badges, replace pluggable jargon, remove upgrade section (now in CHANGELOG), add missing intro line and changelog section, fix setup FAQ answer. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: warn on installed-but-unconfigured channels instead of silent skip Channels with missing credentials now emit WARN logs naming the exact missing variable, so misconfigurations surface instead of being hidden. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: simplify changelog to one-liner with compare link Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add isMain flag and channel-prefixed group folders Replace MAIN_GROUP_FOLDER constant with explicit isMain boolean on RegisteredGroup. Group folders now use channel prefix convention (e.g., whatsapp_main, telegram_family-chat) to prevent cross-channel collisions. - Add isMain to RegisteredGroup type and SQLite schema (with migration) - Replace all folder-based main group checks with group.isMain - Add --is-main flag to setup/register.ts - Strip isMain from IPC payload (defense in depth) - Update MCP tool description for channel-prefixed naming - Update all channel SKILL.md files and documentation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: gavrielc <gabicohen22@yahoo.com> Co-authored-by: Koshkoshinski <daniel.milliner@gmail.com>	2026-03-03 00:35:45 +02:00
gavrielc	5958175ba1	fix: use 'Assistant' as fallback name instead of 'AssistantNameMissing' Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 00:04:16 +02:00
gavrielc	495b7df5fc	merge: resolve conflict with origin/main Keep ASSISTANT_NAME import, drop removed GROUPS_DIR import. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 00:03:20 +02:00
gavrielc	77f7423172	fix: pass host timezone to container and reject UTC-suffixed timestamps (#371 ) Containers had no TZ set, so any time-aware code inside ran in UTC while the host interpreted bare timestamps as local time. Now TIMEZONE from config.ts is passed via -e TZ= to the container args. Also rejects Z-suffixed or offset-suffixed timestamps in the container's schedule_task validation, since bare timestamps are expected to be local time and silently accepting UTC suffixes would cause an offset mismatch. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 23:23:34 +02:00
Dan Shapiro	107aff850c	fix: pass assistantName to container agent instead of hardcoding 'Andy' The container agent-runner had 'Andy' hardcoded as the sender name in archived conversation transcripts. This ignored the configurable ASSISTANT_NAME setting, so users who changed their assistant's name (via .env or config) would still see 'Andy' in transcripts. - Add assistantName field to ContainerInput interface (both host and container copies) - Pass ASSISTANT_NAME from config through to container in index.ts and task-scheduler.ts - Thread assistantName through createPreCompactHook and formatTranscriptMarkdown in the agent-runner - Use 'AssistantNameMissing' as fallback instead of 'Andy' so a missing name is visible rather than silently wrong Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 12:22:07 -08:00
gavrielc	1549ad503e	security: pass secrets via SDK env option and delete temp file (#213 ) Pass secrets to the SDK via the `env` query option instead of setting process.env, so Bash subprocesses never inherit API keys. Delete /tmp/input.json immediately after reading to remove secrets from disk. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 22:46:42 +02:00
Cole	1a07869329	security: sanitize env vars from agent Bash subprocesses (#171 ) Use a PreToolUse SDK hook to prepend `unset ANTHROPIC_API_KEY CLAUDE_CODE_OAUTH_TOKEN` to every Bash command Kit runs, preventing secret leakage via env/printenv/echo/$PROC. Secrets are now passed via stdin JSON instead of mounted env files, closing all known exfiltration vectors. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 22:33:39 +02:00
Gavriel Cohen	b5a6757211	fix: pass requiresTrigger through IPC and auto-discover additional directories - IPC register_group handler now passes requiresTrigger field to registerGroup(), fixing groups silently defaulting to trigger-required mode - Agent runner scans /workspace/extra/* and passes them as additionalDirectories to the SDK query, so CLAUDE.md files in mounted dirs are loaded automatically Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 12:18:16 +02:00
gavrielc	2b56fecfdc	Refactor index (#156 ) * feat: add Telegram channel with agent swarm support Add Telegram as a messaging channel that can run alongside WhatsApp or standalone (TELEGRAM_ONLY mode). Includes bot pool support for agent swarms where each subagent appears as a different bot identity in the group. - Add grammy dependency for Telegram Bot API - Route messages through tg: JID prefix convention - Add storeMessageDirect for non-Baileys channels - Add sender field to IPC send_message for swarm identity - Support TELEGRAM_BOT_TOKEN, TELEGRAM_ONLY, TELEGRAM_BOT_POOL config Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add index.ts refactor plan Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor: extract channel abstraction, IPC, and router from index.ts Break the 1088-line monolith into focused modules: - src/channels/whatsapp.ts: WhatsAppChannel class implementing Channel interface - src/ipc.ts: IPC watcher and task processing with dependency injection - src/router.ts: message formatting, outbound routing, channel lookup - src/types.ts: Channel interface, OnInboundMessage, OnChatMetadata types Also adds regression test suite (98 tests), updates all documentation and skill files to reflect the new architecture. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * ci: add test workflow for PRs Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: remove accidentally committed pool-bot assets Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(ci): remove grammy from base dependencies Grammy is installed by the /add-telegram skill, not a base dependency. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 00:36:37 +02:00
gavrielc	6f02ee530b	Adds Agent Swarms * feat: streaming container mode, IPC messaging, agent teams support Major architectural shift from single-shot container runs to long-lived streaming containers with IPC-based message injection. - Agent runner: query loop with AsyncIterable prompt to keep stdin open for agent teams (fixes isSingleUserTurn premature shutdown) - New standalone stdio MCP server (ipc-mcp-stdio.ts) inheritable by subagents, with send_message and schedule_task tools - Streaming output: parse OUTPUT_START/END markers in real-time, send results to WhatsApp as they arrive - IPC file-based messaging: host writes to ipc/{group}/input/, agent polls for follow-up messages without respawning containers - Per-group settings.json with CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 - SDK bumped to 0.2.34 for TeamCreate tool support - Container idle timeout (30min) with _close sentinel for shutdown - Orphaned container cleanup on startup - alwaysRespond flag for groups that skip trigger pattern check - Uncaught exception/rejection handlers with timestamps in logger - Combined SDK documentation into single deep dive reference Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: remove unused ipc-mcp.ts (replaced by ipc-mcp-stdio.ts) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: clarify agent communication model in docs and tool descriptions - CLAUDE.md (main + global): split communication instructions into "responding to messages" vs "scheduled tasks" sections - send_message tool: note that scheduled task output is not sent to user - Remove structured output (outputFormat) — not needed with current flow - Regular output is sent to WhatsApp; scheduled task output is only logged Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: ignore dynamic group data while preserving base structure Only track groups/main/CLAUDE.md and groups/global/CLAUDE.md. All other group directories and files are ignored to prevent tracking user-specific session data. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve critical bugs in streaming container mode Bug 1 (scheduled task hang): Task scheduler now passes onOutput callback with idle timer that writes _close sentinel after IDLE_TIMEOUT, so containers exit cleanly instead of blocking queue slots for 30 minutes. Scheduled tasks stay alive for interactive follow-up via IPC. Bug 2 (timeout disabled): Remove resetTimeout() from stderr handler. SDK writes debug logs continuously, resetting the timer on every line. Timeout now only resets on actual output markers in stdout. Bug 3 (trigger bypass): Piped messages in startMessageLoop now check trigger pattern for non-main groups. Non-trigger messages accumulate in DB and are pulled as context via getMessagesSince when a trigger arrives. Bug 7 (non-atomic IPC writes): GroupQueue.sendMessage uses temp file + rename for atomic writes, matching ipc-mcp-stdio.ts pattern. Also: flip isVerbose back to false (debug leftover), add isScheduledTask to host-side ContainerInput interface. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: idle timer not starting + scheduled task groupFolder missing Two bugs that prevented the scheduled task idle timeout fix from working: 1. onOutput was only called when parsed.result !== null, but session update markers have result: null. The idle timer never started for "silent" query completions, leaving containers parked at waitForIpcMessage until hard timeout. 2. Scheduler's onProcess callback didn't pass groupFolder to queue.registerProcess, so closeStdin no-oped (groupFolder was null). The _close sentinel was never written even when the idle timer fired. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: duplicate messages and timestamp rollback in piping path Two bugs introduced by the trigger context accumulation change: 1. processGroupMessages didn't advance lastAgentTimestamp until after the container finished. The piping path's getMessagesSince(lastAgent Timestamp) re-fetched messages already sent as the initial prompt, causing duplicates. 2. processGroupMessages overwrote lastAgentTimestamp with the original batch timestamp on completion, rolling back any advancement made by the piping path while the container was running. Fix: advance lastAgentTimestamp immediately after building the prompt, before starting the container. This matches the piping path behavior and eliminates both the overlap and the rollback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: container idles 30 extra minutes after _close during query When _close was detected during pollIpcDuringQuery, it was consumed (deleted) and stream.end() was called. But after runQuery returned, main() still emitted a session-update marker (resetting the host's idle timer) and called waitForIpcMessage (which polled forever since _close was already gone). The container had to wait for a second _close. Fix: runQuery now returns closedDuringQuery. When true, main() skips the session-update marker and waitForIpcMessage, exiting immediately. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resume branching, internal tags, and output forwarding - Fix resume branching: pass resumeSessionAt with last assistant UUID to anchor each query loop resume to the correct conversation tree position. Prevents agent responses landing on invisible branches when agent teams subagents create parallel JSONL entries. - Add <internal> tag stripping: agent can wrap internal reasoning in <internal> tags which are logged but not sent to WhatsApp. Prevents duplicate messages and internal monologue reaching users. - Forward scheduled task output: scheduled tasks now send result text to WhatsApp (with <internal> stripping), matching regular message behavior. No more special-case instructions. - Update Communication guidance in CLAUDE.md: simplified to "your output is sent to the user or group" with soft guidance on <internal> tags and send_message usage. - Add messaging behavior docs to schedule_task tool: prompts the scheduling agent to include guidance on whether the task should always/conditionally/never message the user. - Mount security: containerPath now optional, defaults to basename of hostPath. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: cursor rollback on error, flush guard, verbose logging - Roll back lastAgentTimestamp on container error so retries can re-process the messages instead of silently losing them. - Add guard flag to flushOutgoingQueue to prevent duplicate sends from concurrent flushes during rapid WA reconnects. - Revert isVerbose from hardcoded false back to env-based check (LOG_LEVEL=debug\|trace). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: orphan container cleanup was silently failing The startup cleanup used `container ls --format {{.Names}}` which is Docker Go-template syntax. Apple Container only supports `--format json` or `--format table`. The command errored with exit code 64, but the catch block silently swallowed it — orphan containers were never cleaned up on restart. Fixed to use `--format json` and parse `configuration.id` from the JSON output. Also filters by `status: running` and logs a warning on failure instead of silently catching. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add Discord badge and community section Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: idle timer reset on null results and flush queue message loss - Only reset idle timer on actual results (non-null), not session-update markers. Prevents containers staying alive 30 extra minutes after the agent finishes work. - flushOutgoingQueue now uses shift() instead of splice(0) so unattempted messages stay in the queue if an unexpected error bails the loop. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add Agent Swarms to README Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: update Telegram skill for current architecture Rewrite integration instructions to match the per-group queue/SQLite architecture: remove onMessage callback pattern (store to DB, let message loop pick up), fix startSchedulerLoop signature, add TELEGRAM_ONLY service startup, SQLite registration, data/env/env sync, @mention-to-trigger translation, and BotFather group privacy docs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: Telegram skill message chunking, media placeholders, chat discovery - Split long messages at Telegram's 4096 char limit to prevent silent send failures - Store placeholder text for non-text messages (photos, voice, stickers, etc.) so the agent knows media was sent - Update getAvailableGroups filter to include tg: chats so the agent can discover and register Telegram chats via IPC - Fix removal step numbering Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: update REQUIREMENTS.md and SPEC.md for SQLite architecture - Replace all registered_groups.json / sessions.json / router_state.json references with SQLite equivalents - Fix CONTAINER_TIMEOUT default (300000 → 1800000) - Add missing config exports (IDLE_TIMEOUT, MAX_CONCURRENT_CONTAINERS) - Update folder structure: add missing src files (logger, group-queue, mount-security), remove non-existent utils.ts, list all skills - Fix agent-runner entry (ipc-mcp.ts → ipc-mcp-stdio.ts) - Update startup sequence to reflect per-group queue architecture - Fix env mounting description (data/env/env, not extracted vars) - Update troubleshooting to use sqlite3 commands Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: fix README architecture description, revert SPEC.md env error - README: update architecture blurb to mention per-group queue, add group-queue.ts to key files, update file descriptions - SPEC.md: restore correct credential filtering description (only auth vars are extracted from .env, not the full file) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 02:50:43 +02:00
gavrielc	f26468c9b0	fix: setup skill reliability, requiresTrigger option, agent-browser visibility Setup skill fixes: - Run QR auth in foreground with long timeout, not background - Replace fragile message-based registration with DB group sync lookup - Personal chats: ask for phone number instead of querying empty DB - Consolidate trigger word + security model + channel selection into one step - Remove `timeout` shell command (unavailable on macOS), use Bash tool timeout - Query 40 groups, display 10 at a time, support name lookup requiresTrigger support: - Add requiresTrigger field to RegisteredGroup type and DB schema - Skip trigger check when requiresTrigger is false (for solo/personal chats) - Main group still always processes all messages (unchanged) Agent-browser visibility: - Append global CLAUDE.md to non-main agent system prompts via SDK - Add browser tool docs to global and main CLAUDE.md - Update skill description to be broader (not just "web testing") - Reference agent-browser.md in root CLAUDE.md key files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 01:39:31 +02:00
gavrielc	8dd27bc58d	fix: defend against missing structured output and message without content - Fall back to text result when success subtype has no structured_output - Treat outputType 'message' without userMessage as 'log' with warning Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:29:04 +02:00
gavrielc	44f0b3d99c	fix: improve agent output schema, tool descriptions, and shutdown robustness - Rename status→outputType, responded/silent→message/log for clarity - Remove scheduled task special-casing: userMessage now sent for all contexts - Update schema, tool, and CLAUDE.md descriptions to be clear and non-contradictory about communication mechanisms - Use full tool name mcp__nanoclaw__send_message in docs - Change schedule_task target_group to accept JID instead of folder name - Only show target_group_jid parameter to main group agents - Add defense-in-depth sanitization and error callback to exec() in shutdown - Use "user or group" consistently (supports both 1:1 and group chats) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:22:45 +02:00
gavrielc	ae177156ec	feat: per-group queue, SQLite state, graceful shutdown (#111 ) * fix: wire up queue processMessagesFn before recovery to prevent silent message loss recoverPendingMessages() was called after startMessageLoop(), which meant: 1. Recovery could race with the message loop's first iteration 2. processMessagesFn was set inside startMessageLoop, so recovery enqueues would fire runForGroup with processMessagesFn still null, silently skipping message processing Move setProcessMessagesFn and recoverPendingMessages before startMessageLoop so the queue is fully wired before any messages are enqueued. https://claude.ai/code/session_01PCY8zNjDa2N29jvBAV5vfL * feat: structured agent output to fix infinite retry on silent responses (#113) Use Agent SDK's outputFormat with json_schema to get typed responses from the agent. The agent now returns { status: 'responded' \| 'silent', userMessage?, internalLog? } instead of a plain string. This fixes a critical bug where a null/empty agent response caused infinite 5-second retry loops by conflating "nothing to say" with "error". - Agent runner: add AGENT_RESPONSE_SCHEMA and parse structured_output - Host: advance lastAgentTimestamp on both responded AND silent status - GroupQueue: add exponential backoff (5s-80s) with max 5 retries for actual errors, replacing unbounded fixed-interval retries https://claude.ai/code/session_014SLc8MxP9BYhEhDCLox9U8 Co-authored-by: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-02-06 18:54:26 +02:00

1 2

61 Commits