nanoclaw

Author	SHA1	Message	Date
gavrielc	684a98d078	test: add host-side routing and session resolution tests Host-side (vitest): - Routed message preserves platformId/channelType/threadId on messages_in - Fan-out gives each agent correct per-agent routing - writeSessionRouting populates session_routing from messaging group - writeSessionRouting writes null routing for agent-shared sessions - Per-thread session includes thread_id in session_routing - Agent-shared resolves to same session on repeated calls - Agent-shared session has null messaging_group_id - findSessionByAgentGroup returns channel-bound session (documents #2332) - Skip: agent-shared/channel-bound coexistence (blocked on #2332 fix) Container-side (bun:test): - Internal tags stripped between message blocks - Mixed task + chat batch with correct routing The agent-shared tests uncovered the exact bug from #2332: findSessionByAgentGroup doesn't distinguish agent-shared from channel-bound sessions, so A2A resolution reuses a channel session when one exists. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-08 00:26:41 +03:00
gavrielc	fc3c11b6b9	fix(session-manager): apply outbox path-confinement to inbound attachments Mirrors the four defenses on the outbound side onto extractAttachmentFiles: 1. Reject unsafe messageId via isSafeAttachmentName before any inbox path is built. WhatsApp passes msg.key.id through raw and that field is client generated, so a peer can craft it; future end to end encrypted adapters will have the same property. 2. lstatSync on the inbox dir refuses a pre placed symlink before mkdirSync would silently follow it. 3. realpathSync + isPathInside contains the resolved dir under the session inbox root. 4. writeFileSync uses the wx flag so a pre placed symlink at the file path is refused atomically by the kernel; EEXIST surfaces as a logged skip. Threat: the session dir is mounted writable into the container at /workspace, so a compromised agent can pre place inbox/<future msgId>/ as a symlink and wait for a chat message with a matching id to redirect the host write. The four guards together close that window. Consolidates with the existing isSafeAttachmentName helper from attachment-safety.ts rather than introducing a duplicate basename validator inside session-manager. Co-Authored-By: Daisuke Tsuji <dim0627@gmail.com> Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 01:27:09 +03:00
hinotoi-agent	852009dcb1	fix(container): confine outbound attachment paths	2026-05-01 01:27:09 +03:00
gavrielc	7e37b13aab	Fix path traversal in attachment handling on channel-inbound path	2026-04-28 13:26:44 +03:00
gavrielc	6c26c0413a	feat(router,cli): replyTo override + CLI admin-transport flows - InboundEvent gains an optional replyTo; router stamps the row's address fields from it when set, so replies can route to a different channel than the one the inbound came in on. - ChannelSetup adds onInboundEvent for admin-transport adapters that build the full event themselves. - CLI wire format accepts {text, to, reply_to}. Routed messages go through onInboundEvent and do not evict an active chat client. - init-first-agent hands the DM welcome to the running service via data/cli.sock — synchronous wake, no sweep wait. Fails loudly if the service is down; no silent fallback. - Split the CLI scratch-agent bootstrap into scripts/init-cli-agent.ts; init-first-agent is DM-only. Agents cannot set replyTo: it lives only on the inbound/router seam and is consumed once when writing messages_in. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 23:30:47 +03:00
gavrielc	a4061a0012	refactor(channels,router): move all policy to router; bridge is transport Follow-up to `b159722`. That shrank the bridge's shouldEngage to a flood gate + coarse sticky-subscribe signal. This completes the move — policy lives exclusively in the router, the bridge is transport-only, and the conversations map + ChannelSetup.conversations + ChannelAdapter.updateConversations are all gone. Key shifts: 1. Subscribe moves from bridge to router. Bridge used to call `thread.subscribe()` from its onNewMention / onDirectMessage handlers based on a coarse "any mention-sticky wiring exists on this channel" check. That forced the decision before the router could apply per-wiring engage logic, and it relied on the conversations map being current (staleness risk). ChannelAdapter gains `subscribe?(platformId, threadId)`. The Chat SDK bridge implements it via SqliteStateAdapter.subscribe(threadId) (idempotent — a repeat call on an already-subscribed thread is a no-op). The router's fan-out loop calls it once per message when the first mention-sticky wiring actually engages. Precise, not coarse. 2. Short-circuit the drop path with one combined query. New `getMessagingGroupWithAgentCount(channelType, platformId)` does the messaging_groups lookup AND counts wirings in a single SELECT, using the existing UNIQUE(channel_type, platform_id) index on messaging_groups and UNIQUE(messaging_group_id, agent_group_id) on messaging_group_agents for the JOIN. No new indexes needed. routeInbound now short-circuits: - No messaging_groups row AND not addressed (no mention/DM) → return silently. One DB read, nothing written. This is the Discord-bot-in-a-big-guild case; we no longer auto-create rows for every plain message in every channel the bot can see. - Messaging group exists but no wirings AND not addressed → return silently. One DB read. - Otherwise fall through to sender resolution + fan-out as before. Behavioral change: plain chatter on unwired channels no longer gets dropped_messages audit rows, which used to bloat the table. Audit still fires on addressed-to-bot drops where the admin cares ("someone @-mentioned us but nobody's wired"). 3. Bridge is now purely transport. Deleted entirely: ConversationConfig, ChannelSetup.conversations, ChannelAdapter.updateConversations?, bridge's `conversations` map, buildConversationMap, shouldEngage, EngageSource, engageDecision, bridge.updateConversations method, src/index.ts buildConversationConfigs. Four handlers reduce to "resolve channel id, build InboundMessage with isMention, call onInbound". Net ~130 LOC deleted from the bridge. Collateral: the conversations-map staleness problem is gone. The upcoming channel-registration feature doesn't need any map-refresh plumbing — when an approval creates a new wiring, the next message hits the DB fresh and just works. Bridge tests prune to the narrow platform-adjacent surface (openDM delegation, subscribe presence). Host-core test that asserted the old "auto-create on every unknown message" behavior updates to reflect the new escalation-gated semantics: plain messages on unknown channels don't auto-create, mentions do. 159 tests pass (was 172 — net -13, almost entirely from bridge-engage-mode tests that covered logic now owned by the router and exercised through host-core.test.ts). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 13:55:49 +03:00
gavrielc	16b9499532	feat(routing): engage modes + sender scope + accumulate/drop + per-agent fan-out Replaces the opaque trigger_rules JSON + response_scope enum on messaging_group_agents with four explicit orthogonal columns: engage_mode 'pattern' \| 'mention' \| 'mention-sticky' engage_pattern regex source; required when mode='pattern'; '.' is the "always" sentinel sender_scope 'all' \| 'known' ignored_message_policy 'drop' \| 'accumulate' Inbound routing becomes a fan-out — every wired agent is evaluated independently. A match gets its own session + container wake. A miss with accumulate keeps the message as context-only (trigger=0) in that agent's session, so when the agent does eventually engage it sees the prior chatter. ## Schema - Migration 010 (`engage-modes`): adds the 4 new columns, backfills from trigger_rules.pattern + requiresTrigger + response_scope, drops the legacy columns. - messages_in gains `trigger INTEGER NOT NULL DEFAULT 1` (session DB schema + `migrateMessagesInTable` forward-compat). - countDueMessages gates waking on `trigger = 1`. ## Routing - `pickAgent` (returns one) → loop over all wired agents. Per agent: evaluate engage_mode; run access gate + sender-scope gate; on full match → resolveSession + writeSessionMessage(trigger=1) + wake. On miss with accumulate → writeSessionMessage(trigger=0), no wake. On miss with drop → skip. - New `findSessionForAgent(agentGroupId, mgId, threadId)` scopes session lookup by agent so fan-out doesn't cross sessions. - `messageIdForAgent` namespaces inbound message ids by agent_group_id so PRIMARY KEY doesn't collide across per-agent session DBs. ## Adapter layer - `ConversationConfig` replaces `triggerPattern` + `requiresTrigger` with `engageMode` + `engagePattern`. - Chat SDK bridge stores `Map<platformId, ConversationConfig[]>` (multi- agent per conversation) and applies union gating pre-onInbound: * onSubscribedMessage: engage if any wiring keeps firing in subscribed state (mention-sticky or pattern) * onNewMention: engage on mention; only subscribes the thread if at least one wiring is `mention-sticky` * onDirectMessage: engage per mode; sticky follows same rule - Bridge no longer unconditionally calls `thread.subscribe()`. ## Sender scope - Permissions module registers a second hook `setSenderScopeGate` that runs per-wiring after the existing access gate. `sender_scope='known'` requires canAccessAgentGroup(); `'all'` is a no-op. Not installed → no-op everywhere (default allow). ## Container side - Host passes `NANOCLAW_MAX_MESSAGES_PER_PROMPT` (reuses existing MAX_MESSAGES_PER_PROMPT config; was dead code from v1). - `getPendingMessages` queries `ORDER BY seq DESC LIMIT N`, reverses to chronological order for the prompt — accumulated context rides along with trigger rows up to the cap. - `MessageInRow` gains `trigger: number` so the container can tell them apart in downstream code (container still processes both; only the host uses `trigger=0` for don't-wake). ## Defaults (per ACTION-ITEMS item 1 decision) - DM (is_group=0): `engage_mode='pattern'`, `engage_pattern='.'` (always) - Threaded group: `engage_mode='mention-sticky'` (seed-discord) - Non-threaded group / CLI: pattern '.' in bootstrap scripts ## Tests - src/host-core.test.ts: 3 new cases — fan-out (2 agents, 2 sessions, 2 wakes), accumulate (trigger=0 + no wake), drop (no session created). - Existing 10 host-core tests still pass. - Migration 010 runs on an empty DB in 0-row path — verified. Closes: ACTION-ITEMS items 1, 4. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 01:30:04 +03:00
gavrielc	6a815190c0	feat(lifecycle): stuck detection + heartbeat lifecycle + SDK tool blocklist Replaces the two overlapping old mechanisms (30-min setTimeout kill in container-runner, 10-min heartbeat STALE_THRESHOLD reset in host-sweep) with message-scoped stuck detection anchored to the processing_ack claim age + an absolute 30-min ceiling that extends for long-declared Bash tools. Old model problems: - IDLE_TIMEOUT setTimeout fired on plain wall-clock time; slow-but-alive agents got killed at 30min regardless of activity - 10-min STALE_THRESHOLD in the sweep was unreliable — the heartbeat is only touched on SDK events, so legitimate silent tool work (sleep 30, long WebFetch, npm install) looked identical to a hung container - Two overlapping sources of truth for "when to let go of a container" New model: - Host sweep is the single source of truth. - Container exposes a new `container_state` single-row table in outbound.db (schema added; container writes, host reads). PreToolUse hook writes current_tool + tool_declared_timeout_ms (read from Bash's tool_input); PostToolUse / PostToolUseFailure clear it. - Sweep decides with a pure helper `decideStuckAction`: * absolute ceiling — kill if heartbeat age > max(30min, bash_timeout) * per-claim stuck — kill if any processing_ack row has claim_age > max(60s, bash_timeout) AND heartbeat hasn't been touched since claim * otherwise ok Kill paths reset leftover processing rows with exponential backoff, reusing the existing retry machinery. Tool blocklist expanded: - AskUserQuestion (SDK placeholder; we have mcp__nanoclaw__ask_user_question) - EnterPlanMode, ExitPlanMode, EnterWorktree, ExitWorktree (Claude Code UI affordances; would hang in headless containers) PreToolUse hook is also defense-in-depth: if a disallowed tool name slips through, it returns `{ decision: 'block' }` so the agent sees a clear error instead of appearing stuck. Removed: - container-runner.ts: IDLE_TIMEOUT setTimeout, resetIdle callback on activeContainers entry, resetContainerIdleTimer export. - delivery.ts: the resetContainerIdleTimer call on successful delivery. - poll-loop.ts: IDLE_END_MS + its setInterval. Keeping the query open is cheaper than close+reopen (no cold prompt cache). Liveness is now a host-side concern. - host-sweep.ts: 10-min STALE_THRESHOLD_MS + getStuckProcessingIds in the stale-detection path (still exported for kill reset). Tests: - src/host-sweep.test.ts — 9 tests for decideStuckAction covering: fresh heartbeat, absolute ceiling, absent heartbeat, Bash-timeout extension (both ceiling and per-claim), claim age below tolerance, heartbeat touched after claim, unparseable timestamps. Ref: docs/v1-vs-v2/ACTION-ITEMS.md items 9, 6a, 10. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 01:16:57 +03:00
gavrielc	7169c25e70	refactor: relocate outbox I/O to session-manager + dead-code sweep ## Outbox extraction (delivery.ts → session-manager.ts) File I/O for outbound attachments now lives in session-manager.ts alongside the symmetric inbound extractAttachmentFiles. delivery.ts no longer touches the filesystem — it hands buffers to the adapter and calls clearOutbox on success. - New `readOutboxFiles(agentGroupId, sessionId, messageId, filenames)` and `clearOutbox(agentGroupId, sessionId, messageId)` in session-manager.ts. - deliverMessage in delivery.ts loses ~35 lines of fs/path code and its `fs`/`path` imports. ## Dead-code sweep TypeScript's --noUnusedLocals surfaced several cruft imports. Fixed: - src/container-runner.ts: drop unused `markContainerIdle` import; drop unused `session` parameter from `buildContainerArgs` signature. - src/delivery.ts: drop unused `getSession`, `writeSessionMessage`, `wakeContainer` imports. - src/host-sweep.ts: drop unused `updateSession`, `outboundDbPath` imports. - container/agent-runner/src/poll-loop.ts: drop unused `config`, `processingIds` params from `processQuery`. - Test files: drop unused imports in channel-registry.test, db-v2.test, host-core.test. Skipped: `conversations` state in chat-sdk-bridge.ts (never read but tangled with public `updateConversations` method; cleaning it risks a merge conflict with the channels branch at the next sync). ## Validation - `pnpm run build` clean - `pnpm test` — 137 host tests pass - `bun test` in container/agent-runner — 17 tests pass - Service boots (`NanoClaw running`, `OneCLI approval handler started`) and shuts down cleanly on SIGTERM Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 21:34:08 +03:00
gavrielc	75c2fde2b5	feat(v2): builder-agent self-modification WIP + container-config as per-group file Checkpoints the builder-agent dev-agent/worktree/swap flow (create_dev_agent, request_swap, classifier, deadman, promote) before pivoting to a unified draft-activate approach with OS-level RO enforcement. Lifts container_config out of the agent_groups row into groups/<folder>/container.json so install_packages, add_mcp_server, and rebuild flows can eventually route through the same draft path as source edits. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 21:15:13 +03:00
gavrielc	0d3326aae5	feat(v2): user-level privilege model + cold DM infra + init-first-agent skill Replaces the agent-group-centric "main group" concept with user-level privileges and adds the cold-DM infrastructure needed for proactive outbound messaging (pairing, approvals, welcome flows). Privilege model - New tables: users, user_roles (owner global-only; admin global or scoped to an agent_group), agent_group_members (explicit non- privileged access; admin/owner imply membership), user_dms (cold-DM resolution cache). - Removed agent_groups.is_admin, messaging_groups.admin_user_id. Replaced with messaging_groups.unknown_sender_policy (strict \| request_approval \| public) for per-chat unknown-sender gating. - src/access.ts: canAccessAgentGroup, pickApprover, pickApprovalDelivery. - src/router.ts: access gate on every inbound, honoring unknown_sender_policy for unknown senders. - src/channels/telegram.ts: pairing interceptor upserts the paired user and promotes them to owner if hasAnyOwner() is false (first-pair-wins). Cold DM infrastructure - ChannelAdapter.openDM?(handle) — optional method. Chat-SDK-bridge wires it to chat.openDM() for resolution-required channels (Discord, Slack, Teams, Webex, gChat); direct-addressable channels (Telegram, WhatsApp, iMessage, Matrix, Resend) fall through to the handle directly. - src/user-dm.ts: ensureUserDm(userId) — resolves + caches via user_dms. Approval routing - onecli-approvals + delivery use pickApprover + pickApprovalDelivery: scoped admins → global admins → owners (dedup), first reachable via ensureUserDm, same-channel-kind tie-break. Approvals land in the approver's DM, not the origin chat. Delivery fixes - delivery.ts ACL rejection now throws instead of returning undefined — the outer loop previously marked rejected messages as delivered. - Implicit-origin allow: session.messaging_group_id === target skips the destination check. - createMessagingGroupAgent auto-creates the companion agent_destinations row (normalized local_name from the messaging group's name, collision- broken within the agent's namespace). Container - container-runner.ts: /workspace/global always read-only; drops NANOCLAW_IS_ADMIN; adds NANOCLAW_ADMIN_USER_IDS (owners + global admins + scoped admins for this agent group). Agent-runner poll-loop gates slash commands against that set. New skill: /init-first-agent - Walks the operator through standing up the first agent for a channel: channel pick → identity lookup (reads each channel SKILL.md's ## Channel Info > how-to-find-id) → DM platform_id resolution (direct- addressable, cold-DM via "user DMs bot first + sqlite lookup", or Telegram pair-code fallback) → run scripts/init-first-agent.ts → verify via tail of nanoclaw.log. - scripts/init-first-agent.ts: parameterized helper that upserts the user + grants owner (if none), creates dm-with-<display-name> agent group + initGroupFilesystem, reuses/creates the DM messaging_group, wires it (auto-creates destination), resolves the session, and writes a kind:'chat' / sender:'system' welcome message into inbound.db. Host sweep wakes the container and the agent DMs the operator via the normal delivery path. /manage-channels rewrite - Drops --is-main / --jid / main-vs-non-main isolation references. - First-channel flow delegates to /init-first-agent. - Explains createMessagingGroupAgent auto-creates destinations. - Adds a privileged-users show section. setup/ - register.ts: drop --is-main, --jid, --local-name, --trigger requiresTrigger defaults; call initGroupFilesystem; normalize to v2 schema (no is_admin, no admin_user_id, sets unknown_sender_policy 'strict'); let createMessagingGroupAgent handle the destination row. - pair-telegram.ts: emit PAIRED_USER_ID (namespaced "telegram:<id>") instead of ADMIN_USER_ID; update header comment. - register.test.ts deleted — was v1-only, tested a registered_groups table that no longer exists. Docs - v2-architecture-diagram.{md,html}: ER diagram updated to drop is_admin/admin_user_id, add unknown_sender_policy, and include users/user_roles/agent_group_members/user_dms. - v2-architecture-draft.md: approval-routing paragraph rewritten for pickApprover/pickApprovalDelivery/ensureUserDm; SQL schema block updated; admin-verification paragraph references NANOCLAW_ADMIN_USER_IDS. - v2-setup-wiring.md: entity-model sketch rewritten. - v2-checklist.md: marked privilege refactor / container filtering / approval routing / unknown-sender gating done; removed obsolete admin_user_id and main-vs-non-main items. Scripts - scripts/init-first-agent.ts (new) replaces scripts/welcome-owner-dm.ts (removed; welcome-owner was a Discord-specific one-off). - test-v2-host.ts, test-v2-channel-e2e.ts, seed-discord.ts: drop is_admin + admin_user_id, use unknown_sender_policy. Tests - src/access.test.ts (new): 14 tests for canAccessAgentGroup, role helpers, pickApprover, ensureUserDm, pickApprovalDelivery. - src/db/db-v2.test.ts: adds 3 tests for the auto-created agent_destinations row (normalized name, no duplicates, collision break within an agent group). - host-core.test.ts, channel-registry.test.ts: updated fixtures to use unknown_sender_policy: 'public' where the test exercises routing rather than the access gate. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:03:51 +03:00
gavrielc	b76fd425c8	style: prettier formatting fixes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 12:18:31 +03:00
gavrielc	82cb363f84	v2: split session DB into inbound/outbound for write isolation Eliminates SQLite write contention across the host-container mount boundary by splitting the single session.db into two files, each with exactly one writer: inbound.db — host writes (messages_in, delivered tracking) outbound.db — container writes (messages_out, processing_ack) Key changes: - Host uses even seq numbers, container uses odd (collision-free) - Container heartbeat via file touch instead of DB UPDATE - Scheduling MCP tools now emit system actions via messages_out (host applies them to inbound.db during delivery) - Host sweep reads processing_ack + heartbeat file for stale detection - OneCLI ensureAgent() call added (was missing from v2, caused applyContainerConfig to reject unknown agent identifiers) Verified: tsc clean, 327 tests pass, real e2e through Docker works. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 12:17:31 +03:00
gavrielc	9486d56b01	v2: make v2 the main entry point, move v1 to src/v1/ - Move all v1 files (index, router, container-runner, db, ipc, types, logger, channels/registry, and all utilities) to src/v1/ as a fully self-contained archive with no shared dependencies - Rename v2 files to remove -v2 suffix (index-v2.ts → index.ts, etc.) - Update all imports across v2 source, tests, and setup files - Migrate shared utilities (config, env, container-runtime, mount-security, timezone, group-folder) from pino logger to v2 log module - Migrate setup/ files from logger to log with argument order swap - Container agent-runner: move v1 entry to v1/, rename v2 to index.ts - Update setup skill to offer all 13 v2 channels - Install all Chat SDK adapter packages - dist/index.js now runs v2; dist/v1/index.js runs v1 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 11:40:36 +03:00
gavrielc	d35386a46e	style: apply prettier formatting to v2 source files Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:59:08 +03:00
gavrielc	8535875d0c	v2: add host core integration tests Tests for session manager (folder/DB creation, shared vs per-thread resolution, message writing), router (end-to-end routing, auto-create messaging groups), and delivery (undelivered message detection). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:44:26 +03:00

16 Commits