nanoclaw

Author	SHA1	Message	Date
gavrielc	cc784ff94b	refactor(v2): remove trigger_credential_collection MCP tool Drops the in-chat credential-collection flow introduced in `e92b245`. Agents can no longer collect API keys via a secure modal — users must add secrets through OneCLI directly. Keeps the OneCLI manual-approval handler and threaded-routing work from the same commit intact. Removed: * container/agent-runner/src/mcp-tools/credentials.ts (MCP tool) * src/credentials.ts (host-side modal/OneCLI pipeline) * src/db/credentials.ts + migration 005 (pending_credentials table) * src/onecli-secrets.ts (createSecret CLI facade, only caller was credentials.ts) * findCredentialResponse from agent-runner DB layer * PendingCredential types * Four credential hooks from ChannelSetup (getCredentialForModal, onCredentialReject, onCredentialSubmit, onCredentialChannelUnsupported) * Credential card/modal handling in chat-sdk-bridge (nccr/nccm prefixes, Modal/TextInput imports) * credential_request text fallback in WhatsApp adapter * request_credential system-action case in delivery.ts Added: * Migration 009 drops pending_credentials on existing installs. Vercel skill now tells the agent to ask the user to register the token via OneCLI instead of invoking the removed tool. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-16 21:41:41 +03:00
gavrielc	e55ed0f4e8	fix(whatsapp): upgrade Baileys 6.7→6.17, fix proto import and 515 restart Baileys 6.7.21 silently failed the pairing handshake. Upgrade to 6.17.16 which fixes this. Three related issues: 1. proto is no longer a named ESM export in 6.17.x — use createRequire to import via CJS (matching the proven v1 pattern). 2. Setup auth script didn't handle the 515 stream restart that WhatsApp sends after successful pairing. Refactored to reconnect (matching v1's connectSocket(isReconnect) pattern) instead of hanging until timeout. 3. Added succeeded guard and process.exit(0) to prevent timeout race after successful auth. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 21:01:55 +03:00
exe.dev user	8ef26d323f	fix(v2/telegram): await pairing interceptor work to serialize DB commits The Telegram pairing interceptor fired DB writes (createMessagingGroup, upsertUser, grantRole) and the pairing-success confirmation inside an unawaited `void (async () => {...})()`. Recent changes (`0d3326a` user privilege model, `c483860` pairing confirmation) widened the work done inside this closure to include an extra two DB writes and a Telegram API round-trip, making the race between match and commit reproducible — a paired message could appear "lost" until a second send. Change onInbound to optionally return a Promise, await it in the chat-sdk-bridge dispatch callbacks, and make the pairing interceptor async so its DB writes + confirmation send complete before the handler resolves. Note: the upstream @chat-adapter/telegram SDK itself does not await processUpdate in its polling loop, so the adapter's getUpdates offset still advances before our handler resolves. A true restart-safe fix needs a corresponding change in chat-adapter. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 12:16:49 +00:00
gavrielc	0d3326aae5	feat(v2): user-level privilege model + cold DM infra + init-first-agent skill Replaces the agent-group-centric "main group" concept with user-level privileges and adds the cold-DM infrastructure needed for proactive outbound messaging (pairing, approvals, welcome flows). Privilege model - New tables: users, user_roles (owner global-only; admin global or scoped to an agent_group), agent_group_members (explicit non- privileged access; admin/owner imply membership), user_dms (cold-DM resolution cache). - Removed agent_groups.is_admin, messaging_groups.admin_user_id. Replaced with messaging_groups.unknown_sender_policy (strict \| request_approval \| public) for per-chat unknown-sender gating. - src/access.ts: canAccessAgentGroup, pickApprover, pickApprovalDelivery. - src/router.ts: access gate on every inbound, honoring unknown_sender_policy for unknown senders. - src/channels/telegram.ts: pairing interceptor upserts the paired user and promotes them to owner if hasAnyOwner() is false (first-pair-wins). Cold DM infrastructure - ChannelAdapter.openDM?(handle) — optional method. Chat-SDK-bridge wires it to chat.openDM() for resolution-required channels (Discord, Slack, Teams, Webex, gChat); direct-addressable channels (Telegram, WhatsApp, iMessage, Matrix, Resend) fall through to the handle directly. - src/user-dm.ts: ensureUserDm(userId) — resolves + caches via user_dms. Approval routing - onecli-approvals + delivery use pickApprover + pickApprovalDelivery: scoped admins → global admins → owners (dedup), first reachable via ensureUserDm, same-channel-kind tie-break. Approvals land in the approver's DM, not the origin chat. Delivery fixes - delivery.ts ACL rejection now throws instead of returning undefined — the outer loop previously marked rejected messages as delivered. - Implicit-origin allow: session.messaging_group_id === target skips the destination check. - createMessagingGroupAgent auto-creates the companion agent_destinations row (normalized local_name from the messaging group's name, collision- broken within the agent's namespace). Container - container-runner.ts: /workspace/global always read-only; drops NANOCLAW_IS_ADMIN; adds NANOCLAW_ADMIN_USER_IDS (owners + global admins + scoped admins for this agent group). Agent-runner poll-loop gates slash commands against that set. New skill: /init-first-agent - Walks the operator through standing up the first agent for a channel: channel pick → identity lookup (reads each channel SKILL.md's ## Channel Info > how-to-find-id) → DM platform_id resolution (direct- addressable, cold-DM via "user DMs bot first + sqlite lookup", or Telegram pair-code fallback) → run scripts/init-first-agent.ts → verify via tail of nanoclaw.log. - scripts/init-first-agent.ts: parameterized helper that upserts the user + grants owner (if none), creates dm-with-<display-name> agent group + initGroupFilesystem, reuses/creates the DM messaging_group, wires it (auto-creates destination), resolves the session, and writes a kind:'chat' / sender:'system' welcome message into inbound.db. Host sweep wakes the container and the agent DMs the operator via the normal delivery path. /manage-channels rewrite - Drops --is-main / --jid / main-vs-non-main isolation references. - First-channel flow delegates to /init-first-agent. - Explains createMessagingGroupAgent auto-creates destinations. - Adds a privileged-users show section. setup/ - register.ts: drop --is-main, --jid, --local-name, --trigger requiresTrigger defaults; call initGroupFilesystem; normalize to v2 schema (no is_admin, no admin_user_id, sets unknown_sender_policy 'strict'); let createMessagingGroupAgent handle the destination row. - pair-telegram.ts: emit PAIRED_USER_ID (namespaced "telegram:<id>") instead of ADMIN_USER_ID; update header comment. - register.test.ts deleted — was v1-only, tested a registered_groups table that no longer exists. Docs - v2-architecture-diagram.{md,html}: ER diagram updated to drop is_admin/admin_user_id, add unknown_sender_policy, and include users/user_roles/agent_group_members/user_dms. - v2-architecture-draft.md: approval-routing paragraph rewritten for pickApprover/pickApprovalDelivery/ensureUserDm; SQL schema block updated; admin-verification paragraph references NANOCLAW_ADMIN_USER_IDS. - v2-setup-wiring.md: entity-model sketch rewritten. - v2-checklist.md: marked privilege refactor / container filtering / approval routing / unknown-sender gating done; removed obsolete admin_user_id and main-vs-non-main items. Scripts - scripts/init-first-agent.ts (new) replaces scripts/welcome-owner-dm.ts (removed; welcome-owner was a Discord-specific one-off). - test-v2-host.ts, test-v2-channel-e2e.ts, seed-discord.ts: drop is_admin + admin_user_id, use unknown_sender_policy. Tests - src/access.test.ts (new): 14 tests for canAccessAgentGroup, role helpers, pickApprover, ensureUserDm, pickApprovalDelivery. - src/db/db-v2.test.ts: adds 3 tests for the auto-created agent_destinations row (normalized name, no duplicates, collision break within an agent group). - host-core.test.ts, channel-registry.test.ts: updated fixtures to use unknown_sender_policy: 'public' where the test exercises routing rather than the access gate. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:03:51 +03:00
Gabi Simons	8676c07448	feat(v2): support async channel adapter factories Channel adapter factories can now return a Promise, enabling adapters that need async initialization like loading auth state from disk (e.g. WhatsApp reading credentials via useMultiFileAuthState). Existing sync factories are unaffected — await on a sync return is a no-op. All current adapters remain synchronous. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 11:11:06 +00:00
gavrielc	9dda75bb21	docs(v2): cross-mount invariants + diagrams; inline a2a routing - session-manager.ts: shrink the cross-mount invariant header from 31 lines to 12, keeping each invariant's cause and consequence inline. - agent-runner/db/connection.ts: parallel cross-mount comment for the container-side reader (inbound.db must be journal_mode=DELETE). - agent-runner/db/messages-out.ts: document that even/odd seq parity is load-bearing — seq is the agent-facing message ID returned by send_message and consumed by edit_message / add_reaction, looked up across both tables. - v2-checklist.md: record the cross-mount invariants and seq parity under Core Architecture so future "simplifications" don't regress them. - scripts/sanity-live-poll.ts: empirical validation harness for the three cross-mount invariants — flips each one and observes silent message loss / corruption. - delivery.ts: inline routeAgentMessage at its single callsite (-17 net lines). The wrapper added more boilerplate than it factored. - docs/v2-architecture-diagram.{md,html}: rendered Mermaid diagrams of the v2 system, message flow, named destinations, entity model, and the two-DB split. - channels/adapter.ts, chat-sdk-bridge.ts, credentials.ts, db/sessions.ts, db/db-v2.test.ts: prettier format pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 00:21:12 +03:00
gavrielc	e92b245399	feat(v2): OneCLI 0.3.1 — approvals, credential collection, threaded routing Three features built on top of @onecli-sh/sdk 0.3.1, landed together because they share wiring surfaces (session DB schema, delivery dispatcher, Chat SDK bridge, channel adapter contract). ## OneCLI manual-approval handler * `src/onecli-approvals.ts` — long-polls OneCLI via the SDK's `configureManualApproval`; on each request, delivers an `ask_question` card to the admin agent group's first messaging group, persists a `pending_approvals` row, and waits on an in-memory Promise resolved by the admin's button click or an expiry timer. Expired cards are edited to "Expired (...)" and a startup sweep flushes any rows left over from a previous process. * Short 11-byte approval id (`oa-<8 base36>`) instead of the SDK's UUID so the Telegram 64-byte `callback_data` limit is respected; the OneCLI UUID stays in the persisted payload for audit. * Migration 003 consolidated: `pending_approvals` now has the OneCLI-aware columns from the start (`agent_group_id`, `channel_type`, `platform_id`, `platform_message_id`, `expires_at`, `status`), `session_id` relaxed to nullable so cross-session approvals fit. * `handleQuestionResponse` in `src/index.ts` now routes OneCLI approvals through `resolveOneCLIApproval` before falling back to the session-bound approval path. ## Credential collection from chat New `trigger_credential_collection` MCP tool — the agent researches a third-party API, calls the tool with `{name, hostPattern, headerName, valueFormat, description}`, and blocks until the host reports saved, rejected, or failed. The credential value never enters the agent's context: the user submits it into a Chat SDK Modal on the host side, the host writes it to OneCLI via a thin facade (`src/onecli-secrets.ts` — shells out to `onecli secrets create`, shape mirrors the SDK we expect upstream), and only the status string flows back to the container via a system message. * `src/credentials.ts` — host-side handler: delivers the card to the conversation's own channel (not the admin channel — credential collection is a user-facing flow, distinct from admin approval), persists a `pending_credentials` row, drives the submit → `createSecret` → notify pipeline. Falls back gracefully when the channel doesn't support modals. * `src/db/credentials.ts` + migration 005: `pending_credentials` table. * `src/channels/chat-sdk-bridge.ts`: renders a `credential_request` card, handles the `nccr:` action prefix by opening a Modal with a TextInput, registers an `onModalSubmit` handler for the `nccm:` callback prefix. * `container/agent-runner/src/mcp-tools/credentials.ts`: the blocking MCP tool, mirroring the `ask_user_question` polling pattern. * `container/agent-runner/src/db/messages-in.ts`: `findCredentialResponse` helper to pick up the system message the host writes back. ## Threaded adapter routing The destination layer previously didn't carry thread context, so agent replies to Discord always landed in the root channel regardless of which thread the inbound came from. * `ChannelAdapter.supportsThreads: boolean` — declared by every channel skill at `createChatSdkBridge`. Threaded: Discord, Slack, Teams, Google Chat, Linear, GitHub, Webex. Non-threaded: Telegram, WhatsApp Cloud, Matrix, Resend, iMessage. * `src/router.ts`: non-threaded adapters strip `threadId` at ingest (threads collapse to channel-level sessions). Threaded adapters override the wiring's `session_mode` to `'per-thread'` so each thread = a session (except `agent-shared`, which is preserved as a cross-channel intent the adapter can't know about). * `session_routing` table in `inbound.db` — single-row default reply routing written by the host on every container wake from `session.messaging_group_id` + `session.thread_id`. Forward-compat `CREATE TABLE IF NOT EXISTS` handles older session DBs lazily. * `container/agent-runner/src/db/session-routing.ts` — container-side reader. * `send_message` / `send_file` / `ask_user_question` / `send_card` / scheduling tools all default their routing (channel, platform, and thread) from the session when no explicit `to` is given. Explicit `to` uses the destination's channel with `thread_id = null` (cross-destination sends start a new conversation elsewhere). * `poll-loop.ts::sendToDestination` (the final-text single-destination shortcut) now inherits `thread_id` from `RoutingContext` too — this was the root cause of Discord replies landing in the root channel even after `send_message` was wired correctly. ## Related cleanups * `src/container-runner.ts`: OneCLI agent identifier switched from the lossy folder-derived string to `agent_group.id`, making `getAgentGroup(externalId)` a trivial reverse lookup for per-agent scoping. * `wakeContainer` race fix via an in-flight promise map — concurrent wakes during the async buildContainerArgs / OneCLI `applyContainerConfig` window no longer double-spawn containers against the same session directory. * `src/db/db-v2.test.ts`: dropped the brittle `expect(row.v).toBe(N)` schema version assertion — it had to be bumped on every migration addition. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 17:18:21 +03:00
gavrielc	d8fbd3b239	feat: agent-to-agent communication, dynamic agent creation, self-modification tools Agent-to-agent: host routes messages with channel_type='agent' to target agent's inbound.db, enriches with sender info, wakes target container. Bidirectional routing works via inherited routing context. Dynamic agents: create_agent MCP tool + system action handler creates agent groups, folders, and optional CLAUDE.md on the fly. Self-modification: install_packages (apt/npm, requires admin approval), add_mcp_server (no approval), request_rebuild (builds per-agent-group Docker image with approved packages). Approval flow reuses interactive card infrastructure with pending_approvals table. Also includes fixes from prior session: attachment download, reply context extraction, message editing (platform message ID tracking), delivery retry limits, and card update on button click. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 01:11:06 +03:00
gavrielc	57a6491c7e	v2: channel isolation model, manage-channels skill, refactored channel skills - Add three-level isolation model (shared session, same agent, separate agent) with agent-shared session mode for cross-channel shared sessions - Create /manage-channels skill for wiring channels to agent groups - Refactor all 12 v2 channel skills: lean SKILL.md + VERIFY.md + REMOVE.md with structured Channel Info section for platform-specific metadata - Create /add-discord-v2 skill (was missing) - Add step 5a to setup SKILL.md invoking /manage-channels after channel install - Update setup/verify.ts to check all 12 channel token types - Add docs/v2-isolation-model.md explaining the isolation model - Update v2-checklist.md and v2-setup-wiring.md to reflect completed work Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 13:19:19 +03:00
gavrielc	c31bb02c06	v2 phase 5: pending questions with interactive cards End-to-end ask_user_question flow: - Agent MCP tool writes question card to messages_out - Host delivery creates pending_questions row, delivers as Discord Card with buttons - Local webhook server receives Gateway INTERACTION_CREATE events - Acknowledges interaction + updates card to show selected answer - Routes response back to session DB as system message - MCP tool poll picks up response and returns to agent Key fixes: - Poll loop now skips system messages (reserved for MCP tool responses) - Gateway listener uses webhookUrl forwarding mode for interaction support - Button custom_id encodes questionId + option text for self-contained routing Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 03:26:16 +03:00
gavrielc	c348fabf22	v2 phase 5: scheduling fixes, media handling, command processing - Host sweep: fix DELETE journal mode, busy_timeout, seq in recurrence INSERT - Outbound files: delivery reads from outbox dir, passes buffers to adapter, cleans up after delivery. Chat SDK bridge sends files via postMessage. - Inbound attachments: formatter includes attachment info in prompts - Commands: categorize /commands as admin, filtered, or passthrough. Admin commands check sender against NANOCLAW_ADMIN_USER_ID. Filtered commands silently dropped. Passthrough sent raw to agent. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 02:59:33 +03:00
gavrielc	7201fe5032	v2 phase 4: channel adapter interface, registry, and host wiring ChannelAdapter interface with setup/deliver/teardown/setTyping lifecycle. Self-registration pattern via channel-registry. Host wiring in index-v2 bridges inbound messages to routeInbound and outbound delivery to adapters. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 00:10:46 +03:00

12 Commits