Craft Agents - The way working with AI Agents should feel
Based on Craft Docs's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Craft Agents separates riskier actions into explore mode (plan only) and execute mode (command execution), with an additional “ask to edit” permission gate for specific filesystem changes.
Briefing
Craft Agents is an open-source, multiplatform AI “agent” app built to make day-to-day work with AI feel controlled, auditable, and fast—especially when the agent needs to touch a user’s files or connect to external tools. The core pitch is an agent workflow that separates planning from execution, shows exactly what commands and configuration changes it intends to make, and then asks for permission when real-world actions are at stake.
After installing from agents.craft.2 (Mac with Apple silicon, Mac with Intel, Windows, and Linux) users choose how to connect to an AI endpoint. Craft Agents supports cloud subscriptions by default (recommended for best experience) and also offers alternatives such as direct API key connections, OpenRouter, Vercel AI gateway, Ollama, or other custom providers. Once connected, new chats can start immediately, but the app’s standout behavior appears when tasks require system access. In a demo, the agent is asked to reorganize the user’s downloads folder, identify deletable items, and estimate storage savings. Instead of blindly running commands, it uses an “explore mode” that blocks execution and produces a plan. The UI then reveals the steps it would take—down to the commands and outputs—along with why certain actions were blocked (for example, because execution is restricted in explore mode).
Users can then accept the plan to switch into “execute mode,” where the agent carries out file operations (deleting, moving, creating folders) and provides a running to-do list. If trust is an issue, Craft Agents supports an “ask to edit” style: the agent proposes a specific filesystem change, lists the exact files involved, and prompts for permission (including options like always allow or deny). For users who want fewer interruptions, switching back to execute mode allows the agent to continue without repeated confirmations.
Beyond local actions, Craft Agents emphasizes “sources” as the mechanism for connecting tools. A dedicated sources menu supports APIs, MCPs, and even local folders. Rather than requiring users to understand every integration, the app can guide setup through documentation and guided prompts. In examples, it walks through connecting to Linear (issues/tickets) and GitHub (listing issues and organizing pull requests), handling authentication via secure credential storage and prompting for API keys only when needed. The app also provides an edit view showing configuration diffs—lines added or removed—so users can audit what changed.
Organization is handled through statuses, labels, and auto-apply rules. Conversations start in a default status but can be moved with one click or hashtag shortcuts. Labels can be assigned automatically using rules (e.g., any session containing “bill” or “invoice” gets an “invoices” label). Search is designed for scale, with fast retrieval across thousands of sessions and the ability to highlight matches inside both the session list and the conversation body, plus AI-generated title regeneration.
Finally, Craft Agents turns multi-step workflows into reusable “skills.” A weekend-planning skill for Lake Balaton demonstrates combining sources (Outlook calendar, weather lookup, and cat-facts API) into a single plan that can be triggered later. Skills can be visualized as Mermaid flowcharts, showing decision branches (season-based activity suggestions) and the sequence of actions (web search, weather-based recommendations, calendar event creation, and final presentation). The overall message: agentic automation is most usable when it’s planned, permissioned, connected to real tools, and easy to audit.
Cornell Notes
Craft Agents is an open-source, multiplatform AI agent app designed for practical work: it plans before it acts, shows what it will do, and then executes with user permission when tasks affect files or external systems. Users connect to AI endpoints (cloud subscriptions recommended, but API keys, OpenRouter, Vercel AI gateway, Ollama, and custom options are supported), then run chats that can switch between explore mode (plan only) and execute mode (run commands). The app also manages “sources” like APIs and MCPs, guiding setup for tools such as Linear and GitHub while storing credentials securely and displaying configuration diffs. Organization features—statuses, labels, auto-apply rules, and fast search—help users manage thousands of conversations. Reusable “skills” bundle multi-step workflows and can be rendered as Mermaid flowcharts for clarity.
How does Craft Agents prevent an AI from taking risky actions immediately?
What does “accept and compact” do, and why does it matter for long tasks?
How are external tools integrated, and what makes setup less painful?
What audit and security features show up when the agent changes configuration or touches credentials?
How do statuses, labels, and auto-apply rules organize conversations?
What are skills, and how do they help with repeatable workflows?
Review Questions
- When would a user choose explore mode versus execute mode, and what UI evidence confirms the agent’s plan before execution?
- How does Craft Agents handle authentication for sources like Linear and GitHub, and what mechanisms prevent credentials from being stored in chat sessions?
- Describe how statuses, labels, and auto-apply rules work together to keep large conversation histories navigable.
Key Points
- 1
Craft Agents separates riskier actions into explore mode (plan only) and execute mode (command execution), with an additional “ask to edit” permission gate for specific filesystem changes.
- 2
The UI provides step-by-step transparency, including the exact commands the agent intends to run and the reasons execution is blocked in explore mode.
- 3
Sources unify integrations across APIs, MCPs, and local folders, with guided setup that can use documentation and web search to reduce manual configuration.
- 4
Configuration changes are auditable via diff-style edit views (lines added/removed), while credentials are handled through secure authentication cards and local credential storage.
- 5
Statuses and labels organize conversations, and auto-apply rules can automatically tag sessions based on content keywords.
- 6
Search is optimized for speed across thousands of sessions, highlighting matches both in the session list and inside conversation bodies, with AI-assisted title regeneration.
- 7
Skills turn multi-source workflows into reusable routines and can be visualized as Mermaid flowcharts for easier understanding and auditing.