The One Rule Claude Code's Creator Never Breaks

AAI LABS
Computing/SoftwareAdult EducationInternet Technology

Transcript

00:00:00This is Boris, and when people get stuck with Claude Code, they ask him how to use it.
00:00:04But why would they ask him?
00:00:05It's because Boris works at Anthropic and he's the one who actually created Claude Code.
00:00:09Even if he says that his setup is pretty simple, it's not.
00:00:12That's just his perspective.
00:00:13What's simple to him might be a whole system to everyone else.
00:00:16Now we've got a lot to talk about, as I was honestly surprised at some of the points,
00:00:20and while some things were technically older,
00:00:22he still gave me a fresh perspective on those features as well.
00:00:25In one of his tips, Boris corrects one of the biggest misconceptions people have
00:00:29when trying to build something with Claude Code.
00:00:31They basically give it a task, and then when there are minor errors,
00:00:34or it's not able to complete it 100%, they get disappointed.
00:00:38But humans work in a very similar way.
00:00:40That's why he tells us that we should always give Claude a way to verify its work.
00:00:44For example, if you're going to write any Python code for building a project,
00:00:47you should tell it to use tests as well.
00:00:49This is because Claude gets that feedback loop,
00:00:51and this instantly increases the quality of the final result.
00:00:54And if you don't even want to get into the code,
00:00:56you can actually ask it which tests you should use,
00:00:59and it's going to give you a test for every single thing that it implements.
00:01:02Even if you want it to work autonomously,
00:01:04which Boris also has a lot of tips on,
00:01:06you can just edit the Claude.md file, which gets sent into every Claude Code session.
00:01:11In there you can tell it that before it does any work,
00:01:14it should mention how it would verify that work, so it gets that constant feedback loop.
00:01:18Another important point that Boris identifies is that verification is different for each domain.
00:01:22Again, it might be as simple as running a bash command,
00:01:25or for UI-based verification, you have a lot of different ways to do that as well,
00:01:29one of which is Claude's own browser extension.
00:01:32We made a pretty extensive video on that too, and we'll link it down below.
00:01:35Basically, Claude navigates the whole site and can verify everything.
00:01:39This works as verification for pretty much anything.
00:01:41For example, someone asked how they could verify a mobile app that was being built.
00:01:45There are actually a lot of iOS and Android simulator MCPs that you can use as well.
00:01:50But if you think these MCPs are too token hungry,
00:01:52then you can use not just verification, but verification-led techniques.
00:01:56For example, you can use test-driven development where tests for a new feature are written first,
00:02:01and then you do the actual implementation.
00:02:03You can also use screenshots in that process to verify the UI as well.
00:02:06Speaking of the Claude.md, Boris also emphasizes how important this file is
00:02:11and how unique it should be to each Claude Code repo.
00:02:14Think of it this way, it should contain all the basic information
00:02:17that a newly created Claude session should have about your project.
00:02:20This includes the tech stack, the basic project structure,
00:02:23some code style and conventions that your team follows,
00:02:26and most importantly, the things that it shouldn't do.
00:02:28This is because if you encounter errors in any of your sessions,
00:02:32you probably don't want Claude to repeat them.
00:02:34So he has made it a habit to ask his whole team to contribute to the Claude.md multiple times a week.
00:02:39Anytime they see it doing something incorrectly, they add it there as well.
00:02:43Each member of the team also has a specific responsibility for maintaining a specific file.
00:02:47Another great structure for full stack apps is that each microservice you're running can have its own Claude.md.
00:02:53The front end can have its own, and the back end can have its own as well.
00:02:56Another important thing is not to bloat the Claude.md.
00:02:59He confirms this by saying that his own Claude.md is about 2.5K tokens,
00:03:04and he even highlighted exactly what they cover in it.
00:03:06You might already know about the different modes you can use Claude Code with,
00:03:10with plan mode being specifically for planning.
00:03:12We'll come back to this later, as it's an important point.
00:03:15Then there's auto-accept mode, which basically auto-accepts all file edits.
00:03:19But there's another option as well, which is the dangerously skip permissions mode,
00:03:23where Claude essentially has permission to run everything.
00:03:26Boris says he doesn't use this, and he's right, especially in production environments,
00:03:30where a single terminal command can mess up the entire project.
00:03:33Instead, he uses the permissions in Claude Code to set up which commands are allowed,
00:03:37which commands it should ask before running, and which commands are completely denied.
00:03:41You can pretty much add any rule you want and set it at the project level,
00:03:45and Claude is going to follow that.
00:03:46To share this with his team, the .Claude folder also contains a settings.json file
00:03:51where all of these commands are written down,
00:03:53and he can then share that file with the rest of the team.
00:03:56In a way, Claude Code has been one of the most badly named products from Anthropic,
00:04:00because it's not only used for code.
00:04:02You might have even seen people generating full-blown Anetian workflows with it,
00:04:06or even setting up entire SEO pipelines or different agents for analyzing business markets and other things.
00:04:12The point is that it's a full system for people to control a lot of different workflows.
00:04:16The example that Boris gives is that Claude pretty much uses all of his other tools for him as well.
00:04:21For instance, he uses Slack with Claude Code using the MCP server,
00:04:25along with a bunch of other tools like BigQuery and Sentry,
00:04:28all through their CLIs inside Claude Code.
00:04:30Again, this just shows that you can use Claude Code as a form of orchestrator.
00:04:33For me specifically, one of the biggest ways I use it in this orchestration is inside Notion,
00:04:38where Claude connects to Notion via MCP.
00:04:40It's able to create databases, populate them, and understand the context of all our video ideas.
00:04:45Of course, you're going to see them blurred because these are upcoming video ideas,
00:04:49but I hope you get the point.
00:04:50Now, if you're wondering how Boris actually uses Claude Code before starting to implement any feature or task,
00:04:56he first switches over to plan mode and clearly describes what he wants.
00:05:00Once Claude has a plan, he continues from there.
00:05:02He argues that having a good plan is extremely important
00:05:05and that all the back and forth he needs to do with the agent happens during plan mode.
00:05:09He only starts the agent once he's verified that the plan is actually valid.
00:05:13After that, he kicks it off in auto-accept edits mode, not the dangerously skip permissions mode.
00:05:17As for how he uses it at scale, he also mentions that he runs five of these Claude Code sessions in parallel.
00:05:23While working with them, he basically numbers his tabs.
00:05:26He does this so that when he receives notifications from different tabs, he knows which terminal tab they're coming from.
00:05:32On top of that, he also heavily uses the web sessions of Claude Code.
00:05:35This is where you connect the web version of Claude Code to your GitHub,
00:05:39give it access to a demo repo, and it's able to work on that in the cloud.
00:05:43The way Boris works with this is that he uses the background agent's feature to put any long-running task into the background.
00:05:49Sometimes, he even uses it on his phone, where Claude can automatically do all the work in a cloud session,
00:05:54and when it's done, it creates a new branch and pushes all the code changes there for you to review.
00:05:59There is also the teleport command, which brings the cloud sessions back into the local terminal and allows you to view and work on it yourself.
00:06:06And this is truly the definition of working with any AI agent, where he's basically treating it as a junior developer.
00:06:11Another interesting point he mentioned is that he pretty much uses Opus 4.5 with thinking enabled for everything.
00:06:17He argues that even though it's a bigger and slower model and takes more time than Sonnet and significantly more time than the Haiku models,
00:06:24it has a much lower chance of making errors.
00:06:26As a result, the overall time spent working with the model and steering it is actually much lower with Opus 4.5.
00:06:33He also uses the Claude Code GitHub action, which is basically a bot that you can call inside your authorized GitHub repos.
00:06:40He's provided a really strong use case for this as well.
00:06:43During PR reviews, whenever he finds mistakes, he simply asks Claude to add those mistakes to the Claude.md file.
00:06:49As for long running tasks, he uses three different approaches.
00:06:52The first is asking Claude to verify its work using different tests that Claude Code itself comes up with and then putting those tasks into the background.
00:07:00These are not the cloud agents we're talking about.
00:07:02These are just background tasks that run silently and report back to the main agent once they're done.
00:07:07The second approach is using a stop hook to automatically trigger this verification when Claude stops outputting.
00:07:12The third option he mentions is based on this idea as well, which is what the Ralph Wiggum plugin is built on.
00:07:17We did a pretty extensive video on the Ralph Wiggum plugin.
00:07:20We showed how we use the Ralph plugin and how we test during development to verify that the UI Claude Code is producing is actually correct.
00:07:28It essentially removes the human from the loop entirely.
00:07:31Again, repeating the pattern of verification that he's talked about,
00:07:34he also uses it to verify the format of the code being written by using different kinds of linters and formatters for various languages.
00:07:41He basically says that while the models do generate correct code, these formatters help clean up the remaining 10%.
00:07:47And this is important so there aren't any errors when the code runs in CI as part of their GitHub integration pipelines.
00:07:53Now moving on to some of the simpler features that have been in Claude Code for a long time but still save a lot of time and that many beginners don't know very well.
00:08:01As he has already mentioned, his own workflow is surprisingly vanilla.
00:08:04This applies to most of us.
00:08:06We don't need to overcomplicate everything in Claude Code to be truly productive.
00:08:09These simple built-in features already get you about 90% of the productivity you're trying to achieve.
00:08:15Starting with /commands, he refers to his repetitive workflows as his "inner loop" workflows, things he has to repeat many times a day.
00:08:22He gives a very simple example of a GitHub /command that he uses frequently.
00:08:26Since all of these /commands are stored inside your .claud folder in a command subfolder, you can actually commit them to git.
00:08:33This is also how he shares them with his entire team.
00:08:36After that, another really simple built-in feature is subagents.
00:08:39People have gone a bit crazy with this feature, building subagents for almost every use case imaginable like market researchers or product validators.
00:08:47Everyone has their own use cases, but Boris uses subagents to automate simple workflows that require completely different instructions.
00:08:55For example, most of his agents are used to verify that the architecture of his code is correct, to refactor the code that's been written and to validate that the final builds actually work.
00:09:04Let's talk about Automata now.
00:09:05After teaching millions of people how to build with AI, we started implementing these workflows ourselves.
00:09:11We discovered we could build better products, faster than ever before.
00:09:14We helped bring your ideas to life, whether it's apps or websites.
00:09:18Maybe you've watched our videos thinking, "I have a great idea, but I don't have a tech team to build it."
00:09:22That's exactly where we come in.
00:09:24Think of us as your technical co-pilot.
00:09:26We apply the same workflows we've taught millions directly to your project, turning concepts into real, working solutions without the headaches of hiring or managing a dev team.
00:09:36Ready to accelerate your idea into reality?
00:09:39Reach out at hello@automata.dev
00:09:41That brings us to the end of this video.
00:09:43If you'd like to support the channel and help us keep making videos like this, you can do so by using the super thanks button below.
00:09:50As always, thank you for watching and I'll see you in the next one.

Key Takeaway

Boris, Claude Code's creator at Anthropic, emphasizes that successful AI-assisted development requires verification loops, careful planning in plan mode, and treating Claude as a junior developer with proper guardrails rather than autonomous dangerous permissions.

Highlights

Always give Claude a way to verify its work - use tests, bash commands, or browser extensions to create a feedback loop that instantly improves code quality

The Claude.md file should contain tech stack, project structure, code conventions, and things Claude shouldn't do - Boris's team updates it multiple times per week when errors occur

Boris never uses 'dangerously skip permissions' mode; instead he configures allowed/denied commands at the project level and shares settings via .claude/settings.json

Always start in plan mode before implementing - do all back-and-forth during planning, only switch to auto-accept edits mode once the plan is verified

Use Opus 4.5 with thinking enabled for everything - despite being slower, it makes fewer errors which saves overall time on steering and corrections

Claude Code is badly named because it's not just for coding - it's an orchestrator for workflows including Slack, BigQuery, Sentry, Notion databases, and other tools via MCP servers

Run multiple Claude Code sessions in parallel (Boris runs 5), use numbered tabs for tracking, leverage cloud sessions with background agents for long-running tasks, and use the GitHub action bot for PR reviews

Timeline

Introduction to Boris and Core Verification Philosophy

Boris works at Anthropic and created Claude Code, making him the go-to expert when people get stuck using the tool. He corrects a major misconception where users expect Claude to complete tasks 100% perfectly on the first try and get disappointed with minor errors. Boris emphasizes that humans work similarly and make mistakes too. The fundamental principle he advocates is always giving Claude a way to verify its work, such as through tests when writing Python code. This verification creates a feedback loop that instantly increases the quality of the final result.

Implementing Verification and Feedback Loops

Users can ask Claude which tests to implement, and it will provide tests for every feature it builds, even without users needing to understand the code details. For autonomous work, Boris recommends editing the Claude.md file, which gets sent into every Claude Code session, to instruct Claude to mention how it would verify work before starting any task. This ensures a constant feedback loop is maintained throughout the development process. The Claude.md file serves as persistent instructions that guide Claude's behavior across all sessions. Even non-technical users can benefit from this approach by having Claude suggest appropriate verification methods.

Domain-Specific Verification Methods

Boris explains that verification differs across domains and should be tailored to the specific type of work being done. For simple cases, verification might be as straightforward as running a bash command to check results. For UI-based verification, multiple approaches exist including Claude's own browser extension, which can navigate entire websites and verify everything automatically. For mobile app development, iOS and Android simulator MCPs are available for verification purposes. Beyond basic verification, Boris recommends verification-led techniques like test-driven development (TDD), where tests for new features are written first before implementation, and screenshots can be used to verify UI changes.

Mastering the Claude.md Configuration File

Boris emphasizes the critical importance of the Claude.md file and how it should be unique to each repository. This file should contain all basic information a newly created Claude session needs including the tech stack, basic project structure, code style and team conventions, and most importantly, things Claude shouldn't do. When errors occur in sessions, teams should document them in Claude.md to prevent repetition. Boris has made it a habit for his entire team to contribute to Claude.md multiple times per week whenever they observe Claude doing something incorrectly. Each team member has specific responsibility for maintaining particular sections of the file, ensuring comprehensive coverage and accountability.

Claude.md Structure and Token Management

For full-stack applications, Boris recommends a structured approach where each microservice has its own Claude.md file, with separate files for frontend and backend components. This modular approach prevents configuration bloat and keeps instructions relevant to specific contexts. However, Boris warns against bloating the Claude.md file with excessive information. His own Claude.md is approximately 2.5K tokens, which he considers an optimal size. He provided specific examples of what sections his team covers in their Claude.md file. The key is balancing comprehensiveness with conciseness to ensure Claude receives essential guidance without overwhelming context.

Understanding Claude Code Modes and Permissions

Claude Code offers different operational modes including plan mode for planning, auto-accept mode that automatically accepts file edits, and dangerously skip permissions mode where Claude has permission to run everything. Boris explicitly states he never uses the dangerous permissions mode, especially in production environments where a single terminal command could break the entire project. Instead, he configures Claude Code's permissions system to specify which commands are allowed automatically, which require confirmation before running, and which are completely denied. Users can add any rules they want and set them at the project level, and Claude will follow these guardrails consistently.

Sharing Team Settings and Beyond Coding Use Cases

The .claude folder contains a settings.json file where all permission commands and configurations are documented, allowing Boris to share these settings with his entire team for consistency. He argues that Claude Code is one of Anthropic's most badly named products because it's not limited to coding tasks. Users have successfully generated full Anetian workflows, set up entire SEO pipelines, created different agents for analyzing business markets, and built various other non-coding systems. Claude Code functions as a comprehensive system for controlling many different workflows, not just software development. Boris describes it as an orchestrator that can coordinate multiple tools and processes simultaneously.

Claude Code as an Orchestration Tool

Boris explains that Claude essentially uses all of his other tools for him through integration and orchestration. He uses Slack with Claude Code via the MCP server, along with tools like BigQuery and Sentry, all accessed through their CLIs inside Claude Code. This demonstrates Claude Code's capability as a form of orchestrator that can coordinate multiple services and tools simultaneously. The speaker shares a personal example of using Claude Code for orchestration with Notion, where Claude connects via MCP to create databases, populate them, and understand the context of video ideas. This practical example shows how Claude Code can manage content workflows and handle complex database operations through natural language instructions.

Boris's Workflow: Plan Mode First Approach

Boris reveals his actual workflow for implementing features or tasks, which always begins by switching to plan mode and clearly describing the desired outcome. Once Claude generates a plan, he continues from there after thorough review. He argues that having a good plan is extremely important and that all back-and-forth discussion with the agent should happen during plan mode. Boris only starts the agent once he has verified that the plan is actually valid and comprehensive. After plan validation, he kicks off execution in auto-accept edits mode, not the dangerously skip permissions mode, maintaining safety guardrails even during automated work.

Working at Scale with Multiple Sessions

For working at scale, Boris runs five Claude Code sessions in parallel simultaneously to maximize productivity. While working with multiple sessions, he numbers his tabs to maintain organization and track which session is which. This numbering system is crucial because when he receives notifications from different tabs, he immediately knows which terminal tab they're coming from and can respond appropriately. This parallel workflow demonstrates how experienced users can leverage Claude Code's architecture to work on multiple independent tasks or features simultaneously. The numbered tab system is a simple but effective organizational technique for managing concurrent AI-assisted development work.

Web Sessions and Background Agents

Boris heavily utilizes web sessions of Claude Code, where users connect the web version to their GitHub account, grant access to a demo repository, and Claude works on it in the cloud. His workflow involves using the background agents feature to put long-running tasks into the background, freeing him to work on other things. He even uses Claude Code on his phone, where Claude automatically performs all work in a cloud session, and when complete, creates a new branch and pushes all code changes for review. There's also a teleport command that brings cloud sessions back into the local terminal, allowing users to view and work on the code themselves. This truly exemplifies working with an AI agent as if it were a junior developer operating independently.

Why Boris Uses Opus 4.5 with Thinking Enabled

Boris reveals that he uses Opus 4.5 with thinking enabled for everything, despite it being a bigger and slower model that takes more time than Sonnet and significantly more time than Haiku models. He provides a compelling rationale: even though Opus 4.5 is slower per task, it has a much lower chance of making errors compared to smaller models. As a result, the overall time spent working with the model and steering it back on track is actually much lower with Opus 4.5. This counterintuitive insight suggests that the slower but more accurate model saves time in the aggregate by reducing the need for corrections and iterations. Quality and accuracy outweigh raw speed when considering the full development cycle.

GitHub Action Bot and PR Review Workflow

Boris uses the Claude Code GitHub action, which is essentially a bot that can be called inside authorized GitHub repositories. He provides a strong use case for this integration during the pull request review process. Whenever he finds mistakes during PR reviews, he simply asks Claude to add those mistakes to the Claude.md file automatically. This creates a continuous improvement loop where common errors get documented and prevented in future development. For long-running tasks, Boris employs three different approaches to maintain productivity. The first approach involves asking Claude to verify its work using different tests that Claude Code itself creates, then putting those tasks into the background to run independently.

Advanced Verification with Hooks and Plugins

Boris's background tasks are not cloud agents but local background processes that run silently and report back to the main agent once completed. The second approach for long-running tasks uses a stop hook to automatically trigger verification when Claude stops outputting, ensuring no work completes without validation. The third option is based on the Ralph Wiggum plugin, which the video creator covered extensively in another video. The Ralph plugin removes humans from the loop entirely during UI development by automatically testing and verifying that the UI Claude Code produces is actually correct. This represents the ultimate automation in verification, where Claude can develop, test, and validate interface changes without human intervention at each step.

Code Formatting and CI Integration

Continuing the verification pattern, Boris uses linters and formatters for various programming languages to verify the format of code being written by Claude. He acknowledges that while AI models do generate correct code, these formatters help clean up the remaining 10% of issues that might exist. This formatting verification is particularly important to prevent errors when code runs in CI (Continuous Integration) as part of their GitHub integration pipelines. Proper formatting ensures that code meets team standards and passes automated checks before merging. The combination of functional correctness from Claude and formatting perfection from automated tools creates production-ready code with minimal manual intervention.

Surprisingly Vanilla Workflow and Built-in Features

Boris reveals that his own workflow is surprisingly vanilla despite his expertise, challenging the assumption that power users need complex setups. This applies to most users who don't need to overcomplicate everything in Claude Code to be truly productive. The simple built-in features already provide about 90% of the productivity users are trying to achieve. This insight is valuable for beginners who might feel overwhelmed by advanced configurations. Boris emphasizes that mastering the fundamentals and built-in features is more important than creating elaborate custom setups. The next sections will cover these simple but powerful features that have been in Claude Code for a long time but many beginners don't know well.

Slash Commands for Inner Loop Workflows

Boris discusses /commands (slash commands) which he uses for his repetitive workflows, which he calls his 'inner loop' workflows - tasks he repeats many times daily. He provides a simple example of a GitHub /command that he uses frequently for common GitHub operations. All of these /commands are stored inside the .claude folder in a command subfolder, which means they can be committed to git version control. This is exactly how Boris shares his custom commands with his entire team, ensuring everyone has access to the same productivity shortcuts. Custom commands reduce repetitive typing and standardize common operations across the team, making collaboration more efficient and consistent.

Subagents for Different Instruction Sets

Another powerful built-in feature is subagents, which people have extensively utilized for various use cases like market researchers or product validators. However, Boris takes a more focused approach, using subagents to automate simple workflows that require completely different instructions from the main agent. Most of his subagents are used to verify that the architecture of his code is correct, to refactor code that's been written, and to validate that final builds actually work. This specialization allows different subagents to have specific expertise and instruction sets optimized for their particular verification or transformation task. By delegating specific responsibilities to subagents, Boris creates a modular system where each component excels at its designated function.

Sponsored Segment - Automata Services

The video includes a sponsored segment for Automata, the company behind the channel. After teaching millions of people how to build with AI, they started implementing these workflows themselves and discovered they could build better products faster than ever. Automata helps bring ideas to life by building apps or websites for clients who have great ideas but lack a technical team. They position themselves as a technical co-pilot, applying the same workflows they've taught millions directly to client projects. The service aims to turn concepts into real, working solutions without the headaches of hiring or managing a development team. Interested viewers are invited to reach out at hello@automata.dev to accelerate their ideas into reality.

Closing and Call to Action

The video concludes with a brief closing segment where the creator thanks viewers for watching. He mentions that viewers can support the channel and help them continue making similar educational videos by using the Super Thanks button below the video. This monetization option allows viewers to directly contribute to the channel's sustainability. The creator ends with his standard sign-off, thanking the audience again and promising to see them in the next video. This closing maintains the channel's approachable and appreciative tone while encouraging community support for continued content creation.

Community Posts

View all posts