Quality anyone?

Englishالعربية Deutsch Español Français हिन्दी Bahasa Indonesia 日本語 한국어 Português Русский 中文

MMaximilian Schwarzmüller

Computing/SoftwareJob SearchInternet Technology

Transcript

00:00:00Cursor announced the release of GPT 5.2 Codex in Cursor, a new frontier model for long running

00:00:07tasks. But that's actually not the main point of my video. Instead, the main point of my video here

00:00:14is about this post here by Michael Truel, the CEO of Cursor, where he mentions that they used this

00:00:22model, I assume at least, they're mentioning GPT 5.2 here, not Codex, but I guess he means Codex,

00:00:28that they used this model to build a browser with AI, with just AI, as I understand it,

00:00:35because it ran uninterrupted for one week. So the AI in Cursor ran for one week and built a browser.

00:00:43It wrote three plus millions lines of code across thousands of files and the rendering engine it

00:00:49wrote was written from scratch and it handles HTML parsing, CSS cascade, all the things you would

00:00:55expect from a browser, I assume. However, there then is one important restriction. It kind of

00:01:04works. And I totally get where the Cursor team is coming from here. It is impressive that just AI on

00:01:13its own wrote a browser that for the most part works. However, even though I never built a browser

00:01:19and probably never will, it's probably fair to say that it's all the parts that take it from 80 to 100%

00:01:27that are complex. And that's not just true for browsers. If you have built anything in your life,

00:01:34even outside of coding, you know that for most projects, the difficult part starts once you're

00:01:4080% done. And I'm not even talking about the marketing and so on, which is super difficult.

00:01:45I'm talking about just building. And for many projects, for many software, you don't need to

00:01:51get to 100%, but 80% or 70% might not cut it. And it is that extra bit that can be super hard

00:02:00to achieve and where AI might not get you to. Just AI, I mean. And I want to be very clear here

00:02:08because it's easy to misinterpret or misunderstand that video. I am 100% positive on AI. I use it all

00:02:16the time. For example, buildmygraphic.com has most of its code written by AI. Not vibe coded though,

00:02:23instead with my instructions, with myself reviewing the code, with myself going into the code and

00:02:30tweaking stuff when it needs tweaking. But I used a lot of AI for this site. I also just released

00:02:38a huge update for my AI for developers course where I walk you through using GitHub Copilot and Cursor

00:02:44efficiently and explore the different features they offer to help you get more out of AI. Because

00:02:49I believe and I've shared that in other videos too, AI is the future for developers. It's a super

00:02:56useful tool and using it heavily and efficiently will be vital. That is something I'm totally

00:03:03convinced of. I'm not so much convinced that vibe coding in its purest form will get us there. And

00:03:09that's probably worth explaining. Because there is a spectrum, I would say, between vibe coding and

00:03:18agentic engineering. Well, of course, you could also say there also is not using AI at all. But

00:03:23again, I'm convinced you should use AI. And the question is where on that spectrum are you? Are you

00:03:29here? Are you here? Are you in the middle? And you can't be anywhere there. But there are different

00:03:37trade offs or use cases, I would say. The question also is how you define vibe coding. Vibe coding,

00:03:45as I understand it, is all about letting AI write the code, having no code reviews, having no

00:03:52understanding of the code base, and also passing no code specific instructions like use this pattern or

00:04:00use this package. So really not knowing anything about the code. That's 100% vibe coding, as I

00:04:08would define it. And there definitely are different other definitions out there as well. That's just

00:04:13what I mean with vibe coding. This form of coding does not have a future, in my opinion,

00:04:21for commercial products for real products. It can be great, however, for other things for other kinds

00:04:30of products. So vibe coding, for example, can be great, I would say for personal utility tools,

00:04:36or for throw away software. So something which you use once or twice and don't care about too much,

00:04:46or maybe also for free software, where you don't really charge people money and therefore it doesn't

00:04:54really matter if it works that well. You could make these arguments and I would say these are use cases

00:04:58where pure vibe coding is viable. You can absolutely use AI to just request a script that does something

00:05:06and you don't care if it covers all edge cases, if it maybe has some potential bugs,

00:05:12because if it gets the job done for you, you're happy. That is absolutely fine. And you can do

00:05:19vibe coding fine. Now on the other hand of the spectrum, we have agentic engineering. And with

00:05:26agentic engineering, which is what I do and what I think is the future, you use AI as a tool. This

00:05:33does not mean that you use it just for the dumb tasks that can include complex tasks. Very important

00:05:43to me because it's easy to get this wrong, but this can include complex tasks. But it means that you

00:05:50have clear instructions regarding patterns, libraries, etc. you want to use. It also means

00:06:00that you do review the code in one way or the other, can also include automated reviews with

00:06:05help of other AI tools, but you will look at the code from time to time to understand what's

00:06:12going on. And it also means that you get into the code yourself when the AI gets stuck or when you

00:06:20want to get it started with a certain implementation where you know how a certain interface or should

00:06:26look like or which pattern you want to use so that the AI can then finish your thoughts. So to say,

00:06:32I would say this is the future. This year, agentic engineering, that is my future at least. And of

00:06:39course, I could be wrong here. Maybe in a couple of years, vibe coding is the only way because the AI

00:06:46is so good that it can do everything. I don't think it will, but it absolutely could. I think the only

00:06:54wrong decision right now, however, is to not be anywhere on this spectrum. You should be anywhere

00:07:01here. You should definitely use AI. And I've shared that in other videos. However, coming back to this

00:07:07post, I have a problem with that kind of works thing. And I understand it as mentioned here in

00:07:13the context of this cursor post. It's also worth noting that clearly the cursor team kind of wants

00:07:18to shift the narrative or maybe gain more visibility again, especially on X where the past weeks have

00:07:26been dominated by developers using Claude code with the Rolf loop to let AI build everything in the end

00:07:33in a vibe coding inspired way. It makes sense that the cursor team wants to show that you can use

00:07:39cursor too, to do long running tasks with AI and let AI build software autonomously, because that

00:07:47clearly is something that's gaining a lot of visibility right now, especially on X. So I totally

00:07:52get this. And again, cursor is an amazing tool. I want to be very clear regarding that. I just have

00:07:58a problem with this kind of works attitude because I think it's accelerating. It's becoming more and

00:08:05more a thing now with AI. And we've seen it for years. We've seen it long before AI that operating

00:08:13systems like iOS or Windows got worse. They're full of bugs. You can see it in video games, which

00:08:19are often unplayable on day one. You can see it in so much software. It has nothing to do with AI.

00:08:26Software quality got worse. And I get it. We can iterate quickly. You can patch things up. That's

00:08:33kind of the mindset that developed over the last 15 years or so. And that is the mindset I see

00:08:40continuing and accelerating now with AI, because with AI you can patch things up quickly, of course.

00:08:47And if you are wipe coding, for example, then you might not care too much about bugs because you can

00:08:54fix them in an instant anyways. And having a horrible code quality in your code base might

00:08:59not matter because no human needs to get in there. The AI can figure it out and fix it. And if your

00:09:06fix is a bunch of if statements to fix all the different things that could go wrong instead of

00:09:11one clean implementation, that might not matter. And again, that is absolutely one future we could

00:09:18have. I don't think it's the future. I certainly don't hope it's the future, but we could have that

00:09:25as a future. But I also think that as developers, as companies building software, there will be a

00:09:32real market for high quality software, software that's not broken on day one, software that's not

00:09:40shitty. And you could use AI to build better software too. There is no law that forces you

00:09:46to move quick and sacrifice software quality. You can use AI to build better software, to get the

00:09:53best out of both worlds, to combine your skills with AI, to use AI as an extra pair of eyes to

00:10:00look over your code. And I would hope that we kind of go more into that direction because I believe

00:10:08that whilst the majority probably won't, valuable opportunities will open up for companies and

00:10:15developers that do build high quality software and that do try to get the best out of both worlds.

Key Takeaway

While AI can autonomously write millions of lines of code, the future of software development lies not in pure vibe coding but in agentic engineering where developers use AI as a tool with clear instructions and quality oversight to build reliable, high-quality products.

Highlights

Cursor announced GPT 5.2 Codex, which autonomously built a browser over one week, writing 3+ million lines of code across thousands of files
The AI-built browser 'kind of works' but highlights the challenge that the final 20% of any project (from 80% to 100% completion) is often the most difficult part
Vibe coding (letting AI write code with no reviews or understanding) may work for personal tools or throwaway software, but not for commercial products
Agentic engineering (using AI as a tool with clear instructions, code reviews, and developer involvement) is presented as the sustainable future approach
Software quality has been declining across industries (iOS, Windows, video games) and AI could accelerate this trend if developers prioritize speed over quality
There's a market opportunity for developers who use AI to build high-quality software rather than just moving fast and sacrificing quality
The speaker uses AI extensively (buildmygraphic.com is mostly AI-written) but with human oversight, code reviews, and manual tweaking when needed

Timeline

Cursor's GPT 5.2 Codex Announcement and Browser Demo

Cursor announced the release of GPT 5.2 Codex, a new frontier model designed for long-running tasks. The CEO Michael Truel revealed that they used this model to build a browser entirely with AI, running uninterrupted for one week. The AI wrote over 3 million lines of code across thousands of files and created a rendering engine from scratch that handles HTML parsing, CSS cascade, and other browser fundamentals. However, there's one critical caveat: the browser only 'kind of works,' which becomes the central theme of the video's argument about software quality and AI development.

The 80/20 Rule and Software Development Complexity

The speaker argues that while it's impressive AI built a working browser, the real complexity in any project lies in the final 20% that takes it from 80% to 100% completion. This principle applies not just to browsers but to virtually any project in life. For many software products, achieving only 70-80% functionality isn't sufficient, and that extra bit can be extraordinarily difficult to achieve. The speaker suggests that AI alone might not be capable of delivering that crucial final quality level. This sets up the core tension between impressive AI capabilities and the reality of production-ready software requirements.

Personal AI Usage and Clarifying the Stance

The speaker emphasizes being 100% positive on AI and uses it extensively in their own work. They cite buildmygraphic.com as an example, where most code was written by AI, but crucially with human instructions, code reviews, and manual tweaking when necessary. They recently released a major update to their 'AI for Developers' course covering GitHub Copilot and Cursor features. The speaker firmly believes AI is the future for developers and that using it heavily and efficiently will be vital. This section establishes that the critique is not about AI usage itself, but about how developers approach AI integration in their workflow.

The Spectrum: Vibe Coding vs Agentic Engineering

The speaker introduces a spectrum between vibe coding and agentic engineering. Vibe coding is defined as letting AI write code with no code reviews, no understanding of the codebase, and no specific instructions about patterns or packages—essentially zero developer knowledge or involvement. The speaker argues this pure form of vibe coding does not have a future for commercial or real products. However, they acknowledge different definitions exist and that their definition represents the most extreme form. The key point is that developers should position themselves somewhere on this spectrum of AI usage rather than avoiding AI entirely.

Valid Use Cases for Pure Vibe Coding

The speaker identifies legitimate scenarios where pure vibe coding is viable: personal utility tools, throwaway software used once or twice, and potentially free software where quality standards are lower because users aren't paying. In these contexts, it doesn't matter if the code covers all edge cases or has potential bugs, as long as it gets the immediate job done. For example, requesting a quick script to accomplish a one-time task is perfectly appropriate for vibe coding. This nuanced perspective shows the speaker isn't dismissing vibe coding entirely, but rather defining appropriate contexts for different approaches to AI-assisted development.

Agentic Engineering as the Sustainable Future

Agentic engineering is presented as the future approach, where AI is used as a tool rather than autonomous creator. This methodology includes giving AI clear instructions about patterns and libraries, reviewing code regularly (potentially with automated AI-assisted reviews), and having developers directly engage with code when AI gets stuck or when establishing initial implementations. The speaker emphasizes that this doesn't mean AI only handles trivial tasks—complex tasks are absolutely included—but rather that human oversight and architecture decisions remain central. The speaker acknowledges they could be wrong and pure vibe coding might become viable in a few years if AI improves dramatically, but currently believes agentic engineering is the right path.

Context of Cursor's Announcement and Market Positioning

The speaker analyzes Cursor's motivation for this announcement, noting that recent weeks have been dominated by developers using Claude Code with the ROLF loop for vibe coding-style autonomous development. Cursor's demo appears to be an effort to shift the narrative and show they can also handle long-running autonomous tasks. The speaker acknowledges Cursor is an amazing tool and understands the competitive positioning, but expresses concern about the 'kind of works' attitude. They emphasize the only wrong decision is not using AI at all—developers should be somewhere on the spectrum between vibe coding and agentic engineering.

The Declining Software Quality Problem

The speaker identifies a broader software quality crisis that predates AI: operating systems like iOS and Windows becoming buggier, video games being unplayable on launch day, and general software quality degradation over 15+ years. This decline stems from an 'iterate quickly and patch later' mindset that developed in the industry. The speaker warns that AI is accelerating this trend because it enables even faster patching and may lead developers to care less about bugs (since they can be fixed instantly) or code quality (since humans don't need to read it). Vibe coding could result in codebases full of messy if-statement patches rather than clean implementations, representing a dystopian future for software quality.

Market Opportunity in High-Quality Software

The speaker concludes with an optimistic vision: there will be a genuine market for high-quality software that isn't broken on release. There's no requirement to sacrifice software quality for speed—AI can be used to build better software by combining human skills with AI capabilities. Developers can use AI as an extra pair of eyes for code review and quality assurance. While the speaker expects most companies will continue prioritizing speed, they believe valuable opportunities will emerge for companies and developers who use AI to achieve the best of both worlds: high quality and efficient development. This represents a call to action for developers to resist the race-to-the-bottom in software quality and instead leverage AI for excellence.

Community Posts

Write about this video