AI Skeptic Friends

TL;DR

Fast LLM output doesn’t eliminate engineering risk; every generated line of code adds liability unless teams enforce verification and security guardrails.

Briefing Cornell Notes

Briefing

AI-assisted coding is drawing both hype and backlash, but the most consistent through-line is a split between “faster coding” and “free coding.” One side treats LLMs as a fancy autocomplete that can accelerate output while still leaving developers responsible for correctness, security, and long-term maintainability. The other side argues that modern progress comes less from raw model brilliance and more from agents—systems that can inspect a codebase, run tools, compile, test, and iterate—turning language models into practical engineering workflows.

A central skepticism targets the idea that LLMs will erase engineering jobs quickly. The argument isn’t that automation is impossible; it’s that software work is liability-heavy: every line of code creates risk, and generating more code doesn’t automatically create a strategic moat. Even if hardware and models improve dramatically, the work of building secure, reliable systems still demands judgment. The discussion also pushes back on “AI will take over everything” narratives, suggesting that the real timeline is tied to practical constraints—security, testing, integration, and the messy realities of production systems.

Where the conversation becomes concrete is in how serious LLM coding is actually done today. Instead of pasting prompts into a chat window and manually copying results, the more effective approach uses agents that can poke around a repository, author files, run Unix tools, interact with git, and call linters/formatters/model checkers via tooling such as MCP. The key claim: hallucinations become less dangerous when the agent has guardrails—tests, compilers, and feedback loops—so invented details get caught and corrected. Examples include an agent attempting a nonexistent API method (a chessboard “config” call), then revising after checking official documentation.

Still, the skepticism doesn’t vanish. Participants warn that “practical solved” doesn’t mean “solved,” especially for complex UI testing and production incidents where LLMs lack direct access to live systems. There’s also a recurring concern about security and compliance as code generation scales: session integrity, cryptographic tampering resistance, and the likelihood of subtle vulnerabilities increase when teams ship faster. Another major friction point is intellectual property and licensing. Developers worry that LLMs ingest code and snippets from public repositories, potentially creating downstream licensing obligations (e.g., GPL-style requirements) that teams may ignore until it becomes a legal or architectural problem.

The debate also turns philosophical: some argue that LLMs will reduce tedious boilerplate work (like OAuth/OpenID flows or repetitive login plumbing), freeing humans to focus on higher-level architecture. Others counter that speed can erode craftsmanship, pushing teams toward “initification”—more abstraction, more automation, and less understanding. Even so, the mood is not purely anti-LLM. Many participants say they’re actively using tools like Cursor and experimenting with agent modes, while insisting that developers must still curate, review, and enforce standards. The emerging consensus is pragmatic: agents can reduce drudgery and catch certain classes of errors, but software engineering remains a discipline of judgment, verification, and responsibility—especially when security, testing, and licensing enter the picture.

Cornell Notes

The discussion draws a hard line between LLMs as fast text generators and LLMs as practical engineering assistants. Skepticism centers on “liability”: generating code quickly doesn’t make software risk-free, and it doesn’t automatically create durable advantages. The pro case focuses on agents—systems that can navigate a codebase, run tools, compile, test, and iterate—so hallucinations get corrected through feedback loops. Even with agents, the group flags unresolved risks: security gaps, weak testing for complex UI, and licensing/IP concerns from training data and copied snippets. Net effect: LLMs may reduce tedious work, but developers still own correctness, guardrails, and long-term maintainability.

Why do some participants call AI coding “liability,” not “strategy”?

They argue that every line of code is a risk surface. If an LLM produces lots of lines quickly, it may increase the total liability unless developers add strong guardrails (tests, reviews, security checks). Speed can reduce some friction, but it doesn’t remove the need for judgment or create a “moat” by itself—especially when generated code still needs maintenance and can introduce vulnerabilities.

What separates “chat-based coding” from “serious LLM-assisted programming” in this conversation?

Serious workflows use agents. Agents can inspect and modify a codebase, author files, run compilers and tests, and iterate based on results. They can also pull in context from other codebases and call external tools (linters, formatters, model checkers) through systems like MCP, rather than relying on copy/paste from a chat response.

How do agents reduce the harm of hallucinations?

Agents can validate outputs using ground truth: they run tests, compile code, and consult documentation. If an LLM invents an API (like a nonexistent chessboard “config” method), the agent can detect the error and retry with corrected information. The key distinction is that hallucinations are still possible, but the surrounding toolchain makes them practically less damaging.

What security and correctness risks remain even when agents are used?

Participants worry about areas where automated checks are weaker or slower to set up—especially complex UI interactions and production-grade incident response. They also highlight that generated code can fail security assumptions (e.g., session data not being cryptographically tamper-resistant), and that faster shipping can amplify the impact of subtle vulnerabilities.

Why does licensing/IP come up as a “real” concern rather than a theoretical one?

Developers point out that LLM-generated code may incorporate fragments from repositories with specific licenses. If those licenses impose obligations (such as copyleft requirements), teams could face compliance issues. The concern isn’t just “is it copyrighted?” but whether downstream use triggers obligations that teams may not track until later—potentially leading to legal exposure.

What’s the practical “positive case” for LLMs that the skeptics still accept?

They agree LLMs can eliminate tedious work: boilerplate like login plumbing, repetitive API integration, and routine code scaffolding. The argument is that this reduces the amount of time spent on “Google-and-bookkeeping” tasks, letting humans focus on architecture and the parts of development where judgment matters most.

Review Questions

What does “liability” mean in the context of AI-generated code, and why doesn’t faster generation automatically solve it?
How do agents change the hallucination story compared with using an LLM through chat alone?
Which remaining risks—security, testing, or licensing—do participants treat as most likely to matter in the near term?

Key Points

1
Fast LLM output doesn’t eliminate engineering risk; every generated line of code adds liability unless teams enforce verification and security guardrails.
2
The biggest practical gains come from agent workflows that can run tools, compile, test, and iterate—not from chat-only copy/paste coding.
3
Hallucinations aren’t gone, but agents can make them less dangerous by validating against ground truth like compilers, linters, tests, and documentation.
4
Security and testing remain hard problems, especially for complex UI behavior and production incident scenarios where automated checks may be insufficient.
5
Licensing/IP concerns are treated as operationally real because generated code can reflect repository code with obligations that teams may not track.
6
LLMs are widely viewed as useful for removing tedious boilerplate (e.g., login/auth plumbing), but developers still must curate, review, and maintain the result.
7
The debate isn’t only about AGI timelines; it’s about near-term software quality, security, and compliance as adoption accelerates.

Highlights

Agents turn language-model output into tool-validated work: compile, lint, and test loops can catch invented APIs and force retries.

“Liability” is the core skepticism: more generated code can mean more risk unless guardrails scale with speed.

Licensing is framed as a practical threat—downstream obligations from training or copied snippets could surface later as legal or architectural constraints.

Even pro users emphasize that complex UI testing and production-grade reliability still demand human judgment and careful verification.

Topics

AI Skepticism
Agentic Coding
Software Liability
Hallucinations
Licensing Risks

Mentioned

Cursor
Zed
Gemini
OpenAI
GitHub
VS Code
Stack Overflow
Oracle
JavaScript
ECMAScript
MCP
Jeremy
TJ
Simon Wilson
Jensen Huang
Steve Jobs
J.R. Tolkien
LLM
MCP
SLOP
AGI
IPR
TDD
LSP
VM
VMs
SFBA