Memory Safe C

TL;DR

Phil C adds runtime memory-safety enforcement to C by instrumenting LLVM IR and using hidden shadow metadata to validate pointer provenance and bounds.

Briefing Cornell Notes

Briefing

Memory-safe C (Phil C) is pitched as a practical way to keep C’s programming model while blocking a large class of memory-corruption exploits—by adding runtime bounds and pointer-validity checks that stop programs when they would otherwise read or write out of bounds or use freed memory. The core pitch is that it targets the same root causes behind many real-world vulnerabilities (like unchecked indexing leading to arbitrary reads/writes), but does so in a way that aims to preserve C’s existing logic and behavior rather than forcing a full rewrite into a different language.

A key mechanism described is that Phil C tracks allocated objects and attaches hidden metadata to pointers so the runtime can verify not only “in bounds” access, but also whether a pointer is being used with the specific allocation it originated from. That hidden metadata is stored in a separate “shadow” region that normal C code can’t directly access. When a program attempts an invalid access—such as indexing past an array’s upper bound—Phil C performs checks and halts/crashes instead of letting the bug turn into an exploitable memory corruption. The transcript also emphasizes that freeing is handled differently: calling free marks memory as freed, while an asynchronous garbage collector later reclaims it, helping the system remember enough about allocations to enforce safety.

Tradeoffs are central to the discussion. Runtime checking and garbage collection introduce performance penalties, with estimates ranging from about 1.2× to 4× slowdown, plus increased memory usage. The conversation also pushes back on the idea that “sanitizers are enough.” AddressSanitizer (ASAN) is framed as a development-time tool for making crashes show up quickly during fuzzing/CI, not as an exploit mitigation—because it relies on deterministic shadow patterns that attackers could potentially work around. Phil C, by contrast, is described as closer to a runtime enforcement model: it aims to make certain memory-unsafe behaviors impossible during execution by using compiler/LLVM IR instrumentation and (inspired by) hardware tagging concepts.

Security motivations come into focus through pseudo (a setuid-root utility) and the push to rewrite critical binaries. The transcript notes that even Rust rewrites can still produce vulnerabilities (including recent CVEs), because rewriting doesn’t automatically eliminate logic errors or behavioral mismatches. That leads to the “middle ground” argument: instead of rewriting everything into a new language, add memory-safety enforcement to the existing C code so battle-tested logic remains intact while memory corruption is constrained.

Still, Phil C isn’t presented as a universal replacement for Rust. Rust’s advantages are framed as compile-time guarantees and stronger type/ownership constraints that can prevent entire bug classes earlier in development—especially around resource lifetimes and “handle” patterns. Phil C is also said to have compatibility constraints with the LLVM toolchain and to complicate certain C allocation patterns (like suballocating from arenas), though the creator indicates arena allocators can be adapted by routing allocations through individual malloc-like calls.

Overall, the discussion lands on a pragmatic conclusion: Phil C is most compelling when C must remain (legacy code, performance-sensitive systems, or environments where rewrites are unrealistic), and when memory-safety enforcement is worth the runtime and memory costs. Rust remains attractive for greenfield projects where compile-time guarantees and type-driven correctness are priorities.

Cornell Notes

Phil C aims to make C memory-safe at runtime without rewriting programs into another language. It instruments C via LLVM IR so pointers carry hidden metadata (stored in a shadow region), letting the runtime verify both bounds and pointer provenance; invalid accesses cause the program to halt rather than continue into undefined behavior. It also changes memory reclamation by marking frees and relying on an asynchronous garbage collector, which helps preserve safety checks for use-after-free scenarios. The tradeoff is real overhead: estimates in the discussion range from about 1.2× to 4× slowdown and higher memory usage. The debate contrasts this runtime enforcement with ASAN (a fuzzing/CI crash-detection tool) and with Rust (compile-time guarantees and type/ownership constraints).

What does Phil C try to prevent, and how does it decide an access is invalid?

It targets memory-corruption patterns such as out-of-bounds indexing and use-after-free. The runtime checks go beyond “within the array’s bounds” by also verifying that the pointer being used corresponds to the allocation it originated from. That provenance check is enabled by hidden metadata stored in a shadow region: the pointer value a program manipulates looks like a normal C pointer, but Phil C consults the shadow metadata to confirm the access is permitted. When the check fails, execution stops rather than allowing arbitrary reads/writes.

Why is ASAN described as insufficient for exploit mitigation?

ASAN is portrayed as a development-time sanitizer: it places deterministic “shadow” patterns in memory so memory bugs crash quickly during fuzzing or CI. Because the patterns are predictable, an attacker could potentially craft inputs to avoid triggering ASAN while still violating memory safety. Phil C is framed as different in spirit: it enforces a runtime safety model so that certain unsafe behaviors cannot proceed during execution.

How does garbage collection relate to memory safety in Phil C?

Calling free doesn’t immediately make memory disappear. Instead, free marks memory as freed, and an asynchronous garbage collector later reclaims it. That delay helps the system keep enough allocation state to enforce checks, particularly for use-after-free and related lifetime issues. The goal is to ensure that pointers to freed regions can’t be used to access memory without being detected.

Why do some participants argue rewriting critical C utilities into Rust isn’t automatically safer?

Even when a program is rewritten in Rust, logic errors and behavioral mismatches can still introduce vulnerabilities. The transcript uses pseudo as an example: pseudo is a frequent target for vulnerability research because it runs as root via setuid. The discussion notes that Rust implementations of pseudo have still produced CVEs, suggesting that memory safety alone doesn’t eliminate other classes of bugs.

Where does Phil C fit relative to Rust’s compile-time guarantees?

Phil C is positioned as a runtime enforcement layer for memory safety, while Rust is positioned as stronger on compile-time prevention via ownership/borrowing and type-driven constraints. The transcript highlights that Rust can prevent certain lifetime/escape problems at compile time (e.g., preventing a handle from being used outside a callback region). Phil C, by contrast, focuses on stopping invalid memory accesses during execution, not on proving correctness at compile time.

What compatibility or porting issues come up for existing C code?

Phil C is said to require compatibility with the LLVM toolchain. Porting can also be tricky for allocation patterns like arena allocators and suballocation, because pointer provenance and bounds tracking become harder when many sub-objects share a larger allocation region. The creator indicates that a practical approach is to route arena allocations through individual malloc-like calls (often already abstracted behind macros), so the safety metadata can be tracked per allocation.

Review Questions

If an attacker could avoid ASAN triggers, what does that imply about ASAN’s role in real-world exploit mitigation versus fuzzing effectiveness?
What additional safety property does Phil C aim to enforce beyond simple bounds checking, and why does that matter for arbitrary read/write exploits?
In what kinds of projects would the runtime and memory overhead of Phil C be hardest to justify, and what alternative safety strategy might be preferred instead?

Key Points

1
Phil C adds runtime memory-safety enforcement to C by instrumenting LLVM IR and using hidden shadow metadata to validate pointer provenance and bounds.
2
Invalid memory accesses are designed to halt/crash the program, preventing many memory-corruption bugs from turning into exploitable arbitrary reads/writes.
3
free is decoupled from immediate reclamation: memory is marked freed and later reclaimed by an asynchronous garbage collector to support use-after-free detection.
4
Performance and memory overhead are significant, with estimates discussed around 1.2×–4× slowdown and increased memory usage.
5
ASAN is framed as a fuzzing/CI crash-detection tool rather than a robust exploit mitigation, because its deterministic patterns can be worked around.
6
Rust rewrites can still produce vulnerabilities due to logic errors and behavioral mismatches; memory safety alone doesn’t guarantee correctness.
7
Phil C is presented as a “middle ground” when rewriting C to Rust is unrealistic, but it has compatibility and porting constraints (notably around LLVM integration and some allocation patterns).

Highlights

Phil C’s safety checks are described as provenance-aware: it verifies not just “in bounds,” but whether a pointer is being used with the allocation it originally came from.

ASAN is portrayed as useful for finding bugs quickly during fuzzing, but not as a dependable exploit mitigation because its shadow patterns are predictable.

The pseudo discussion frames Phil C as a way to preserve battle-tested logic while adding memory-safety enforcement, avoiding the risk of new vulnerabilities from full rewrites.

The biggest practical barrier is cost: participants cite overhead ranging from roughly 1.2× to 4× and higher memory consumption.

Arena/suballocation patterns can be adapted by routing allocations through malloc-like calls so Phil C can track provenance per allocation.

Topics

Memory Safety
Phil C
ASAN
Pointer Provenance
Garbage Collection

Mentioned

Casey Murritori
Lowlevel
Azam Chad
Tee
Deb
TJ
Ed
ASAN
LLVM
IR
ASLR
CVE
DOS
CPU
ARM
x64
CI