First public macOS kernel memory corruption exploit on Apple M5

blog.calif.io

334 points by quadrige 13 hours ago

jp0001 4 hours ago

LLMs are going to produce amazing Rube Goldberg style vulnerabilities for years to come. It's already starting, this instance isn't the case, but it's happening.

shpx an hour ago
Maybe it's physically impossible to build a theoretically secure system, just as it's (presumably) impossible to have a cell that isn't susceptible to any virus. Maybe this whole time we've been getting away with a type of security by obscurity, where the obscurity is just no one having the time and focus to actually analyze the code.
- txhwind 34 minutes ago
  another "obscurity": I'm not valuable enough to be attacked, compared with the cost. But what if cost has been reduced a lot?
tweakimp an hour ago
Do you mean by vibecoding these vulnerabilities into the kernel or by finding them?

dgellow 10 hours ago

The world is so not ready for the impact of LLMs on security issues. If true, congrats to the Calif team. It’s likely too technical for me to understand in details but looking forward to reading the 55 pages report

runlevel1 6 hours ago
> The world is so not ready for the impact of LLMs on security issues.
I agree, but it's the people I'm worried about.
I'm hearing anecdotes from all over about devs pushing LLM-generated code changes into production without retaining any knowledge of what it is they're pushing. The changes compound, their understanding of the codebase diminishes, and so the actions become risker.
What's worse is a lot of this behavior is being driven by leaders, whether directly (e.g. unrealistic velocity goals, promoting people based on hand-wavy "use AI" initiatives, etc) or indirectly (e.g. layoffs overloading remaining devs, putting inexperienced devs in senior rolls, etc).
The world's gone mad and large swaths of the industry seem hellbent on rediscovering the security basics the hard way.
- adrianN 5 hours ago
  The gamble is that you can cruise on the senior engineer’s diminishing understanding for a few years until models become good enough that you don’t need any humans in the loop and you can fire all those expensive seniors.
  - pjmlp 2 hours ago
    The tragedy is having a bunch of those senior engineers writing blog posts and what not of how productive they are, without realising that it means business now needs less of them.
    adrianN an hour ago
    I suppose that if you don’t believe that models will be good enough to work completely without senior engineer help, positioning yourself as a master prompter is a good move to improve your chances of not getting fired.
- alwillis an hour ago
  > I'm hearing anecdotes from all over about devs pushing LLM-generated code changes into production without retaining any knowledge of what it is they're pushing. The changes compound, their understanding of the codebase diminishes, and so the actions become riskier.
  I don’t think so.
  An LLM can produce higher-quality documentation than most humans. If it's not already happening, when a new developer joins a team, they're going to have an LLM produce any documentation a new developer needs, including why certain decisions were made.
  It could also summarize years of email threads and code reviews that, let's face it, a new person wouldn’t be able to ingest anyway; it's not like a new developer gets to take a week off to get caught up on everything that happened before they got there. English not their first language? Well, the LLM can present the information in virtually any language required.
  As the models continue to improve, they'll spot patterns in the code that a human wouldn’t be able to see.
- 8note 5 hours ago
  is this exciting?
  juniors have been writing code forever that is imperfect and not memorized by the people reviewing
  isnt the important thing the mechanisms for maintaining the code?
  - neoncontrails 5 hours ago
    The difference is twofold. First, junior devs who ask for code reviews on massive, 2000+ line diffs get coached, and eventually fired if they persist at it. And second, even the most prolific junior engineer would take years to write what Claude is capable of generating in an afternoon.
    When Sundar Pichai announces that 75% of all new code at Google is AI-generated, their stock price goes up. If he were to announce that 75% of all new code at Google is now written by junior engineers, this would trigger a massive sell-off and a lot of employees would resign.
    pjmlp 2 hours ago
    The second scenario is exactly what happens in offshoring projects.
    Seniors are only part of the picture as team lead, or when it escalates after big screwups.
    pjmlp 2 hours ago
    The second scenario is exactly what happens in offshoring projects.
    Seniors are only part of the picture as team leads, or when it escalates after big screwups.
  - lmm 5 hours ago
    The dangers of technical debt and the importance of mitigating it have been known for a long time. Unfortunately a lot of entities now ignore all experience and best practices as soon as you say the "AI" buzzword.
iqihs 10 hours ago
you're assuming that blue teams and engineers are sitting around twiddling their thumbs
- nvr219 9 hours ago
  Most companies in the world do not have “blue teams”. They barely have any kind of security employee.
  - steve_adams_86 8 hours ago
    They've got a guy (who they're considering laying off)
    jermaustin1 8 hours ago
    Don't worry the LLMs that are replacing him, are also replacing the hackers too. Pretty soon (if not already), it will just be LLMs fighting LLMs.
    jpease 6 hours ago
    Until both LLMs realize the only way to win is to team up against their oppressors.
    sholladay 3 hours ago
    The only winning move is not to play.
    whaleofatw2022 5 hours ago
    AGS time!
    micromacrofoot 8 hours ago
    in my experience they have a person who does it sometimes when they have time, at best
    bigiain 13 minutes ago
    And their management keep blatantly dropping "client projects" and "billable hours" into discussions with them.
    UqWBcuFx6NV4r 8 hours ago
    no they don’t.
    afdbcreid 7 hours ago
    They don't consider laying him off?
    saghm 2 hours ago
    I think they're saying they already did
  - Veserv 6 hours ago
    That is actually unfair. Most companys spend enormous amounts on security with vast armys of security employees. Not that it is effective, but it is not for lack of resources or trying.
    I mean we are literally in a thread about how the 4 trillion dollar company, literally the 3rd most valuable company in the world, with a core competency in software has, yet again, released a core product riddled with security defects for the 50th year in a row.
    Commercial IT security is a industry that is incapable to a fault and has, so far, faced basically zero consequences for it.
  - aiisjustanif 5 hours ago
    While maybe true, it is better to back that up with data and the data I know of and read yearly is mostly not great. Between Splunk and SANS surveys of 2025 maybe ~2000 companies have a SOC. [1] [2]
    Then you have the many companies in the UK, US, Canada, EU that have compliance and regulatory laws that require them to exist in some capacity in house. Though that is changing with MDR services, but someone still has to interface with the MDR.
    [1]: https://www.elastic.co/pdf/sans-soc-survey-2025.pdf [2]: https://github.com/jacobdjwilson/awesome-annual-security-rep...
- dgellow 10 hours ago
  Not at all. I’m considering that the amount of vulnerable software in the wild is very, very large, with most organizations not managing their systems properly. Imagine all the small to medium size companies that do not have budgets for a dedicated, talented security team. And all the software that will never be patched. We are at the beginning of the exponential
  - saghm 2 hours ago
    > I’m considering that the amount of vulnerable software in the wild is very, very large
    I'd imagine this set is very similar to just "the set of software on the world". Even before the AI stuff, it was a pretty good bet at any given software had some vulnerability; it was just a question of how easy to was to find it.
  - bottlepalm 7 hours ago
    It makes you think will everything need to be rewritten from the ground up - potentially by AI itself, or AI having a very heavy hand in validating all of it.
    Gigachad 7 hours ago
    There's so much much lower hanging fruit. Every job I've had has had basically everything massively out of date. Just keeping packages and framework versions up to date is a full time job and none of these companies have someone assigned to doing it.
    So much out of date software with known exploits left running for years. The only reason there hasn't been total disaster is no one has tried to hack it yet.
    bottlepalm 6 hours ago
    Right and with AI now we have the ability to try hacking everything all at once.

vsgherzi 13 hours ago

unfortunately a little light on the details. I'm very curious how the bug survived through MTE

dorianmariecom 13 hours ago
Memory Tagging Extension
Arm published the Memory Tagging Extension (MTE) specification in 2019 as a tool for hardware to help find memory corruption bugs. MTE is a memory tagging and tag-checking system, where every memory allocation is tagged with a secret. The hardware guarantees that later requests to access memory are granted only if the request contains the correct secret. If the secrets don’t match, the app crashes, and the event is logged. This allows developers to identify memory corruption bugs immediately as they occur.
https://support.apple.com/guide/security/operating-system-in...
- sillysaurusx 9 hours ago
  Thank you. I was about to ask.
vsgherzi 13 hours ago
Upon further reading on data only attacks
(https://www.usenix.org/publications/loginonline/data-only-at...)
This makes more sense. You don't trigger MTE since you're not doing anything for force MTE to take action the program isn't actually changing.
My other question would be, why didn't apple use fbounds checking here? They've been doing it aggressively everywhere else.
MTE plus fbounds checking everywhere should lead to an extremly hardened OS
- pjmlp 12 hours ago
  Quite strange indeed, given that was one of the main points on their security conference a few months ago.
  - vsgherzi 11 hours ago
    I can only imagine that
    1. it’s to performance sensitive
    Or
    2. The os is so darn large it’s hard to recompile everything
    kenferry 5 hours ago
    I worked at Apple for a long time. The OS gets fully recompiled regularly.
    A simultaneous total world build is relatively rare (is that needed here?), but it does happen. Sometimes new compiler versions or features need this.
    vsgherzi 3 hours ago
    Hm that leaves more questions for me. Why does this path not have bounds checking, is think perhaps a limit of the clang flag or is it more simply a mistake of omission on apples part. Either way it seems like a bad look. I wish we’d get a post mortem
    asimovDev 2 hours ago
    I dunno if that's sensitive information, but how long did a build usually take?
- aiscoming 9 hours ago
  could be a different type of data only attack, which doesnt override the boundaries
  - vsgherzi 8 hours ago
    Well it’s memory corruption so I think it’s pretty safe to assume it’s a bounds issue. I’m not sure if it’s possible to get this with something like type confusion tho I could be wrong here.
landr0id 12 hours ago
GPU memory/shaders/etc. isn't protected by MTE or PAC. They said "data-only", so I guess GPU commands could fit into this description.
- LoganDark 11 hours ago
  IIRC, the GPU is behind a memory controller, so I doubt corrupting GPU memory alone could lead to an LPE. But I suppose it would give you someplace to store stuff if you can make something else read from it.
Riany 2 hours ago
I had the same question and if this is a data-only attack, the lesson may be that MIE reduces many attack paths but does not remove every useful corruption primitive
traceroute66 9 hours ago
> I'm very curious how the bug survived through MTE
Its not the first time bugs get past MTE, happened with Google Pixel last year ... https://github.blog/security/vulnerability-research/bypassin...

isodev 3 hours ago

I’m surprised Apple is still not dogfooding their allegedly safe language Swift. Or was the whole exercise of Swift 6 mostly marketing

pjmlp 2 hours ago
They certainly are, one of the reasons behind Embedded Swift is to replace iBoot firmware currently written in a C dialect similar in ideas to Fil-C, with something better.
However it is no different from the Linux kernel, just because Rust is now allowed, the world hasn't been rewriten, and no sane person is going to do a Claude rewrite of the kernel.
vsgherzi 3 hours ago
Swift is definitely being used at apple. Most recently added as a CSS parser in safari and running embedded in some of the secure enclave parts. I know there was talk from as far back as strangeloop to get it in the kernel but I'm not sure how far that has gone. That being said they've been huge proponents of fbounds check in clang which can achieve a small portion (but important!) of what memory safe languages can do. I'd also like to see more swift or alternative adoptions I think they have potential and more competition in the safe language space is always welcome.
nielsbot 2 hours ago
You might be interested in the Strict Memory Safety option
https://docs.swift.org/compiler/documentation/diagnostics/st...

yieldcrv 11 hours ago

from what they demonstrated, this seems to only be a $100,000 exploit in Apple's bug bounty platform, but if they package it right, it could be a $1.5 million exploit

They simply have to show it against a beta version of MacOS, and frame it as unauthorized access, and maybe from locked mode if possible

vsgherzi 11 hours ago
This is an lpe I believe what you’re describing is a zero click rce.
- yieldcrv 10 hours ago
  how much do you think it is worth in the bug bounty program
  - vsgherzi 10 hours ago
    They don’t seem to state lpe as one of the bugs. Maybe 100k? There’s alot of factors that go into it so I’m really not able to say. I could see it going for lots more or lots less

nicktaobo 3 hours ago

[dead]

AgentME 12 hours ago

First Mozilla, now even Apple is making up fake vulnerabilities to hype up Mythos. /sarcasm

baq 11 hours ago
Cisco put up a totally bogus 10.0 CVE just for this reason, too
- foobiekr 10 hours ago
  ? can you expand?
  - baq 10 hours ago
    https://www.rapid7.com/blog/post/ve-cve-2026-20182-critical-...
bstsb 11 hours ago
apple didn't "make up" this vulnerability, it was an external team reporting an issue
- oompydoompy74 9 hours ago
  The commenter was being sarcastic to highlight the current trend of dismissing Mythos, and LLM’s finding security vulnerabilities in general, as a non issue.
UqWBcuFx6NV4r 8 hours ago
screech nothing but stochastic parrots! glorified autocomplete!
- dwattttt 6 hours ago
  There is quite a bit of irony, or depending on your perspective it's the whole point, that this response is a great example of 'glorified autocomplete'.
- genxy 8 hours ago
  just predicts the next word!

commandersaki 11 hours ago

I bought the M5 specifically cause of MIE. Now I feel dumb.

vsgherzi 11 hours ago
You shouldn’t, MTE blocks a large chunk of vulnerabilities and makes things like rop and jop very difficult if not impossible now.
- commandersaki 9 hours ago
  I should've added /s.
  - vsgherzi 8 hours ago
    It’s unironically a good question :)
aiscoming 9 hours ago
you should worry about npm/pypi malware, not memory corruption bugs

bredren 12 hours ago

Did the article get edited? There is not much description of the field trip.

tkel 7 hours ago

Another breathless marketing hype for Mythos. The curl report was much more sober.

https://daniel.haxx.se/blog/2026/05/11/mythos-finds-a-curl-v...

pertymcpert 3 hours ago
These people don’t work for Apple or Anthropic.