o3 and Grok 4 accidentally vindicate neurosymbolic AI

garymarcus.substack.com

56 points by NotInOurNames 4 days ago

It's a very funny read.

"See, LLMs that are allowed to use Python perform better than ones that aren't, and Python is symbolic, so I was right all along!"

Looks like a surrender to me.

xrd a day ago

Surrender by whom?
Isn't his argument that leaders in the AI/ML space have consistently dismissed the need for that the entire point of the article? And that seems like a valid question to be after reading it.
And huge financial implications for the industry.
- ACCount36 a day ago
  
  By Gary Marcus, of course.
  If he claims that giving LLMs a Python interpreter is a huge win for his paradigm, then major AI companies have been "winning" since 2022.
  
  throw310822 a day ago
  
  Jobs described computers as bicycles for the mind.
  Turns out that LLMs find bicycles useful too.
- killerstorm a day ago
  
  That's a rather silly strawman.
  OpenAI demonstrated Codex - a version of GPT-3 which could write Python and JS code - in 2021, only a year after the first GPT-3 release.
  Here's a demo of Codex doing MS Word tasks using Python code: https://www.youtube.com/watch?v=-Dpl2awseZU
  Live coding demo: https://www.youtube.com/watch?v=SGUCcjHTmGY
  There's also a demo of doing some data processing using Codex.
  The idea of using LLM to write code is rater obvious, and many people talked about it around GPT-3 or even GPT-2 release. People know that many logic tasks require search and it's rather silly to use LLM to do that. If it can generate any text it can generate Prolog code or a specification for SAT solver, and then dedicated efficient tool can handle the computation.
  Gary Marcus is so detached from actual research that he might have genuinely missed the 2021 version of Codex. So he might truly believe that they added Python interpreter "quietly", and it's not the big announcement of 2021.
  This guy is known as a clown in the industry, and for a very good reason... If he was an actual researcher he would have jumped on the opportunity to make an actual neurosymbolic demos when first language models were released - you don't actually need to wait GPT-3 to do that. But he preferred to relish in writing books claiming that the entire industry is wrong instead of actually building something.
rowanG077 a day ago

I'm not really a follower of low level implementation of AIs. Why would python tool use(among others) not qualify as symbolic? I don't think it's under question that tool use vastly improves LLMs.
- anuramat 9 hours ago
  
  I'd say it's just not a part of the machine-that-is-learning; I don't consider myself to be symbolic, even though I also use python

YuriNiyazov 4 days ago

motte, meet bailey. Gary Marcus' shtick the entire time has been "LLMs are the wrong approach", and now the claim is "actually, the entire time I've been claiming something much weaker: LLMs that call out to code interpreters are sufficient for neurosymbolic AI"/

qwertylicious 3 days ago

It says a lot about the current discourse around AI that 6 years ago Marcus would write:
> Despite all of the problems I have sketched, I don’t think that we need to abandon deep learning.
And that would somehow be spun, today, as "LLMs are the wrong approach".
Meanwhile, another attempt to post this article here got straight up flagged, I can only assume because this whole topic has become about religious orthodoxy vs the heretics.
- YuriNiyazov 2 days ago
  
  Thanks for your reply; I can’t edit the original comment but I have updated my personal understanding of Marcus’ position.
  
  4b11b4 a day ago
  
  have also updated
kgwgk 3 days ago

2001: Resisting the conventional wisdom that says that if the mind is a large neural network it cannot simultaneously be a manipulator of symbols, Marcus outlines a variety of ways in which neural systems could be organized so as to manipulate symbols, and he shows why such systems are more likely to provide an adequate substrate for language and cognition than neural systems that are inconsistent with the manipulation of symbols.
2018: While none of this work has yet fully scaled towards anything like full-service artificial general intelligence, I have long argued (Marcus, 2001) that more on integrating microprocessor-like operations into neural networks could be extremely valuable.
2022: Where people like me have championed “hybrid models” that incorporate elements of both deep learning and symbol-manipulation, Hinton and his followers have pushed over and over to kick symbols to the curb.
- YuriNiyazov 2 days ago
  
  Thanks for your reply; I can’t edit the original comment but I have updated my personal understanding of Marcus’ position.
hooah 4 days ago

He’s been saying that LLM isn’t a “universal solvent”, not as a “recent claim”.
''' In my 2018 Deep Learning: A Critical Appraisal for example, I wrote
Despite all of the problems I have sketched, I don’t think that we need to abandon deep learning.
Rather, we need to reconceptualize it: not as a universal solvent, but simply as one tool among many, a power screwdriver in a world in which we also need hammers, wrenches, and pliers, not to mentions chisels and drills, voltmeters, logic probes, and oscilloscopes. '''
- YuriNiyazov 2 days ago
  
  Thanks for your reply; I can’t edit the original comment but I have updated my personal understanding of Marcus’ position.

mindcrime 3 days ago

I mostly agree with Gary on the core premise of this post, which I interpret generally as "it would be a good idea to pursue neuro-symbolic AI, not just deep learning."

A couple of additional thoughts:

1. She goes on to point out that the field has become an intellectual monoculture, with the neurosymbolic approach largely abandoned, and massive funding going to the pure connectionist (neural network) approach

Just to nitpick... that is largely true, but with the caveat that there has been something of a resurgence of interest in neuro-symbolic AI over just the last couple of years. There's been a series of "Neuro-Symbolic AI Summer School" events[1][2][3] going on since 2022 with the next one coming up in August. And there have been recent books[4][5] published specifically on neuro-symbolic AI. You'll also find recent papers on neuro-symbolic AI on arXiv[6]. So for those who are interested in this topic, there is definitely activity underway "out there".

2. Including LLMs somewhere in the next evolution of AI makes sense to me, but leaving them at the core may be a mistake.

I've spent a lot of time thinking about this, and generally agree with this sentiment. Some kind of fusion of LLM's (or "connectionism" in general) and symbolic processing seems desirable, but I'm not sure that we should rely on LLM's to be "core" and try to just layer symbolic processing on top of what we get from the LLM. I have my own thoughts on how such an integration might work, but it's all still speculative at the moment. But I find the whole notion worthy enough to invest time and attention into it, for whatever that is worth.

[1]: https://ibm.github.io/neuro-symbolic-ai/events/ns-summerscho...

[2]: https://neurosymbolic.github.io/nsss2023/

[3]: https://neurosymbolic.github.io/nsss2024/

[4]: https://www.amazon.com/Neuro-Symbolic-AI-transparent-trustwo...

[5]: https://www.iospress.com/catalog/books/handbook-on-neurosymb...

[6]: https://arxiv.org/pdf/2501.05435

atleastoptimal a day ago

He adopts a conspiratorial lens, (or at least implies it), that neurosymbolic AI was "kept down" over the last 4 decades, which is a very funny reframing of the fact that it simply never was useful enough to lift itself off the ground by the virtue of its own merits in the first place. If a ground-up neurosymbolic approached had shown promise in getting an AI system to the general level of intelligence LLM's have reached, it would have been adopted and scaled up. The money, research and effort went to what was useful, and transformers won out by virtue of their undeniable utility.

4b11b4 a day ago

This time, it really does make sense

brcmthrowaway a day ago

I guess it is over for him

Filip Pieknewski next.

nilkn a day ago

As far as I can tell, putting the conspiratorial thinking aside, he's not really wrong, but I'm also not sure it matters that much.

If neurosymbolic AI was "sidelined" in favor of "connectionist" pure NN scaling, I don't think it was part of a conspiracy or deeply embedded ideological bias. I mean, maybe that's the case, but it seems far more likely to me that pure deep learning scaling just provided a more incremental and accessible on-ramp to building real-world systems that are genuinely useful for hundreds of millions of users. If anything, I think the lesson here was to spend less time theory-crafting and more time building. In this case, it looks like it was the builders who got to the endpoint that was only imagined by the theory-crafters, and that's what matters at the end of the day.

4b11b4 a day ago

That resonates. There _are_ a lot of good approximation functions can be developed from deep learning and good data and now RL on top. But then, we really do need symbolism, and now we need to somehow combine them. And it'll be different for text vs vision... Lots of ...s ahead
- 4b11b4 a day ago
  
  But how to even combine them. Is it only via another AGENT who has a symbolism tool. and if that (group of agents) cant extract multiple symbolisms from the context, of one which best fits, from the current approximation (context) then..