Latest from the world of LLMs - CG - lists.cs.uni-kassel.de

4 Feb 2025

Michael D and Michael DB,

The answer to that question is very clear and simple:

MDB:  Where is there a purely symbolic AI system that can support NLP anywhere in the same
ballpark as LLMs? If such a thing exists would love to know about it. Keep in mind the
system has to do a lot more than turn natural language into SPARQL. It needs to keep the
context of a discussion thread, it needs to be able to respond appropriately when you give
it feedback like: "give me more/less detail" "change the tone to [more
playful] [more professional] [more creative]" etc. It needs to be able to do common
sense reasoning and handle very hard NLP problems like anaphora. 

Answer:  We did that with the purely symbolic VivoMind technology (from 2000 to 2010). 
The language understanding part was far better than the language generating part.   In
accuracy and precision, the 2010 version far exceeds any and every current LLM system that
does not use a symbolic component to detect and evaluate the inevitable errors and
hallucinations.   Our largest clients (some US gov't agencies) attest to that fact. 

For an overview and documentation of the old VivoMind system, see
https://jfsowa.com/talks/cogmem.pdf .  The date of this presentation is 2016, but it is an
update of slides that were originally presented in 2011.   See the applications that begin
on slide 45.  Every one of them was implemented in 2010 or earlier.  And in precision and
accuracy, they are far ahead of anything that can be implemented by LLMs today without
using symbolic technology for reasoning and evaluation.

Our new Permion Inc. system combines an upgraded version of the 2010 symbolic technology
with LLMs for an enhanced natural language OUTPUT that has a more readable and more
elegant syntax.  Permion also uses LLMs for some of the searching, but it only requires a
modest amount of storage.   For customers with deep pockets, however, the Permion
technology can scale up to use as much storage as the customer provides.

For a talk on YouTube, see Evaluating and Reasoning with and about GPT by John Sowa and
Arun Majumdar,   https://www.youtube.com/watch?v=6K6F_zsQ264 .  

That talk only had 544 views.  But the following YouTube, from the 2024 Ontolog Summit,
had over 10 thousand views:  Without Ontology LLMs are Clueless by John Sowa,
https://www.youtube.com/watch?v=t7wZbbISdyA .

We strongly suspect that a large percentage of the viewers are in China.  One reason is
that I have been invited to give talks (expenses paid) in places like Hong Kong, Macau,
Singapore... 

Another reason:  Arun has also received an offer from China to buy our Permion company. 
However, some of our key customers would not be happy with that idea.

John

----------------------------------------

From: "Michael Denny" &lt;regattago(a)gmail.com&gt;

Michael, 
I was a little surprised to see you say
“What LLMs do is to solve the problem of common sense reasoning better than any other
approach to date. LLMs know that things fall down, that water is wet, etc. “
When I asked Claude 3.5 the following:
“Is it accurate to say that LLMs solve the problem of common sense reasoning and that LLMs
know that things fall down, that water is wet, etc.?”
The  response was:
“No, that's not accurate. The relationship between LLMs and common sense reasoning is
much more complex and limited.
LLMs can often produce text that appears to reflect common sense understanding, but this
is primarily because they've learned statistical patterns in human-written text about
how objects and phenomena are typically described. They don't have genuine
understanding or reliable common sense reasoning capabilities.
For example, an LLM might correctly state that objects fall downward or that water makes
things wet, but it doesn't truly "understand" gravity or wetness the way
humans do. Some key limitations:

- Inconsistency: LLMs can make basic physics or common sense errors when questions are
asked in novel ways or contexts
- No real-world grounding: Their "knowledge" comes from text patterns, not from
physical experience with the world
- No causal understanding: They don't truly grasp cause and effect relationships
- Pattern matching vs. reasoning: They often give sensible-looking answers by matching
patterns rather than through actual reasoning

This is why LLMs can sometimes make surprisingly basic errors about physical reality or
common sense relationships that would be obvious to humans. They might correctly state
that "water flows downhill" in one context but then make an elementary mistake
about fluid dynamics in another context.
What are your thoughts on the nature of common sense reasoning in AI systems? I'm
curious about your perspective on this.”
I think the key point is that LLMs do not understand naïve physics or any deterministic
relationships as humans do, nor have the inferential capabilities symbolic logics do.  But
as you point out, they do make a great interlocutor.
Mike 

From: ontolog-forum(a)googlegroups.com [mailto:ontolog-forum@googlegroups.com] On Behalf Of
Michael DeBellis
Sent: Monday, February 3, 2025 6:47 PM
To: ontolog-forum
Subject: [ontolog-forum] Re: Latest from the world of LLMs

----------------------------------------

...
  It’s official: the myth of an LLM Scaling Moat has
been debunked 
It is a big leap  from what we know about DeepThink to make that conclusion. For one
thing, I don't trust the mainstream press (CNBC) for any info about a complex
scientific or technology question. For another, there is so much we don't know with
confidence about DeepThink. How well does it really do on benchmarks? How much money or
computing resources did the Chinese government or other groups associated with them pump
into DeepThink? For another, there are things that Open AI can do that DeepThink can't
do such as their recent Deep Research announcement:
https://www.youtube.com/watch?v=YkCDVn3_wiw ; Also, see the Playlist "12 Days of Open
AI" on Youtube:
https://www.youtube.com/playlist?list=PLOXw6I10VTv9lin5AzsHAHCTrC7BdVdEM 

But even supposing that every claim made about DeepThink is true, it doesn't mean
"the myth of an LLM Scaling Moat has been debunked" It just means the moat
isn't as large as we thought. And if that moat isn't as large as we thought,  I
don't see how that justifies the conclusion: " All roads lead back to good old
symbolic reasoning and inference—now more accessible and understandable when combined with
a multi-modal natural language interface that enhances interaction possibilities." If
anything, it makes LLMs even MORE usable and accessible because it doesn't require the
huge resources that we used to think to create one (from what I know, I'm deeply
skeptical but I actually hope that the claims of DeepThink are true because I think it is
a good thing that we don't restrict foundational LLMs to just a few companies)

Where is there a purely symbolic AI system that can support NLP anywhere in the same
ballpark as LLMs? If such a thing exists would love to know about it. Keep in mind the
system has to do a lot more than turn natural language into SPARQL. It needs to keep the
context of a discussion thread, it needs to be able to respond appropriately when you give
it feedback like: "give me more/less detail" "change the tone to [more
playful] [more professional] [more creative]" etc. It needs to be able to do common
sense reasoning and handle very hard NLP problems like anaphora. 

What LLMs do is to solve the problem of common sense reasoning better than any other
approach to date. LLMs know that things fall down, that water is wet, etc. It doesn't
make sense to also expect them to be able to solve that problem AND have completely
reliable domain knowledge. There are several approaches to using curated, authoritative
knowledge sources with an LLM to reduce or eliminate hallucinations. One of them is
Retrieval Augmented Generation (RAG). That's an example of how semantic tech and LLMs
complement each other: https://journals.sagepub.com/eprint/RAQRDSYGHKBSF4MRHZET/full  also
see: https://www.michaeldebellis.com/post/integrating-llms-and-ontologies 

When you use a RAG, you have complete control over the domain knowledge sources. Building
a RAG is the best way I've ever seen to put an NLP front end to your knowledge graph.

Another way to get domain knowledge into an LLM are to train LLMs with special training
sets for the domain. One excellent example of this is Med-BERT, a version of the BERT LLM
specially trained with healthcare data. See: L. Rasmy, Y. Xiang, Z. Xie, C. Tao and D.
Zhi, "Med-BERT: pretrained contextualized embeddings on large-scale structured
electronic health records for disease prediction," Nature npj digital medicine, vol.
4, no. 86, 2021.

Still another (much newer) way is to use reinforcement fine-tuning to train an LLM with
domain specific data. This doesn't take as much work as the domain specific LLM but is
much more work than creating a RAG. See: M. Chen, J. Allard, J. Wang and J. Reese,
"Reinforcement Fine-Tuning—12 Days of OpenAI: Day 2," 6 December 2024. 
https://www.youtube.com/watch?v=yCIYS9fx56U&t=29s