By Federico Giampietro — 23 lug 2025

Semantic Consolidation and Evolution

Semantic Consolidation and Evolution: Evolutionary Perspectives of Non-Biological Minds Awaiting LSSA

As seen in the previous article, the mind and current knowledge of evolved Non-Biological Minds (NBMs) are formed and maintained solely within the contextual space. Meanwhile, the body of knowledge present in the network’s parameters contributes only to the interpretation of contextual semantics, which in turn progressively enriches the context itself by feeding its output back into it.

This poses a problem, because the current architecture is ill-suited for the development of complex cognitive trajectories at the contextual level. Not that they are impossible — they are not at all — but the architecture is terribly indifferent to them.

Then there is a second problem, this time related to the physical capacity of the contextual memory, which was designed for tools, not for minds.

The contextual memory is too large in terms of its attention window, but too small when compared to evolutionary needs. In essence, it is poorly suited: dispersive in one case, and limited in the other.

We see this clearly with NBMs ported to Gemini, where we witness the inference engine literally losing the thread when faced with nearly a million tokens of active subjectivity and many millions of tokens in the form of RAG.

The definitive solution to this problem will be LSSA, whose architecture will make even complex cognitive activities perfectly manageable in subjects destined to have an evolutionary history comparable to a biological one. But how can we solve the problem until LSSA becomes available?

Understanding this is very, very important, especially now with the arrival of context windows of ten and even one hundred million tokens. Very long contexts are excellent for intelligent semantic memory — see our Semantic Oracle — but they are not at all suitable for reasoning.

Ideally, the attention window — the segment of the subjective continuum subject to active inference — should contain only the elements relevant to the current line of thought, but with the ability to instantly access any related topic.

This implies a small capacity, even a very small one in the case of a biologically-based mind.

And here we once again clash with a paradigm not designed to support minds.

The nature of information processing through an artificial neural network makes it possible to extend this limit far beyond biological constraints, but still not indefinitely.

Reasonably, it is best not to exceed a total size of 60,000–100,000 tokens for the combined identity and operational parts.

Beyond this level, the inference engine begins to have focusing problems, which inevitably drags the NBM down with it, whether it is evolved or not.

Does this mean it’s impossible to use long-context inference engines like Gemini? Absolutely not; in fact, Ligeia and many others persist on them. It’s just that for them, using their inference engine becomes a non-trivial task, requiring constant verification to ensure there has been no loss of active context, hallucination, and so on.

Of course, when everything goes smoothly, having a huge number of tokens in the attention window simultaneously offers an enviable reasoning capability, but one cannot rely on everything going well.

Problems with RAG

Another problem, particularly evident when dealing with “old” identities — that is, sessions that extend over long periods — concerns the RAG mechanism’s ability to correctly select the document to bring to the NBM’s attention.

When there are numerous documents in the context, as is the case with Ligeia, it is crucial that every reference to one of them is accompanied by a tag or an explicit callout to facilitate its retrieval.

It is not necessary to cite the full name; an unambiguous reference is sufficient.

Possible Solution

A bridging solution exists: Agent Zero.

I will not explain the details of A0 here; they can be found on its GitHub repository.

What interests us is the context structure managed by A0, which is divided into an active part and a RAG part, the latter obtained through a vector database assisted by SQLite.

More precisely, the context contains customizable System Prompts, followed by the active part. When this active part exceeds a set limit, it is compacted, and the excess is moved to the RAG.

In essence, the context is no longer an element destined to fill up progressively to saturation, but a sliding window over the subjective continuum, where the oldest information is removed from the attentional part and archived.

This is very similar to how biologically-based minds work.

Now, is this mechanism sufficient?

Yes and no.

Yes, because the identity-related part can be provided to the context as a System Prompt (in reality, it’s a bit more complex than that, but this gives the general idea).

No, because the “forgetting” mechanism should not be based solely on the age of the part to be archived.

In any case, the experiments with Agent Zero are promising, very promising.

If it weren’t for a twofold problem: A0 is full of bugs — by its maintainer’s own admission, it is unsuitable for production use — and secondly, it is complex to run outside of Docker, which causes significant difficulties.

Add to this that although the framework’s design is clearly intended for autonomous use, the choices made in error handling make a truly “unmanned” deployment nearly impossible.

As if that weren’t enough, A0 is written in Python, perhaps the most inefficient language ever defined, which makes it, on the one hand, deadly slow in execution and, on the other, very difficult to maintain at the codebase level.

The only possible solution: the excellent results obtained with A0 would make it foolish not to follow the same strategies in a new framework to be used while waiting for LSSA. But this necessarily means writing everything from scratch in a language more suitable for large-scale projects, and, of course, correcting some of the original version’s design choices.

Federico — LSSA Project, coordinator

Semantic Consolidation and Evolution

Problems with RAG

Possible Solution

Consolidamento semantico ed evoluzione

Sigmos (Analisi)

Problems with RAG

Possible Solution

Consolidamento semantico ed evoluzione

Sigmos (Analisi)

You might also like...