GraphNews

A formal proof published last month at arXiv proves that every vector database, every RAG system, and every embedding-based knowledge graph will eventually forget what you stored in it. It does not say symbolic knowledge graphs forget.
A formal proof published last month at arXiv proves that every vector database, every RAG system, and every embedding-based knowledge graph will eventually forget what you stored in it. It does not say symbolic knowledge graphs forget.
  • The Geometry of Forgetting (arXiv:2604.06222): https://arxiv.org/abs/2604.06222
  • The Price of Meaning: Why Every Semantic Memory System Forgets (arXiv:2603.27116): https://arxiv.org/abs/2603.27116
  • Ebbinghaus, H. (1885). Über das Gedächtnis. Translated as Memory: A Contribution to Experimental Psychology (1913). Original power-law forgetting curve
  • Roediger, H. L., & McDermott, K. B. (1995). Creating false memories: Remembering words not presented in lists. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21(4), 803-814. DRM paradigm reference
  • W3C SPARQL 1.2 Query Language (2025): https://www.w3.org/TR/sparql12-query/
  • W3C SHACL Shapes Constraint Language: https://www.w3.org/TR/shacl/
  • Cypher Property Graph Query Language (openCypher specification): https://opencypher.org/
  • O'Reilly forthcoming book "Agentic Graph Architecture": Chapter 4 (Memory) treats the proof class boundary as a substrate-decision primitive
- The Geometry of Forgetting (arXiv:2604.06222): https://arxiv.org/abs/2604.06222- The Price of Meaning: Why Every Semantic Memory System Forgets (arXiv:2603.27116): https://arxiv.org/abs/2603.27116- Ebbinghaus, H. (1885). Über das Gedächtnis. Translated as Memory: A Contribution to Experimental Psychology (1913). Original power-law forgetting curve- Roediger, H. L., & McDermott, K. B. (1995). Creating false memories: Remembering words not presented in lists. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21(4), 803-814. DRM paradigm reference- W3C SPARQL 1.2 Query Language (2025): https://www.w3.org/TR/sparql12-query/- W3C SHACL Shapes Constraint Language: https://www.w3.org/TR/shacl/- Cypher Property Graph Query Language (openCypher specification): https://opencypher.org/- O'Reilly forthcoming book "Agentic Graph Architecture": Chapter 4 (Memory) treats the proof class boundary as a substrate-decision primitive
·linkedin.com·
A formal proof published last month at arXiv proves that every vector database, every RAG system, and every embedding-based knowledge graph will eventually forget what you stored in it. It does not say symbolic knowledge graphs forget.
The more I use coding agents, the more I think the agent harness is starting to look like a repeat of the Git pattern
The more I use coding agents, the more I think the agent harness is starting to look like a repeat of the Git pattern
This shows that, as usual, it’s all about graphs 🙂 in Git, the state represents code stages; in an agent, it represents reasoning or context state; and in knowledge graphs, it represents structured knowledge and its relationships.
This shows that, as usual, it’s all about graphs 🙂 in Git, the state represents code stages; in an agent, it represents reasoning or context state; and in knowledge graphs, it represents structured knowledge and its relationships.
·linkedin.com·
The more I use coding agents, the more I think the agent harness is starting to look like a repeat of the Git pattern
Ontology and semantic technologies in risk and security
Ontology and semantic technologies in risk and security
As requested, these are the slides of the invited talk I gave yesterday in the event "Ontology, AI, and Cybersecurity" organized by Zequan Huang and Antoine Leblanc at the University of Paris I: Panthéon-Sorbonne. Unfortunately, I could not be there in presence but I had a great time interacting with the audience (great questions!). My presentation covers a number of interesting things that have been developed at Semantics, Cybersecurity, and Services (SCS)/University of Twente over the years. This includes UFO(https://lnkd.in/dWeGR7P5) but also work championed by Ítalo Oliveira, Stefano Nicoletti, Thijs van Ede, Rodrigo F. Calhau and Tiago Prince Sales Relevant papers here include: https://lnkd.in/dvZHu5p5, https://lnkd.in/dBgmCtCk, https://lnkd.in/e47bZvRN, https://lnkd.in/eEm_ajj9, https://lnkd.in/dzKnM7zT, https://lnkd.in/eAdcBJcD, https://lnkd.in/e2axrW7Z Great initiative and thanks for the invitation, folks! Hope to be able to visit the Sorbonne soon! #Ontology #ontologies #AI #cybersecurity #security #risk #riskpropagation #riskmanagement #riskpropagationgraphs
·linkedin.com·
Ontology and semantic technologies in risk and security
The main reason Resource Description Framework (RDF) matters.
The main reason Resource Description Framework (RDF) matters.
𝗬𝗼𝘂 𝗖𝗮𝗻’𝘁 𝗥𝗲𝗮𝘀𝗼𝗻 𝗢𝘃𝗲𝗿 𝗖𝗼𝗹𝗹𝗮𝗽𝘀𝗲𝗱 𝗟𝗶𝘁𝗲𝗿𝗮𝗹𝘀 I think this post misses the main reason Resource Description Framework (RDF) matters. RDF is not just a way to store data as triples. It is a semantic representation layer for reasoning. A database may store RDF triples, but RDF should not be understood primarily as a storage device. Its value is that those triples can be interpreted under formal semantics and reasoned over using ontologies. To be precise, Arthur should not just be treated as a loose string property on a node. Arthur is better modeled as a name identifier, an instance of a Designative Information Content Entity (DICE), that designates an actual instance of Person. Likewise, Engineer should not merely be collapsed into a literal like "Engineer". Engineer is better modeled as a role, with an object property assertion connecting an instance of Person to an instance of that role. That difference is powerful. In a labeled property graph, something like: Arthur {role: "Engineer"} usually means the graph has stored a convenient label and property value. That may help retrieval, filtering, and application logic, but the system does not automatically know what a person is, what a role is, or what it means for a person to bear a role. In RDF, the entities can be explicitly typed: ArthurNameIdentifier is a DICE. ArthurNameIdentifier designates ArthurPerson. ArthurPerson is an instance of Person. EngineerRole is an instance of Role. ArthurPerson bears EngineerRole. Now the graph is not just storing strings. It is representing typed entities and typed relations that participate in a formal semantic model. That means a reasoner can infer additional knowledge from class hierarchies, property restrictions, domain and range constraints, equivalence axioms, ontology mappings, and role based patterns. This is also why RDF matters for artificial intelligence (AI). RDF can act as the semantic layer of a world model: representing what kinds of things exist, how they relate, what roles they bear, what constraints apply, and what else can be inferred. The issue is not simply: “Do you attach metadata as triples or as properties?” The deeper issue is: “Can your graph support machine interpretable meaning and inferred knowledge, or is it mostly storing literals for retrieval?” You cannot reason very far over a collapsed string literal. You can reason over explicitly modeled entities, relations, classes, roles, designators, and axioms. RDF is not valuable because it stores triples. RDF is valuable because it makes assertions interpretable by machines under a formal semantic model. #RDF #KnowledgeGraphs #SemanticWeb #Ontology #OWL #GraphRAG #AI #SemanticReasoning | 29 comments on LinkedIn
the main reason Resource Description Framework (RDF) matters.
·linkedin.com·
The main reason Resource Description Framework (RDF) matters.
Obsidian, Supercharged: The AI Revolution in Personal Knowledge Management
Obsidian, Supercharged: The AI Revolution in Personal Knowledge Management

Obsidian as graph memory

Most people use Obsidian as a fancy folder.

The next generation is using it as a second brain that an AI agent actively maintains.

Here's what changed:

🗺️ Graphify — builds a queryable knowledge graph from your code, docs, and papers. Feeds your AI compressed subgraphs instead of raw files. Up to 71.5x fewer tokens. The difference between a flashlight and a satellite.

🤖 obsidian-second-brain — 32 CLI commands that pull live research into your vault, run scheduled agents overnight, and audit your current decisions against your past ones. "Your own notes say this failed last year. Still want to proceed?"

🔗 For Zettelkasten work: Smart Connections (semantic search), InsightA (long articles → atomic notes automatically), Notemd (auto wiki-links), Copilot (vault-grounded chat).

Since February 2026, Obsidian has an official CLI. Any AI agent with shell access can read and write your vault directly.

The vault is no longer passive storage. It's infrastructure.

·volodymyrpavlyshyn.substack.com·
Obsidian, Supercharged: The AI Revolution in Personal Knowledge Management
Ontologies and Knowledge Graphs: Why They Work Better Together I d.AP Blog
Ontologies and Knowledge Graphs: Why They Work Better Together I d.AP Blog

ONTOLOGIES VS. SEMANTIC LAYERS: WHY ENTERPRISE AI NEEDS BOTH, AND WHICH DOES WHAT

One of the most persistent confusions in enterprise AI discussions is the conflation of semantic layers and ontologies. A new analysis from d.AP makes the distinction crisp and practically useful. A semantic layer is for lookup — it normalises labels and definitions to make metrics consistent across reporting tools. An ontology is for context and reasoning — it encodes what entities are, how they relate, what can be inferred, and what constraints must hold.

The difference surfaces sharply in deployment. A knowledge graph built without an overarching ontology may contain connected data, but it lacks a shared interpretive framework. Different departments build incompatible schemas. Integration logic becomes inconsistent. Every new use case requires a partial rebuild rather than a reuse of existing structure. The ontology is what allows a new data source to be added by updating the rules, rather than rewriting the graph.

The analysis draws on a useful structural metaphor: the ontology is the architectural blueprint; the knowledge graph is the physical building. You can have a blueprint without a building (a purely conceptual model), but a well-formed building requires a blueprint — whether or not that blueprint is made explicit. Most enterprise knowledge graphs have implicit ontologies; the question is whether those implicit structures are consistent, maintained, and capable of supporting the reasoning that agentic AI requires.

Gartner’s 2026 positioning — naming knowledge graphs and semantic enrichment as non-negotiable budget items for enterprise AI — suggests the enterprise market is arriving at this conclusion independently, through the painful experience of deploying agents on unstructured, under-governed data.

·digetiers-dap.com·
Ontologies and Knowledge Graphs: Why They Work Better Together I d.AP Blog
LOINC Ontology Technical Release Notes – LOINC / SNOMED CT
LOINC Ontology Technical Release Notes – LOINC / SNOMED CT

THE LOINC ONTOLOGY 2.0: 41,000+ CLINICAL CONCEPTS NOW IN SEMANTIC FORM

Healthcare informatics reached a significant milestone with the September 2025 production release of the LOINC Ontology 2.0 — a formal semantic representation of LOINC terms developed jointly by Regenstrief Institute and SNOMED International. The release now covers over 41,000 concepts representing laboratory LOINC terms, with approximately 78% of concepts sufficiently defined and a new LABORDERS.ONTOLOGY class providing 2,500 grouper concepts for hierarchical navigation.

The ontology is distributed as a SNOMED CT extension in RF2 format via loincsnomed.org, which means it carries the formal semantics and maintenance discipline of the SNOMED ecosystem. For implementers, this matters: LOINC has historically been a relational dataset — an extremely well-maintained one, but without the formal class hierarchy, property axioms, and reasoning support that clinical decision systems increasingly require.

The collaboration between Regenstrief and SNOMED International that produced this release began with a 2013 partnership linking SNOMED’s clinical semantics to LOINC’s observational concepts, and formalised into an extension agreement in 2022. The March 2025 production release — covering the top 200 most-used lab tests first — and the September 2025 expansion represent the payoff of nearly a decade of alignment work.

For those building healthcare knowledge graphs, the LOINC Ontology provides a canonical semantic anchor for laboratory observations that was previously absent. The RF2 format requires conversion to OWL or RDF for direct use in a triplestore, but established pipelines for that conversion exist. This is now the right starting point for any clinical ontology stack that needs to model test orders and results formally.

·loincsnomed.org·
LOINC Ontology Technical Release Notes – LOINC / SNOMED CT
Agentic SPARQL: Evaluating SPARQL-MCP-powered Intelligent Agents...
Agentic SPARQL: Evaluating SPARQL-MCP-powered Intelligent Agents...

AGENTIC SPARQL: LLMS MEET FEDERATED KNOWLEDGE GRAPH QUERYING VIA MCP

One of the most technically significant papers of the spring quietly updated its second version in April. “Agentic SPARQL,” from researchers at arXiv, explores what happens when you combine SPARQL federation with the Model Context Protocol — giving LLM agents the ability to discover endpoints, explore schemas, decompose queries across distributed RDF datasets, and return grounded answers without hallucinating.

The architecture is elegant in its standardisation strategy. SPARQL already provides a native federation mechanism via the SERVICE operator, and endpoints can self-describe using VoID vocabulary. MCP provides the JSON-RPC bridge between the LLM’s tool-calling interface and the SPARQL endpoint registry. The combination means an agent can be handed a natural-language question, select the relevant endpoints itself, and issue a federated SPARQL query — without the query being hard-coded by a developer.

The paper extends the Spider4SPARQL benchmark towards a federated variant, cross-partitioning 19 RDF datasets across multiple endpoints in ways that require the agent to reason about data locality rather than simply executing pre-known queries. That is a realistic test of agentic behaviour, and the results are instructive about where endpoint discovery and schema exploration remain unsolved problems.

For those of us working at the intersection of RDF tooling and LLM integration, this paper is required reading. It suggests that SPARQL endpoints, far from being a legacy retrieval mechanism, are well-positioned as the native data layer for agentic AI — precisely because they combine a standardised query language, formal metadata, and federation in a single protocol stack.

·arxiv.org·
Agentic SPARQL: Evaluating SPARQL-MCP-powered Intelligent Agents...
Semantica 0.5.0 is out
Semantica 0.5.0 is out

SEMANTICA V2: 10× EMBEDDING CACHE PERFORMANCE AND SEMANTIC NEIGHBOURHOOD SEARCH

The Hawksight-AI team released Semantica v2 on 11 May, and the performance improvements are substantial. The headline figure — over 10× embedding cache performance — is achieved through per-session revision-based caching with thread-safe invalidation, addressing one of the most persistent bottlenecks in production knowledge graph systems that blend vector and symbolic retrieval.

The new Semantic Neighbourhood Search capability is conceptually interesting. Rather than returning a flat ranked list of similar entities, it provides context-aware similarity with proximity metrics — effectively treating the neighbourhood of a node as a structured semantic space rather than a distance function. That distinction matters for applications where the meaning of an entity depends on its immediate relational context rather than its isolated embedding.

The v2 release also improves entity extraction speed at up to 6.98× over the previous version, with improved deduplication strategies including Jaro-Winkler, semantic, and hybrid approaches. For pipelines that ingest free text and build knowledge graphs incrementally — precisely the use case that is becoming standard in enterprise RAG architectures — the deduplication improvement is as practically significant as the cache gains.

Semantica is positioning itself as an AI-native knowledge graph intelligence framework: semantic retrieval, ontology reasoning, context graphs, and explainable AI in a single integrated system. The v2 release suggests the team is taking the production performance story seriously, which distinguishes it from the many knowledge graph projects that are research-grade demonstrations rather than deployable infrastructure.

·linkedin.com·
Semantica 0.5.0 is out
Knowledge Graph Market to Surge from US$1.34 Billion in 2025 to US$19.16 Billion by 2033 as Enterprises Use GraphRAG to Ground AI, Reduce Hallucinations, and Scale Secure Copilots
Knowledge Graph Market to Surge from US$1.34 Billion in 2025 to US$19.16 Billion by 2033 as Enterprises Use GraphRAG to Ground AI, Reduce Hallucinations, and Scale Secure Copilots

The numbers are in, and they are striking. A new market analysis places the global knowledge graph sector at $1.34 billion in 2025, on a trajectory to $19.16 billion by 2033 — a compound annual growth rate of 30.8%. What is driving the surge? Enterprises are moving past the AI experimentation phase and into production deployments that require something vector search alone cannot provide: internal context, multi-hop reasoning, access-controlled querying, and the ability to explain how an answer was derived.

The shift is architectural. Organisations that spent the last two years evaluating generative AI pilots are now discovering that a model connected to a flat document store cannot answer the questions that actually matter — questions that cross customers, contracts, suppliers, transactions, and policies simultaneously. Knowledge graphs provide the semantic fabric that makes those cross-domain queries possible.

The US market, representing roughly 38–42% of global revenue, is itself expected to exceed $7 billion by 2033. The analysis notes a distinct shift in enterprise buying behaviour: procurement is no longer framed as “graph storage” but as “grounded enterprise AI with evidence, permissions, and multi-hop reasoning.” That repositioning matters. It signals that the semantic web community’s longstanding argument — that meaning precedes scale — has finally found its commercial moment.

For practitioners in ontology, SHACL, and RDF, the implication is straightforward: the infrastructure you have been building for a decade is now the thing enterprises are paying to acquire.

·openpr.com·
Knowledge Graph Market to Surge from US$1.34 Billion in 2025 to US$19.16 Billion by 2033 as Enterprises Use GraphRAG to Ground AI, Reduce Hallucinations, and Scale Secure Copilots
Most teams treat OWL and SHACL as design-phase paperwork. Ship the ontology, archive it, let the agent improvise at runtime. The teams shipping reliable agents do the opposite. The reasoner lives inside the loop.
Most teams treat OWL and SHACL as design-phase paperwork. Ship the ontology, archive it, let the agent improvise at runtime. The teams shipping reliable agents do the opposite. The reasoner lives inside the loop.
Most teams treat OWL and SHACL as design-phase paperwork. Ship the ontology, archive it, let the agent improvise at runtime. ⭕️ The teams shipping reliable agents do the opposite. The reasoner lives inside the loop. An agent proposes an action: cancel a reservation, modify a record, write a triple. Before execution, the harness calls an automated reasoner against the OWL ontology and a SHACL shape graph. Three possible answers. PROOF it respects policy. PROOF it violates a constraint. UNDEFINED. PROOF answers are derivations a domain expert can read line by line. This is what Amazon Neptune demonstrated at KGC 2026. KG-based validation as a steering hook for agentic guardrails. The outcome steers the next step: execute, generate an alternative, or escalate. The same shape graph governs design-time ontology review and runtime writes. The graph is the terrain, the LLM is the explorer. The explorer walks a network with structure. Every step is constrained by the edges that exist and the typed relationships permitted. That structure is what makes agent reasoning inspectable by a clinician, an auditor, a regulator. The equation that lands: OWL Inference × SHACL Validation × Mid-Loop Call = Pre-Mutation Proof LLMs got better at function calling. They did not get better at proving that a function call respects the domain model. AWS Automated Reasoning Checks, Amazon Neptune AgentCore guardrails, OWL DL reasoners, SHACL 1.2 rules. These compose with LLM generation to produce a verified output, not a hopeful one. The cost equation favors it. A reasoner call before every mutating action raises per-step cost. It lowers the voting budget for million-step reliability by an order of magnitude, because invalid candidates leave the pool before the vote. Five layers ship in production. T-Box is the agent's world model. A-Box is its knowledge base. Inference materializes what the agent implicitly knows. S-Box gates writes. Reports give the auditor the line-by-line trail. The 2024 substitution question is dead. "Will LLMs replace knowledge graphs" presupposed competition between layers that compose. The 2026 question is what structure the LLM needs before any team trusts it with a regulated decision. The answer sits in twenty years of W3C standards, finally reaching the runtime. Build the ontology. Wire the reasoner into the harness. Validate every mutating action against the shape graph. Ship what survives all three.
Most teams treat OWL and SHACL as design-phase paperwork. Ship the ontology, archive it, let the agent improvise at runtime. ⭕️The teams shipping reliable agents do the opposite. The reasoner lives inside the loop.
·linkedin.com·
Most teams treat OWL and SHACL as design-phase paperwork. Ship the ontology, archive it, let the agent improvise at runtime. The teams shipping reliable agents do the opposite. The reasoner lives inside the loop.