<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Rob Hanna, Precision Content]]></title><description><![CDATA[I've been in the structured authoring and information architecture space for three decades and witnessed tremendous growth around the discipline of writing to a single source of truth.]]></description><link>https://www.trustinyourcontent.com</link><image><url>https://substackcdn.com/image/fetch/$s_!dMie!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b0cf2c9-c03c-41de-8c1d-d252e69c03ba_748x748.png</url><title>Rob Hanna, Precision Content</title><link>https://www.trustinyourcontent.com</link></image><generator>Substack</generator><lastBuildDate>Wed, 03 Jun 2026 13:19:55 GMT</lastBuildDate><atom:link href="https://www.trustinyourcontent.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Rob Hanna]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[singlesourceror@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[singlesourceror@substack.com]]></itunes:email><itunes:name><![CDATA[Rob Hanna, Precision Content]]></itunes:name></itunes:owner><itunes:author><![CDATA[Rob Hanna, Precision Content]]></itunes:author><googleplay:owner><![CDATA[singlesourceror@substack.com]]></googleplay:owner><googleplay:email><![CDATA[singlesourceror@substack.com]]></googleplay:email><googleplay:author><![CDATA[Rob Hanna, Precision Content]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Derivative Reuse at Scale: The Core-Exception Model for DITA Structured Content]]></title><description><![CDATA[How to preserve reuse, reduce conditional complexity, and make high-variant publications manageable]]></description><link>https://www.trustinyourcontent.com/p/derivative-reuse-at-scale-the-core</link><guid isPermaLink="false">https://www.trustinyourcontent.com/p/derivative-reuse-at-scale-the-core</guid><dc:creator><![CDATA[Rob Hanna, Precision Content]]></dc:creator><pubDate>Sun, 31 May 2026 18:46:50 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!jemm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jemm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jemm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 424w, https://substackcdn.com/image/fetch/$s_!jemm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 848w, https://substackcdn.com/image/fetch/$s_!jemm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 1272w, https://substackcdn.com/image/fetch/$s_!jemm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jemm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png" width="1456" height="820" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2349018,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.trustinyourcontent.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jemm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 424w, https://substackcdn.com/image/fetch/$s_!jemm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 848w, https://substackcdn.com/image/fetch/$s_!jemm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 1272w, https://substackcdn.com/image/fetch/$s_!jemm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4e15427-bd59-47ff-bedf-6076bb470edb_1920x1081.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Derivative reuse has always been one of the most practical&#8212;and one of the most dangerous&#8212;forms of content reuse. It begins with a sensible premise: use an existing content component wherever possible, but allow limited changes when the reuse context requires them. In <em>Managing Enterprise Content</em>, Ann Rockley and The Rockley Group distinguish derivative reuse from locked reuse by allowing reused content to change while retaining a relationship to the original content source. In principle, derivative reuse gives content teams the best of both worlds: reuse without forcing false sameness, and variation without losing traceability. [1]</p><p>In practice, derivative reuse can become one of the hardest reuse patterns to govern. The problem is not that derivatives exist. The problem is that derivatives often accumulate at the wrong level of granularity. When a sentence, word, phrase, or even a character varies by product, jurisdiction, market, customer, audience, or effective date, authors often respond by embedding more and more conditional logic directly inside the content. Over time, the source becomes less like an authored topic and more like a dense programming artifact. The content may still publish correctly, but it becomes increasingly difficult for authors, reviewers, subject matter experts, translators, and auditors to understand what the source actually says.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>This article describes a field-tested approach for managing derivative reuse in high-variant structured content environments: the Core-Exception model for DITA content. The model was developed at NCCI (National Council for Compensation Insurance) to support a complex, state-specific publication with dozens of variants, frequent regulatory updates, strict traceability requirements, and a need to move from a manual exception-compendium model to state-specific publishing at build time. The approach was later presented publicly in the ConVEx conference session <em>On the Road to Boca Raton</em>, delivered in Tempe, Arizona, on May 4, 2022, where the architectural principles, publishing model, and implementation lessons were shared with the broader DITA community. [7][8]</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!L1BU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!L1BU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!L1BU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1355621,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!L1BU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!L1BU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81806594-2f23-4934-96b1-b82fb5099ef4_1448x1086.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The central idea is simple: stop placing most conditional logic inside the smallest pieces of prose. Instead, move the condition to a managed block of microcontent. One block represents the core rule that applies to most contexts. Zero or more additional blocks represent exceptions that apply to specific contexts. Publishing logic selects an applicable exception when one exists; otherwise, it falls back to the core.</p><p>The result is not the elimination of conditional content. It is the disciplined containment of conditional content.</p><h2>Derivative reuse and the granularity trap</h2><p>Rockley&#8217;s reuse model treats derivative reuse as a legitimate reuse option. A derivative is created when reused content is modified for a new context, while the relationship to the original is preserved. This is valuable when the substance of the content is shared but the expression must differ: spelling, tense, order, emphasis, regional needs, translation, or other contextual variation may justify a derivative. [1]</p><p>The difficulty begins when derivative reuse is implemented as a proliferation of tiny conditional fragments. DITA provides powerful mechanisms for profiling and conditional processing. Conditional attributes can be used to include, exclude, or flag content during processing, typically through DITAVAL profiles. [3] These mechanisms are essential for structured publishing. They make it possible to produce multiple deliverables from a common source.</p><p>But the same mechanism that works elegantly for a topic, section, paragraph, or reusable component can become fragile when applied repeatedly inside sentences. A phrase conditioned for five audiences may be manageable. A sentence containing conditions for dozens of jurisdictions, effective dates, business rules, and approval states is not. Multiply that pattern across hundreds or thousands of topics and the source becomes difficult to review, difficult to audit, and dangerous to modify.</p><p>This is the granularity trap: the content team chooses reuse at a level so small that the management cost begins to exceed the reuse benefit.</p><p>At that point, the content architecture needs a different pattern. The question is no longer &#8220;Can this word be reused?&#8221; The better question is &#8220;At what level should variation be governed?&#8221;</p><h2>The NCCI challenge: many variants, high traceability, frequent updates</h2><p>The NCCI use case involved a large policy-type publication of approximately 800 pages, chunked into numbered sections. The publication had historically been maintained through a manual workflow. A national version of the rules was distributed along with a compendium of state exceptions. Clients then had to use both the national content and the applicable exception material to determine how the rules applied in a particular state. [7]</p><p>The publication was not static. It required incremental updates four to six times per year. Some content applied only to certain states. Some content was excluded for certain states. Other content was substantively the same across states but differed in wording. During updates, impacted numbered sections were revised and sent to state regulators for review and approval. Regulators could approve, reject, or approve with changes, and each regulator worked on its own timeline. [7]</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d4GA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d4GA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d4GA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1786810,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d4GA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!d4GA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e020b5f-066e-4282-a8f1-b21c1a448a6f_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Traceability was not optional. The organization needed to track historical, current, approved future, and proposed changes. Some content could be approved but not take effect for six to eighteen months. Some historical versions remained relevant because of court cases, regulatory interpretation, or future product contracts. [7]</p><p>In this environment, ordinary conditional processing at the phrase level would have produced an authoring and governance problem. With 37 state clients at the time, and the possibility of more in the future, the content team needed a way to publish state-specific versions without forcing authors to manage dozens of inline conditions inside every affected topic.</p><p>As described in <em>On the Road to Boca Raton</em>, the challenge was not simply one of publishing multiple outputs. It was a challenge of balancing reuse, regulatory variation, workflow, version control, and long-term maintainability. The solution was to combine DITA, microcontent, structured writing, and conditional publishing into a more governable architecture. [8]</p><h2>From topics to microcontent blocks</h2><p>The first architectural move was to break each numbered section into a block of microcontent. Because workflow, approval, versioning, and traceability needed to be managed in the component content management system, each microcontent block was managed as a topic. [7]</p><p>This had an important consequence: the &#8220;topic&#8221; no longer represented a traditional, full-bodied unit of exposition. Instead, it became a managed block of structured information. The team applied strict writing and titling rules aligned with Precision Content practices and typed each block according to its information purpose: concept, task, process, rule, or reference. [7]</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!j3Mi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j3Mi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j3Mi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1299181,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j3Mi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!j3Mi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5897ec9a-88dc-4dea-935b-4d50609001e8_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This decision also aligned with the broader logic of information typing in DITA. DITA uses information types such as concept, task, and reference to distinguish different kinds of information and to help authors keep content focused, modular, searchable, navigable, and reusable. [5] Precision Content similarly emphasizes modular, reusable, typed content, organized through repeatable patterns and governed standards. [6]</p><p>The key architectural insight was that the standard topic body structure could be too heavy for this use case. Instead of authoring a large task topic with context, prerequisites, steps, results, and related supporting information inside one body, the team decomposed those structures into separately managed microcontent topics. The body was removed from the topic design, and the key semantic structures were moved into a specialization of the topic abstract for each information type. [7]</p><p>As discussed in the ConVEx presentation, this shift represented a move away from thinking of topics as documents and toward thinking of topics as governed information assets. The architecture emphasized management, reuse, and traceability at the microcontent level rather than at the publication level. [8]</p><p>That move made the block&#8212;not the paragraph, phrase, or character&#8212;the primary unit of governance.</p><h2>The Core-Exception model</h2><p>Once the content was managed as typed microcontent blocks, the team could address derivative reuse at the correct level. Instead of embedding state-specific conditions inside a single block of prose, the team organized each affected section around a Core-Exception pattern. [7]</p><p>The topic model has three main components:</p><ol><li><p>A typed container topic that holds the title and prolog for the topic blocks.</p></li><li><p>A Core Block: an embedded microcontent topic block representing the content that applies to the majority of states.</p></li><li><p>Zero or more Exception Blocks: embedded microcontent topic blocks representing content that applies to one or more specific states.</p></li></ol><p>Each Exception Block carries the conditions for the states to which it applies. The publishing logic looks for exceptions first. If an exception matches the state being published, that exception is included and the Core Block is suppressed. If no exception matches, the Core Block is included. Non-applicable Exception Blocks are excluded. [7]</p><p>This changes the authoring model dramatically. Without the Core-Exception model, a topic serving 37 states might require many conditional fragments inside the same source topic. With the Core-Exception model, the topic contains one core expression and only the exceptions that actually exist. If 31 states use the core wording and six states require changes, the author manages one Core Block and six Exception Blocks&#8212;not 37 parallel variants. [7]</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UIbW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UIbW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UIbW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1273177,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UIbW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!UIbW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b5ce3ce-1969-467e-b6a9-016f7a3518b6_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>An important characteristic of the model is that most content requires only a Core Block. Exception Blocks are created only when a meaningful variation exists. As a result, the majority of topics can remain simple and free of conditional complexity, while still supporting extensive variation where necessary.</p><p>The ConVEx presentation emphasized that the architecture intentionally treats exceptions as managed content objects rather than conditional fragments. This distinction is critical because it preserves readability, reviewability, and governance while still leveraging DITA&#8217;s conditional processing capabilities. [8]</p><p>The pattern also scales more gracefully. When a new state is added, the team does not need to touch every topic. Only the sections that differ for that state require new exception blocks. [7]</p><h2>Exception-first publishing logic</h2><p>The Core-Exception model depends on clear publishing logic. The processor must evaluate exception applicability before selecting the core. Conceptually, the rule is:</p><ul><li><p>If a matching exception exists, publish the exception, and</p><ul><li><p>Exclude the core and non-matching exceptions.</p></li></ul></li><li><p>Else if no matching exception exists, publish the core, and</p><ul><li><p>Exclude all exceptions.</p></li></ul></li><li><p>Else if the matching block is marked as not applicable, exclude the topic, and</p><ul><li><p>Exclude its dependent child topics in the map.</p></li></ul></li></ul><p>This final point is important. The model was extended beyond individual topics into the publication map. If a topic was excluded according to the Core-Exception rules, subsequent child topics in the map under that excluded topic were also excluded. [7]</p><p>DITA already supports conditional processing through profiling attributes and DITAVAL, and DITA branch filtering allows filtering conditions to be applied to specific branches of a map rather than only globally. [3][4] The Core-Exception model can be understood as a business architecture and authoring governance layer built on top of such conditional publishing concepts. It gives authors a predictable pattern for where variation belongs and gives publishing logic a deterministic way to resolve that variation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!O91d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!O91d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!O91d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!O91d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!O91d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!O91d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1440823,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/200011073?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!O91d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!O91d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!O91d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!O91d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb75919d-8206-4c00-8aba-b734b24f233b_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As demonstrated in <em>On the Road to Boca Raton</em>, the publishing framework effectively creates a controlled inheritance model in which the core serves as the default expression and exceptions override that default only when applicable. This exception-first approach significantly reduces the amount of conditional markup required in source content. [8]</p><h2>Why the model works</h2><p>The Core-Exception model works because it separates three concerns that are often tangled together:</p><ul><li><p>the stable identity of the content unit</p></li><li><p>the default expression of the content, and</p></li><li><p>the variant expressions required for specific conditions.</p></li></ul><p>The typed container topic preserves the identity of the numbered section. The Core Block represents the default content. Exception Blocks represent controlled deviations from the default. Conditions are attached to the exception blocks rather than scattered throughout the prose.</p><p>This makes the content easier to reason about. Reviewers can compare the core and the exception directly. Authors can see whether a state has a true exception or simply inherits the core. Information architects can govern the number and type of exceptions. Publishing engineers can implement a clear fallback rule. Auditors can trace which block appeared in which state-specific output and why.</p><p>The model also protects readability in the source. Authors are no longer forced to read a sentence interrupted by multiple conditional fragments. Instead, they read complete, coherent blocks. This is especially important when reviewers are regulators, legal experts, or business stakeholders who must approve the meaning of the content, not the cleverness of the markup.</p><h2>Relationship to profiling and conditional content</h2><p>It would be a mistake to present the Core-Exception model as an alternative to DITA profiling. It is better understood as a disciplined use of profiling.</p><p>DITA conditional processing is designed to filter or flag content based on processing-time criteria. It allows attributes such as audience, platform, delivery target, props, and specialized props to determine whether elements are included, excluded, or flagged during processing. DITAVAL defines the conditional-processing profile. [3]</p><p>The Core-Exception model uses that general capability but changes where the conditions live. Instead of conditioning many small fragments inside a topic, it conditions whole microcontent blocks. This creates a more governable relationship between reuse and variation.</p><p>In simple publications, inline profiling may be entirely appropriate. If a short phrase differs for two products, an inline condition may be the simplest solution. But as the number of variants rises, the burden shifts. The question becomes not &#8220;Can the tool support this?&#8221; but &#8220;Can people safely understand, review, and maintain this over time?&#8221;</p><p>The ConVEx presentation explicitly positioned the Core-Exception model as a response to the limitations of highly granular conditional content. Rather than replacing profiling, the model establishes architectural boundaries around where profiling should be applied and how variation should be represented. [8]</p><p>The Core-Exception model is most useful when:</p><ul><li><p>variants are numerous</p></li><li><p>exceptions are sparse relative to the full variant set</p></li><li><p>reviewers need to approve complete statements</p></li><li><p>traceability and effective dating are important</p></li><li><p>the publication has a stable logical structure</p></li><li><p>new variants may be added over time, and</p></li><li><p>differences are meaningful enough to deserve block-level governance.</p></li></ul><h2>Change tracking and version baselines</h2><p>The final major challenge in the NCCI implementation was change tracking. Each state could potentially be associated with different versions of the same section. Some states might have no change between releases. Others might have substantial changes. Some approved changes might not yet be effective. [7]</p><p>The Core-Exception model supports this complexity because each microcontent block is managed as a discrete topic in the CCMS. That means the system can version, review, approve, and baseline the Core Block and each Exception Block independently. Rather than treating a large topic as a monolithic unit with hidden internal variation, the architecture exposes the meaningful units of change.</p><p>This has several governance benefits. First, it allows the organization to know exactly which block was active for which state at which time. Second, it enables review workflows to focus only on affected blocks. Third, it reduces the risk that a change intended for one state will inadvertently affect another. Fourth, it gives auditors a clearer record of the relationship between source content, approval status, effective date, and published output.</p><p>For regulatory and policy content, this is often more important than authoring efficiency. Faster publishing matters, but trustworthy publishing matters more.</p><h2>Design principles for adopting the Core-Exception model</h2><p>Organizations considering this pattern should treat it as a content architecture decision, not merely a publishing customization. The following principles are essential.</p><h3>1. Establish the correct unit of variation</h3><p>Do not begin by asking how small a reusable element can be. Begin by asking what unit reviewers, approvers, and auditors need to understand. In many high-variant environments, the right unit is not the word or phrase. It is the rule, requirement, procedure, condition, or statement.</p><h3>2. Make the core explicit</h3><p>The core is not &#8220;everything that remains after exceptions are removed.&#8221; It is an authored block that represents the majority case. It should be complete, reviewable, and valid on its own.</p><h3>3. Treat exceptions as first-class content</h3><p>An exception should not be a hidden fragment. It should be a managed content block with metadata, ownership, review status, version history, effective dates, and applicability conditions.</p><h3>4. Use exception-first resolution</h3><p>Publishing logic must look for applicable exceptions before falling back to the core. Without this rule, exceptions become unpredictable and authors lose confidence in the model.</p><h3>5. Avoid overusing null exceptions</h3><p>The NCCI model included triggers to indicate that a block was not applicable as a core or as a state exception. This is powerful, but it should be governed carefully. &#8220;Not applicable&#8221; is a business statement, not just a publishing trick.</p><h3>6. Extend the rule to the map when needed</h3><p>If excluding a parent topic leaves child topics without context, exclusion logic must cascade through the map. Otherwise, the publication may contain orphaned content.</p><h3>7. Keep authors out of unnecessary logic</h3><p>The authoring experience should make the model visible without making authors think like build engineers. Authors should understand core, exception, applicability, and effective date. They should not need to mentally simulate the publishing pipeline.</p><h2>Conclusion: reuse needs architecture</h2><p>Derivative reuse is not a mistake. It is an inevitable requirement in enterprise content. Real organizations serve multiple products, jurisdictions, channels, audiences, and timeframes. Content must vary.</p><p>The danger is unmanaged variation. When derivatives are handled through copy-and-paste, they drift. When they are handled through excessive inline conditioning, they become unreadable. When they are handled as governed, typed, block-level exceptions, they can remain reusable, reviewable, traceable, and publishable.</p><p>The Core-Exception model demonstrates that the solution to derivative reuse at scale is not simply &#8220;more reuse&#8221; or &#8220;more conditions.&#8221; The solution is better content architecture. By moving variation to the block level, preserving a clear core, treating exceptions as managed microcontent, and resolving applicability at publishing time, organizations can support complex derivative reuse without sacrificing authoring clarity or governance control.</p><p>The experience documented by NCCI and shared publicly in <em>On the Road to Boca Raton</em> reinforces a broader lesson for structured content practitioners: successful reuse is not merely a technical capability. It is an architectural discipline that requires thoughtful decisions about granularity, governance, workflow, and publishing behavior. [8]</p><p>For high-variant DITA publications, especially in regulated or policy-driven environments, the Core-Exception model offers a practical path forward: write once where content is truly common, vary only where variation is real, and make every exception visible enough to govern.</p><h2>References</h2><p>[1] Ann Rockley and The Rockley Group, &#8220;Fundamental Concepts of Reuse,&#8221; excerpt from <em>Managing Enterprise Content: A Unified Content Strategy</em>.</p><p>[2] IBM, &#8220;Content Reuse,&#8221; technical content practices documentation.</p><p>[3] OASIS, DITA 1.3 Architectural Specification, &#8220;Conditional processing (profiling).&#8221;</p><p>[4] OASIS, DITA 1.3 Architectural Specification, &#8220;Branch filtering.&#8221;</p><p>[5] DITA language specification, &#8220;Information typing.&#8221;</p><p>[6] Precision Content, &#8220;The Precision Content Method.&#8221;</p><p>[7] Internal NCCI source document, &#8220;Derivative Content Reuse: The Core-Exception Model.&#8221;</p><p>[8] NCCI, &#8220;On the Road to Boca Raton,&#8221; presentation delivered at ConVEx Conference, Tempe, Arizona, May 4, 2022.</p><div><hr></div><h1>For more information</h1><p>Visit our site at www.precisioncontent.com to download our <a href="https://www.precisioncontent.com/wp-content/uploads/On-the-road-to-Boca-Raton.pdf">ConVEx presentation</a> and <a href="https://www.precisioncontent.com/wp-content/uploads/Precision-Content_NCCI-WhitePaper_web.pdf">white paper</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[From Performance Support to Intelligent Assistance]]></title><description><![CDATA[Why Structured Content and the 5 Moments of Need&#174; Are Foundational to an AI-Ready Future]]></description><link>https://www.trustinyourcontent.com/p/from-performance-support-to-intelligent</link><guid isPermaLink="false">https://www.trustinyourcontent.com/p/from-performance-support-to-intelligent</guid><dc:creator><![CDATA[Rob Hanna, Precision Content]]></dc:creator><pubDate>Fri, 15 May 2026 18:26:15 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/f2e74c00-f220-49df-803e-4c2d2a935913_1200x630.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Artificial intelligence is rapidly changing how people interact with information at work. Search boxes are giving way to conversations. Documentation is being replaced by answers. Employees increasingly expect systems not just to retrieve information, but to <strong>understand intent, context, and urgency</strong>.</p><p>Yet beneath the excitement lies a hard truth: <em>AI does not fix content problems&#8212;it exposes them.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>AI systems can only be as effective as the content they rely on. When content is ambiguous, bloated, inconsistent, or poorly structured, AI simply delivers confusion faster. When content is precise, modular, and semantically clear, AI becomes transformative.</p><p>This is where the Five Moments of Need&#174; and structured authoring intersect&#8212;not as complementary ideas, but as <em>mutually dependent enablers of intelligent delivery</em>.</p><h1>The 5 Moments of Need&#174;: Context is the missing piece</h1><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HHGO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HHGO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 424w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 848w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 1272w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HHGO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png" width="116" height="116" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:110,&quot;width&quot;:110,&quot;resizeWidth&quot;:116,&quot;bytes&quot;:9726,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/197874703?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HHGO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 424w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 848w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 1272w, https://substackcdn.com/image/fetch/$s_!HHGO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6b5e0b4-6deb-4d84-9c4f-ff9fe5c2aab9_110x110.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>The most important contribution of the 5 Moments of Need&#174; to adult learning is not simply instructional&#8212;it is contextual.<a href="#_ftn1">[1]</a></strong></p><p>By distinguishing between learning moments and doing moments, the framework made explicit something AI systems still struggle with today.</p><p>The same information is not equally useful in every situation.</p><ul><li><p>Someone learning for the first time needs explanation, structure, and progression</p></li><li><p>Someone applying knowledge needs speed, clarity, and confirmation</p></li><li><p>Someone troubleshooting needs diagnostic guidance, not background theory</p></li><li><p>Someone adapting to change needs contrast&#8212;what&#8217;s different, and why</p></li></ul><p>The 5 Moments of Need&#174; provides a human performance map that defines why information is needed in a given moment&#8212;not just <em>what</em> information exists.</p><p>For AI systems, this is critical. Without contextual signals, AI can retrieve content&#8212;but it cannot reliably deliver <em>the right content, in the right way, at the right time</em>.</p><h2>Structured authoring: Giving AI something it can actually work with</h2><p>If the 5 Moments of Need&#174; supplies context, structured authoring supplies precision.</p><p>AI systems do not reason well over:</p><ul><li><p>Long-form narrative documents</p></li><li><p>Implicit meaning</p></li><li><p>Mixed intent content</p></li><li><p>Inconsistent terminology</p></li></ul><p>They excel when content is:</p><ul><li><p>Modular and atomic</p></li><li><p>Explicitly typed by purpose</p></li><li><p>Predictably structured</p></li><li><p>Semantically labeled</p></li></ul><p>Structured authoring turns content into <strong>intelligent building blocks</strong>&#8212;microcontent that answers one question, supports one task, or explains one concept. This is the level at which both humans and machines perform best.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MKha!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MKha!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 424w, https://substackcdn.com/image/fetch/$s_!MKha!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 848w, https://substackcdn.com/image/fetch/$s_!MKha!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 1272w, https://substackcdn.com/image/fetch/$s_!MKha!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MKha!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png" width="658" height="242.23076923076923" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:536,&quot;width&quot;:1456,&quot;resizeWidth&quot;:658,&quot;bytes&quot;:60168,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/197874703?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MKha!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 424w, https://substackcdn.com/image/fetch/$s_!MKha!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 848w, https://substackcdn.com/image/fetch/$s_!MKha!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 1272w, https://substackcdn.com/image/fetch/$s_!MKha!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89a8585d-d611-49b7-93d8-c616a7eda8c9_1695x624.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Instead of asking AI to &#8220;figure it out,&#8221; structured content <em>removes ambiguity before delivery ever happens</em>.</p><h2>Precision + Context: How AI delivers the right answer, not just an answer</h2><p>In an AI-enabled environment, structured content and the 5 Moments of Need&#174; play distinct but inseparable roles:</p><ul><li><p>Structured content ensures accuracy, clarity, and reuse, while</p></li><li><p>The 5 Moments of Need&#174; informs delivery logic and situational relevance.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bVV9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bVV9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 424w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 848w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 1272w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bVV9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png" width="500" height="293.43065693430657" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:804,&quot;width&quot;:1370,&quot;resizeWidth&quot;:500,&quot;bytes&quot;:125658,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/197874703?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bVV9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 424w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 848w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 1272w, https://substackcdn.com/image/fetch/$s_!bVV9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0558347-aeb8-4fc8-adbf-c2d0bfb18286_1370x804.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Together, they allow AI systems to</p><ul><li><p>distinguish between learning support and performance support</p></li><li><p>adjust tone, depth, and format based on the moment</p></li><li><p>assemble just-enough guidance dynamically, and</p></li><li><p>deliver answers that feel intentional rather than generic.</p></li></ul><p>This is the difference between an AI system that retrieves information and one that supports performance.</p><p>Without structured content, AI lacks reliable input.</p><p>Without the 5 Moments of Need&#174;, AI lacks meaningful context.</p><h2>From content assets to AI-ready knowledge infrastructure</h2><p>When organizations adopt structured authoring aligned to the 5 Moments of Need&#174;, content stops behaving like static artifacts and starts functioning as knowledge infrastructure.</p><p>The same content components can:</p><ul><li><p>Power formal learning experiences<a href="#_ftn2">[2]</a></p></li><li><p>Support in-workflow performance guidance</p></li><li><p>Feed chatbots and conversational assistants</p></li><li><p>Adapt to new tools, processes, and policies</p></li><li><p>Scale across channels without rewriting</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x43l!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x43l!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 424w, https://substackcdn.com/image/fetch/$s_!x43l!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 848w, https://substackcdn.com/image/fetch/$s_!x43l!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 1272w, https://substackcdn.com/image/fetch/$s_!x43l!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x43l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png" width="447" height="300.34645669291336" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:256,&quot;width&quot;:381,&quot;resizeWidth&quot;:447,&quot;bytes&quot;:18576,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/197874703?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x43l!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 424w, https://substackcdn.com/image/fetch/$s_!x43l!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 848w, https://substackcdn.com/image/fetch/$s_!x43l!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 1272w, https://substackcdn.com/image/fetch/$s_!x43l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f8cb214-c432-4849-9df8-f42fa02e843c_381x256.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is especially important in AI environments, where content must be:</p><ul><li><p>Continuously updated</p></li><li><p>Instantly reusable</p></li><li><p>Trustworthy in every context</p></li></ul><p>Well-structured content also reduces risk. Changes propagate cleanly. Contradictions are minimized. Outdated guidance is easier to identify and remove&#8212;an essential capability when AI systems are amplifying delivery at scale.</p><h2>An AI-ready strategy that pays off today</h2><p>Perhaps the most compelling aspect of this approach is that it delivers value immediately.</p><p>Organizations that can align structured authoring with the 5 Moments of Need&#174; will</p><ul><li><p>improve findability and usability for today&#8217;s workforce</p></li><li><p>reduce dependence on formal training for performance issues</p></li><li><p>increase confidence and speed at the point of work, and</p></li><li><p>strengthen trust in content.</p></li></ul><p>At the same time, they are quietly building an AI-ready foundation&#8212;content that is already modular, contextualized, and semantically clear.</p><p>When AI capabilities mature or expand, these organizations are not scrambling to retrofit content. They are ready.</p><h2>Preparing for the future by designing for reality</h2><p>AI will not replace learning content. It will reshape how learning and performance support are delivered. Organizations that succeed will be those that recognize a simple principle:</p><p><em>Intelligence in delivery depends on discipline in content.</em></p><p>The 5 Moments of Need&#174; gives us a proven model for understanding human performance. Structured authoring gives us a proven method for creating content machines&#8212;and people&#8212;can trust.</p><p>Together, they form a strategy that improves work today and enables intelligent systems tomorrow.</p><p>That is not just content modernization.</p><p>It is intelligent, organizational AI-enablement.<a href="#_ftn3">[3]</a></p><div><hr></div><p><a href="#_ftnref1">[1]</a> For more information, visit https://www.5momentsofneed.com</p><p><a href="#_ftnref2">[2]</a> Microcontent and the 5 Moments of Need&#174;, Rob Hanna, ConVex Virtual Conference, September 2020</p><p><a href="#_ftnref3">[3]</a> Precision Content&#174; is a registered trademark of Precision Content Authoring Solutions, inc. The 5 Moments of Need&#174; is a registered trademark of APPLY Synergies.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Lean RAG Architecture]]></title><description><![CDATA[Optimizing Performance and Reducing Cost Through Standardized Structured Content]]></description><link>https://www.trustinyourcontent.com/p/lean-rag-architecture</link><guid isPermaLink="false">https://www.trustinyourcontent.com/p/lean-rag-architecture</guid><dc:creator><![CDATA[Rob Hanna, Precision Content]]></dc:creator><pubDate>Sat, 02 May 2026 13:34:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lfhd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Retrieval-augmented generation (RAG) has become the preferred pattern for giving generative AI systems access to fast-changing enterprise knowledge. Instead of retraining a large language model every time a product, policy, feature, or procedure changes, organizations can retrieve current source content at answer time and ask the model to synthesize a response from that evidence. In principle, this gives organizations a practical path to conversational support, intelligent documentation, and just-in-time product assistance.</p><p>In practice, many RAG pilots succeed for the wrong reason. A small repository, one product version, and a narrow set of documents can make hallucination rates appear low. As the repository grows to include multiple product versions, regional variants, archived manuals, overlapping procedures, duplicate topics, and inconsistent terminology, the same system becomes harder to trust. The problem is not simply model quality. It is content supply-chain quality.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Lean RAG is the discipline of reducing unnecessary content volume, semantic duplication, and retrieval ambiguity before content reaches the vector store, search index, graph, or prompt. It combines structured source content, governed metadata, semantic chunking, reuse-aware publishing, provenance, and lifecycle control. For technical documentation teams, the most practical path to Lean RAG is often already present in their existing investments: Rich XML for structured source content, a component content management system for governance and lifecycle management, globally-standardized metadata for exchange and discovery, and a publishing pipeline that transforms authoritative source into AI-ready retrieval units.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lfhd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lfhd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lfhd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1264561,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/196164497?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lfhd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!lfhd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9dbbd27-2083-4186-bf80-1f07469558c4_1448x1086.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The goal is not to put less useful information into the system. The goal is to put fewer ambiguous copies of consistently chunked information into the system, with enough context, metadata, and lineage for the retrieval layer to select the right evidence at the right time.</p><h2>Why RAG became the default pattern</h2><p>RAG emerged as a response to a basic limitation of large language models: even when models contain broad factual knowledge in their parameters, they do not reliably know the current, proprietary, or highly specific facts that enterprise applications require. The original RAG research framed retrieval as a way to combine parametric memory, the model&#8217;s learned knowledge, with non-parametric memory, an external index of source documents. This matters especially for knowledge-intensive tasks, where source access, specificity, and provenance are central to trust (Lewis et al., 2020).</p><p>For product and service organizations, the appeal is obvious. Product documentation changes constantly. Software features are released, procedures are corrected, warnings are updated, parts are replaced, supported configurations shift, and regulatory language evolves. Training or fine-tuning a model every time these facts change is not a viable operating model. A RAG architecture lets the enterprise keep the model stable while updating the knowledge source underneath it.</p><p>This is why RAG is surfacing as the default strategy for chatbots, voice assistants, support copilots, field-service tools, and intelligent delivery portals. It promises current answers without repeated model retraining. It also promises traceability: an answer can be grounded in retrieved source material rather than generated solely from model memory.</p><p>But RAG does not solve the content problem by itself. Retrieval only works as well as the evidence available to retrieve.</p><h2>The hidden failure mode: RAG looks best before it scales</h2><p>Many RAG projects begin with a small proof of concept. A team loads one manual, one policy set, or one product documentation collection into a vector database. The early results look impressive. The assistant answers recognizable questions. It quotes relevant passages. It appears to avoid hallucinations. Stakeholders conclude that the system is nearly production-ready.</p><p>This early success can be misleading.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CJiw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CJiw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CJiw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1282457,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/196164497?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CJiw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!CJiw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff52e2902-fa29-4ca8-a3c4-9875b0a64f2e_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>With only one version of a documentation suite in the repository, the retrieval problem is artificially simple. There are fewer near-duplicates, fewer conflicting procedures, fewer obsolete variants, and fewer semantically similar passages competing for selection. As more versions and variations accumulate, the system must distinguish between the current procedure and an outdated one, the domestic version and the European version, the standard product and the premium model, the installation guide and the maintenance guide, the warning that applies to one configuration and the warning that applies to another.</p><p>Long context windows do not eliminate this problem. Research on long-context language models shows that models may perform worse when the relevant evidence is buried in the middle of a long input, even when the input technically fits within the context window (Liu et al., 2024). In other words, simply giving the model more content is not the same as giving it the right content.</p><p>This is the central argument for Lean RAG: a production RAG system must control ambiguity before retrieval and prompting, not merely compensate for ambiguity afterward.</p><h2>Lean RAG defined</h2><blockquote><p><strong>Lean RAG applies lean process principles to the content supply chain: it removes semantic waste, reduces retrieval variation, preserves context, and builds quality into AI-ready content before the model is asked to generate an answer.</strong></p></blockquote><p>A Lean RAG system is not thin. It is disciplined. It contains the right content at the right granularity, with the right metadata, linked to the right source, governed by the right lifecycle controls.</p><p>A lean repository has six characteristics:</p><ol><li><p><strong>Authoritative source:</strong> retrieved units originate from approved source content, not arbitrary file dumps.</p></li><li><p><strong>Semantic boundaries:</strong> meaningfully labelled chunks defining expressions of intent such as tasks, warnings, error codes, prerequisites, and reference facts.</p></li><li><p><strong>Context preservation:</strong> small units retain enough surrounding context to remain intelligible outside the page where they were authored.</p></li><li><p><strong>Metadata control:</strong> retrieval can filter or rank by product, version, audience, lifecycle phase, component, information type, and safety relevance.</p></li><li><p><strong>Lineage and provenance:</strong> every answerable unit can be traced back to its source, version, publication, and approval state.</p></li><li><p><strong>Incremental update:</strong> changed content can be reprocessed without duplicating every unchanged unit across every publication or product release.</p></li></ol><p>Lean RAG is therefore not only an AI architecture. It is a content operations model.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Bv0w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Bv0w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 424w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 848w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 1272w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Bv0w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png" width="822" height="498" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:498,&quot;width&quot;:822,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:67364,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/196164497?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Bv0w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 424w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 848w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 1272w, https://substackcdn.com/image/fetch/$s_!Bv0w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a4142a8-0b73-4456-8fb9-926f9989eb71_822x498.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Start with the source content</h2><p>The two most important factors in RAG quality are the source content and the system used to manage that content. Models matter. Embeddings matter. Vector databases, re-rankers, graph retrieval, and prompt patterns all matter. But none of them can reliably compensate for content that is duplicated, inconsistent, stale, context-poor, or unmanaged.</p><p>For best results, enterprise content needs to be lean and structured before it enters the RAG pipeline. It should be written in focused units, governed by standards, enriched with contextual metadata, and maintained as reusable components rather than uncontrolled files. Word documents, PDFs, and unmanaged Markdown pages can be useful delivery formats, but they are weak source formats for large-scale RAG governance. They often hide structure, flatten metadata, obscure reuse, and make it difficult to maintain referential integrity across publications and variants.</p><h2>The CCMS as the control plane for RAG</h2><p>A component content management system (CCMS) is not merely a repository for documentation files. In a Lean RAG architecture, the CCMS becomes the control plane for the content supply chain.</p><p>A CCMS can manage reusable topics and fragments, workflow states, approvals, version history, product applicability, publication maps, conditional processing, metadata inheritance, and relationships among content units. These are not peripheral features. They are the exact controls that a RAG system needs in order to avoid retrieving the wrong information.</p><p>Systems that treat documents as binary objects cannot easily see inside the content to manage metadata, reuse, lineage, or semantic relationships. A code repository or document management system may provide version control at the file level, but file-level versioning is not the same as content-unit governance. Code repositories lack the ability to preserve referential integrity of link and dependencies. A RAG system needs to know whether a procedure is approved, which product variant it applies to, whether it replaced an older procedure, which warnings must travel with it, and which publication contexts contain it.</p><p>The CCMS houses the single source of truth for product and service information. The RAG repository should be a governed publication target, not a shadow content store. RAG output becomes another delivery target along with any number of other channels.</p><h2>Why DITA is a strong source model for Lean RAG</h2><p>DITA was created for topic-based, information-typed, reusable technical content. Its core building blocks are topics and maps, and its architecture supports metadata, content reuse, conditional processing, and structured relationships. The OASIS specification describes topics and maps as the basic building blocks of DITA, with metadata available on maps, topics, and elements to support reuse and conditional publishing (OASIS DITA Technical Committee, 2010; OASIS DITA 1.3, 2015/2018).</p><p>This makes DITA unusually well aligned with the needs of RAG, even though DITA predates modern generative AI. DITA&#8217;s value comes from several properties that map directly to AI retrieval needs:</p><ul><li><p><strong>Topics define meaningful boundaries.</strong> A topic is already a candidate retrieval unit because it is designed to be coherent and reusable.</p></li><li><p><strong>Information typing expresses intent.</strong> Task, concept, reference, troubleshooting, and other specialized structures help distinguish what a unit is meant to do.</p></li><li><p><strong>Maps preserve publication context.</strong> A topic can be reused in multiple outputs while the map supplies structure, sequence, and product scope.</p></li><li><p><strong>Metadata supports filtering and applicability.</strong> Audience, product, platform, lifecycle phase, and other attributes can guide retrieval.</p></li><li><p><strong>Reuse reduces duplication.</strong> A shared topic can remain one authoritative object even when it appears in many publications.</p></li><li><p><strong>Specialization supports enterprise semantics.</strong> Organizations can define structures and constraints that reflect their products and domains.</p></li></ul><p>This is why content that was originally designed for human cognition can become valuable for machine cognition. DITA&#8217;s topic boundaries, information types, and metadata are not just publishing conveniences. They are retrieval signals.</p><h2>XML ingestion challenges</h2><p>DITA XML is an excellent source format, but it is not usually the best format to hand directly to a language model as prompt evidence. DITA was designed for structured authoring, reuse, specialization, and multichannel publishing. A RAG pipeline needs something different: readable, context-rich, deduplicated retrieval units that can be ranked, filtered, cited, and assembled into reliable answers.</p><p>Content reuse is one of DITA&#8217;s greatest strengths, but it creates ingestion challenges. Topics, fragments, and phrases may appear in many outputs through conrefs, keyrefs, maps, and conditional processing. If every rendered instance is indexed as independent evidence, the retrieval system may treat repeated text as more important than it really is. The problem is not reuse itself; the problem is unmanaged reuse in the retrieval layer.</p><h2>Preparing DITA XML for Lean RAG</h2><p>A Lean RAG pipeline should rationalize reuse before indexing. Topic-level and chunk-level reuse should be deduplicated in the repository, while metadata records each location, product, variant, or deliverable where the chunk is reused. This preserves the value of reuse without allowing repetition to distort retrieval ranking or answer generation.</p><p>Phrase-level reuse requires a different approach. Product names, feature names, variant labels, and other key-based terms may resolve differently across maps or versions. These resolved values should be preserved in the rendered Markdown and added to the search thesaurus or lexical index as aliases, deprecated terms, or variant-specific terms. This helps users retrieve the right chunk even when product terminology changes.</p><p>Much of DITA&#8217;s semantic markup should be stripped away. Relevant markup should be transformed into explicit retrieval metadata where it improves filtering, ranking, or answer assembly. Topic type, task structure, product applicability, lifecycle phase, audience, component, variant, and safety information can all become useful metadata fields. The readable content itself should be rendered as clean Markdown, preserving headings, lists, procedures, tables, and warnings where formatting carries meaning.</p><h2>Transform source DITA into AI-ready retrieval units</h2><p>The goal is to transform DITA into AI-ready evidence: deduplicated where reuse could mislead retrieval, enriched where semantic context improves ranking, rendered where structure helps the model read, and preserved through metadata where DITA&#8217;s original semantics matter.</p><p>A Lean RAG pipeline should treat RAG delivery as a publishing output. Just as the same DITA source can be transformed into HTML, PDF, help systems, or training materials, it can also be transformed into AI-ready retrieval packages. There is no need to connect to content repositories via APIs when it can be optimized by the publishing system.</p><p>The DITA Open Toolkit already provides an extensible transformation architecture for converting DITA maps and topics into deliverable formats. In a Lean RAG model, a DITA-OT plug-in can generate semantically chunked, metadata-enriched output for search indexes, vector stores, or graph databases.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ooXJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ooXJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ooXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1363831,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://singlesourceror.substack.com/i/196164497?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ooXJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!ooXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a3c604-c4e1-4c53-b18a-1a94d67d0998_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The transformation should not simply strip tags and split text by token count. It should use the source structure to determine the retrieval unit:</p><ul><li><p>A <strong>task topic</strong> may become one topic-level retrieval unit, with steps preserved in sequence.</p></li><li><p>A <strong>warning</strong> may become a required child fragment that must travel with the relevant task.</p></li><li><p>An <strong>error-code table</strong> may become row-level reference units, each linked back to the parent table and product context.</p></li><li><p>A <strong>prerequisite</strong> may become a reusable fragment attached to multiple procedures.</p></li><li><p>A <strong>troubleshooting topic</strong> may become a set of symptom/cause/remedy units, while retaining the parent topic context.</p></li></ul><p>The result should be a canonical chunk object with a stable identifier, source lineage, lifecycle state, applicability metadata, relationships, and a model-facing body. JSON is usually a strong internal representation for this object because it supports validation, schema control, stable IDs, and metadata fields. Markdown is usually better for the human-readable body that the model receives, because headings, lists, steps, and tables remain easy for both humans and models to interpret. Current prompt engineering guidance from OpenAI also supports Markdown for hierarchy and XML-style tags or attributes for clear boundaries and metadata in prompts (OpenAI, 2026).</p><p>The practical pattern is therefore hybrid:</p><ul><li><p>DITA XML as governed source</p></li><li><p>JSON as the canonical chunk and metadata record</p></li><li><p>Markdown as the model-facing content body</p></li><li><p>RDF or JSON-LD when standards-based semantic exchange is required</p></li></ul><h2>Semantic chunking beats blind chunking</h2><p>Chunking is one of the most consequential design decisions in any RAG system. Chunks that are too large may contain irrelevant information and increase cost. Chunks that are too small may lose context and become misleading. Microsoft&#8217;s RAG guidance emphasizes that effective chunking and search strategies are necessary to optimize relevance, reduce false positives and false negatives, and avoid poor outcomes from chunks that lack sufficient context (Microsoft Azure Architecture Center, 2025).</p><p>Blind chunking divides content by character count, token count, paragraph count, page breaks, or generic document layout. Semantic chunking divides content by meaning and function. For structured technical content, semantic chunking can use the authoring model itself: topic type, element role, heading hierarchy, step sequence, warning placement, table structure, relationship metadata, and map context.</p><p>Internal semantic chunking tests at Precision Content point in the same direction. In a comparison of four chunking approaches, semantically enriched subtopic chunks were directionally preferred over non-semantic markup-based chunks, topic-level chunks, and non-semantic PDF-based chunking. The study supports the practical hypothesis that semantic enrichment is a promising path for improving RAG for technical documentation.</p><p>The key lesson is not that the smallest chunk always wins. It is that chunks need semantic integrity. A procedure step without the procedure title, actor, prerequisites, result, or warning may be too thin. A whole chapter may be too broad. A Lean RAG pipeline should choose the smallest unit that remains safe, coherent, and answerable.</p><h2>Content standards matter</h2><p>How the content is written has a material impact on the quality of the response. To create content that can apply relevant chunking consistently at scale, standards need to be in place that govern how information is typed to align with intent or the function of information. Precision Content&#8217;s structured authoring methods are constructed to provide guidance on writing for intent to produce viable microcontent for better human cognition and machine processing.</p><p>Precision Content defines microcontent as concise blocks of information written, structured, and labeled based on the intended user response for the information. Microcontent must have</p><ul><li><p><strong>Focus</strong> &#8211; based on a single idea or concept</p></li><li><p><strong>Function</strong> &#8211; based on user intent according to defined information types</p></li><li><p><strong>Structure</strong> &#8211; use repeatable patterns and language according to intent, and</p></li><li><p><strong>Context</strong> &#8211; metadata that can draw microcontent back together.</p></li></ul><p>These principles tie into Lean RAG to eliminate content overlap, redundancy, and consistency for chunking.</p><h2>Context is not optional</h2><p>Context loss is one of the most common reasons RAG systems fail. A retrieved fragment may be textually relevant but semantically incomplete. For example, the sentence &#8220;Remove the cover&#8221; is not useful unless the system knows which product, which procedure, which cover, which conditions, and which safety precautions apply.</p><p>Anthropic&#8217;s Contextual Retrieval work addresses this issue by prepending a short chunk-specific context summary before embedding and indexing. Anthropic reports that contextual embeddings and contextual BM25 reduced failed retrievals substantially in their evaluation, and that additional reranking improved results further (Anthropic, 2024). The principle is directly applicable to structured technical content: a chunk should carry concise inherited context from the source topic, map, publication, and metadata.</p><p>In DITA terms, this means that a retrieval unit may need context from multiple layers including</p><ul><li><p>element or block</p></li><li><p>topic metadata</p></li><li><p>map metadata</p></li><li><p>the publication or product context</p></li><li><p>system-level metadata</p></li><li><p>conditional processing values</p></li><li><p>related warnings and prerequisites, and</p></li><li><p>version and approval metadata.</p></li></ul><p>This inherited context should not necessarily be dumped into the prompt as a large metadata blob. Too much metadata can become noise. Instead, the pipeline should distinguish between metadata used for filtering and metadata rendered for the model. Filterable metadata can stay in the search index. Prompt-visible metadata should be concise: product, variant, component, information type, audience or role, lifecycle phase, safety relevance, and source date or version when needed.</p><h2>iiRDS and the role of standardized metadata</h2><p>DITA provides structure at the source level. iiRDS (Intelligent Information Retrieval and Delivery Standard) provides a standardized metadata vocabulary and exchange model for intelligent information delivery. iiRDS specifies a package format and metadata vocabulary for technical documentation, with RDF as a core expression of metadata and, as of version 1.3, optional JSON-LD serialization (iiRDS Consortium, 2025).</p><p>For Lean RAG, iiRDS is valuable because it describes the kinds of contextual distinctions that retrieval systems need to make: document, topic, fragment, product, component, event, lifecycle phase, information type, role or qualification, and relationships among information units. These distinctions can reduce retrieval ambiguity when a repository contains many similar units.</p><p>However, metadata should be purposeful. The iiRDS guidance emphasizes that annotation requires significant concept knowledge and should be driven by use cases. A Lean RAG implementation should not &#8220;tag everything&#8221; simply because fields exist. It should prioritize metadata that changes retrieval behavior:</p><ul><li><p>product and product variant</p></li><li><p>component or feature</p></li><li><p>lifecycle phase</p></li><li><p>information type</p></li><li><p>audience, role, or qualification</p></li><li><p>geography or regulatory scope</p></li><li><p>source publication</p></li><li><p>effective date and version</p></li><li><p>safety or compliance relevance, and</p></li><li><p>relationships such as prerequisite, replacement, version-of, and related procedure.</p></li></ul><p>This metadata can support hybrid retrieval, where vector search handles semantic similarity, keyword search handles exact terms such as part numbers and error codes, and filters enforce applicability.</p><h2>Provenance and trust</h2><p>A generated answer is only as trustworthy as the organization&#8217;s ability to show where it came from. Provenance is therefore not an afterthought. It is a core requirement for production RAG.</p><p>The W3C PROV model defines provenance as information about entities, activities, and agents involved in producing a piece of data or thing, which can be used to assess quality, reliability, or trustworthiness (W3C, 2013). Applied to Lean RAG, provenance means every retrieval unit should be traceable to its source topic, map, publication, version, transformation event, approval state, and publication date.</p><p>A strong RAG chunk should answer questions such as:</p><ul><li><p>What source object produced this chunk?</p></li><li><p>Which publication or product release included it?</p></li><li><p>Which version replaced it or was replaced by it?</p></li><li><p>Was it approved, draft, deprecated, or superseded?</p></li><li><p>Which transformation generated it?</p></li><li><p>Which metadata was inherited, and from where?</p></li><li><p>Which warnings, prerequisites, or related fragments must accompany it?</p></li></ul><p>This lineage is essential for audit, troubleshooting, and user trust. It also supports incremental updates: when a source topic changes, the pipeline can reprocess only affected retrieval units and update their metadata without duplicating all unchanged content.</p><h2>Why Lean RAG reduces cost and saves energy</h2><p>RAG cost is not only the cost of model tokens. It includes indexing, storage, embedding, reranking, prompt size, evaluation, governance, support escalation, and ongoing maintenance. A bloated repository increases these costs in several ways.</p><p>Duplicate and near-duplicate content increases embedding volume and retrieval competition. Overly large chunks increase prompt tokens and may include irrelevant evidence. Overly small chunks increase retrieval calls and require more context reconstruction. Poor metadata increases false positives, which then require reranking, longer prompts, or model reasoning to resolve. Stale content increases risk, which increases the need for human review and escalation.</p><p>Lean RAG reduces these costs by keeping one authoritative copy of reusable content, carrying applicability through metadata, and publishing only the retrieval units needed by the target system. The same chunk of content can be valid across multiple publications and product releases without being physically duplicated in the RAG repository. The repository can store one approved unit with metadata that links it to the applicable products, versions, publications, and contexts.</p><p>This is the same logic that made structured content and reuse valuable long before generative AI. RAG simply makes the cost of poor content operations more visible.</p><h2>Governance: The missing layer in many RAG projects</h2><p>AI governance is often discussed in terms of model behavior, privacy, bias, security, and human oversight. Those concerns are real, but content governance deserves equal attention in enterprise RAG. The NIST AI Risk Management Framework organizes AI risk work around governance, mapping, measurement, and management. Lean RAG applies those same principles to the knowledge supply chain that feeds the model (NIST, 2023).</p><p>A governed Lean RAG program should include:</p><ul><li><p><strong>Content standards</strong> for topic types, microcontent blocks, terminology, warnings, titles, and metadata.</p></li><li><p><strong>Source governance</strong> through workflows, approvals, review states, and release controls.</p></li><li><p><strong>Retrieval-unit standards</strong> defining when to publish topics, fragments, table rows, examples, troubleshooting units, and safety notes.</p></li><li><p><strong>Metadata governance</strong> defining required fields, inherited fields, controlled vocabularies, and validation rules.</p></li><li><p><strong>Evaluation practices</strong> using test questions, reference answers, pairwise judgments, regression tests, and retrieval diagnostics.</p></li><li><p><strong>Lifecycle rules</strong> for replacing, deprecating, archiving, and removing retrieval units.</p></li><li><p><strong>Operational ownership</strong> across documentation, product, support, engineering, compliance, localization, and AI teams.</p></li></ul><p>A RAG system is not production-ready just because it can answer a test question. It is production-ready when the organization can keep the source content accurate, the retrieval layer current, the metadata trustworthy, and the answer behavior measurable over time.</p><h2>A practical Lean RAG pipeline</h2><p>A practical Lean RAG pipeline for technical documentation can be implemented in seven stages.</p><ol><li><p>Authors create and maintain structured source content in DITA or another semantically rich authoring model. They write using clear information types and focused blocks that correspond to user intent.</p></li><li><p>The CCMS manages reuse, lifecycle state, product applicability, variant conditions, relationships, and publication context.</p></li><li><p>The publishing pipeline resolves keys, conditions, reusable components, and map-level context so the output reflects the correct publication and product scope.</p></li><li><p>A semantic chunking Open Toolkit transformation generates retrieval units at the appropriate granularity: topics where the topic is the best answer unit, fragments where reusable microcontent is safe and useful, and row-level units where tables contain independent facts.</p></li><li><p>The pipeline enriches each unit with inherited metadata, relationship links, provenance, and a concise context summary.</p></li><li><p>The retrieval layer indexes the units for hybrid retrieval, using vector embeddings, keyword search, metadata filters, and, where useful, graph relationships.</p></li><li><p>The prompt renderer gives the model compact evidence in Markdown, with clear boundaries and only the metadata needed to answer accurately.</p></li></ol><p>This pipeline allows DITA XML to remain the governed source while each downstream AI system receives the representation it needs.</p><h2>Conclusion: The future of RAG is a content supply chain issue</h2><p>RAG is often presented as a model architecture. For enterprise technical content, it is better understood as a content supply-chain problem.</p><p>A RAG system retrieves what the organization gives it. If the repository contains uncontrolled documents, duplicate versions, conflicting variants, stale procedures, weak metadata, and context-poor chunks, the model will inherit those weaknesses. If the repository contains structured, approved, semantically coherent, metadata-rich, provenance-aware units, the model has a far better chance of producing accurate, current, and trustworthy answers.</p><p>Lean RAG is the bridge between technical communication discipline and generative AI performance. It does not abandon structured authoring principles. It extends them into AI delivery. DITA supplies structure. The CCMS supplies lifecycle control. iiRDS-style metadata supplies machine-readable context. Semantic intent-based chunking supplies answerable units. Provenance supplies trust. Governance supplies durability.</p><p>The organizations that succeed with RAG will not be the ones that simply load the most documents into the largest vector database. They will be the ones that treat AI delivery as another output of a governed, lean, intelligent content supply chain.</p><div><hr></div><p><strong>References</strong></p><p>Anthropic. (2024). <em>Introducing Contextual Retrieval.</em> Anthropic Engineering.</p><p>DITA Open Toolkit. (2026). <em>DITA Open Toolkit architecture documentation.</em></p><p>iiRDS Consortium. (2025). <em>iiRDS Version 1.3 materials and Guide for the Standardized Use of iiRDS.</em></p><p>Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K&#252;ttler, H., Lewis, M., Yih, W., Rockt&#228;schel, T., Riedel, S., &amp; Kiela, D. (2020). <em>Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.</em> Advances in Neural Information Processing Systems.</p><p>Liu, N. F., Lin, K., Hewitt, J., Paranjape, A., Bevilacqua, M., Petroni, F., &amp; Liang, P. (2024). <em>Lost in the Middle: How Language Models Use Long Contexts.</em> Transactions of the Association for Computational Linguistics.</p><p>Microsoft Azure Architecture Center. (2025). <em>Develop a RAG solution: Chunking phase.</em> Microsoft Learn.</p><p>National Institute of Standards and Technology. (2023). <em>Artificial Intelligence Risk Management Framework (AI RMF 1.0).</em></p><p>OASIS DITA Technical Committee. (2010). <em>Darwin Information Typing Architecture (DITA) Version 1.2: Architectural Specification.</em> OASIS.</p><p>OASIS DITA Technical Committee. (2015/2018). <em>Darwin Information Typing Architecture (DITA) Version 1.3.</em> OASIS.</p><p>OpenAI. (2026). <em>Prompt engineering, retrieval, and structured output guidance.</em> OpenAI Developers.</p><p>World Wide Web Consortium. (2013). <em>PROV-DM: The PROV Data Model.</em> W3C Recommendation.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.trustinyourcontent.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>