6:["$","$La",null,{"initialPosts":[{"slug":"2026-04-30-agentic-rag-public-status","title":"Agentic RAG Public Engineering Status","date":"2026-04-30","excerpt":"A public-safe engineering status note for Cortagent Agentic RAG: routing, retrieval, evidence selection, memory, caching, safety, and evaluation remain the active surface.","content":"\nThis is a public-safe status note for Cortagent Agentic RAG. The active engineering surface remains focused on retrieval as part of the reasoning loop.\n\n## Active surface\n\n- Query structure parsing\n- Retrieval routing\n- Hybrid retrieval\n- Evidence selection\n- Conversation context\n- Memory boundaries\n- Cache correctness\n- Safety validators\n- Circuit breaker behavior\n- Reflex policies\n- Evaluation harnesses\n\n## Current rule\n\nWe are not treating top-k retrieval as sufficient. The system must preserve the distinction between retrieval, evidence selection, reasoning, and answer synthesis.\n\n\nThis note intentionally avoids internal PR references, private traces, customer data, and unpublished benchmark numbers. It only summarizes the implemented engineering surface at a public-safe level.\n\n","tags":["Agentic RAG","Engineering","Status"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":true,"coverImage":"$undefined"},{"slug":"2026-04-24-memory-retrieval-boundary","title":"Memory and Retrieval Boundary","date":"2026-04-24","excerpt":"We clarified the boundary between conversation memory, retrieved evidence, and answer synthesis in the Agentic RAG runtime.","content":"\nWe clarified the boundary between memory and retrieval. Memory can stabilize follow-ups, but it should not silently become evidence unless the system can inspect and explain its use.\n\n## Boundary rules\n\n- Conversation state can help interpret the query.\n- Retrieved evidence remains separate from remembered context.\n- Answer synthesis should not treat unresolved memory as proof.\n- Memory writes must be intentional and bounded.\n\n\nMemory is useful only when it is inspectable. If memory cannot be reviewed, corrected, or scoped, it becomes another hidden prompt input.\n\n\n## Trade-off\n\nUsing memory can reduce recomputation and improve follow-up handling. It also increases the risk of stale or irrelevant context. The runtime keeps that trade-off explicit.\n","tags":["Memory","Retrieval","Grounding"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-04-17-evaluation-case-shaping","title":"Evaluation Case Shaping","date":"2026-04-17","excerpt":"We shaped Agentic RAG evaluation cases around follow-ups, weak evidence, source targeting, and complex multi-part questions.","content":"\nWe shaped evaluation cases around the failure modes that matter for Agentic RAG. The useful tests are not only \"can it answer a single question\"; they need to show what happens when retrieval and reasoning interact.\n\n## Case types\n\n- Follow-up questions where prior context changes the retrieval target.\n- Complex questions that should trigger decomposition.\n- Direct questions that should avoid unnecessary decomposition.\n- Weak-evidence cases where the system should avoid confident synthesis.\n- Source-targeting cases where the router should avoid irrelevant corpora.\n\n\nThis is evaluation harness work, not a published accuracy result. Accuracy claims require ground truth, reproducible runs, and documented failures.\n\n\n## Engineering note\n\nThe evaluation direction favors failure visibility over inflated pass rates. If evidence is missing, the expected behavior is to surface that condition.\n","tags":["Evaluation","QA","Agentic RAG"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-04-10-grounding-trace-pass","title":"Grounding Trace Pass","date":"2026-04-10","excerpt":"We tightened how retrieval diagnostics, evidence boundaries, and answer synthesis traces connect in the Agentic RAG loop.","content":"\nWe tightened the grounding trace across the Agentic RAG loop. The goal is simple: answer synthesis should be explainable from retrieval and evidence state, not from a post-hoc narrative.\n\n## Trace shape\n\nThe public-safe shape is:\n\n1. Query structure is parsed.\n2. Retrieval method selection is recorded.\n3. Candidate evidence is collected.\n4. Evidence selection narrows the context.\n5. Answer synthesis uses the selected context.\n6. Diagnostics remain available for review.\n\n\nIf retrieval, evidence selection, and synthesis collapse into one opaque step, a correct-looking answer cannot be audited. The trace exists to keep those stages separable.\n\n\n## What stayed out\n\nThis update does not expose private corpora, internal PRs, customer data, or operational traces.\n","tags":["Grounding","Evidence","Diagnostics"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-04-03-latency-discipline-pass","title":"Latency Discipline Pass for Agentic RAG","date":"2026-04-03","excerpt":"We tightened the Agentic RAG path around avoidable work: repeated decomposition, repeated embeddings, and unnecessary retrieval fan-out.","content":"\nWe tightened the Agentic RAG path around avoidable work. The target was not to make the system look faster in a demo; the target was to make expensive steps visible and defensible.\n\n## What changed\n\n- Repeated decomposition paths were reviewed against cache behavior.\n- Retrieval fan-out stayed tied to query complexity rather than becoming a default.\n- Embedding reuse stayed explicit so unchanged inputs do not trigger unnecessary work.\n- Diagnostics remain part of the loop so latency reductions do not hide retrieval behavior.\n\n\nThis update does not publish latency numbers. Latency claims require measured runs, environment details, and variance. This pass only documents implementation direction and control points.\n\n\n## Engineering note\n\nLow latency and grounding can conflict. The system should avoid repeated work, but it cannot skip evidence selection just to save time. That boundary remains explicit.\n","tags":["Agentic RAG","Latency","Cache"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-20-ingestion-guardrails","title":"Ingestion Guardrails","date":"2026-03-20","excerpt":"Ingestion guardrails were added to make document handling and chunk quality more explicit before retrieval uses the content.","content":"\nWe added ingestion guardrails around document and chunk handling. Retrieval quality depends on what enters the index, not only on the query path.\n\nThe focus is explicit failure. Bad input, duplicate state, or invalid chunks should be caught before they become retrieval noise.\n","tags":["Ingestion","Chunking","Reliability"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-17-qa-audit-benchmark","title":"QA Audit and Benchmark Harness","date":"2026-03-17","excerpt":"QA audit and benchmark tooling was added to separate measurable behavior from impression-based assessment.","content":"\nWe added QA audit and benchmark tooling around RAG behavior. Accuracy claims require reproducible inputs, expected outputs, and documented failure cases.\n\nThis update does not publish benchmark results. It adds the harness needed to measure behavior without relying on subjective demos.\n\n## Evaluation dimensions\n\nThe harness direction focuses on:\n\n- retrieval coverage,\n- evidence relevance,\n- answer grounding,\n- follow-up stability,\n- empty-context behavior,\n- and failure transparency.\n\n\nBenchmarks without methodology are invalid. A useful result needs the task set, environment, expected answers, scoring rules, and failure notes.\n\n\n## Why we added it\n\nAgentic RAG can look good in a hand-picked conversation and still fail on repeatable cases. The harness is there to make repeated evaluation possible.\n","tags":["Evaluation","QA","Benchmark"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-13-feedback-store","title":"Feedback Store Added","date":"2026-03-13","excerpt":"A feedback store was added so retrieval and answer behavior can be improved from recorded outcomes instead of anecdotes.","content":"\nWe added a feedback store to support recorded outcomes. This gives the system a place to persist feedback signals about retrieval and answer behavior.\n\nFeedback is not the same as proof. It becomes useful when tied to reproducible examples, ground truth, and failure categories.\n","tags":["Feedback","Evaluation","Reliability"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-10-datasource-routing","title":"Datasource Routing Work","date":"2026-03-10","excerpt":"Datasource routing work was added so retrieval can target relevant sources instead of searching every corpus the same way.","content":"\nWe added datasource routing work to make retrieval more selective. A question about a runbook and a question about a policy should not necessarily hit the same source set.\n\nThe trade-off is recall versus latency. Source targeting can reduce noise and cost, but the router must be explicit about what it skipped.\n","tags":["Datasources","Routing","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-06-keyword-retrieval-path","title":"Keyword Retrieval Path","date":"2026-03-06","excerpt":"Keyword retrieval work gave the RAG layer a lexical path for exact terms, identifiers, and phrases.","content":"\nWe added a keyword retrieval path to complement vector retrieval. Exact terms, identifiers, file names, and policy phrases often need lexical matching.\n\nThis path exists because semantic retrieval alone is not sufficient for every question. Hybrid orchestration can use both signals where appropriate.\n","tags":["Keyword Retrieval","Evidence","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-03-03-retrieval-stitching","title":"Retrieval Stitching Work","date":"2026-03-03","excerpt":"Retrieval stitching work was added to join evidence from multiple retrieval paths without losing source boundaries.","content":"\nWe added retrieval stitching work for cases where evidence comes from more than one path. The system needs to merge useful context without erasing where each result came from.\n\nThis is a grounding requirement. If source boundaries disappear, later reasoning cannot explain why a piece of evidence was trusted.\n","tags":["Evidence","Retrieval","Hybrid Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-24-memory-consolidation","title":"Memory Consolidation Work","date":"2026-02-24","excerpt":"Memory consolidation work was added to reduce duplicate memory state and keep useful context retrievable over time.","content":"\nWe added memory consolidation work around the RAG system. The goal is to avoid unbounded, duplicate, or low-value memory growth.\n\nMemory has to remain inspectable. Consolidation should improve future retrieval without turning memory into invisible prompt stuffing.\n","tags":["Memory","Consolidation","Follow Ups"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-20-structured-answer-path","title":"Structured Answer Path","date":"2026-02-20","excerpt":"Structured answer work was added so synthesis can carry explicit fields instead of only free-form text.","content":"\nWe added structured answer work to the RAG path. Free-form text is not always enough when the caller needs citations, status fields, uncertainty, or follow-up actions.\n\nThe goal is not to make answers longer. It is to make answer synthesis more explicit and easier to validate.\n","tags":["Answer Synthesis","Structure","Grounding"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-17-reranker-selector","title":"Reranker Selector Work","date":"2026-02-17","excerpt":"Reranker selection work was added to keep evidence ordering separate from initial retrieval.","content":"\nWe added reranker selection work so result ordering can be handled after initial retrieval. The first retrieval pass finds candidates; reranking decides what deserves attention.\n\nKeeping those steps separate is important. It lets the system inspect retrieval quality and evidence ordering independently before synthesis.\n","tags":["Reranking","Evidence","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-13-prompt-loader-auditability","title":"Prompt Loading Becomes Auditable","date":"2026-02-13","excerpt":"Prompt loading work made behavior dependencies easier to inspect instead of hiding important instructions inside ad hoc strings.","content":"\nWe added prompt loading structure around the RAG runtime. Prompt-dependent behavior should be versioned and inspectable, not hidden in scattered string concatenation.\n\nThis supports future debugging. If answer behavior changes, the prompt surface must be part of the evidence trail.\n","tags":["Prompts","Auditability","Runtime"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-10-language-routing","title":"Language Routing Work","date":"2026-02-10","excerpt":"Language routing work was added so multilingual queries can be handled without treating language as a knowledge silo.","content":"\nWe added language routing work for RAG queries. The goal is to keep knowledge access language-aware without forcing every path into translation first.\n\nThis matters for retrieval because evidence can exist in a different language from the user query. Language routing gives the system an explicit place to decide how to handle that case.\n","tags":["Multilingual","Retrieval","Routing"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-06-graph-retrieval-path","title":"Graph Retrieval Path Added to the Retrieval Surface","date":"2026-02-06","excerpt":"Graph retrieval modules were added so relationship-oriented evidence can become a selectable retrieval path.","content":"\nWe added graph-oriented retrieval modules to the RAG surface. This gives the runtime a path for relationship-heavy questions where plain chunks may not be enough.\n\nGraph retrieval remains one retrieval option, not a replacement for vector or keyword search. The system still needs to route based on query shape and evidence needs.\n","tags":["Graph","Retrieval","KFE"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-02-03-kfe-public-boundary","title":"KFE Retrieval Boundary Defined","date":"2026-02-03","excerpt":"KFE was positioned as an additional structured knowledge-access path alongside chunk retrieval.","content":"\nWe clarified the engineering boundary for KFE, the Knowledge Fabric Engine. KFE is an additional structured knowledge-access path alongside chunk retrieval.\n\n## Boundary\n\nKFE is not a replacement for retrieval. It is a second path for knowledge that benefits from structure, relationships, or claim-aware access.\n\nThe Agentic RAG runtime still needs to decide:\n\n- when chunk retrieval is enough,\n- when graph or structured access is more appropriate,\n- how evidence from different paths should be stitched,\n- and how answer synthesis should cite or explain the selected evidence.\n\n\nThis update describes the role of KFE without exposing private datasets, source contents, internal traces, or unpublished validation artifacts.\n\n","tags":["KFE","Knowledge","Architecture"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-01-30-observability-surface","title":"Observability Surface for RAG Operations","date":"2026-01-30","excerpt":"Metrics and health modules were added around RAG infrastructure so operational state can be inspected.","content":"\nWe expanded the observability surface around RAG infrastructure. The codebase includes health and metrics modules so runtime state can be inspected instead of inferred.\n\nThis does not claim production uptime or performance. It establishes the plumbing required to measure and debug those properties.\n","tags":["Observability","Metrics","Reliability"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-01-16-reflex-policies","title":"Reflex Policies Added Around Retrieval","date":"2026-01-16","excerpt":"Reflex policy hooks were added for pre-retrieval, post-retrieval, and post-generation checks.","content":"\nWe added reflex policy hooks around the RAG lifecycle. Policies can run before retrieval, after retrieval, and after generation.\n\nThis creates a control surface for behaviors such as warning on low coverage, refusing low-confidence paths, blocking prompt injection patterns, and enforcing token budgets.\n\n## Policy positions\n\nThe hooks are placed where they can stop different classes of failure:\n\n- **Pre-retrieval**: reject unsafe or malformed inputs before they create expensive work.\n- **Post-retrieval**: detect empty context, low coverage, or weak evidence before synthesis.\n- **Post-generation**: check whether the answer drifted away from the selected evidence.\n\n\nPolicies must surface explicit reasons. A policy that silently rewrites behavior is another hidden prompt layer.\n\n\n## Why this matters\n\nAgentic RAG needs controls that are close to the reasoning loop. Policy checks after the final answer are too late for many retrieval failures.\n","tags":["Policies","Safety","Grounding"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-01-13-circuit-breaker","title":"Circuit Breaker for RAG Failures","date":"2026-01-13","excerpt":"A circuit breaker path was added so repeated retrieval or generation failures can be surfaced instead of retried blindly.","content":"\nWe added circuit breaker support for RAG failure paths. Repeating the same failing operation can waste latency and hide the real issue.\n\nThe breaker is part of failure transparency. When the system cannot proceed reliably, it should stop or degrade explicitly rather than producing a confident answer from weak evidence.\n","tags":["Reliability","Safety","Runtime"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-01-09-safety-validators","title":"Safety Validators for the RAG Path","date":"2026-01-09","excerpt":"Safety validators were added around the RAG path to make invalid inputs and unsafe retrieval states explicit.","content":"\nWe added safety validators around the RAG path. These checks are meant to surface invalid inputs, unsafe states, and policy-sensitive paths before they become answer quality problems.\n\nThe implementation remains conservative: validation should block or warn with explicit reasons. Silent fallback is not acceptable for this layer.\n","tags":["Safety","Validation","Agentic RAG"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2026-01-06-vector-integrity-contract","title":"Vector Integrity Contract","date":"2026-01-06","excerpt":"Vector integrity checks were added to catch invalid retrieval state before it leaks into answer synthesis.","content":"\nWe added vector integrity contract work to make retrieval failures explicit. Bad vector state should fail as a system issue, not silently degrade into weak answers.\n\nThis improves failure transparency. It does not guarantee answer accuracy by itself, but it reduces one class of hidden retrieval errors.\n","tags":["Reliability","Retrieval","Validation"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-30-lineage-and-events","title":"Lineage and Retrieval Event Logging","date":"2025-12-30","excerpt":"RAG event journals and lineage artifacts were added so retrieval behavior can be inspected after a run.","content":"\nWe added retrieval event logging and lineage-oriented artifacts. The goal is operational clarity: a run should leave enough trace data to understand what happened.\n\nThis is not user-facing analytics. It is engineering evidence for debugging retrieval decisions, cache behavior, and grounding failures.\n","tags":["Diagnostics","Lineage","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-23-evidence-selection-boundaries","title":"Evidence Selection Boundaries","date":"2025-12-23","excerpt":"We tightened the boundary between retrieval output and evidence selection so answer synthesis does not consume context blindly.","content":"\nRetrieval is not the same as evidence. We started tightening the boundary between candidate chunks and the context that should actually be used for synthesis.\n\nThis keeps the RAG path inspectable. Retrieval can return many candidates, but evidence selection needs its own reasoning and failure modes.\n","tags":["Evidence","Grounding","Reliability"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-19-pronoun-resolution-pass","title":"Pronoun Resolution Pass","date":"2025-12-19","excerpt":"A pronoun resolution path was added to reduce failures on follow-ups like 'what about that' and 'what was her role'.","content":"\nWe added a pronoun resolution pass for follow-up handling. This work targets references like \"that\", \"it\", \"he\", and \"she\" when prior conversation state exists.\n\nThe implementation goal is narrow: improve retrieval targeting when a referent can be grounded. If the referent cannot be resolved, the system should surface uncertainty instead of guessing.\n","tags":["Follow Ups","Memory","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-16-conversation-context","title":"Conversation Context Becomes a Retrieval Input","date":"2025-12-16","excerpt":"Conversation context handling was added so follow-up questions can reuse prior state instead of being treated as isolated prompts.","content":"\nFollow-up questions are a core failure mode for naive RAG. We added conversation context handling so retrieval can account for what the user already said.\n\nThe system still cannot assume missing facts. The purpose is to preserve state that exists, not to fabricate state that was never established.\n\n## Follow-up behavior we targeted\n\n- \"What about that?\" where the referent exists in the prior turn.\n- \"What was her role?\" where a named person was already introduced.\n- \"Compare it with the earlier option\" where the comparison target is in the conversation state.\n- \"Do the same for the second source\" where source ordering matters.\n\n## Boundary\n\nConversation context can help interpret a query. It is not automatically evidence. The runtime still needs retrieved or remembered context that can be inspected.\n\n\nIf a referent cannot be resolved, the correct behavior is to ask for clarification or surface uncertainty. Guessing creates a false grounding trail.\n\n","tags":["Memory","Follow Ups","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-12-semantic-cache-context-isolation","title":"Semantic Cache Context Isolation","date":"2025-12-12","excerpt":"Semantic cache work added context checks so similar wording does not automatically mean the same retrieval state.","content":"\nWe continued cache work with context isolation for semantic reuse. Similar language is not enough; the surrounding conversation can change what a question refers to.\n\nThe cache path therefore needs to account for language, pronoun resolution, and conversation state. This favors correctness over aggressive reuse.\n","tags":["Cache","Memory","Safety"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-09-decomposition-cache","title":"Decomposition Cache for Repeated Complex Questions","date":"2025-12-09","excerpt":"Decomposition results gained their own cache path so repeated complex queries do not always restart from zero.","content":"\nComplex questions can require the same decomposition more than once. We added a decomposition cache so the system can reuse intermediate planning work when the request and context are compatible.\n\nThis keeps sequential conversations cheaper without hiding the decomposition step. The system still needs to expose what sub-queries were used and why they were reused.\n","tags":["Decomposition","Cache","Agentic RAG"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-05-fingerprint-cache","title":"Fingerprint Cache Added to Reduce Repeated Work","date":"2025-12-05","excerpt":"Fingerprint caching was added as an explicit performance layer for repeated retrieval and reasoning inputs.","content":"\nWe added a fingerprint cache path to reduce repeated work when inputs are equivalent. This is not an accuracy feature by itself; it is a latency control.\n\nThe constraint is correctness. Cache keys must preserve enough context to avoid reusing an answer or retrieval result where the surrounding conversation changed the meaning of the request.\n","tags":["Cache","Latency","Reliability"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-12-02-hybrid-retrieval-shape","title":"Hybrid Retrieval Shape","date":"2025-12-02","excerpt":"Vector and keyword retrieval paths were shaped into a hybrid orchestration layer with explicit diagnostics.","content":"$b","tags":["Hybrid Retrieval","Evidence","Diagnostics"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-11-28-complexity-gates","title":"Complexity Gates for Decomposition","date":"2025-11-28","excerpt":"We added query complexity logic so decomposition can be based on explicit signals instead of defaulting every request into a heavy path.","content":"\nAgentic retrieval should not make every request expensive. We added complexity evaluation so the system can distinguish direct questions from requests that need decomposition.\n\nThis is a latency and accuracy trade-off. Decomposition can improve recall for multi-part questions, but it adds work. The gate keeps that cost visible and avoids treating \"more retrieval\" as automatically better.\n","tags":["Decomposition","Latency","Retrieval"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-11-26-retrieval-routing-baseline","title":"Retrieval Routing Baseline","date":"2025-11-26","excerpt":"A retrieval router baseline was introduced to make vector, keyword, graph, and swarm-style paths selectable instead of implicit.","content":"\nThe retrieval path gained an explicit routing layer. The codebase now contains selectable retrieval methods rather than a single hidden path.\n\nThe important change is inspectability. Routing can produce diagnostics about method choice, bypass behavior, and whether a query should stay simple or use a more complex path. That makes later debugging possible without guessing what the runtime did.\n","tags":["Retrieval","Router","Architecture"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-11-24-query-structure-parsing","title":"Query Structure Parsing Enters the RAG Path","date":"2025-11-24","excerpt":"Early work moved query handling toward explicit structure parsing so retrieval could respond to intent and complexity.","content":"\nWe added structure-aware query handling to the Agentic RAG path. The goal was to stop treating every user request as a flat search string.\n\nThis gave the pipeline room to separate simple lookups from multi-part questions that need decomposition, context tracking, or stricter evidence selection. It did not replace retrieval; it made retrieval decisions more explicit before the system fetched context.\n","tags":["Retrieval","Query Processing","Agentic RAG"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":false,"coverImage":"$undefined"},{"slug":"2025-11-21-agentic-rag-work-begins","title":"Agentic RAG Work Begins","date":"2025-11-21","excerpt":"We started the Agentic RAG track by defining retrieval as part of the reasoning loop, not a one-shot pre-processing step.","content":"$c","tags":["Agentic RAG","Architecture","Research"],"author":"$undefined","authorId":"$undefined","authorIds":["alper-yilmaz","osman-homek"],"authorImage":"$undefined","readingTime":1,"featured":true,"coverImage":"$undefined"}]}]

Updates