Complex questions can require the same decomposition more than once. We added a decomposition cache so the system can reuse intermediate planning work when the request and context are compatible.
This keeps sequential conversations cheaper without hiding the decomposition step. The system still needs to expose what sub-queries were used and why they were reused.


