Handling "fast-routing" in mastra?
I’ve noticed Mastra can feel a bit slow for smaller or simpler tasks (which I assume is just part of having an agent in the loop). I’m exploring a setup where lightweight requests get handled directly through an NLP or heuristic route, and only more complex ones fall back to the full Mastra agent.
The issue is that these “fast-routed” messages and their quick responses don’t get threaded through Mastra, so they don’t appear in the conversation history or memory context when the user continues chatting.
Has anyone tackled this before or found a good workaround to keep fast responses visible in Mastra’s chat/memory context? Curious if there’s a recommended approach here.
2 Replies
📝 Created GitHub issue: https://github.com/mastra-ai/mastra/issues/8969
GitHub
[DISCORD:1428691262017372160] Handling "fast-routing" in mastra? ·...
This issue was created from Discord post: https://discord.com/channels/1309558646228779139/1428691262017372160 I’ve noticed Mastra can feel a bit slow for smaller or simpler tasks (which I assume i...
You’re in uncharted territory but you can add messages manually to an Agents memory via the memory class.