I'd be glad to hear more. I'm not certain what I would even ask, as the space is really fuzzy (prompting and all that).
I've got an Ollama instance (24GB VRAM) I want to leverage to try and reduce dependency on Claude Code. Even the tech stack seems unapproachable. I've considered LiteLLM, router agents, micro-agents (smallest slice of functionality possible), etc. I haven't wrapped my head around it all the way, though.
Ideally, it would be something like:
UI <--> LiteLLM
^
|
v
Agent Shim
Where the UI is probably aider or something similar. Claude Code muddies the differentiation between UI and agent (with all the built in system-prompt injection). I imagine I would like to move system-prompt injection / agent CRUD into the agent shim.I'm just spitballing here.
Thoughts? (my email is in my profile if you would prefer to continue there)