

Do you have a GitHub and would you be willing to share it with me so I could credit you with the name? No worries if not, I can just link to your Lemmy profile instead of you prefer. I just don’t want to change it without giving credit.


Do you have a GitHub and would you be willing to share it with me so I could credit you with the name? No worries if not, I can just link to your Lemmy profile instead of you prefer. I just don’t want to change it without giving credit.


Looking forward to it! Heads up in case you missed it: I had settled on renaming it to Coyote, so sometime this week will be a breaking change and release to get that done.
Biggest pains are just going to be updating the repo tokens for Crates.io and renaming the homebrew repo.


Works for me. I’ll refactor that and rename it tomorrow and hopefully have a new minor release sometime this week. It’ll be another baking change release so I’ll need to attach a couple commands to the release notes to make it easy to migrate.


Looking at Llama-swap, since it says it supports OpenAI-compatible API, it should just work natively already. Just set up the client to be type: openai-compatible and fill in the URL and provide the models. Should work out of the box!


After sitting with Coyote for a while, I’m really liking the name. Before I get too attached, any other ideas? (Just to make sure I stay objective 😛)


OpenCode is specific to coding workflows. Loki is built to be a general LLM runtine/workflow engine for any problem domain, not just code. An example use I have for it is a cron job that runs at boot to


When it comes to writing code, OpenCode is my go-to as well. It’s my ultimate benchmark for how well optimized and reliable I can make local models function in Loki.


Ooh I like Coyote! That’s definitely in the running now. Not to mention that’s really a really cool allusion to Native American mythology!


It was just the one gif I had available and also the model that worked fast enough to fit into a gif without taking forever between prompts so I could demo Loki well. You make a good point though. It’s an old build and is slightly outdated. I’ll update that. Thanks for pointing that out.


Yes it does. By default, any of the execute_command or fs_write/fs_patch/etc. tools all have guards around them that prompt for user confirmation before doing things. They can be disabled via the AUTO_APPROVE environment variable if necessary (like they are when using the sisyphus agent). For bash tools, I’ve included functions that can help do this when you write your own tools. For Python tools, you can use the usual input methods.


Unfortunately, yes. It’s one reason I’m trying to figure out a good mechanism to maybe do something like multiple ollama hosts. So like: you can specify what model to use specifically in an agent. But if an agent delegates to a sub-agent, it unloads that model and loads the new one. I’m trying to figure out if there’s a way to “alternate” between multiple hosts (say, ollama running locally and one running on your server), so that when a switch happens, it does it on the secondary host while also looking ahead to see what needs to be switched, if anything, on the primary host.
It supports multiple Ollama hosts right now as-is so what I’ve honestly been doing for the time being is specify which model on which host each agent uses so there’s only loading of one model at the beginning of a session. Then there’s no unloading/loading/etc. The other thing I’ve been trying is to see how small I can get the models to be without losing performance. While the tricks implemented in Loki help dramatically, I know there’s still a lot more I can do to improve it further.


I’ve been thinking about integrating LSP into it but I can’t think of a great way to do it. I’ve been meaning to look at OpenCode and see how they do it. Maybe I’ll work that into the next release!


I’m using a ton of different ones but the main ones I use daily are
gemma4:26bdeepseek-coderdeepseek-r1:32bdevstral:24bgranite-code:34bopenthinker:latestphi4:latestqwen3:30bmixtral:8x22bI’m also going to use this opportunity to plug an amazing project to help figure out which models will work well on my hardware: https://github.com/AlexsJones/llmfit Is amazing!


Yeah… 😅 I originally named it Loki because, well…if you leave LLMs unsupervised they just create mischief. Any ideas of a good rename? I’ve gotten this comment before and I just couldn’t think of anything good.


So actually, this was the original purpose of it. But all the help I tried to get on it didn’t really have much interest in doing anything outside of the usual big model providers, so I tried advertising a more general use case to attract more input. I can’t deny that agnostic support for even the big providers is helpful when you’re trying to stay current with the rapid advances in LLMs.
After that, I kind of gave up on getting feedback on local-first models. So, instead, I just dove in head-first the way I wanted;Trying new things, building new agents to try and rival Claude Code, adding features as I found them useful and necessary to improve that reliability, etc., and iterating. Then, with the most recent release on Friday, I had done so many changes and improvements specifically for local models that I thought I finally had a strong enough tool to maybe pique enough people’s interest to get some feedback and input. 🙂
Oh, and the config example shows how to add Ollama models here


That’s exactly the reason why I made it only work with user-defined aliases. That way it forces you to think of how to cover the most ground command-wise without needing anything interactive or too much response data. It’s slow but it’s functional and that’s really all I wanted from it. I can have scripts on my server that fix things and just invoke them with this.


They’re encrypted with a PSK encrypted with either AES128 or AES256. It’s built into the Meshtastic firmware so that’s not something I have to handle directly, thankfully!
By popular demand, Loki is now Coyote!
I’ve just released a new version with the full rename.
Thanks to @minticecream for the suggestion!
All the docs are updated except for the screenshots and gifs. I’ll update those tomorrow (since they don’t do any harm right now and don’t require another release to be updated).
For almost a year I’ve been using Loki as the name but it never really satisfied me. Not to mention it’s already the name of an extremely popular logging system by Grafana, which makes Loki hard to discover.
Got some great feedback and the rename was born! Thanks again everyone who gave suggestions and for all the positive feedback!