This is an automated archive made by the Lemmit Bot.
The original was posted on /r/homeassistant by /u/Grandpa-Nefario on 2025-03-23 22:28:20+00:00.
There is a lot of variety in the LLMs being used with Home Assistant, as well as voice pipelines.
Because this tech moves pretty fast, I would be curious to hear what others are either using right now, and satified with, or if they are like me and waiting for the next shiny object.
I am using an older model, luna_7b served by LM Studio using the LM Studio gpt compatible API. Works pretty for the most part, however, it has a problem with musltiple commands at a time.
I have also used, the Qwen 2.5_7 and 14b variants with pretty good success; they are on average about 1.5 to 3 seconds slower in their response times than luna_7b, but don't struggle with multipler commands at one time.
Chat GPT-4o, while not local, is still the fastest model I have used, but of course it is not local. FYI, I am using faster-whisper turbo on my server and haven't tested Gemini or Claude