overview for smiletolerantly

Intent recognition for HomeAssistant without an LLM? in c/selfhosted@lemmy.world

[–] smiletolerantly@awful.systems 10 points 5 months ago

Yep, that's the idea! This post basically boils down to "does this exist for HASS already, or do I need to implement it?" and the answer, unfortunately, seems to be the latter.

Intent recognition for HomeAssistant without an LLM? in c/selfhosted@lemmy.world

[–] smiletolerantly@awful.systems 4 points 5 months ago* (last edited 5 months ago) (2 children)

Thanks, had not heard of this before! From skimming the link, it seems that the integration with HASS mostly focuses on providing wyoming endpoints (STT, TTS, wakeword), right? (Un)fortunately, that's the part that's already working really well 😄

However, the idea of just writing a stand-alone application with Ollama-compatible endpoints, but not actually putting an LLM behind it is genius, I had not thought about that. That could really simplify stuff if I decide to write a custom intent handler. So, yeah, thanks for the link!!

Intent recognition for HomeAssistant without an LLM? in c/selfhosted@lemmy.world

[–] smiletolerantly@awful.systems 5 points 5 months ago (2 children)

Thanks for your input! The problem with the LLM approach for me is mostly that I have so many entities, HASS exposing them all (or even the subset of those I really, really want) is already big enough to slow everything to a crawl, and to get bad results from all models I've tried. I'll give the model you mentioned another shot though.

However, I really don't want to use an LLM for this. It seems brittle and like overkill at the same time. As you said, intent classification is a wee bit older than LLMs.

Unfortunately, the sentence template matching approach alone isn't sufficient, because quite frequently, the STT is imperfect. With HomeAssistant, currently the intent "turn off all lights" is, for example, not understood if STT produces "turn off all light". And sure, you can extend the template for that. But what about

turn of all lights
turn off wall lights
turnip off all lights
off all lights
off all fights
...

A human would go "huh? oh, sure, I'll turn off all lights". An LLM might as well. But a fuzzy matching / closest Levensthein distance approach should be more than sufficient for this, too.

Basically, I generally like the sentence template approach used by HASS, but it just needs that little bit of additional robustness against imperfections.

Intent recognition for HomeAssistant without an LLM? in c/selfhosted@lemmy.world

[–] smiletolerantly@awful.systems 5 points 5 months ago

Thanks for sharing your experience! I have actually mostly been testing with a good desk mic, and expect recognition to get worse with room mics... The hardware I bought are seeed ReSpeaker mic arrays, I am somewhat hopeful about them.

Adding a lot of alternative sentences does indeed help, at least to a certain degree. However, my issue is less with "it should recognize various different commands for the same action", and more "if I mumble, misspeak, or add a swear word on my third attempt, it should still just pick the most likely intent", and that's what's currently missing from the ecosystem, as far as I can tell.

Though I must conceit, copying your strategy might be a viable stop-gap solution to get rid of Alexa. I'll have to pay around with it a bit more.

That all said, if you find a better intent matcher or another solution, please do report back as I am very interested in an easier solution that does not require me to think of all possible sentence ahead of time.

Roger.

Intent recognition for HomeAssistant without an LLM? in c/selfhosted@lemmy.world

[–] smiletolerantly@awful.systems 3 points 5 months ago* (last edited 5 months ago) (1 children)

Never heard about willow before - is it this one? Seems there is still recent activity in the repo - did the creator only recently pass away? Or did someone continue the project?

How's your experience been with it?

And sure, will do!

New trailer for Star Trek: Voyager - Across the Unknown has a rather bald Borg crew member in c/gaming@lemmy.zip

[–] smiletolerantly@awful.systems 2 points 5 months ago

Amazing. She's a great role model.

New trailer for Star Trek: Voyager - Across the Unknown has a rather bald Borg crew member in c/gaming@lemmy.zip

[–] smiletolerantly@awful.systems 4 points 5 months ago (2 children)

Ah! Finally! Something where I can look up at the sky and go:

What would Janeway do...?

without people looking at me like I'm a weirdo and that wasn't what everyone does when faced with difficult questions.

Oh the humanity in c/microblogmemes@lemmy.world

[–] smiletolerantly@awful.systems 7 points 5 months ago

Yeah. Back left is the only burned in the right size for my pasta pot. Back right is a copy of front left and thus uniquely useless.

ich🐧🎮iel in c/ich_iel@feddit.org

[–] smiletolerantly@awful.systems 9 points 5 months ago* (last edited 5 months ago)

...benutzt Wurzel-ebene Gegentäuschung. Damit hat eine Amerikanisch-Saudische Firma eine Wurzel-Werkzeugkiste auf deinem Rechner. Ich weiß ja nicht, wie begehrenswert das ist.

Can we all agree that CSS is a massive pile of shit? in c/webdev@programming.dev

[–] smiletolerantly@awful.systems 6 points 5 months ago

Disagree. CSS allows you to do whatever you want with it, usually with just a handful of lines. The "it's so difficult to center things!" meme is, well, a meme.

$5,000 grant from Slate Star Codex to get an AI to write 5,000 novels about AI going well, to be fed back into AI training corpuses. This is the most Effective possible Altruism. in c/sneerclub@awful.systems

[–] smiletolerantly@awful.systems 12 points 5 months ago

Ironic. Every AI who would be worthy of that name, would also be capable of understanding the context of "AI-negativity" and thus clearly not "hyperstition itself into existence".

Loops Joins the Fediverse in c/fediverse@lemmy.world

[–] smiletolerantly@awful.systems 13 points 5 months ago (1 children)

Yeah but why would I make myself dependent on Cloudflare.