546

Can you manage your house with a local, no-cloud voice assistant? Mostly, yes. (arstechnica.com)

submitted 2 years ago by catculation@lemmy.zip to c/technology@lemmy.world

100 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Deebster@programming.dev 12 points 2 years ago

Even ignoring privacy arguments, I think that voice control is a great use case for running services locally - lower latency due to not having up upload your sample and the option of having it learn your accent is very attractive.

That said, voice control is irritatingly error-prone and seems to be slower than just reaching for the remote control. I agree that automatic stuff would be best, but some stuff you can't have rules for.

Something that would be interesting is a more eye- and gesture-based system: I'm thinking something like you look at the camera and slice across your throat for stop or squeeze fingers together to reduce volume. This is definitely one to run locally, for privacy and performance reasons.

[-] oDDmON@lemmy.world 8 points 2 years ago

Assistive technology has been focused on this for a while.

My brother had severe cerebral palsy and for years (80s-90s) communicated via analog technology, a literal alpha/iconography communication board, which he could tap on with a head wand. By 2000 he had a digital voice, but still had to use a wand.

Stephen Hawking demonstrated eye sensing technology almost as soon as it was invented and that’s been over a decade ago.

In most cases, there is a definite aspect of “bespokeness” to implementing assistive consumer communication technology, but the barriers implementing the same for an able audience would appear much lower.

[-] Tippon@lemmy.dbzer0.com 1 points 2 years ago

But where do you put the camera? If you're sitting in front of the TV, then near the TV makes sense. What if you're sitting facing a different direction with a book though? What if your hands are full?

A camera based system would be much more limited, and probably wouldn't work in the dark.

[-] Deebster@programming.dev 1 points 2 years ago

You're assuming that we can't have both. Why not have it as an complementary input?

I think looking at a device and talking is better than saying hey $brandname before everything, but having both would be better still.

this post was submitted on 14 Feb 2024

546 points (97.4% liked)

Technology

83989 readers

677 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws