I have an android phone which means I use android auto fairly often. The sheer quality destruction it's experienced since transitioning to Gemini is incredible.
I experience this mostly when asking for music. Before gemini, mistakes were common but deterministic. It was easy to understand where the query had gone wrong and so how to fix it. Example:
"Hey google, play Blackstar"
(Plays the album blackstar by David Bowie, not what I wanted)
"Hey google, play "Blackstar by Radiohead"
(Plays the right thing).
Now:
"Hey Google, play Blackstar by Radiohead" can result in playing... something vaguely semantically related with no way to course correct. In this exact instance (happened yesterday!) it played an album by the hip hop due Black Star.
I will admit that there are some superpowers hidden in Gemini that were not present in the previous AI assistant. I recently discovered that Gemini can manipulate the navigation app, and a prompt like "Mute alerts" works, which is kind of cool. However like OP said, it's incredibly verbose, which is super annoying.
dhruvmittal 36 minutes ago [-]
Before:
"Hey Google, call my wife"
------> Immediately calls wife
With Gemini:
"Hey Google, call my wife"
--50%-> "Is that XXXXXX?"
--25%-> Immediately calls wife
--20%-> "I'm sorry, I couldn't find a contact saved as 'wife'"
--05%-> Immediately calls someone different from my wife
fhdkweig 2 hours ago [-]
I don't really use Google products except Youtube, but their auto-subtitler (closed captions) have really taken a weird turn lately. It used to try sounding out the words so even if it didn't pick the right word, it was at least close to right one. But now it substitutes entire phrases and sentences based on what is most likely to appear in a conversation even if it has no bearing or similarities to what was said.
bsimpson 2 hours ago [-]
There was an Amazon UXR study that floated around ~8 years ago that said the only things people care about from a voice assistant are music, weather, and alarms.
PMs keep trying to make them "smarter," and it just makes the core user journeys worse.
Surely they think they're inventing cars when we're griping about buggy whips. But it really feels like voice assistants peaked ~10y ago for the things people actually want them for.
Xeoncross 2 hours ago [-]
Considering the decade of poor results from doing anything other than music, weather and alarms I think we've all learned to avoid using them for anything else.
esafak 43 minutes ago [-]
Don't forget recipes. They went and bungled up the only good things about it. Every day I suggest Google fire the whole division responsible for the product when it cheerfully says "I'm still learning" and asks me for feedback.
RajT88 53 minutes ago [-]
For a bit, our Amazon Echo's were pretty decent at doing things other than music, weather and alarms. Then there was a big regression, and we stopped trying to make it do more. I guess the exception is turning the lights on and off.
My wife and I both would love for the voice assistants to do more. They just won't. Even with weather, anything more than "what's the weather like today" will usually not get a good result. "When will it rain today?" gets OK responses.
As soon as I figure out how to put a decent GPU into my old rack server(s), I'll see how far I can get with HomeAssistant. I suspect it'll be some effort, but it'll be better at the end of the day.
cfiggers 18 minutes ago [-]
I have a crystal clear memory of the day I stopped trying to make voice assistants work for me.
I use a timer every day to brew my coffee. With a voice assistant I can set a timer, but with the lack of a screen I can't see how much time is left. One day I thought, "I'm going to finally get around to digging into this voice interface and see what the options are," hoping for something like, "Hey Dingus, set a five minute timer and notify me when there's 10 seconds left."
Or better yet, "Hey Dingus, five minute timer, with notify at 10 seconds."
Notice that this almost maps 1:1 onto a shell command with option flags, just verbally interfaced: "$ timer 5m --with-notify 10s"
Notice also a complete willingness on my part to learn how the thing works and change how I'm using it accordingly. This is the opposite of end user laziness. I'm willing to invest effort in becoming a "Power User" of my voice assistant.
So I looked for documentation, ready to read and use my brain to understand it and do what it tells me in order to start and stop my timers with greater proficiency.
...I found none.
Literally, there isn't any. They don't have documentation. Nowhere is there, even for someone motivated and willing to learn, the ability to do so.
Ok, well that is understandable if these things are changing rapidly. Maybe there's the equivalent to a "$ timer --help" baked into the assistant itself. Maybe it can tell me what options exist so I can use them. I ask the assistant to explain itself. "What are my options when setting a timer?"
It can't parse my question. Literally, it doesn't know what I'm asking for. Because nobody ever considered that a user might ask that question or even want to know that information.
At that point, on the spot, I gave up. Clearly this thing is not designed or intended to be a thing that one could gain skill with. It's an utterly unserious product.
I would very happily learn an entire verbal DSL, a whole pidgin dialect of English, purely for interacting with my voice assistant. "Hey Dingus, five minute staged timer: thirty seconds, two minutes, one minute, one minute, remainder, with countdown from five seconds" is not "natural language" anymore. But you can bet you'd hear me saying it, if that's all it took to make the voice assistant run my coffee brewing recipe with nothing but my voice. And then, hey bonus, let me bind that to a personal shortcut so all I need to say is "Hey Dingus, coffee timer" and I don't even need to reach for my phone.
But you can't do that. It literally does not support being taken seriously. Or, if it can, the design is utterly hostile to me discovering how. So I've never even tried to do anything, since that very day, other than turning smart lights on and off.
My point is: I didn't fail the technology. I came to the table with willingness to tinker and experiment, willingness to change my expectations to suit the design as I discovered it, willingness to work to make it a part of my routine. The technology failed me.
jimbokun 2 hours ago [-]
And knowing how old Geena Davis is.
themythfable 2 hours ago [-]
Same thing with Google homes.
Pre-gemini, you knew what you would get, basically the structured snippets that would appear at the top of the search results.
Now it's much more verbose.
My biggest gripe is that it basically stopped listening to me, since "upgrading" to Gemini, which is frustrating because I've used it to control the Hue lights for the past decade.
It listens to my partner though, so after it fails to listen to me, I have to ask her to ask Google to adjust the fing lights.
Welp
jimbokun 2 hours ago [-]
Like being the one who brought the dog home from the shelter and feeds it, and the dog still prefers your partner.
ack210 1 hours ago [-]
Since android auto transitioned to Gemini, whenever I try to reply to a message by tapping the reply button, it now states "Sorry, but to do that, you'll need permission from your Google Workspace administrator." The stupid part is, if I listen to the message first, then say yes when it asks me if I'd like to reply, I'm then allowed to reply. There's countless articles and posts about this but no one seems to have found a reliable solution yet. Has been going on since at least December.
It also now seems to trigger its own barge-in about 50% of the time. It'll start reading the first syllable of a message, apparently confuse itself talking for me saying something, then just follow that with silence "listening" for my response until I physically have to hit the back button on the car.
elAhmo 2 hours ago [-]
Similarly, Siri was arguably better a decade ago when it has deterministic intents that you could trigger with certainty, now it is all seemingly random at time.
RobRivera 2 hours ago [-]
I wonder how much of it is them trying to minmax their revenue generating marketing algos for content discovery. (Willing to bet it's somewhat relative)
jimbokun 2 hours ago [-]
Soon it will be like Jack Black in High Fidelity, preventing you from listening to music that’s currently too hip for you.
2 hours ago [-]
daveshistory 2 hours ago [-]
Perhaps Gemini's been training on sources too young to listen to Radiohead.
2 hours ago [-]
Hugsbox 2 hours ago [-]
I Fired theartofdoingstuff.com for extremely obnoxious ads - particularly the "Would you like to save this stuff?" one that greys out the rest of the article you're trying to read to... checks notes... ask if you want the article emailed to you? Who the hell thought that was a good idea?
9dev 2 hours ago [-]
Why would you even browse the web without using an ad blocker?
omoikane 28 minutes ago [-]
So that you would notice sites that are serving obnoxious ads, and remember not to visit them in the future.
Maybe you are saying "the whole internet is like that now, it's impossible to find good sites without obnoxious ads", but I don't think it's that bad yet (hacker news is a good counterexample). But if everyone keep visiting user hostile sites, the site operators will see no incentive to change.
Hugsbox 2 hours ago [-]
The "ad" as described in my comment isn't really an ad in the typical sense, it's baked into the website. But the real reason is, I'm on my work computer and unable to install browser extensions.
9dev 1 hours ago [-]
You should urgently contact your IT department. Not enforcing ad blocking extensions is a security liability.
Hugsbox 47 minutes ago [-]
I'm in the IT department. Unfortunately massive global corporations routinely make extremely misguided decisions and don't particularly give a shit about what people like me have to say on the matter.
xnx 2 hours ago [-]
No good options on Android right now. You can use Firefox, but there are a lot of tradeoffs.
radiorental 2 hours ago [-]
I'm curious what the tradeoffs are, I exclusively use FF and any time Chrome auto opens via a link I immediately baulk at the experience.
xnx 10 minutes ago [-]
Save page, speed, battery life, bookmark sync, tab sync, and password sync are all slightly better in Chrome.
I used Firefox on Android for probably 11 years before switching to Dolphin (RIP) and then stock Chrome when Firefox made it a huge pain to install extensions. I keep waiting for someone to fully enable extensions on Chrome for Android.
leosanchez 1 hours ago [-]
I use FF exclusively and disabled chrome on my Android phone.
Lucky me :)
sqquima 2 hours ago [-]
And a clearly AI-generated or at least AI-assisted post.
tomasphan 2 hours ago [-]
It’s not clear to me. What makes you sure it’s AI assisted?
shaky-carrousel 2 hours ago [-]
> Google Home used to be one of my favourite gadgets.
> Not because it was smart.
> Because it was useful.
I was half expecting "and that's bold" after that.
deafpolygon 2 hours ago [-]
It might be quietly bold in a world of stunning declarations.
warpfactor 2 hours ago [-]
and load bearing
input_sh 2 hours ago [-]
I promise you that she went through the effort of putting less ads than what Adsense does by default.
intrasight 2 hours ago [-]
It is ironic that so many articles calling out enshittification are themselves examples of the same.
pwython 2 hours ago [-]
I had Claude whip up a local solution for me using Gemma 4 26b-a4b on my Mac and a Raspberry Pi Zero 2 W. It can do web search (valyu), file reading -- I have business & personal context stored in many markdown files -- weather, Apple reminders, and has cross-session memory. Streamline but capable agent that has been pieced together over a few weeks after using Karpathy's LLM wiki pattern, with bits of Hermes logic. Orpheus-TTS streams the spoken reply back with the first word usually landing in half a second. Voice input is openWakeWord for the wake word plus faster-whisper for speech-to-text, all on-device. I can run it straight on the Mac but I use it with the pi satellite and a cheap USB speakerphone (ConfCall MS13B). You can barge in by just talking over it rather than having to say the wake word again. Pretty handy Google Home replacement.
ravetcofx 2 hours ago [-]
Do you have a GitHub or something for the project? Sounds lovely.
pwython 2 hours ago [-]
Not yet, I have to decouple some of the specific/personal logic. When I do push it, it'll be here: https://github.com/alisorcorp
jimbokun 2 hours ago [-]
Sounds like you could have a successful product.
itodd 3 hours ago [-]
> Then I went directly to Amazon and ordered an Alexa.
Dear lord.
psanford 2 hours ago [-]
I got rid of my Alexa devices after they would not shut up. "Alexa what is the weather for today?" "The weather for today will be ... By the way, did you know that you can set alarms? Just say ..."
I do not care about whatever stupid feature you want to build engagement around. Do what I asked you to do and then shut up.
And don't get me started on the Alexa Show that had the audacity to display ads.
jorts 2 hours ago [-]
I threw all my Alexa devices away due to that as well.
goopthink 2 hours ago [-]
> I ordered it because when one superpower fails you, eventually you have to try the other one before they eventually ruin the thing that was working perfectly well before they tried to improve it in order to make enough money to be a trio of planets.
When your options are a few competing BigCos and you don’t have incentive to try to build it for yourself because it straddles the annoying in-between space of “frustrating enough to do something” and “ not frustrating or valuable enough to actually solve the problem.”
reilly3000 2 hours ago [-]
She sounds addicted to toxic relationships.
kwanbix 2 hours ago [-]
I sadly did the same as I didn't know best. The Google speakers started to have issues identifying the orders, so we bought some alexas, which we also use as speakers for our tvs.
Do you have a better recommendation of a smart speaker that can play spotify, youtube music, or tidal?
nativeit 2 hours ago [-]
Well, now when she wants to know the symptoms of dehydration, she can purchase some sort of Gatorade really convenient like.
daveguy 2 hours ago [-]
And how about this:
> The closest I could get to punching Google in the ear and ripping out its nose hairs.
As if giving more money to another shitty megacorp for another frustrating device is going to make any difference to the first shitty megacorp.
krabizzwainch 2 hours ago [-]
I think that this is a really good article for a reason that none of us are mentioning. This is not a tech blog saying this. This is a cooking/gardening/home blog. I use these terms jokingly, but we are nerds on hacker news and this blog writer is a normie. Regular people are getting fed up with how bad google is getting, not just the nerds.
Companies don't care when nerds complain. We always complain. But when their normal user base starts jumping ship, then they could very well start listening.
chymerax 2 hours ago [-]
The whole upgrade thing is stupid. For years I was able to tell my phone in the car: "Play this song on spotify" and it did it. Not any more.
"Ok google! Set a timer for 20 minutes." -> "Sorry Bob, I don't have permission to do that".
"Ok google, navigate home" -> "Navigating to Store@Home24!" (half a country away).
I agree. I wasn't smart but it was useful in certain cases. Now it's just lobotomized.
cmoski 2 hours ago [-]
and it takes sooooo looooong to write a text message with your voice now.
data-ottawa 2 hours ago [-]
> For years I could ask Google what song was playing and it would identify it. It was one of the most useful features it had. You'd hear a song in a store, on television or drifting over from a neighbour's backyard and Google would identify it in seconds.
A huge pet peeve of mine is when I’m in the car and want to know what song is playing on the radio. I run Shazam and my phone mutes the stereo to activate a microphone. I have to disconnect from CarPlay then run Shazam, then reconnect — it’s a passenger only operation.
Song recognition is built into both iOS and Android, the device should always use the internal mic instead of a CarPlay/Android Auto microphone over Bluetooth.
Side note: is there a good “dumb smart speaker” I can have run with a wake word connected to my own API? Speech to Text and Speech to Speech are fairly well supported for local AI workflows now, it would be great to have my own Home device without worrying about where the audio goes.
I’m sure it’s a very niche audience today, but I imagine giving this thing MCP for Wikipedia, a music app, and my recipes would be perfect.
CephalopodMD 10 minutes ago [-]
i really like Gemini Google home. The old software felt lobotomized by comparison.
trilogic 1 hours ago [-]
Google is following facebook, they got an expiration date, born together, expired together, RIP
Gemini made me turn google off completely. There is no provision to not allow your chat data from being shared send for training - even if you pay. Clearly they don’t care about end users and I am guessing they are losing money on paid cunaumer tier as well. But that is not acceptable to me. I just turned off all Google services and finally switched to ddg as the default search engine.
Hard_Space 2 hours ago [-]
This is a chronic LLM style, particularly the multipart dramatic flourishes and the Haiku paragraphs. But at this stage, there's a good chance that LLM re-diffusion of this style is feeding back into human culture. God save us.
hylaride 2 hours ago [-]
The only things these "smart" speakers are good for is timers, adding to lists (especially grocery lists in the kitchen), and playing music. And even then they often stutter...
francisofascii 2 hours ago [-]
Agree, but they are very good at playing music. They are still useful even if the don't do anything else, just like a microwave just needs to cook food.
mrhottakes 2 hours ago [-]
They can't do any of those three things anymore
ihaveone 1 hours ago [-]
FYI; you can give Gemini default preferences which helps some of this. I use the caveman instructions and it applies anywhere I'm logged in, like Android auto. This fixed so many issues.
giancarlostoro 2 hours ago [-]
> Nobody asked for New Coke.
I know nobody did, but seeing as I was too young (and maybe not even alive?) I have always wanted to try it. I'm a Coca Cola enthusiast after all. I wish they'd release a "Throwback Experimental Coke" batch out. I assume it was their attempt to flavor coke without the coca leaves?
Henchman21 1 hours ago [-]
It was a ploy to use a cheaper form of sugar — HFCS — instead of sugar.
fastforwardius 2 hours ago [-]
Good to stop repeating actions making you unhappy, but how about going without assistants alltogether?
From the examples given I haven't seen any meaningful life improvement with them.
netsharc 1 hours ago [-]
"Remind me [at datetime/in x hours] about Y." is very useful for handsfree reminders. I have an alarm clock that reacts to "Hey Google" so I just say "alarm at xyz" to set alarms. With the phone, "Navigate home" saves a few seconds compared to opening Google Maps and hitting the "Home" shortcut.
I'm just waiting for Google to send the email with the title like "An update about Assistant's future" to make the clock e-waste. On the phone I tried Gemini but that's slower because it needs a brain for simple text parsing..
aliasxneo 2 hours ago [-]
I don't really use our Google Home devices, but my wife does. Ever since the "update" I've noticed now that Gemini likes to ask her 2-3 follow up questions after each query. The first might be something like, "Would you like to know more about <X>?" My wife would answer "No" and Gemini would respond like, "Is there anything else you would like to discuss on the subject of <X>?" It's gotten to the point where she will start yelling "cancel" and "stop" to get it to shut up.
delichon 2 hours ago [-]
I talk to chatbots while doing yardwork, often with both hands full. Yesterday I tried to get it to shut up mid conversation and just could not do it with voice only. I had to wait through several minutes of blathering, restarted by every random noise, before I could put the chainsaw down and turn it off.
I was amazed at my own level of anger at that. It was just a voice in my ears but I reacted to it viscerally like it was an assault. It didn't help that it was in the middle of a sequence of it telling one lie after another, like "yes, I can disconnect this conversation." Maybe what I had is a natural reaction to having a lying clueless asshole refuse to go away or shut up, which I haven't otherwise had to deal with lately.
dminik 2 hours ago [-]
This is kind of funny, but very much expected.
The interface into the LLM is tokens in and out (text, images, audio). And the harness generally doesn't understand what you're passing in. The LLM has nothing to do other than to respond with tokens and empty responses (eg. just a stop token) have been aggressively trained out of it.
aNapierkowski 1 hours ago [-]
i know it "has" to respond the way its set up
but it feels like a robot trying to get the last word when youre just trying to stop it
speak_plainly 2 hours ago [-]
The quality of Gemini varies so much throughout the day, the week and the month that it’s hard to rely on it for anything. It feels like Google is trying to throttle costs but has completely missed the mark.
bighead1 2 hours ago [-]
> Nobody asked for cars that require IT support and 3 sub-menus to lower the air conditioning. (Screens are cheaper to install than buttons and knobs.)
perhaps, but people did ask for cheaper cars.
neogodless 2 hours ago [-]
This doesn't really track. Sibling comment pointed out that the average new car transaction price is massively higher than before knobs were replaced with screens.
But more importantly... screens were put in more expensive cars first, and slowly trickled down to budget cars. It's a very weak argument that it was done for cost reasons. Screens are flashy and impress people during their 5 minute test drive. "Wow! Think of all the things I can do in my car that I couldn't do with a knob for changing fan speed." Sure, living with those screens tends to be a bit less enjoyable than those first impressions lead you to believe, but bright colorful animated screens helped to sell cars. If they're actually less expensive than knobs and buttons, that's just a bonus for the manufacturers.
Also keep in mind, when screens first appeared in (expensive) cars, they weren't actually cheaper than the knobs and buttons they replaced. Technology is, sorry was getting cheaper per unit of performance over time. Screens became commonplace and inexpensive to put in cars, but I suspect they were ten times more expensive than all the knobs and levers they replaced when they started appearing in luxury cars.
fastforwardius 2 hours ago [-]
I saw cost savings trickle down to customers only in an economy textbook.
csunbird 2 hours ago [-]
and we ended up with new cars being 2x expensive even after adjusting with inflation somehow
runarberg 2 hours ago [-]
But nobody got cheaper cars. The shareholders on the other hand asked for a third yacht, and they got exactly what they wanted (after the car manufacturer fired the team which used to do usability testing for their interface)
mrhottakes 2 hours ago [-]
Did they? No one buys cheap new cars, at least in the US.
bruki 1 hours ago [-]
I thought OP had learnt something and then I got to the Alexa part and closed the tab.
jimbokun 2 hours ago [-]
Fun article, but the sea of pop ups and advertisements battling to distract you from the text ironically exemplifies the point being made about Google Home.
orn 2 hours ago [-]
And her site spams you with soo many ads, I had to shut it down
loloquwowndueo 2 hours ago [-]
Hey now you know how she paid for all those home assistant gimmicks.
VBprogrammer 2 hours ago [-]
I wondered how long it would be until someone took the obvious step of wiring a voice assistant up to a full blown LLM model. Seems like the thing I failed to consider was whether that was actually a good idea.
adithyassekhar 2 hours ago [-]
> Award winning actress Geena Davis is 70 years old.
Thank you!
rogerrogerr 2 hours ago [-]
This would be great if it wasn't written so breathlessly.
markbnj 2 hours ago [-]
I feel like the author has another disappointment coming. Alexa has been annoying in this way for a long time now. Still very useful as well, but yeah, annoying. It has always been too proactive about suggesting things I don't want or need in response to simple questions like "What is the outside temperature?" ("It's currently 46 degrees, with a low of 32 degrees expected overnight. Did you know you can turn your lights on and off according to a schedule? Would you like to schedule turning some lights on or off?") But recent AI-oriented updates have just made it worse. Now its chatty and full of attitude, and having raised a few teenagers let me tell you, that's what we want more of in our lives. Chat. And attitude.
freediddy 2 hours ago [-]
Sorry but this page and site have the most obnoxious number of ads on almost any page I've ever seen. After reading a couple of paragraphs I closed the window, no content is worth subjecting myself to that many ads and trying to navigate which is real content and which isn't.
cwnyth 2 hours ago [-]
Firefox + ublock origin and voilà, no more ads.
krabizzwainch 2 hours ago [-]
Don't forget a pihole! Sometimes I really do forget how obnoxious the internet is until I try to look at a website when I'm outside of my ad free setup at home. Which in a way keeps me from browsing around websites on my phone and wasting time in that way.
cwnyth 3 minutes ago [-]
ublock Origin works decently well on Firefox on Android. I think I saw that something worked on iPhones, but I never investigated it myself.
mrhottakes 2 hours ago [-]
I can't fathom how anyone uses the web without an ad blocker these days. Especially shocking to hear on HN.
deafpolygon 2 hours ago [-]
“An adblocker is preventing this page from loading.”
No…, you are.
fiatjaf 2 hours ago [-]
Apparently you haven't fired Cloudflare, since this website forces me to go through a captcha in order to read (and I'm not even using a VPN or Tor).
trimethylpurine 2 hours ago [-]
Hey Google, write an article called "I Fired Google."
(Although, based on the tone, I think it's Grok.)
complianceowll 3 hours ago [-]
This was so spot on I felt like standing up and yelling, "Yes! Exactly!"
ncr100 2 hours ago [-]
I have a similar experience.
Having used Google home assistant since it came out for all the things that it's good for, and watched its quality fluctuate, I find I really have to be more careful nowadays when I ask it questions because it can go overboard more easily.
Is there a term in AI research where it underappreciates the specificity that is being asked for?
Perhaps the AI could be default prompted, "you are a kitchen AI assistant and should tend to answer in facts and details and that are relevant to the current moment."
nyeah 2 hours ago [-]
They've spent a lot of money on this technology, and what users want is just not that high a priority.
2 hours ago [-]
pipeline_peak 2 hours ago [-]
Never understood why anyone would use Google Home or Amazon Echo when we all have phones.
A DSLR camera is better than a phones camera, a voice assisting device seems replaceable however.
advisedwang 48 minutes ago [-]
Basically
a) because some of them have semi-decent speakers for music
b) because it's nice not to have to pull out your phone (especially while cooking)
basisword 2 hours ago [-]
I don't understand how anyone bothers with these things. Other than timers in the kitchen (where hands might be greasy) I've always found it faster to just pull out my phone and Google it.
P.S. I hope that dehydration/headaches question was a poorly chosen example and not something someone over the age of 5 seriously needs an answer to.
They rolled it back though afaik because the whole idea was a comically evil idea.
gostsamo 2 hours ago [-]
BMW actually piloted such a program, but the outcry was brutal.
2 hours ago [-]
mrhottakes 2 hours ago [-]
It's not a false claim, it's a true claim
krabizzwainch 2 hours ago [-]
I stopped reading your false claim about their claim being false.
philipallstar 2 hours ago [-]
> I don't need a four-minute explanation. I don't need context. I don't need a balanced discussion that considers multiple viewpoints and concludes with a summary.
Karen, you mean you don't want those things. Stop confusing want and need.
- The Manager.
Rendered at 17:14:48 GMT+0000 (Coordinated Universal Time) with Vercel.
I experience this mostly when asking for music. Before gemini, mistakes were common but deterministic. It was easy to understand where the query had gone wrong and so how to fix it. Example:
"Hey google, play Blackstar"
(Plays the album blackstar by David Bowie, not what I wanted)
"Hey google, play "Blackstar by Radiohead"
(Plays the right thing).
Now:
"Hey Google, play Blackstar by Radiohead" can result in playing... something vaguely semantically related with no way to course correct. In this exact instance (happened yesterday!) it played an album by the hip hop due Black Star.
I will admit that there are some superpowers hidden in Gemini that were not present in the previous AI assistant. I recently discovered that Gemini can manipulate the navigation app, and a prompt like "Mute alerts" works, which is kind of cool. However like OP said, it's incredibly verbose, which is super annoying.
PMs keep trying to make them "smarter," and it just makes the core user journeys worse.
Surely they think they're inventing cars when we're griping about buggy whips. But it really feels like voice assistants peaked ~10y ago for the things people actually want them for.
My wife and I both would love for the voice assistants to do more. They just won't. Even with weather, anything more than "what's the weather like today" will usually not get a good result. "When will it rain today?" gets OK responses.
As soon as I figure out how to put a decent GPU into my old rack server(s), I'll see how far I can get with HomeAssistant. I suspect it'll be some effort, but it'll be better at the end of the day.
I use a timer every day to brew my coffee. With a voice assistant I can set a timer, but with the lack of a screen I can't see how much time is left. One day I thought, "I'm going to finally get around to digging into this voice interface and see what the options are," hoping for something like, "Hey Dingus, set a five minute timer and notify me when there's 10 seconds left."
Or better yet, "Hey Dingus, five minute timer, with notify at 10 seconds."
Notice that this almost maps 1:1 onto a shell command with option flags, just verbally interfaced: "$ timer 5m --with-notify 10s"
Notice also a complete willingness on my part to learn how the thing works and change how I'm using it accordingly. This is the opposite of end user laziness. I'm willing to invest effort in becoming a "Power User" of my voice assistant.
So I looked for documentation, ready to read and use my brain to understand it and do what it tells me in order to start and stop my timers with greater proficiency.
...I found none.
Literally, there isn't any. They don't have documentation. Nowhere is there, even for someone motivated and willing to learn, the ability to do so.
Ok, well that is understandable if these things are changing rapidly. Maybe there's the equivalent to a "$ timer --help" baked into the assistant itself. Maybe it can tell me what options exist so I can use them. I ask the assistant to explain itself. "What are my options when setting a timer?"
It can't parse my question. Literally, it doesn't know what I'm asking for. Because nobody ever considered that a user might ask that question or even want to know that information.
At that point, on the spot, I gave up. Clearly this thing is not designed or intended to be a thing that one could gain skill with. It's an utterly unserious product.
I would very happily learn an entire verbal DSL, a whole pidgin dialect of English, purely for interacting with my voice assistant. "Hey Dingus, five minute staged timer: thirty seconds, two minutes, one minute, one minute, remainder, with countdown from five seconds" is not "natural language" anymore. But you can bet you'd hear me saying it, if that's all it took to make the voice assistant run my coffee brewing recipe with nothing but my voice. And then, hey bonus, let me bind that to a personal shortcut so all I need to say is "Hey Dingus, coffee timer" and I don't even need to reach for my phone.
But you can't do that. It literally does not support being taken seriously. Or, if it can, the design is utterly hostile to me discovering how. So I've never even tried to do anything, since that very day, other than turning smart lights on and off.
My point is: I didn't fail the technology. I came to the table with willingness to tinker and experiment, willingness to change my expectations to suit the design as I discovered it, willingness to work to make it a part of my routine. The technology failed me.
Pre-gemini, you knew what you would get, basically the structured snippets that would appear at the top of the search results.
Now it's much more verbose.
My biggest gripe is that it basically stopped listening to me, since "upgrading" to Gemini, which is frustrating because I've used it to control the Hue lights for the past decade.
It listens to my partner though, so after it fails to listen to me, I have to ask her to ask Google to adjust the fing lights.
Welp
It also now seems to trigger its own barge-in about 50% of the time. It'll start reading the first syllable of a message, apparently confuse itself talking for me saying something, then just follow that with silence "listening" for my response until I physically have to hit the back button on the car.
Maybe you are saying "the whole internet is like that now, it's impossible to find good sites without obnoxious ads", but I don't think it's that bad yet (hacker news is a good counterexample). But if everyone keep visiting user hostile sites, the site operators will see no incentive to change.
I used Firefox on Android for probably 11 years before switching to Dolphin (RIP) and then stock Chrome when Firefox made it a huge pain to install extensions. I keep waiting for someone to fully enable extensions on Chrome for Android.
Lucky me :)
> Not because it was smart.
> Because it was useful.
I was half expecting "and that's bold" after that.
Dear lord.
I do not care about whatever stupid feature you want to build engagement around. Do what I asked you to do and then shut up.
And don't get me started on the Alexa Show that had the audacity to display ads.
When your options are a few competing BigCos and you don’t have incentive to try to build it for yourself because it straddles the annoying in-between space of “frustrating enough to do something” and “ not frustrating or valuable enough to actually solve the problem.”
Do you have a better recommendation of a smart speaker that can play spotify, youtube music, or tidal?
> The closest I could get to punching Google in the ear and ripping out its nose hairs.
As if giving more money to another shitty megacorp for another frustrating device is going to make any difference to the first shitty megacorp.
Companies don't care when nerds complain. We always complain. But when their normal user base starts jumping ship, then they could very well start listening.
I agree. I wasn't smart but it was useful in certain cases. Now it's just lobotomized.
A huge pet peeve of mine is when I’m in the car and want to know what song is playing on the radio. I run Shazam and my phone mutes the stereo to activate a microphone. I have to disconnect from CarPlay then run Shazam, then reconnect — it’s a passenger only operation.
Song recognition is built into both iOS and Android, the device should always use the internal mic instead of a CarPlay/Android Auto microphone over Bluetooth.
Side note: is there a good “dumb smart speaker” I can have run with a wake word connected to my own API? Speech to Text and Speech to Speech are fairly well supported for local AI workflows now, it would be great to have my own Home device without worrying about where the audio goes.
I’m sure it’s a very niche audience today, but I imagine giving this thing MCP for Wikipedia, a music app, and my recipes would be perfect.
https://futurism.com/artificial-intelligence/google-ceo-sund...
I know nobody did, but seeing as I was too young (and maybe not even alive?) I have always wanted to try it. I'm a Coca Cola enthusiast after all. I wish they'd release a "Throwback Experimental Coke" batch out. I assume it was their attempt to flavor coke without the coca leaves?
From the examples given I haven't seen any meaningful life improvement with them.
I'm just waiting for Google to send the email with the title like "An update about Assistant's future" to make the clock e-waste. On the phone I tried Gemini but that's slower because it needs a brain for simple text parsing..
I was amazed at my own level of anger at that. It was just a voice in my ears but I reacted to it viscerally like it was an assault. It didn't help that it was in the middle of a sequence of it telling one lie after another, like "yes, I can disconnect this conversation." Maybe what I had is a natural reaction to having a lying clueless asshole refuse to go away or shut up, which I haven't otherwise had to deal with lately.
The interface into the LLM is tokens in and out (text, images, audio). And the harness generally doesn't understand what you're passing in. The LLM has nothing to do other than to respond with tokens and empty responses (eg. just a stop token) have been aggressively trained out of it.
perhaps, but people did ask for cheaper cars.
But more importantly... screens were put in more expensive cars first, and slowly trickled down to budget cars. It's a very weak argument that it was done for cost reasons. Screens are flashy and impress people during their 5 minute test drive. "Wow! Think of all the things I can do in my car that I couldn't do with a knob for changing fan speed." Sure, living with those screens tends to be a bit less enjoyable than those first impressions lead you to believe, but bright colorful animated screens helped to sell cars. If they're actually less expensive than knobs and buttons, that's just a bonus for the manufacturers.
Also keep in mind, when screens first appeared in (expensive) cars, they weren't actually cheaper than the knobs and buttons they replaced. Technology is, sorry was getting cheaper per unit of performance over time. Screens became commonplace and inexpensive to put in cars, but I suspect they were ten times more expensive than all the knobs and levers they replaced when they started appearing in luxury cars.
Thank you!
No…, you are.
(Although, based on the tone, I think it's Grok.)
Having used Google home assistant since it came out for all the things that it's good for, and watched its quality fluctuate, I find I really have to be more careful nowadays when I ask it questions because it can go overboard more easily.
Is there a term in AI research where it underappreciates the specificity that is being asked for?
Perhaps the AI could be default prompted, "you are a kitchen AI assistant and should tend to answer in facts and details and that are relevant to the current moment."
A DSLR camera is better than a phones camera, a voice assisting device seems replaceable however.
a) because some of them have semi-decent speakers for music
b) because it's nice not to have to pull out your phone (especially while cooking)
P.S. I hope that dehydration/headaches question was a poorly chosen example and not something someone over the age of 5 seriously needs an answer to.
Related from the last month:
Google changes its search box
https://news.ycombinator.com/item?id=48197370
Google Declaring War on the Web
https://news.ycombinator.com/item?id=48214449
Search engines alternatives now that Google isn't Google anymore
https://news.ycombinator.com/item?id=48266051
Google Hates You
https://news.ycombinator.com/item?id=48313538
You can no longer Google the word 'disregard'
https://news.ycombinator.com/item?id=48238351
The IBM-ification of Google?
https://news.ycombinator.com/item?id=48230049
https://www.thedrive.com/news/bmw-commits-to-subscriptions-e...
https://www.theverge.com/2022/7/12/23204950/bmw-subscription...
They rolled it back though afaik because the whole idea was a comically evil idea.
Karen, you mean you don't want those things. Stop confusing want and need.
- The Manager.