this post was submitted on 24 May 2024

161 points (98.8% liked)

Technology

34891 readers

203 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago

MODERATORS

MinutePhrase@lemmy.ml

161

‘Selective hearing’ headphones: Hear clearly in a crowd with one look (newatlas.com)

submitted 5 months ago by floofloof@lemmy.ca to c/technology@lemmy.ml

16 comments fedilink hide all child comments

all 17 comments

sorted by: hot top controversial new old

[–] Gerudo@lemm.ee 49 points 5 months ago (1 children)

Damn this could help me every day. i have trouble understanding people if there is background noise.

[–] zero_spelled_with_an_ecks@programming.dev 5 points 5 months ago (1 children)

Me, too. I carry earplugs and that helps a little in crowded places. Communicating with shower buddies is always tough, though. Especially when there's a crowd.

[–] WeirdGoesPro@lemmy.dbzer0.com 10 points 5 months ago (1 children)

How many people are typically in your shower?

[–] zero_spelled_with_an_ecks@programming.dev 2 points 5 months ago

Two to four.

[–] onlinepersona@programming.dev 32 points 5 months ago (1 children)

Amazing. And they even made it opensource! I'm amazed at how readable it is, even though I don't get most of it. Code written by people with 20 years of C experience looks leagues worse than what this repo looks like. Bravo!

Anti Commercial-AI license

[–] thejevans@lemmy.ml 31 points 5 months ago* (last edited 5 months ago) (1 children)

That's a non-commercial license. It's not open-source, just source-available.

https://github.com/vb000/LookOnceToHear/blob/main/LICENSE

[–] corsicanguppy@lemmy.ca 6 points 5 months ago

Thank you for not jamming the 'open' and 'source' together like a schmuck.

[–] saigot@lemmy.ca 15 points 5 months ago* (last edited 5 months ago) (1 children)

If this application is legit it's going to get snatched up by Apple/Amazon/Google to make their voice assistants better, right now they can't handle cross talk at all.

[–] BlameThePeacock@lemmy.ca 7 points 5 months ago

This particular implementation doesn't really apply to those situations, there are already existing technologies which can pre-train on specific voices they could be using for that since the target is known. The main "improvement" from this system is that you can train it on any target subject, even with background noise, in only a few seconds.

It's most useful in scenarios they've outlined in their study, like using it with your friend you ran into on the bus, your tour guide, etc.

[–] morriscox@lemmy.world 6 points 5 months ago

The CIA thanks you for your service.

[–] MHLoppy@fedia.io 4 points 5 months ago

Cool idea, though I was surprised by the level of fidelity loss in the fountain example. I would've expected that to be a good case scenario for noise cancellation so maybe it just needs some more time to iterate and improve on its level of "false positive" removal.

[–] lemmeout@lemm.ee 3 points 5 months ago (2 children)

The video fails to explain what about this is "AI" as opposed to active noise cancelling with some regular old signal processing.

[–] floofloof@lemmy.ca 10 points 5 months ago (1 children)

I think once it has taken a profile of the voice it no longer requires you to be facing the person because it can now recognize that voice among the noise. The AI but is taking an imprint of the voice and then extracting it.

[–] SkybreakerEngineer@lemmy.world 4 points 5 months ago (1 children)

So, some tracking layered on top of basic beamforming

[–] Numenor@lemmy.world 7 points 5 months ago

I legume so

[–] blindsight@beehaw.org 5 points 5 months ago

To add to what the other poster said:

I'm not an expert, but my understanding is that noise cancellation works by inverting sounds waves to deaden the sound. So, like, if you add sin(x) and –sin(x) you get 0.

This system is actively adding inverted sound waves to cancel most sounds. What makes this system unique is that it samples the voice and uses the unique "voice print" to selectively not invert the sound waves from the targeted voice.

Or that's what I'm getting from reading this, as a layman.