University of Washington AI technology lets headphone wearers pick specific sounds to hear

University of Washington AI headphone technology allows only specific sounds through. (Image source: Paul G. Allen School at YouTube)

A University of Washington led team has developed AI headphone technology allowing wearers to pick specific sounds to hear while blocking all others. This advanced noise-cancelling targets any sound from animals to machines creating a new kind of sound targeting called semantic hearing.

David Chien, Published 11/15/2023

AI Audio Wearable

A team led by University of Washington (UofW) computer science researchers has created AI software for headphones that allow wearers to select specific sounds to hear. Unlike noise-cancelling headphones that simply filter out everything except voices, the new neural network allows users to select specific sounds such as the chirp of a bird.

Prior headphones such as the Sony INZONE buds (available on Amazon) use DSEE Extreme, Speak-to-Chat, and AI DNN AI technology to improve music and speech quality while automatically letting voices though noise cancelling when conversations begin. The UofW work advances upon this by allowing listeners to pick from 20 different types of sounds to hear, such as birds chirping, ocean, door knock, and toilet flush, while filtering out all else. Called semantic hearing, this allows users to enjoy the chirping of birds at a park without hearing people talk or cars motoring by.

Currently, the UofW app utilizes binaural microphones to capture the real-time position of external sounds before sending filtered sounds to headphones. Because this software runs on smartphones, their app can leverage more powerful CPUs than found in headphones, however, it is only a matter of time before noise-cancelling headphones come with semantic hearing built-in.

UofW AI semantic hearing allows only specifc sounds through such as a knock. (Image source: Paul G. Allen School at YouTube)

UofW AI noise cancelling filters out 20 different types of sounds. (Image source: UofW research article published on ACM)

UofW AI headphone technology uses neural networks to filter sounds. (Image source: Paul G. Allen School at YouTube)

Source(s)

University of Washington, ACM, and Paul G. Allen School (YouTube)

▶ ▼ Press Release

November 9, 2023

New AI noise-canceling headphone technology lets wearers pick which sounds they hear

Stefan Milne

UW News

Most anyone who’s used noise-canceling headphones knows that hearing the right noise at the right time can be vital. Someone might want to erase car horns when working indoors, but not when walking along busy streets. Yet people can’t choose what sounds their headphones cancel.

Now, a team led by researchers at the University of Washington has developed deep-learning algorithms that let users pick which sounds filter through their headphones in real time. The team is calling the system “semantic hearing.” Headphones stream captured audio to a connected smartphone, which cancels all environmental sounds. Either through voice commands or a smartphone app, headphone wearers can select which sounds they want to include from 20 classes, such as sirens, baby cries, speech, vacuum cleaners and bird chirps. Only the selected sounds will be played through the headphones.

The team presented its findings Nov. 1 at UIST ’23 in San Francisco. In the future, the researchers plan to release a commercial version of the system.

“Understanding what a bird sounds like and extracting it from all other sounds in an environment requires real-time intelligence that today’s noise canceling headphones haven’t achieved,” said senior author Shyam Gollakota, a UW professor in the Paul G. Allen School of Computer Science & Engineering. “The challenge is that the sounds headphone wearers hear need to sync with their visual senses. You can’t be hearing someone’s voice two seconds after they talk to you. This means the neural algorithms must process sounds in under a hundredth of a second.”

Because of this time crunch, the semantic hearing system must process sounds on a device such as a connected smartphone, instead of on more robust cloud servers. Additionally, because sounds from different directions arrive in people’s ears at different times, the system must preserve these delays and other spatial cues so people can still meaningfully perceive sounds in their environment.

Tested in environments such as offices, streets and parks, the system was able to extract sirens, bird chirps, alarms and other target sounds, while removing all other real-world noise. When 22 participants rated the system’s audio output for the target sound, they said that on average the quality improved compared to the original recording.In some cases, the system struggled to distinguish between sounds that share many properties, such as vocal music and human speech. The researchers note that training the models on more real-world data might improve these outcomes.

Additional co-authors on the paper were Bandhav Veluri and Malek Itani, both UW doctoral students in the Allen School; Justin Chan, who completed this research as a doctoral student in the Allen School and is now at Carnegie Mellon University; and Takuya Yoshioka, director of research at AssemblyAI.

For more information, contact [email protected].

The Inzone Buds will be available in two colour options. (Image source: @MysteryLupin)

Sony Inzone Buds to launch soon for US$199 as new gaming wireless earbuds with low latency USB adapter 10/10/2023

PlayStation Earbuds will launch soon with noise cancellation and obligatory USB adapter 07/29/2023

Status Audio Between 3 ANC TWS hands-on: Premium earbuds with better noise-cancelling than Apple's AirPods Pro 06/10/2023

The new generation of WH-1000X headphones retails for US$50 more than the WH-1000XM4. (Image source: Sony)

Sony WH-1000XM5: Revised WH-1000X headphones arrive nearing the US$400 mark with better noise cancellation, more microphones and a bulkier design 05/12/2022

No comments for this article

Got questions or something to add to our article? Even without registering you can post in the comments!

No comments for this article / reply

Loading Comments

Comment on this article

New Garmin Fenix 7 and Forerunner 2...

David Chien - News Writer - 3 articles published on Notebookcheck since 2023

Having worked at Activision, UCLA, Anime Expo and more, I've seen technology being used to save lives, create games, and create fantastic 3D VR/AR worlds. There's always something fun in emerging technology that I want to get my hands on and all my friends turn to me to find the best for their needs, so I'm glad to bring my experience to Notebookcheck.

Please share our article, every link counts!

> Notebook / Laptop Reviews and News > News > News Archive > Newsarchive 2023 11 > University of Washington AI technology lets headphone wearers pick specific sounds to hear

David Chien, 2023-11-15 (Update: 2023-11-15)

University of Washington AI technology lets headphone wearers pick specific sounds to hear

Source(s)

Related Articles

No comments for this article