Adobe is developing an audio app that literally lets you put words into someone's mouth

Shawn Knight · Nov 6, 2016

Adobe at its recent MAX 2016 conference for creative professionals demonstrated an experimental technology that’s sure to stir up its fair share of controversy.

Known internally as VoCo and currently in development with Princeton University, the technology in question can be best described as Photoshop for audio. As Adobe developer Zeyu Jin showcases in the clip above, you can rearrange the order of spoken words and literally put words in someone’s mouth to make it sound as if they said something that never actually happened.

The current iteration of the technology needs roughly 20 minutes of dialog from a person in order to recreate their voice. Given the proliferation of audiobooks, podcasts, vlogs and so on, finding enough material to feed the program – especially if the target is a celebrity, public figure or social media influencer – would be trivial.

It’s easy to imagine how technology of this nature could be used in all sorts of unethical or nefarious manners but according to Jin, the development team has researched how to prevent forgery (likening it to watermarking for images).

As for legitimate uses, Adobe said in a companion blog post that when doing voiceover, dialogue and narration work, it’d be nice to have the option to edit or insert a few words without the hassle of recreating the recording environment or bringing the artist back in for a follow-up session.

Adobe hasn’t yet said when or even if the technology will one day make its way into a consumer-facing product. If it does, however, we’ll have to condition ourselves to be skeptical of any audio we listen to (just as we do today with images thanks to Photoshop).

Permalink to story.

https://www.techspot.com/news/66941-adobe-developing-audio-app-literally-you-put-words.html

Uncle Al · Nov 6, 2016

Interesting and yet another way to allow criminals to successfully fool those that are not tech savvy .....

Kenrick · Nov 6, 2016

Better fix flash first before releasing another crime tool.

Evernessince · Nov 7, 2016

This is one of those tools that has far more bad uses than good.

cliffordcooley · Nov 7, 2016

I'm sure we will all continue to believe everything we hear. This will change nothing because naturally we want to believe lies!

fktech · Nov 7, 2016

This isn't new.

mbrowne5061 · Nov 7, 2016

fktech said:
This isn't new.

Cutting in word someone's already spoken? No.
But this takes words they never said, generates how it would have sounded had they said it, and then cuts it in.

Greg S · Nov 7, 2016

This is pretty cool and I hope that VoCo becomes apart of Soundbooth/Audition

Moneyd623 · Nov 7, 2016

Cool, maybe now we'll be able to preserve the best story telling voices and use them for as long as we'd like. Would be cool if I could set it up to have someone like Morgan Freeman be the voice for all my audio books or something like that.

rvnwlfdroid · Nov 7, 2016

Moneyd623 said:
Cool, maybe now we'll be able to preserve the best story telling voices and use them for as long as we'd like. Would be cool if I could set it up to have someone like Morgan Freeman be the voice for all my audio books or something like that.

I'd pay to be able to do that. I can't wait to hear a longer generated clip.

Igrecman · Nov 8, 2016

The watermark to be effective would need to be some kind of frequency the human ear can't hear all over the modified words. If it's something else, I guess it would be easy to bypass by re-recording the result with Audacity.

Adobe is developing an audio app that literally lets you put words into someone's mouth

Shawn Knight

Posts: 15,284 +192

Uncle Al

Posts: 10,154 +9,634

Kenrick

Posts: 631 +401

Evernessince

Posts: 5,469 +6,158

cliffordcooley

Posts: 13,141 +6,441

fktech

Posts: 542 +147

mbrowne5061

Posts: 2,157 +1,362

Greg S

Posts: 1,607 +442

Moneyd623

Posts: 19 +5

rvnwlfdroid

Posts: 193 +49

Igrecman

Posts: 300 +180

Similar threads

Latest posts