Page 32 of 32

Re: AI Thread

Posted: Fri Apr 19, 2024 2:22 pm
by Xeno
https://arstechnica.com/information-tec ... dio-track/

https://www.microsoft.com/en-us/researc ... ct/vasa-1/

On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don't require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

"It paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors," reads the abstract of the accompanying research paper titled, "VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time." It's the work of Sicheng Xu, Guojun Chen, Yu-Xiao Guo, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong, and Baining Guo.

The VASA framework (short for "Visual Affective Skills Animator") uses machine learning to analyze a static image along with a speech audio clip. It is then able to generate a realistic video with precise facial expressions, head movements, and lip-syncing to the audio. It does not clone or simulate voices (like other Microsoft research) but relies on an existing audio input that could be specially recorded or spoken for a particular purpose.


The new Clippy?

The videos are definitely a little off but if this is the early stages then things may well be gooseberry fool for a lot of us.

Re: AI Thread

Posted: Mon May 13, 2024 7:29 pm
by Grumpy David

twitter.com/OpenAI/status/1790072174117613963


Re: AI Thread

Posted: Mon May 13, 2024 8:12 pm
by Garth
Lots of videos up on their YouTube channel too: https://www.youtube.com/@OpenAI/videos

Re: AI Thread

Posted: Mon May 13, 2024 8:12 pm
by Knoyleo
How much time and effort has been wasted making sure it talks in quirky Joss Whedon speak?

Re: AI Thread

Posted: Mon May 13, 2024 10:50 pm
by Monkey Man

twitter.com/tomwarren/status/1790074556981403997


Re: AI Thread

Posted: Mon May 13, 2024 11:13 pm
by Grumpy David

twitter.com/skirano/status/1790080937361027408




Knoyleo wrote:How much time and effort has been wasted making sure it talks in quirky Joss Whedon speak?


twitter.com/bayeslord/status/1790159728460415331


Re: AI Thread

Posted: Tue May 14, 2024 3:38 pm
by Ironhide
Hideous robotic American voice aside, this is incredible.


Re: AI Thread

Posted: Tue May 14, 2024 4:41 pm
by rinks
Amazing - it made the taxi indicate that it was pulling over before he even signalled that he wanted it to stop.