AI Thread

Fed up talking videogames? Why?
User avatar
Xeno
Member
Joined in 2008

PostRe: AI Thread
by Xeno » Fri Apr 19, 2024 2:22 pm

https://arstechnica.com/information-tec ... dio-track/

https://www.microsoft.com/en-us/researc ... ct/vasa-1/

On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don't require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

"It paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors," reads the abstract of the accompanying research paper titled, "VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time." It's the work of Sicheng Xu, Guojun Chen, Yu-Xiao Guo, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong, and Baining Guo.

The VASA framework (short for "Visual Affective Skills Animator") uses machine learning to analyze a static image along with a speech audio clip. It is then able to generate a realistic video with precise facial expressions, head movements, and lip-syncing to the audio. It does not clone or simulate voices (like other Microsoft research) but relies on an existing audio input that could be specially recorded or spoken for a particular purpose.


The new Clippy?

The videos are definitely a little off but if this is the early stages then things may well be gooseberry fool for a lot of us.

User avatar
Grumpy David
Member
Joined in 2008
AKA: Cubeamania

PostRe: AI Thread
by Grumpy David » Mon May 13, 2024 7:29 pm

twitter.com/OpenAI/status/1790072174117613963


User avatar
Garth
Emeritus
Joined in 2008
Location: Norn Iron

PostRe: AI Thread
by Garth » Mon May 13, 2024 8:12 pm

Lots of videos up on their YouTube channel too: https://www.youtube.com/@OpenAI/videos

User avatar
Knoyleo
Member
Joined in 2008

PostRe: AI Thread
by Knoyleo » Mon May 13, 2024 8:12 pm

How much time and effort has been wasted making sure it talks in quirky Joss Whedon speak?

pjbetman wrote:That's the stupidest thing ive ever read on here i think.
User avatar
Monkey Man
Member
Joined in 2008

PostRe: AI Thread
by Monkey Man » Mon May 13, 2024 10:50 pm

twitter.com/tomwarren/status/1790074556981403997


Image
User avatar
Grumpy David
Member
Joined in 2008
AKA: Cubeamania

PostRe: AI Thread
by Grumpy David » Mon May 13, 2024 11:13 pm

twitter.com/skirano/status/1790080937361027408




Knoyleo wrote:How much time and effort has been wasted making sure it talks in quirky Joss Whedon speak?


twitter.com/bayeslord/status/1790159728460415331


User avatar
Ironhide
Fiend
Joined in 2008
Location: Autobot City

PostRe: AI Thread
by Ironhide » Tue May 14, 2024 3:38 pm

Hideous robotic American voice aside, this is incredible.


Image
User avatar
rinks
Member
Member
Joined in 2008
Location: Aboard the train that goes around the world

PostRe: AI Thread
by rinks » Tue May 14, 2024 4:41 pm

Amazing - it made the taxi indicate that it was pulling over before he even signalled that he wanted it to stop.

User avatar
Outrunner
Member
Joined in 2008

PostRe: AI Thread
by Outrunner » Sun May 26, 2024 8:24 pm

We had to do a creative output for one of my modules. I wrote the lyrics (very amateur hour-ish) but they're allowing me to use AI to add music to it since I'm only being judged on the lyrics and getting the theme of my essay across via song.

https://www.udio.com/songs/emXgwUJwdeLfazHFXdezkU

Please do not post this in the "No Context" thread

Return to “Stuff”

Who is online

Users browsing this forum: andretmzt, Garth, Gideon, Grumpy David, Lime and 371 guests