Microsoft VASA tech can create realistic deepfakes using a single photo and one audio track [TechSpot]

April 19, 2024 Peter Pezaris New Relic

View Article on TechSpot

The Visual Affective Skills Animator, or VASA, is a machine-learning framework that analyzes a facial photo and then animates it to a voice, syncing the lips and mouth movements to the audio. It also simulates facial expressions, head movements, and even unseen body movements.

Read Entire Article

Spread the word!