Microsoft VASA tech can create realistic deepfakes using a single photo and one audio track [TechSpot]
View Article on TechSpot
The Visual Affective Skills Animator, or VASA, is a machine-learning framework that analyzes a facial photo and then animates it to a voice, syncing the lips and mouth movements to the audio. It also simulates facial expressions, head movements, and even unseen body movements.