AI
Scale Is All You Need for Lip-Sync?
Alibaba’s EMO and Microsoft’s VASA-1 are crazy good. Let’s break down how they work. Jack Saunders · Follow Published in Towards Data Science · 11 min read · 9 hours ago — It’s no secret that the pace of AI research is exponentially accelerating. One of the biggest trends of the past couple of years has been using transformers to exploit huge-scale datasets. It looks like this trend has finally reached the field of lip-sync