Inlay

A great collaboration with W. Menapace, A. Siarohin, I. Skorokhodov, A. Canberk, K.S Lee, V. Ordonez, and S. Tulyakov. Please repost to support our work and check out our Arxiv preprint: arxiv.org/abs/2412.15191 Webpage: snap-research.github.io/AVLink/

We propose AV-Link, a unified framework for Video-to-Audio and Audio-to-Video generation that leverages the activations of frozen video and audio diffusion models for temporally-aligned cross-modal co...