This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Ziyang Chen
czyang.bsky.social
did:plc:of7tyzi5arweotwpry5pkzrf
🎥 Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🔊
We can
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video.
arXiv: arxiv.org/abs/2411.17698
website: ificl.github.io/MultiFoley/
2024-11-27T02:58:15.759Z