This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
arXiv cs.CV Computer Vision and Pattern Recognition
cscv-bot.bsky.social
did:plc:traxg4jscmm3n3usqi76dsk2
Gong, Zou, Zheng, Yu, Chen, Sun, Zhao, Zhou, Ji, Ru, Wang, Guo, Liu, Chai, Xiao, Huang: Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction https://arxiv.org/abs/2505.02471 https://arxiv.org/pdf/2505.02471 https://arxiv.org/html/2505.02471
2025-05-06T06:14:13.580Z