Qwen2.5-Omni Demo

Submit media inputs to generate text and speech responses