Adding support for Qwen’s Vision-Language (VL) model could be very useful for our workflows. It would allow us to parse images and documents more effectively and could even help with image compression tasks.
1 Like