Reproduction of Kwai Keye-VL-2.0 (arxiv 2606.10651): transformers-native multimodal inference PoC for the 30B-A3B MoE model.