Reproduce VibeThinker-3B frontier reasoning claim (arXiv 2606.16140): minimal vLLM eval of the released 3B model on AIME25.