This repository contains the official code for FlexAttention for Efficient High-Resolution Vision-Language Models. torchrun --nproc_per_node 3 scripts/evaluation/eval ...