Abstract: To effectively reduce the visual tokens in Visual Large Language Models (VLLMs), we propose a novel approach called Wi ndow Token Co ncatenation (WiCo). Specifically, we employ a sliding ...