Abstract: Vision-Language Models (VLMs) have recently shown promising advancements in sequential decision-making tasks through task-specific fine-tuning. However, common fine-tuning methods, such as ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results