This repository accompanies our study on how to teach Large Multimodal Models (LMMs) new skills without degrading existing capabilities. We analyze where to tune (vision encoder, projector, LLM) and, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results