Optimizing Multimodal Large Language Models for Scientific VQA through Caption-Aware Supervised Training

Published in AAAI-2025 AI4EDU Workshop, 2024

Direct Link