Optimizing Multimodal Large Language Models for Scientific VQA through Caption-Aware Supervised TrainingPublished in AAAI-2025 AI4EDU Workshop, 2024Direct LinkShare on Twitter Facebook LinkedIn Previous Next