2306 05240 Dealing with Semantic Underspecification in Multimodal NLP