Re-evaluating the Need for Multimodal Signals in Unsupervised Grammar Induction