Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction