Yu, Y., Eshghi, A., & Lemon, O. (2016). Interactively learning visually grounded word meanings from a human tutor. In Proceedings of the 5th Workshop on Vision and Language (48-53)