MIT Uses Video to Train Machine Vision System


– Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory reported this week they have developed a deep learning algorithm that could help machines using predictive vision anticipate human interactions. The approach uses unlabeled YouTube videos as its source material to train deep networks to predict human interactions.

In a paper titled, “Anticipating Visual Representations from Unlabeled Video,” the researchers said they applied recognition algorithms on the trained network’s prediction to forecast future actions. That prediction capability could be used in applications such as training robots to understand that a greeting in the form of a wave could lead to a handshake or an embrace.

