Hierarchical Attention Model

Applications such as human-robot interaction (HRI) and autonomous driving requires the agent to focus at high resolution on subtle visual cues for achieving proficiency. Existing vision systems have hardware limitations to achieve such high resolution and perform poorly in real-world environments. Human vision overcomes this problem by relying on a foveal view coupled with an…