Search

Workshop

Information flow in sensation & action and the emergence of [reverse] hierarchies

  • Naftali Tishby (The Hebrew University, Jerusalem, Israel)
E1 05 (Leibniz-Saal)

Abstract

Starting form Large Deviation Theory (Sanov's theorem) we can obtain the connection between the reward rate and the control and sensing information capacities, for systems in "metabolic information equilibrium" with stationary stochastic environments (Tishby & Polani, 2010). This result can be considered as an equilibrium characterisation for systems that achieved a certain value through interactions with the environment, but have no new learning (e.g. "stupid" cleaning robots). The affect of learning can be considered by revisiting the sub-extensivity of predictive information in stationary environments (Bialek, Nemenman & Tishby 2002) and combining it with the requirement of computational tractability of planning. We argue that planning is possible if the information flow terms remain proportional to the reward terms on the one hand, but still bounded by the sub-extensive predictive information on the other hand.

I will discuss the possible implications of this new computational principle to the emergence of hierarchical representations and discounting of rewards in our generalised Bellman equation.

Links

Antje Vandenberg

Max Planck Institute for Mathematics in the Sciences Contact via Mail

Nihat Ay

Max Planck Institute for Mathematics in the Sciences

Ralf Der

Max Planck Institute for Mathematics in the Sciences

Keyan Ghazi-Zahedi

Max Planck Institute for Mathematics in the Sciences

Georg Martius

Max Planck Institute for Mathematics in the Sciences