Research Topic

Design of Learning Systems

This project aims at identifying means to reduce the search space in learning systems as one way to improve the corresponding learning processes. To this end, we study the geometric properties of various connectionistic models known within the field of machine learning using information geometry and algebraic statistics. Our goal is to find distinguished architectures of learning systems based on their expressive power and learning performance.
This kind of model selection is motivated by experimental and theoretical work on restricted Boltzmann machines and deep belief networks, popular learning systems which evermore demand a profound mathematical investigation. This project targets especially the development of design principles for our embodied AI project.

Selection Criteria for Neuromanifolds of Stochastic Dynamics

Within many formal models of neuronal systems, individual neurons are modelled as nodes which receive inputs from other nodes in a network and generate an output that can be stochastic in general. This way the dynamics of the whole network can be described as a stochastic transition in each time step, mathematically formalized in terms of a stochastic matrix. Well-known models of this kind are Boltzmann machines, their generalizations, and policy matrices within reinforcement learning. In order to study such learning systems it is helpful to consider not only one stochastic matrix but a parametrized family of matrices, which forms a geometric object, referred to as neuromanifold within information geometry. Learning crucially depends on the shape of the neuromanifold. This information geometric view, which has been proposed by Amari, suggests to select appropriate neuromanifolds and to define corresponding learning processes as gradient flows on these manifolds. We do not only focus on manifolds that are directly induced by a neuronal model, but study general sets that satisfy natural optimality conditions.

Two dimensional sets containing all deterministic policies

Deterministic policies or near to deterministic policies are optimal for a variety of reinforcement learning problems, they represent dynamics with maximal predictive information as considered in robotics and also dynamics of neural networks with maximal network information flow. It is always possible to construct a two dimensional set that reaches all deterministic policies and on which natural gradient optimization works very efficiently.

All Former Research Groups

People

Research Group Leader

01.09.05 31.03.21

Nihat Ay

Technische Universität TUHH

Scientist

01.06.13 30.09.17

Guido Montúfar

University of California - Los Angeles

MiS Profile MiS Profile

Scientist

01.10.10 30.09.14

Johannes Rauh

IQTIG - Institut für Qualität und Transparenz im Gesundheitswesen

Geometry and Complex Systems

Collaborations

Andreas Knauf
František Matúš
Stephan Weis

inBook

2020 Repository Open Access

Johannes Müller and Marius Zeinhofer

Deep Ritz revisited

In: ICLR 2020 workshop on integration of deep neural models and differential equations : Millennium Hall, Addis Ababa, Ethiopia ; 26th April 2020
[S. L.] : ICLR, 2020.

BibTex ArXiv: 1912.03937 Link: openreview.net

inJournal

2022 Repository Open Access

Csongor Várady, Riccardo Volpi, Luigi Malagò and Nihat Ay

Natural reweighted wake-sleep

In: Neural networks, 155 (2022), pp. 574-591

BibTex DOI: 10.1016/j.neunet.2022.09.006 ArXiv: 2008.06687 MiS Preprint

MiS Preprint

2020 Repository Open Access

Csongor Varady, Nihat Ay, Riccardo Volpi and Luigi Malagò

Natural Wake-Sleep Algorithm

ArXiv: 2008.06687 MiS Preprint

inJournal

2023 Journal Open Access

Nihat Ay

On the locality of the natural gradient for learning in deep Bayesian networks

In: Information geometry, 6 (2023) 1, pp. 1-49

BibTex DOI: 10.1007/s41884-020-00038-y ArXiv: 2005.10791 MiS Preprint

inBook

2020 Repository Open Access

Johannes Müller

On the space-time expressivity of ResNets

In: ICLR 2020 workshop on integration of deep neural models and differential equations : Millennium Hall, Addis Ababa, Ethiopia ; 26th April 2020
[S. L.] : ICLR, 2020.

BibTex ArXiv: 1910.09599 Link: openreview.net

inBook

2019 Repository Open Access

Nihat Ay, Johannes Rauh and Guido Montúfar

A continuity result for optimal memoryless planning in POMDPs

In: RLDM 2019 : 4th multidisciplinary conference on reinforcement learning and decision making ; July 7-10, 2019 ; Montréal, Canada
Montréal, Canada : University, 2019. - pp. 362-365

BibTex MiS Preprint Link: rldm.org

inBook

2019 Repository Open Access

Guido Montúfar, Johannes Rauh and Nihat Ay

Task-agnostic constraining in average reward POMDPs

In: Task-agnostic reinforcement learning : workshop at ICLR, 06 May 2019, New Orleans
[S. L.] : ICLR, 2019.

BibTex MiS Preprint Link: tarl2019.github.io

Preprint

2018 Repository Open Access

Guido Montúfar

Illustration of maxout layer upper bound [Suppl. to: On the number of linear regions of deep neural networks]

BibTex Link: www.researchgate.net

inBook

2018 Repository Open Access

Guido Montúfar, Johannes Rauh and Nihat Ay

Uncertainty and stochasticity of optimal policies

In: Proceedings of the 11th workshop on uncertainty processing WUPES '18, June 6-9, 2018 / Václav Kratochvíl (ed.)
Praha : MatfyzPress, 2018. - pp. 133-140

BibTex MiS Preprint Link: wupes.utia.cas.cz

inJournal

2017 Journal Open Access

Guido Montúfar and Jason Morton

Dimension of marginals of Kronecker product models

In: SIAM journal on applied algebra and geometry, 1 (2017) 1, pp. 126-151

BibTex DOI: 10.1137/16M1077489 ArXiv: 1511.03570 MiS Preprint

inBook

2017 Repository Open Access

Guido Montúfar and Johannes Rauh

Geometry of policy improvement

In: Geometric science of information : Third International Conference, GSI 2017, Paris, France, November 7-9, 2017, proceedings / Frank Nielsen... (eds.)
Cham : Springer, 2017. - pp. 282-290
(Lecture notes in computer science ; 10589)

BibTex DOI: 10.1007/978-3-319-68445-1_33 ArXiv: 1704.01785

inBook

2015 Repository Open Access

Guido Montúfar and Johannes Rauh

Hierarchical models as marginals of hierarchical models

In: Proceedings of the 10th workshop on uncertainty processing WUPES '15, Moninec, Czech Republic, September 16-19, 2015 / Václav Kratochvíl (ed.)
Praha : Oeconomica, 2015. - pp. 131-145

BibTex ArXiv: 1508.03606 MiS Preprint Link: wupes.fm.vse.cz

inBook

2017 Repository Open Access

Guido Montúfar

Notes on the number of linear regions of deep neural networks

In: 2017 international conference on sampling theory and applications (SampTA) / Gholamreza Anbarjafari... (eds.)
Piscataway, NJ : IEEE, 2017. - pp. 156-159

BibTex Link: www.researchgate.net

inJournal

2017 Repository Open Access

Guido Montúfar, Jason Morton and Johannes Rauh

Restricted Boltzmann machines [In: Algebraic statistics ; 16 April - 22 April 2017 ; report no. 20/2017]

In: Oberwolfach reports, 14 (2017) 2, pp. 1241-1242

BibTex DOI: 10.4171/OWR/2017/20 Link: publications.mfo.de

inBook

2015 Repository Open Access

Guido Montúfar and Johannes Rauh

Mode poset probability polytopes

In: Proceedings of the 10th workshop on uncertainty processing WUPES '15, Moninec, Czech Republic, September 16-19, 2015 / Václav Kratochvíl (ed.)
Praha : Oeconomica, 2015. - pp. 147-154

BibTex ArXiv: 1503.00572 MiS Preprint Link: wupes.fm.vse.cz

inBook

2015 Repository Open Access

Guido Montúfar

A comparison of neural network architectures

In: Deep learning Workshop, ICML '15, Vauban Hall at Lille Grande Palais, France, July 10 and 11, 2015
2015.

BibTex Link: www.researchgate.net

inJournal

2015 Journal Open Access

Guido Montúfar, Keyan Ghazi-Zahedi and Nihat Ay

A theory of cheap control in embodied systems

In: PLoS computational biology, 11 (2015) 9, e1004427

BibTex DOI: 10.1371/journal.pcbi.1004427 ArXiv: 1407.6836 MiS Preprint

inBook

2015 Repository Open Access

Guido Montúfar

Deep narrow Boltzmann machines are universal approximators

In: Third international conference on learning representations - ICLR 2015 : May 7-9 2015, San Diego, CA. USA
San Diego : ICLR, 2015.

BibTex ArXiv: 1411.3784 MiS Preprint Link: iclr.cc

inJournal

2015 Journal Open Access

Guido Montúfar and Jason Morton

Discrete restricted Boltzmann machines

In: Journal of machine learning research, 16 (2015), pp. 653-672

BibTex ArXiv: 1301.3529 MiS Preprint Link: jmlr.org

inJournal

2015 Journal Open Access

Nihat Ay

Geometric design principles for brains of embodied agents

In: Künstliche Intelligenz : KI, 29 (2015) 4, pp. 389-399

BibTex DOI: 10.1007/s13218-015-0382-z

Preprint

2015 Repository Open Access

Guido Montúfar, Keyan Ghazi-Zahedi and Nihat Ay

Geometry and determinism of optimal stationary control in partially observable Markov decision processes

BibTex ArXiv: 1503.07206 MiS Preprint

inJournal

2015 Journal Open Access

Guido Montúfar, Nihat Ay and Keyan Ghazi-Zahedi

Geometry and expressive power of conditional restricted Boltzmann machines

In: Journal of machine learning research, 16 (2015), pp. 2405-2436

BibTex ArXiv: 1402.3346 MiS Preprint Link: www.jmlr.org

inJournal

2017 Repository Open Access

Guido Montúfar and Johannes Rauh

Hierarchical models as marginals of hierarchical models

In: International journal of approximate reasoning, 88 (2017), pp. 531-546

BibTex DOI: 10.1016/j.ijar.2016.09.003 ArXiv: 1508.03606 MiS Preprint

Preprint

2015 Repository Open Access

Guido Montúfar

Universal approximation of Markov kernels by shallow stochastic feedforward networks

BibTex ArXiv: 1503.07211 MiS Preprint

inJournal

2015 Repository Open Access

Guido Montúfar and Jason Morton

When does a mixture of products contain a product of mixtures?

In: SIAM journal on discrete mathematics, 29 (2015) 1, pp. 321-347

BibTex DOI: 10.1137/140957081 ArXiv: 1206.0387 MiS Preprint

inBook

2014

Guido Montúfar and Jason Morton

Geometry of hidden-visible products of statistical models

In: Algebraic Statistics 2014 : May 19-22
Chicago, IL : Illinois Institute of Technology, 2014.

BibTex Link: mypages.iit.edu

inJournal

2014 Journal Open Access

Guido Montúfar, Johannes Rauh and Nihat Ay

On the Fisher metric of conditional probability polytopes

In: Entropy, 16 (2014) 6, pp. 3207-3233

BibTex DOI: 10.3390/e16063207 ArXiv: 1404.0198 MiS Preprint

inBook

2014 Repository Open Access

Razvan Pascanu, Guido Montúfar and Yoshua Bengio

On the number of inference regions of deep feed forward networks with piece-wise linear activations

In: Second international conference on learning representations - ICLR 2014 : 14-16 April 2014, Banff, Canada
Banff : ICLR, 2014.

BibTex ArXiv: 1312.6098 MiS Preprint Link: openreview.net

inBook

2014 Repository Open Access

Guido Montúfar, Razvan Pascanu, Kyunghyun Cho and Yoshua Bengio

On the number of linear regions of deep neural networks

In: NIPS 2014 : Proceedings of the 27th international conference on neural information processing systems - volume 2 ; Montreal, Quebec, Canada, December 8th-13th
Cambridge, MA : MIT Press, 2014. - pp. 2924-2932

BibTex ArXiv: 1402.1869 MiS Preprint Link: papers.nips.cc

inJournal

2014 Repository Open Access

Johannes Rauh and Nihat Ay

Robustness, canalyzing functions and systems design

In: Theory in biosciences, 133 (2014) 2, pp. 63-78

BibTex DOI: 10.1007/s12064-013-0186-3 ArXiv: 1210.7719 MiS Preprint

inJournal

2014 Journal Open Access

Guido Montúfar and Johannes Rauh

Scaling of model approximation errors and expected entropy distances

In: Kybernetika, 50 (2014) 2, pp. 234-245

BibTex DOI: 10.14736/kyb-2014-2-0234 ArXiv: 1207.3399

inJournal

2014 Repository Open Access

Guido Montúfar

Universal approximation depth and errors of narrow belief networks with discrete units

In: Neural computation, 26 (2014) 7, pp. 1386-1407

BibTex DOI: 10.1162/NECO_a_00601 ArXiv: 1303.7461 MiS Preprint

inBook

2013 Repository Open Access

Guido Montúfar, Johannes Rauh and Nihat Ay

Maximal information divergence from statistical models defined by neural networks

In: Geometric science of information : first international conference, GSI 2013, Paris, France, August 28-30, 2013. Proceedings / Frank Nielsen... (eds.)
Berlin [u. a.] : Springer, 2013. - pp. 759-766
(Lecture notes in computer science ; 8085)

BibTex DOI: 10.1007/978-3-642-40020-9_85 ArXiv: 1303.0268 MiS Preprint

inJournal

2013 Journal Open Access

Guido Montúfar

Mixture decompositions of exponential families using a decomposition of their sample spaces

In: Kybernetika, 49 (2013) 1, pp. 23-39

BibTex ArXiv: 1008.0204 MiS Preprint Link: www.kybernetika.cz

inBook

2013 Repository Open Access

Nihat Ay, Guido Montúfar and Johannes Rauh

Selection criteria for neuromanifolds of stochastic dynamics

In: Advances in cognitive neurodynamics III : proceedings of the 3rd International Conference on Cognitive Neurodynamics 2011 ; [June 9-13, 2011, Hilton Niseko Village, Hokkaido, Japan] / Yoko Yamaguchi (ed.)
Dordrecht : Springer, 2013. - pp. 147-154
(Advances in cognitive neurodynamics)

BibTex DOI: 10.1007/978-94-007-4792-0_20 MiS Preprint

Academic

2012

Guido Montúfar

On the expressive power of discrete mixture models, restricted Boltzmann machines, and deep belief networks - a unified mathematical treatment

Dissertation, Universität Leipzig, 2012

BibTex Link: personal-homepages.mis.mpg.de

inBook

2012 Repository Open Access

Guido Montúfar and Johannes Rauh

Scaling of model approximation errors and expected entropy distances

In: Proceedings of the 9th workshop on uncertainty processing WUPES '12 : Marianske Lazne, Czech Republik ; 12-15th September 2012
Praha : Academy of Sciences of the Czech Republik / Institute of Information Theory and Automation, 2012. - pp. 137-148

BibTex ArXiv: 1207.3399 Link: wupes.fm.vse.cz

inBook

2011 Repository Open Access

Guido Montúfar, Johannes Rauh and Nihat Ay

Expressive power and approximation errors of restricted Boltzmann machines

In: Advances in neural information processing systems 24 : NIPS 2011 ; 25th annual conference on neural information processing systems 2011, Granada, Spain December 12th - 15th / John Shawe-Taylor (ed.)
La Jolla, CA : Neural Information Processing Systems, 2011. - pp. 415-423

BibTex ArXiv: 1406.3140 MiS Preprint Link: papers.nips.cc

inJournal

2011 Repository Open Access

Guido Montúfar and Nihat Ay

Refinements of universal approximation results for deep belief networks and restricted Boltzmann machines

In: Neural computation, 23 (2011) 5, pp. 1306-1319

BibTex DOI: 10.1162/NECO_a_00113 ArXiv: 1005.1593 MiS Preprint

inBook

2010 Repository Open Access

Guido Montúfar

Mixture models and representational power of RBM's, DBN's, and DBM's

In: NIPS 2010 : Deep learning and unsupervised feature learning workshop ; December 19, 2010, Hilton, Vancouver, Canada
[s. l.] : NIPS, 2010. - pp. 1-9

BibTex Link: www.researchgate.net

Academic

2010 Repository Open Access

Thomas Kahle

On boundaries of statistical models

Dissertation, Universität Leipzig, 2010

BibTex Link: nbn-resolving.de

inJournal

2006 Journal Open Access

Nihat Ay and Andreas Knauf

Maximizing multi-information

In: Kybernetika, 42 (2006) 5, pp. 517-538

BibTex ArXiv: math-ph/0702002 MiS Preprint

Academic

2001

Nihat Ay

Aspekte einer Theorie pragmatischer Informationsstrukturierung

Dissertation, Universität Leipzig, 2001

BibTex

Design of Learning Systems

Selection Criteria for Neuromanifolds of Stochastic Dynamics

Two dimensional sets containing all deterministic policies

People

Nihat Ay

Guido Montúfar

Johannes Rauh

Collaborations

Related Publications

Deep Ritz revisited

Natural reweighted wake-sleep

Natural Wake-Sleep Algorithm

On the locality of the natural gradient for learning in deep Bayesian networks

On the space-time expressivity of ResNets

A continuity result for optimal memoryless planning in POMDPs

Task-agnostic constraining in average reward POMDPs

Illustration of maxout layer upper bound [Suppl. to: On the number of linear regions of deep neural networks]

Uncertainty and stochasticity of optimal policies

Dimension of marginals of Kronecker product models

Geometry of policy improvement

Hierarchical models as marginals of hierarchical models

Notes on the number of linear regions of deep neural networks

Restricted Boltzmann machines [In: Algebraic statistics ; 16 April - 22 April 2017 ; report no. 20/2017]

Mode poset probability polytopes

A comparison of neural network architectures

A theory of cheap control in embodied systems

Deep narrow Boltzmann machines are universal approximators

Discrete restricted Boltzmann machines

Geometric design principles for brains of embodied agents

Geometry and determinism of optimal stationary control in partially observable Markov decision processes

Geometry and expressive power of conditional restricted Boltzmann machines

Hierarchical models as marginals of hierarchical models

Universal approximation of Markov kernels by shallow stochastic feedforward networks

When does a mixture of products contain a product of mixtures?

Geometry of hidden-visible products of statistical models

On the Fisher metric of conditional probability polytopes

On the number of inference regions of deep feed forward networks with piece-wise linear activations

On the number of linear regions of deep neural networks

Robustness, canalyzing functions and systems design

Scaling of model approximation errors and expected entropy distances

Universal approximation depth and errors of narrow belief networks with discrete units

Maximal information divergence from statistical models defined by neural networks

Mixture decompositions of exponential families using a decomposition of their sample spaces

Selection criteria for neuromanifolds of stochastic dynamics

On the expressive power of discrete mixture models, restricted Boltzmann machines, and deep belief networks - a unified mathematical treatment

Scaling of model approximation errors and expected entropy distances

Expressive power and approximation errors of restricted Boltzmann machines

Refinements of universal approximation results for deep belief networks and restricted Boltzmann machines

Mixture models and representational power of RBM's, DBN's, and DBM's

On boundaries of statistical models

Maximizing multi-information

Aspekte einer Theorie pragmatischer Informationsstrukturierung