good regulator theorem

{{Short description|Theorem in cybernetics}}

The good regulator theorem is a theorem conceived by Roger C. Conant and W. Ross Ashby that is central to cybernetics. It was originally stated as "every good regulator of a system must be a model of that system".R. C. Conant and W. R. Ashby, "[http://pespmc1.vub.ac.be/books/Conant_Ashby.pdf Every good regulator of a system must be a model of that system]", Int. J. Systems Sci., 1970, vol 1, No 2, pp. 89–97 That is, any regulator that is maximally simple among optimal regulators must behave as an image of that system under a homomorphism.

More accurately, every good regulator must contain or have access to a model of the system it regulates. And while the authors sometimes say the regulator and regulated are 'isomorphic', the mapping they construct is only a homomorphism, meaning the model can lose information about the entity that is modeled. So, while the system that is regulated is a pattern of behavior in the world, it is not necessarily the only pattern of behavior observable in a regulated entity.

Theorem

This theorem is obtained by considering the entropy of the variation of the output of the controlled system, and shows that, under very general conditions, that the entropy is minimized when there is a (deterministic) mapping h:S\to R from the states of the system to the states of the regulator. The authors view this map h as making the regulator a 'model' of the system.

With regard to the brain, insofar as it is successful and efficient as a regulator for survival, it must proceed, in learning, by the formation of a model (or models) of its environment.

The theorem is general enough to apply to all regulating and self-regulating or homeostatic systems.

File:Good regulator.png

Five variables are defined by the authors as involved in the process of system regulation. D as primary disturbers, R as a set of events in the regulator, S as a set of events in the rest of the system outside of the regulator, Z as the total set of events (or outcomes) that may occur, G as the subset of Z events (or outcomes) that are desirable to the system.

The principal point that the authors present with this figure is that regulation requires of the regulator to conceive of all variables as it regards the set S of events concerning the system to be regulated in order to render in satisfactory outcomes G of this regulation. If the regulator is instead not able to conceive of all variables in the set S of events concerning the system that exist outside of the regulator, then the set R of events in the regulator may fail to account for the total variable disturbances D which in turn may cause errors that lead to outcomes that are not satisfactory to the system (as illustrated by the events in the set Z that are not elements in the set G).

The theorem does not explain what it takes for the system to become a good regulator. Moreover, although highly cited, some concerns have been raised that the formal proof does not actually fully support the statement in the paper title.{{cite web |last1=Baez |first1=John |date=27 January 2016 |title=The Internal Model Principle |url=https://johncarlosbaez.wordpress.com/2016/01/27/the-good-regulator-theorem/ |url-status= live|archive-url=https://web.archive.org/web/20231005204511/https://johncarlosbaez.wordpress.com/2016/01/27/the-good-regulator-theorem/ |archive-date=5 October 2023 |access-date=6 June 2024 |website=Azimuth |language=en}}

Related theorems

In cybernetics, the problem of creating ethical regulators is addressed by the ethical regulator theorem.M. Ashby, [https://www.mdpi.com/2079-8954/8/4/53 "Ethical Regulators and Super-Ethical Systems"]. Systems, 2020; 8(4):53. The construction of ethical regulators is a general problem for any system (e.g., an automated information system) that regulates some domain of application.

When restricted to the ordinary differential equation (ODE) subset of control theory, it is referred to as the internal model principle, which was first articulated in 1976 by B. A. Francis and W. M. Wonham.B. A. Francis and W. M. Wonham, "[https://www.researchgate.net/profile/Walter_Wonham/publication/223996193_The_internal_model_principle_of_control_theory/links/5488b5610cf289302e30b828.pdf The internal model principle of control theory]", Automatica 12 (1976) 457–465. In this form, it stands in contrast to classical control, in that the classical feedback loop fails to explicitly model the controlled system (although the classical controller may contain an implicit model).Jan Swevers, "[https://people.mech.kuleuven.be/~jswevers/h04x3a/lecture_c9_c10.pdf Internal model control (IMC)] {{Webarchive|url=https://web.archive.org/web/20170830033243/https://people.mech.kuleuven.be/~jswevers/h04x3a/lecture_c9_c10.pdf |date=2017-08-30 }}", 2006

Echoing the spirit of the Good Regulator Theorem, recent research has also shown that agents modelled as goal-conditioned policies in environments described by a fully observed Markov processes can recover an approximation of the environment transition function (a world model) from the agent’s policy alone (under some assumptions on its regret). . In other words, all the information required to accurately simulate the environment is contained in the agent’s policy. This result paralells the Good Regulator Theorem by demonstrating that any agent capable of generalizing across a sufficiently diverse set of goal-directed tasks must necessarily learn an implicit predictive model of its environment.

Building on the same idea of the good regulator theorem, recent research in reinforcement learning has claimed that agents, modeled as goal-conditioned policies in environments governed by fully observable Markov processes, inherently encode a predictive model of their environment within their policy—provided certain assumptions about their performance hold.{{Citation |last=Richens |first=Jonathan |title=General agents need world models |date=2025-06-02 |url=http://arxiv.org/abs/2506.01622 |access-date=2025-06-04 |publisher=arXiv |doi=10.48550/arXiv.2506.01622 |id=arXiv:2506.01622 |last2=Abel |first2=David |last3=Bellot |first3=Alexis |last4=Everitt |first4=Tom}} In other words, all the information necessary to accurately simulate the environment is embedded in the agent’s policy. This finding parallels the Good Regulator Theorem by formally showing that any agent capable of generalizing across a sufficiently diverse set of goal-directed tasks must learn an implicit predictive model of its environment.

See also

References