Reservoir computing#Physical reservoir computers

{{Short description|Type of recurrent neural network with random and non-trainable internal structure}}

Reservoir computing is a framework for computation derived from recurrent neural network theory that maps input signals into higher dimensional computational spaces through the dynamics of a fixed, non-linear system called a reservoir.{{Cite journal|last1=Tanaka|first1=Gouhei|last2=Yamane|first2=Toshiyuki|last3=Héroux|first3=Jean Benoit|last4=Nakane|first4=Ryosho|last5=Kanazawa|first5=Naoki|last6=Takeda|first6=Seiji|last7=Numata|first7=Hidetoshi|last8=Nakano|first8=Daiju|last9=Hirose|first9=Akira|title=Recent advances in physical reservoir computing: A review|journal=Neural Networks|volume=115|pages=100–123|doi=10.1016/j.neunet.2019.03.005|pmid=30981085|issn=0893-6080|year=2019|doi-access=free|arxiv=1808.04962}} After the input signal is fed into the reservoir, which is treated as a "black box," a simple readout mechanism is trained to read the state of the reservoir and map it to the desired output. The first key benefit of this framework is that training is performed only at the readout stage, as the reservoir dynamics are fixed. The second is that the computational power of naturally available systems, both classical and quantum mechanical, can be used to reduce the effective computational cost.{{Cite journal|last1=Röhm|first1=André|last2=Lüdge|first2=Kathy|date=2018-08-03|title=Multiplexed networks: reservoir computing with virtual and real nodes|journal=Journal of Physics Communications|volume=2|issue=8|pages=085007|bibcode=2018JPhCo...2h5007R|doi=10.1088/2399-6528/aad56d|issn=2399-6528|doi-access=free|arxiv=1802.08590}}

History

The first examples of reservoir neural networks demonstrated that randomly connected recurrent neural networks could be used for sensorimotor sequence learning,{{Cite journal |last=Dominey |first=Peter |last2=Arbib |first2=Michael |last3=Joseph |first3=Jean-Paul |date=1995-07-01 |title=A Model of Corticostriatal Plasticity for Learning Oculomotor Associations and Sequences |url=https://direct.mit.edu/jocn/article-abstract/7/3/311/3172/A-Model-of-Corticostriatal-Plasticity-for-Learning?redirectedFrom=fulltext |journal=Journal of Cognitive Neuroscience |volume=7 |issue=3 |pages=311–336 |doi=10.1162/jocn.1995.7.3.311 |issn=0898-929X|url-access=subscription }} and simple forms of interval and speech discrimination.{{cite journal |last1=Buonomano |first1=Dean |title=Temporal information transformed into a spatial code by a neural network with realistic properties |journal=Science |date=1995 |volume=267 |issue=5200 |pages=1028–30 |doi=10.1126/science.7863330|pmid=7863330 }}{{cite journal |last1=Maass |first1=Wolfgang |title=Real-time computing without stable states: a new framework for neural computation based on perturbations |journal=Neural Computation |date=2002 |volume=14 |issue=11 |pages=2531–2560 |doi=10.1162/089976602760407955 |pmid=12433288 }} In these early models the memory in the network took the form of both short-term synaptic plasticity and activity mediated by recurrent connections. In other early reservoir neural network models the memory of the recent stimulus history was provided solely by the recurrent activity.{{cite journal |last1=Buonomano |first1=Mauk |title=Neural network model of the cerebellum: temporal discrimination and the timing of motor responses |journal=Neural Computation |date=1994 |volume=6 |pages=38–55|doi=10.1162/neco.1994.6.1.38 }}{{cite journal |last1=Jaeger |first1=Herbert |title=Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication |journal=Science |date=2004 |volume=304 |issue=5667 |pages=78–80|doi=10.1126/science.1091277 |pmid=15064413 }} Overall, the general concept of reservoir computing stems from the use of recursive connections within neural networks to create a complex dynamical system.Benjamin Schrauwen, David Verstraeten, and Jan Van Campenhout.

"An overview of reservoir computing: theory, applications, and implementations."

Proceedings of the European Symposium on Artificial Neural Networks ESANN 2007, pp. 471–482. It is a generalisation of earlier neural network architectures such as recurrent neural networks, liquid-state machines and echo-state networks. Reservoir computing also extends to physical systems that are not networks in the classical sense, but rather continuous systems in space and/or time: e.g. a literal "bucket of water" can serve as a reservoir that performs computations on inputs given as perturbations of the surface.{{Cite book|last1=Fernando|first1=C.|last2=Sojakka|first2=Sampsa|title=Advances in Artificial Life|chapter=Pattern Recognition in a Bucket|date=2003 |series=Lecture Notes in Computer Science|volume=2801|pages=588–597|doi=10.1007/978-3-540-39432-7_63|isbn=978-3-540-20057-4|s2cid=15073928}} The resultant complexity of such recurrent neural networks was found to be useful in solving a variety of problems including language processing and dynamic system modeling. However, training of recurrent neural networks is challenging and computationally expensive. Reservoir computing reduces those training-related challenges by fixing the dynamics of the reservoir and only training the linear output layer.

A large variety of nonlinear dynamical systems can serve as a reservoir that performs computations. In recent years semiconductor lasers have attracted considerable interest as computation can be fast and energy efficient compared to electrical components.

Recent advances in both AI and quantum information theory have given rise to the concept of quantum neural networks. These hold promise in quantum information processing, which is challenging to classical networks, but can also find application in solving classical problems.{{Cite journal|last1=Ghosh|first1=Sanjib|last2=Opala|first2=Andrzej|last3=Matuszewski|first3=Michał|last4=Paterek|first4=Tomasz|last5=Liew|first5=Timothy C. H.|date=December 2019|title=Quantum reservoir processing|arxiv=1811.10335|journal=npj Quantum Information|volume=5|issue=1|pages=35|doi=10.1038/s41534-019-0149-8|bibcode=2019npjQI...5...35G|s2cid=119197635|issn=2056-6387}}{{cite arXiv|last1=Negoro|first1=Makoto|last2=Mitarai|first2=Kosuke|last3=Fujii|first3=Keisuke|last4=Nakajima|first4=Kohei|last5=Kitagawa|first5=Masahiro|date=2018-06-28|title=Machine learning with controllable quantum dynamics of a nuclear spin ensemble in a solid|class=quant-ph|eprint=1806.10910}} In 2018, a physical realization of a quantum reservoir computing architecture was demonstrated in the form of nuclear spins within a molecular solid. However, the nuclear spin experiments in did not demonstrate quantum reservoir computing per se as they did not involve processing of sequential data. Rather the data were vector inputs, which makes this more accurately a demonstration of quantum implementation of a random kitchen sink{{cite journal|last1=Rahimi|first1=Ali|last2=Recht|first2=Benjamin|date=December 2008|title=Weighted Sums of Random Kitchen Sinks: Replacing minimization with randomization in Learning|journal=NIPS'08: Proceedings of the 21st International Conference on Neural Information Processing Systems|url=http://papers.nips.cc/paper/3495-weighted-sums-of-random-kitchen-sinks-replacing-minimization-with-randomization-in-learning.pdf|pages=1313–1320}} algorithm (also going by the name of extreme learning machines in some communities). In 2019, another possible implementation of quantum reservoir processors was proposed in the form of two-dimensional fermionic lattices. In 2020, realization of reservoir computing on gate-based quantum computers was proposed and demonstrated on cloud-based IBM superconducting near-term quantum computers.{{cite journal|last1=Chen|first1=Jiayin|last2=Nurdin|first2=Hendra|last3=Yamamoto|first3=Naoki|title=Temporal Information Processing on Noisy Quantum Computers|journal=Physical Review Applied|volume=14|pages=024065|date=2020-08-24|issue=2|doi=10.1103/PhysRevApplied.14.024065|arxiv=2001.09498|bibcode=2020PhRvP..14b4065C|s2cid=210920543|url=https://doi.org/10.1103/PhysRevApplied.14.024065}}

Reservoir computers have been used for time-series analysis purposes. In particular, some of their usages involve chaotic time-series prediction,{{Cite journal|last1=Pathak|first1=Jaideep|last2=Hunt|first2=Brian|last3=Girvan|first3=Michelle|last4=Lu|first4=Zhixin|last5=Ott|first5=Edward|date=2018-01-12|title=Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach|journal=Physical Review Letters|volume=120|issue=2|pages=024102|doi=10.1103/PhysRevLett.120.024102|pmid=29376715|bibcode=2018PhRvL.120b4102P|doi-access=free}}{{Cite journal|last1=Vlachas|first1=P.R.|last2=Pathak|first2=J.|last3=Hunt|first3=B.R.|last4=Sapsis|first4=T.P.|last5=Girvan|first5=M.|last6=Ott|first6=E.|last7=Koumoutsakos|first7=P.|date=2020-03-21|title=Backpropagation algorithms and Reservoir Computing in Recurrent Neural Networks for the forecasting of complex spatiotemporal dynamics|url=http://dx.doi.org/10.1016/j.neunet.2020.02.016|journal=Neural Networks|volume=126|pages=191–217|doi=10.1016/j.neunet.2020.02.016|pmid=32248008|issn=0893-6080|arxiv=1910.05266|s2cid=211146609}} separation of chaotic signals,{{Cite journal|last1=Krishnagopal|first1=Sanjukta|last2=Girvan|first2=Michelle|last3=Ott|first3=Edward|last4=Hunt|first4=Brian R.|date=2020-02-01|title=Separation of chaotic signals by reservoir computing|url=https://aip.scitation.org/doi/10.1063/1.5132766|journal=Chaos: An Interdisciplinary Journal of Nonlinear Science|volume=30|issue=2|pages=023123|doi=10.1063/1.5132766|pmid=32113243|issn=1054-1500|arxiv=1910.10080|bibcode=2020Chaos..30b3123K|s2cid=204823815}} and link inference of networks from their dynamics.{{Cite journal|last1=Banerjee|first1=Amitava|last2=Hart|first2=Joseph D.|last3=Roy|first3=Rajarshi|last4=Ott|first4=Edward|date=2021-07-20|title=Machine Learning Link Inference of Noisy Delay-Coupled Networks with Optoelectronic Experimental Tests|journal=Physical Review X|volume=11|issue=3|pages=031014|doi=10.1103/PhysRevX.11.031014|arxiv=2010.15289|bibcode=2021PhRvX..11c1014B|doi-access=free}}

Classical reservoir computing

= Reservoir =

The 'reservoir' in reservoir computing is the internal structure of the computer, and must have two properties: it must be made up of individual, non-linear units, and it must be capable of storing information. The non-linearity describes the response of each unit to input, which is what allows reservoir computers to solve complex problems. Reservoirs are able to store information by connecting the units in recurrent loops, where the previous input affects the next response. The change in reaction due to the past allows the computers to be trained to complete specific tasks.{{Cite journal|last=Soriano|first=Miguel C.|date=2017-02-06|title=Viewpoint: Reservoir Computing Speeds Up|url=https://physics.aps.org/articles/v10/12|journal=Physics|language=en|volume=10|page=12 |doi=10.1103/Physics.10.12|doi-access=free|hdl=10261/173181|hdl-access=free}}

Reservoirs can be virtual or physical. Virtual reservoirs are typically randomly generated and are designed like neural networks. Virtual reservoirs can be designed to have non-linearity and recurrent loops, but, unlike neural networks, the connections between units are randomized and remain unchanged throughout computation. Physical reservoirs are possible because of the inherent non-linearity of certain natural systems. The interaction between ripples on the surface of water contains the nonlinear dynamics required in reservoir creation, and a pattern recognition RC was developed by first inputting ripples with electric motors then recording and analyzing the ripples in the readout.

= Readout =

The readout is a neural network layer that performs a linear transformation on the output of the reservoir. The weights of the readout layer are trained by analyzing the spatiotemporal patterns of the reservoir after excitation by known inputs, and by utilizing a training method such as a linear regression or a Ridge regression. As its implementation depends on spatiotemporal reservoir patterns, the details of readout methods are tailored to each type of reservoir. For example, the readout for a reservoir computer using a container of liquid as its reservoir might entail observing spatiotemporal patterns on the surface of the liquid.

= Types =

== Context reverberation network ==

An early example of reservoir computing was the context reverberation network.

Kevin Kirby. "Context dynamics in neural sequential learning."

Proceedings of the Florida Artificial Intelligence Research Symposium FLAIRS (1991), 66–70.

In this architecture, an input layer feeds into a high dimensional dynamical system which is read out by a trainable single-layer perceptron. Two kinds of dynamical system were described: a recurrent neural network with fixed random weights, and a continuous reaction–diffusion system inspired by Alan Turing’s model of morphogenesis. At the trainable layer, the perceptron associates current inputs with the signals that reverberate in the dynamical system; the latter were said to provide a dynamic "context" for the inputs. In the language of later work, the reaction–diffusion system served as the reservoir.

== Echo state network ==

{{main|Echo state network}}The Tree Echo State Network (TreeESN) model represents a generalization of the reservoir computing framework to tree structured data.{{Cite journal|last1=Gallicchio|first1=Claudio|last2=Micheli|first2=Alessio|year=2013|title=Tree Echo State Networks|journal=Neurocomputing|volume=101|pages=319–337|doi=10.1016/j.neucom.2012.08.017|hdl=11568/158480|hdl-access=free}}

== Liquid-state machine ==

{{main|Liquid-state machine}}Chaotic Liquid State Machine

The liquid (i.e. reservoir) of a Chaotic Liquid State Machine (CLSM),{{Cite book|last1=Aoun|first1=Mario Antoine|last2=Boukadoum|first2=Mounir|title=2014 IEEE 13th International Conference on Cognitive Informatics and Cognitive Computing |chapter=Learning algorithm and neurocomputing architecture for NDS Neurons |date=2014|chapter-url=http://dx.doi.org/10.1109/icci-cc.2014.6921451|pages=126–132|publisher=IEEE|doi=10.1109/icci-cc.2014.6921451|isbn=978-1-4799-6081-1|s2cid=16026952}}{{Cite journal|last1=Aoun|first1=Mario Antoine|last2=Boukadoum|first2=Mounir|date=2015|title=Chaotic Liquid State Machine|url=http://dx.doi.org/10.4018/ijcini.2015100101|journal=International Journal of Cognitive Informatics and Natural Intelligence|volume=9|issue=4|pages=1–20|doi=10.4018/ijcini.2015100101|issn=1557-3958|url-access=subscription}} or chaotic reservoir, is made from chaotic spiking neurons but which stabilize their activity by settling to a single hypothesis that describes the trained inputs of the machine. This is in contrast to general types of reservoirs that don’t stabilize. The liquid stabilization occurs via synaptic plasticity and chaos control that govern neural connections inside the liquid. CLSM showed promising results in learning sensitive time series data.

== Nonlinear transient computation ==

This type of information processing is most relevant when time-dependent input signals depart from the mechanism’s internal dynamics. These departures cause transients or temporary altercations which are represented in the device’s output.{{cite journal |last1=Crook |first1=Nigel |title=Nonlinear Transient Computation |journal=Neurocomputing |date=2007 |volume=70 |issue=7–9 |pages=1167–1176 |doi=10.1016/j.neucom.2006.10.148}}

== Deep reservoir computing ==

The extension of the reservoir computing framework towards Deep Learning, with the introduction of Deep Reservoir Computing and of the Deep Echo State Network (DeepESN) model{{cite thesis |type=PhD thesis |last=Pedrelli |first=Luca |date=2019 |title=Deep Reservoir Computing: A Novel Class of Deep Recurrent Neural Networks |publisher=Università di Pisa |url=https://etd.adm.unipi.it/t/etd-02282019-191815/}}{{Cite journal|last1=Gallicchio|first1=Claudio|last2=Micheli|first2=Alessio|last3=Pedrelli|first3=Luca|title=Deep reservoir computing: A critical experimental analysis|journal=Neurocomputing|volume=268|pages=87–99|doi=10.1016/j.neucom.2016.12.089|date=2017-12-13|hdl=11568/851934|hdl-access=free}}{{Cite journal|last1=Gallicchio|first1=Claudio|last2=Micheli|first2=Alessio|date=2017-05-05|title=Echo State Property of Deep Reservoir Computing Networks|journal=Cognitive Computation|volume=9|issue=3|pages=337–350|doi=10.1007/s12559-017-9461-9|issn=1866-9956|hdl=11568/851932|s2cid=1077549|hdl-access=free}}{{Cite journal|last1=Gallicchio|first1=Claudio|last2=Micheli|first2=Alessio|last3=Pedrelli|first3=Luca|date=December 2018|title=Design of deep echo state networks|journal=Neural Networks|volume=108|pages=33–47|doi=10.1016/j.neunet.2018.08.002|pmid=30138751|issn=0893-6080|hdl=11568/939082|s2cid=52075702|hdl-access=free}} allows to develop efficiently trained models for hierarchical processing of temporal data, at the same time enabling the investigation on the inherent role of layered composition in recurrent neural networks.

Quantum reservoir computing

Quantum reservoir computing may use the nonlinear nature of quantum mechanical interactions or processes to form the characteristic nonlinear reservoirs{{Cite journal|last1=Chen|first1=Jiayin|last2=Nurdin|first2=Hendra|date=2019-05-15|title=Learning nonlinear input–output maps with dissipative quantum systems|url=https://link.springer.com/article/10.1007%2Fs11128-019-2311-9|journal=Quantum Information Processing|volume=18|issue=7|page=198|doi=10.1007/s11128-019-2311-9|arxiv=1901.01653|bibcode=2019QuIP...18..198C|s2cid=57573677}} but may also be done with linear reservoirs when the injection of the input to the reservoir creates the nonlinearity.{{cite journal|last1=Nokkala|first1=Johannes|last2=Martínez-Peña|first2=Rodrigo|last3=Giorgi|first3=Gian Luca|last4=Parigi|first4=Valentina|last5=Soriano|first5=Miguel C.|last6=Zambrini|first6=Roberta|title=Gaussian states of continuous-variable quantum systems provide universal and versatile reservoir computing|journal=Communications Physics|year=2021|volume=4|issue=1|page=53|doi=10.1038/s42005-021-00556-w|arxiv=2006.04821|bibcode=2021CmPhy...4...53N|s2cid=234355683}} The marriage of machine learning and quantum devices is leading to the emergence of quantum neuromorphic computing as a new research area.{{cite journal|last1=Marković|first1=Danijela |last2=Grollier|first2=Julie|title=Quantum Neuromorphic Computing|journal=Applied Physics Letters|volume=117|pages=150501|date=2020-10-13|issue=15|doi=10.1063/5.0020014|arxiv=2006.15111|bibcode=2020ApPhL.117o0501M |s2cid=210920543|url=https://doi.org/10.1063/5.0020014}}

= Types =

== Gaussian states of interacting quantum harmonic oscillators ==

Gaussian states are a paradigmatic class of states of continuous variable quantum systems.{{cite arXiv|last1=Ferraro|first1=Alessandro|last2=Olivares|first2=Stefano|last3=Paris|first3=Matteo G. A.|date=2005-03-31|title=Gaussian states in continuous variable quantum information|eprint=quant-ph/0503237}} Although they can nowadays be created and manipulated in, e.g, state-of-the-art optical platforms,{{Cite journal|last1=Roslund|first1=Jonathan|last2=de Araújo|first2=Renné Medeiros|last3=Jiang|first3=Shifeng|last4=Fabre|first4=Claude|last5=Treps|first5=Nicolas|date=2013-12-15|title=Wavelength-multiplexed quantum networks with ultrafast frequency combs|url=https://www.nature.com/articles/nphoton.2013.340|journal=Nature Photonics|language=en|volume=8|issue=2|pages=109–112|doi=10.1038/nphoton.2013.340|arxiv=1307.1216|s2cid=2328402|issn=1749-4893}} naturally robust to decoherence, it is well-known that they are not sufficient for, e.g., universal quantum computing because transformations that preserve the Gaussian nature of a state are linear.{{Cite journal|last1=Bartlett|first1=Stephen D.|last2=Sanders|first2=Barry C.|last3=Braunstein|first3=Samuel L.|last4=Nemoto|first4=Kae|date=2002-02-14|title=Efficient Classical Simulation of Continuous Variable Quantum Information Processes|url=https://link.aps.org/doi/10.1103/PhysRevLett.88.097904|journal=Physical Review Letters|volume=88|issue=9|pages=097904|doi=10.1103/PhysRevLett.88.097904|pmid=11864057|arxiv=quant-ph/0109047|bibcode=2002PhRvL..88i7904B|s2cid=2161585}} Normally, linear dynamics would not be sufficient for nontrivial reservoir computing either. It is nevertheless possible to harness such dynamics for reservoir computing purposes by considering a network of interacting quantum harmonic oscillators and injecting the input by periodical state resets of a subset of the oscillators. With a suitable choice of how the states of this subset of oscillators depends on the input, the observables of the rest of the oscillators can become nonlinear functions of the input suitable for reservoir computing; indeed, thanks to the properties of these functions, even universal reservoir computing becomes possible by combining the observables with a polynomial readout function. In principle, such reservoir computers could be implemented with controlled multimode optical parametric processes,{{Cite journal|last1=Nokkala|first1=J.|last2=Arzani|first2=F.|last3=Galve|first3=F.|last4=Zambrini|first4=R.|last5=Maniscalco|first5=S.|last6=Piilo|first6=J.|last7=Treps|first7=N.|last8=Parigi|first8=V.|date=2018-05-09|title=Reconfigurable optical implementation of quantum complex networks|url=https://doi.org/10.1088%2F1367-2630%2Faabc77|journal=New Journal of Physics|language=en|volume=20|issue=5|pages=053024|doi=10.1088/1367-2630/aabc77|arxiv=1708.08726|bibcode=2018NJPh...20e3024N|s2cid=119091176|issn=1367-2630}} however efficient extraction of the output from the system is challenging especially in the quantum regime where measurement back-action must be taken into account.

== 2-D quantum dot lattices ==

In this architecture, randomized coupling between lattice sites grants the reservoir the “black box” property inherent to reservoir processors. The reservoir is then excited, which acts as the input, by an incident optical field. Readout occurs in the form of occupational numbers of lattice sites, which are naturally nonlinear functions of the input.

== Nuclear spins in a molecular solid ==

In this architecture, quantum mechanical coupling between spins of neighboring atoms within the molecular solid provides the non-linearity required to create the higher-dimensional computational space. The reservoir is then excited by radiofrequency electromagnetic radiation tuned to the resonance frequencies of relevant nuclear spins. Readout occurs by measuring the nuclear spin states.

== Reservoir computing on gate-based near-term superconducting quantum computers ==

The most prevalent model of quantum computing is the gate-based model where quantum computation is performed by sequential applications of unitary quantum gates on qubits of a quantum computer.{{Citation|last1=Nielsen|first1=Michael|last2=Chuang|first2=Isaac|title=Quantum Computation and Quantum Information|publisher=Cambridge University Press Cambridge|date=2010|edition=2}} A theory for the implementation of reservoir computing on a gate-based quantum computer with proof-of-principle demonstrations on a number of IBM superconducting noisy intermediate-scale quantum (NISQ) computersJohn Preskill. "Quantum Computing in the NISQ era and beyond." Quantum 2,79 (2018) has been reported in.

See also

References

{{Reflist|30em}}

Further reading

  • [http://www.nature.com/ncomms/journal/v2/n9/full/ncomms1476.html?WT.ec_id=NCOMMS-20110913 Reservoir Computing using delay systems], Nature Communications 2011
  • [http://www.nature.com/srep/2012/120227/srep00287/full/srep00287.html Optoelectronic Reservoir Computing], Scientific Reports February 2012
  • [http://www.opticsinfobase.org/oe/abstract.cfm?uri=oe-20-3-3241 Optoelectronic Reservoir Computing], Optics Express 2012
  • [http://www.nature.com/ncomms/journal/v4/n1/full/ncomms2368.html All-optical Reservoir Computing], Nature Communications 2013
  • [http://www.mitpressjournals.org/doi/10.1162/NECO_a_00694#.WL4P9iHyvIo Memristor Models for Machine learning], Neural Computation 2014 [https://arxiv.org/abs/1406.2210 arxiv]

Category:Artificial neural networks