1 Introduction
Recurrent neural networks, specifically reservoir computing models, are studied in the context of neuroscience and neuromorphic computing to model and process inputs with spatiotemporal dynamics. A reservoir model is a randomly connected system of neurons that creates a complex, high dimensional dynamic representation of an input. The patterns of activity generated by the reservoir are then processed by a layer of linear readout neurons to perform pattern recognition tasks (Fig. 1). The intrinsic recurrence of such systems gives them a sort of ‘memory’ to store patterns of correlated activity in a sequential input. Reservoirs also exhibit ‘chaos’ on account of the recurrence, that presents a challenge in theoretical understanding and exploiting the nonlinear dynamics of such networks. However, understanding the dynamics of these networks during training has large implications toward advancing the reservoir computing field for broader range of artificial intelligence and computational neuroscience applications.
Most of the existing literature tend to bypass ‘chaos’ and theorize the dynamics of reservoir models under designed control settings (Sompolinsky et al., 1988; Rajan et al., 2010b; Stern et al., 2014; Sussillo & Barak, 2013) for random conditions. Rivkind et al. (Rivkind & Barak, 2017) demonstrated the first analysis on the effect of training on reservoir dynamics. However, the analysis was restricted to trained models for fixed point targets. In this work, we present a study to understand the dynamics of reservoir models during training. As opposed to previous works, we use the ‘chaotic’ activity of the reservoir to gauge and analyze the stability of the model. We consider reservoir networks that have feedback connections from the readout to the reservoir (Fig. 1). This closed loop setting enables the reservoir to perform complex tasks. But it also poses a major difficulty to analyze the stability of these networks. We break the readoutfeedback loop during training by unrolling the network over time (defined as, Breaking the Closed Loop (BCL) methodology). Then, we analyze the activity of the auxiliary open loop system at intermediate time steps at different stages of training. The key contributions of this work are as follows:

We show that the network tends to be less chaotic with training, that is representative of the training success of the network. We evaluate the nonlinear complex dynamics of the reservoir using Eigenvalue (EV) spectra and observe that the spectral circle shrinks as training progresses, representative of decrease in random chaotic projections.

Furthermore, we extend our analysis to fixed point and time varying targets and observe similar stability behavior. For time varying targets, we use the first order reduced and controlled error (FORCE) training method (Sussillo & Abbott, 2009), while we use standard least squares regression to train fixed point targets.

Additionally, we use Principal Component Analysis (PCA) to further peek into the model’s activity during training. Rajan et al.
(Rajan et al., 2010b, a) demonstrated that reservoirs operate in a dynamic regime, wherein stable input driven periodic activity and chaotic activity tend to coincide. Interestingly, we observe that our ‘Breaking the Closed Loop (BCL)’ analysis preserves the interaction between the chaotic spontaneous activity and nonchaotic input driven state of the reservoir supporting the results from prior works (Rajan et al., 2010b, a; Abbott et al., 2011). 
To validate the effectiveness of our BCL methodology, we compared our analysis to the prior work (Rivkind & Barak, 2017). Rivkind et al. (Rivkind & Barak, 2017) analyze trained reservoir dynamics with closed loop theory. We verify that the EV spectra obtained from our proposed BCL after training a reservoir coincides with the EV spectra obtained from the closed loop theory (under same operating operating conditions).
In summary, we analyze the dynamics of a reservoir during training and formulate a stability criterion, while substantiating the results from prior work by Rivkind et al.(Rivkind & Barak, 2017) and Rajan et al. (Rajan et al., 2010b, a) on trained recurrent dynamics. It is worth mentioning that BCL method of analysing a feedback system can seem as a trivial approach. However, the EV spectra and PCA results obtained from analysing the auxilliary open loop systems yield interesting and novel statistics about the reservoir dynamics and stability, that can be applied to complex dynamical systems for largescale analysis.
2 Model Description and Motivation for BCL
The dynamics of the reservoir model are given by
(1) 
where represents the internal state of the reservoir at a given time, is the neuronal firing rate, where denotes a nonlinear function ( in this work). is the output activity of the linear readout neurons that are fed back into the reservoir with feedback weights . Input is fed into the reservoir with input weights . represents an recurrent weight matrix (with
equal to the number of neurons in the reservoir) chosen randomly and independently from a Gaussian distribution with
mean and variance,
, where is the synaptic gain parameter. regulates the overall chaotic activity in a system. Previous studies have shown that for large recurrent networks, values of generate increasingly complex and chaotic patterns of spontaneous activity (Rajan & Abbott, 2006; Sompolinsky et al., 1988). In our simulations, we vary to understand the behavior of the system under different chaotic conditions. For simplicity in simulations, we follow previous works (Rajan & Abbott, 2006; Rivkind & Barak, 2017) and set input for all our experiments. In all our simulations, we start by randomly initializing the reservoir statefrom Normal distribution and then continue with our training experiments. Note, reservoirs are generally categorised under recurrent neural networks due to the weight matrix
that imparts recurrency to such models (Rivkind & Barak, 2017; Sompolinsky et al., 1988).The objective of training the readout weights is to ensure that the activity of output neurons () match some predefined target function , i.e. . In case of a fixed point target, , where is a constant value. In case of a time varying target, , say , where the objective is to train the reservoir to generate sinusoidal activity. It is evident that learning for fixed point targets is simple, that involves solving a least squares regression task using standard algebraic and linear equation methods. For time varying targets, solving for becomes slightly complicated. Thus, we use the popular FORCE training devised by Sussillo et al. (Sussillo & Abbott, 2009), a widely used algorithm to train reservoirs for generating sequential patterns.
A noteworthy observation from Eqn. (1) is that the reservoir model is a closed loop system (due to ) with perpetual feedback from the output, that affects the internal state of the reservoir at every time step. Since training occurs in this closed loop dynamics, it is apparent that analyzing the stability of the reservoir will be difficult considering the continuous temporal activity of the system. Here, the feedback affects the training which in turn affects the stability of the system. In several prior works, the authors, thus, analyzed the behavior of random networks without feedback and without any training consideration (Sussillo & Barak, 2013; Rajan et al., 2010b; Stern et al., 2014). Such works theorized interesting results pertaining to the stability and robustness criteria for chaotic networks. However, those results cannot be extended to the feedbackbased reservoir performing realistic tasks. In fact, Rivkind et al.’s analyses (Rivkind & Barak, 2017) on feedbackbased reservoir hinges on the fact that, the network has already been trained. Therefore, the reservoir dynamics have converged to a unique stable state. This allows them to use constant feedback that does not affect the reservoir’s state. For a fixed point target , Rivkind et al. describe the behavior of the trained reservoir model (already at stable state ) as
(2) 
It is clear from Eqn. (2) that the feedback’s effect on temporal dynamics is no longer there, since it is constant. Analyzing stability of a reservoir during training requires us to incorporate feedback (that will change with every time step). Hence, we unroll the reservoir dynamics over time without affecting the properties of the system, as described below.
3 Breaking the Closed Loop (BCL): Unrolling the Reservoir over Time
The basic idea of BCL is illustrated in Fig. 2 (a), where the feedback system can be viewed as a series of interconnected cascaded systems unrolled over multiple time steps. As , the cascaded system’s dynamics will converge to that of the closed loop dynamics. This broken loop theory is a widely used concept used in decision and control to analyze the robustness of feedback systems. Here, we take this concept and apply it to analyzing reservoir dynamics during training. Fig. 2(b) shows the unrolled reservoir dynamics over time. Here, the output () from each state () at a given time step is multiplied by before being fed as input to the network in the next time step. This preserves the temporal property and the feedback dynamics of the reservoir that continually affects the reservoir’s internal state at every time step. Consequently, the reservoir dynamics in the unrolled state can be described as
(3)  
Eqn. (3) bears resemblance to Eqn. (1). The difference arises from the inclusion of that accounts for the intrinsic state of the system from the previous time step in the unrolled reservoir, to obtain the current state of the reservoir. Note, the input has not been shown in Fig. 2 and Eqn. (3) for convenience in representation. Now, let us consider training of a network in an unrolled state. The basic idea of training is to learn the weights to obtain at each time step. Therefore, in the unrolled reservoir system will change over time. For a fixed point target (, this implies solving the linear equation for each state before proceeding to calculating the network dynamics for the next state . It is noteworthy to mention that this simplistic analysis allows us to track the reservoir activity during training without interfering or disrupting the overall dynamics.
One might argue that unrolling over time will be computationally expensive and time consuming. We need to unroll the system for each time step throughout the entire time period of simulation to observe the behavior. However, unrolling time can vary from . We can possibly integrate the state of the system for several time steps before we unroll. Fig. 2 (c) shows the system unrolled at every time step. In this case, the state of the system for the next integration will utilize the last reservoir state, i.e. , as the unrolled feedback input . This approximation helps us circumvent the computational issue without affecting the overall dynamics of the system, while allowing us to perform the stability analysis for each unrolled reservoir state. Note, for integrated unrolling, the last reservoir state after every unrolling (for instance, in Fig. 2 (c)) is used to gauge the stability of the system. Next, we describe the eigenvalue (EV) spectra method that measures the stability of the unrolled reservoir dynamics.
4 Eigenvalue (EV) Spectra: Stability Evaluation Criteria
EV spectra is a powerful tool of random matrix theory that allows one to examine the complex behavior of reservoir networks with random recurrent connections. By diagonalizing the synaptic weight matrix
of the reservoir, we obtain complex modes that represent the activity (specifically, the frequency of oscillation) of each neuron in the reservoir (Rajan & Abbott, 2006; Rajan, 2009). Each mode is denoted by a complex eigenvalue, where the real part () denotes the decay rate of the neuronal oscillation and the imaginary part () denotes the frequency of oscillation. It has been shown in several works that a neuron or mode with exhibits longlasting oscillatory behavior representative of chaos (Rajan & Abbott, 2006; Rajan, 2009). The authors in (Panda & Srinivasa, 2018; Panda & Roy, 2017; Rajan, 2009) have shown that a reservoir model with good ‘memory’ must operate in a region between singular fixed point activity () and complete chaos (). In the context of learning, the reservoir’s activity which is generally chaotic in the beginning of training must converge to stable states as training proceeds. That is, there should be fewer modes with as training progresses. This suggests that the EV spectral circle must shrink to ensure the success of training. In fact, recent works that use novel plasticity rules to train the recurrent weights of the reservoir have demonstrated the effectiveness of their learning methodology with EV spectral circle shrinking (Panda & Srinivasa, 2018; Panda & Roy, 2017).We take this EV spectra evaluation criterion and apply it to analyze the activity of the reservoir at different unrolled instances. Linearizing the reservoir’s dynamics will model the diagonalization of synaptic weights , which in turn, determines the EV spectra. Linearizing Eqn. (1) which is a closed loop system gives
(4) 
In fact, Rivkind et al. analyzed the EV spectra of the trained reservoir using Eqn. 4. In our case, as mentioned earlier, we analyse the network activity after every unrolling to gauge the stability of the system. Linearizing the unrolling dynamics (Eqn. (3)) yields
(5) 
We would like to emphasize that is the reservoir activity from the previous time step or unrolling instance. In case of integrated unrolling (refer Fig. 2 (c)), the EV spectra of the reservoir state after the second unrolling at uses value from last time step of the previous integrated unrolled instance. Note, in our simulations done in MATLAB, we use the available tool to plot the EV spectra. This translates Eqn. (5) to in the simulation framework (note, proper usage of and functions (not shown here) are necessary to maintain the dimensionality of the matrices during implementation).
5 Results
5.1 EV spectra to evaluate training stability
We conducted reservoir training with the unrolling BCL methodology and gauged the stability of training by evaluating the EV spectra. First, we trained a reservoir of 1000 neurons to generate a fixed point target () with least squares regression training for varying values. Since the fixed point target is a simple target, the reservoir could easily get trained in timesteps with . As a result, we did not have to use integrated unrolling. The results are shown in Fig. 3 for . Note, the accuracy of the reservoir undergoing BCL unrolling is same as that of performing training on a Closed Loop (CL) reservoir system for all experiments in Fig. 3.
Fig. 3 (Column 1) illustrates the EV spectra measured from the reservoir dynamics at different steps. It is evident that the spectral radius decreases with increasing time implying the success of the training algorithm in converging the reservoir’s chaotic state to stable fixed point activity. To quantify this further, we measured the radius of the spectrum at the initial and final time step for each case. Table 1 illustrates the radii results that further demonstrates shrinking of EV spectra during training. In Fig. 3 (Column 2), we plot the EV spectra of the reservoir system using Rivkind et al.’s Closed Loop (CL) dynamics (Eqn. (2)). We use the open source code available from the authors to perform this analysis. We use the same operating conditions across ours and their method for isocomparison. In Fig. 3 (Column 3), we plot the EV spectrum obtained at the final time step or unrolling instance from our BCL methodology and compare with that of the spectrum of Rivkind et al.’s CL method. The EV spectra coincide validating our methodology and stability analysis. Rivkind et al. use the EV spectrum to show that the dynamics of the trained reservoir is in a stable regime after training. We get the same spectrum with BCL unrolling at the final step (with a slowly evolving spectra in the intermediate time steps). This establishes the effectiveness of BCL to gauge the stability of a reservoir undergoing training.
Furthermore, the coincidence of the final timestep EV spectrum from our analysis with that of Rivkind et al. also implies that: our stability analysis at each unrolling timestep of an open loop system provides a rigorous assessment of the whole closed loop feedback system. We observe similar behavior across all values. An interesting observation here is that the total time for convergence increases with increasing (for instance, for to for ). This is expected as determines chaotic activity. Thus, a reservoir with abundant chaotic projections will take more time to converge during training. Please note, due to the numerical nature of the simulations, we imposed a stopping criteria where we put an upper bound on the maximum time period of convergence( time steps) OR an upper bound on the weight difference between consecutive time steps ().
Target  g  

Fixed Point  0.9  0.779  0.587 
1.2  0.963  0.708  
1.5  1.176  0.814  
TimeVarying  1.5  1.266  1.036 
Next, we analyzed a reservoir being trained for time varying targets to generate a sinusoidal () pattern. We simulated a reservoir of 1000 neurons (with ) with FORCE training. Fig. 4 shows the results for EV spectra observed at several time steps during training. It is clear that the spectrum shrinks over time (also quantified by the radii results in Table 1). In fact, the spectrum at the final time steps: , completely match signifying the convergence of the training algorithm. We also see that the output activity during training and testing matches with that of the target, empirically indicating the success of the training rule. In Fig. 4 analysis, the network was unrolled for each time step . As a result, the simulation time was longer (2 minutes as per real clock time). In contrast, using integrated unrolling over time (as discussed in Section 3) decreases the overall simulation clock time at the cost of degraded convergence.
Fig. 5 compares the output activity and target for varying integrated unrolling time instances (). We see that as the unrolling takes place at increased interleaved time steps, the output activity fails to match the target activity during testing. While the curves match during training, the testing fails with the output activity significantly shifting away from the target as the integrated time between unrolling increases. The shift during testing is indicative of a decline in accuracy. This is expected as the unrolling behavior approximates the network dynamics. And, with interleaved unrolling, certain significant aspects of the temporal dynamics might get affected. Please note, the EV spectra for the reservoir dynamics corresponding to Fig. 5 (not shown), specifically for do not exhibit shrinking behavior that further corroborates the empirical results seen in Fig. 5. We, therefore, recommend using unrolling at each time step to attain reliable results.
We would like to emphasize that the presented analysis is the first work to show the stability of training a recurrent model with time varying target. We would also like to note that our experiments were restricted to simple fixedpoint or timevarying target analysis due to the intensive numerical nature of simulations as well as the limitation of reservoir training methods. We believe that our analysis can be extended to complex problems on realworld datasets, given that the reservoir training in such cases can be done effectively.
5.2 PCA to analyse BCL methodology
Rajan et al. (Rajan et al., 2010a) demonstrated that reservoirs (that have converged to a given state) exhibit chaotic as well as stable periodic activity. They use PCA to analyze the network activity and visualize the stable and chaotic trajectories. The network state at any given time instant can be described by a point in the Ndimensional space with coordinates corresponding to the firing rates of the N neuronal units. With time, the network activity traverses a trajectory in this Ndimensional space and PCA can be used to visualize the trajectory. To conduct PCA, we diagonalize the equaltime crosscorrelation matrix of the firing rates of the N units as
(6) 
where the angle brackets, , denote time average. The eigenvalues of the matrix (specifically, , where is the eigenvalue corresponding to principal component ) indicate the contribution of different Principal Components (PCs) toward the fluctuations/total variance in the spontaneous activity of the network. Rajan et al. observed that the network activity shows fluctuating patterns and irregular trajectory in the higher order PCs (such as ) characteristic of chaos. In contrast, the trajectories for lower order PCs are more regular and nonfluctuating characteristic of stability. To further corroborate the effectiveness of BCL for analyzing the training stability of reservoir models, we plotted the PCs for different components (PC 1, 2, 3, 41, 42) in Fig. 6 for a model trained with fixed point target (corresponding to Fig. 5 (a)). Note, the network activity was observed after training using the BCL unrolling methodology. We observe slowly fluctuating patterns for PC 1, 2, 3, while extensively fluctuating patterns for PC 41, 42.
Plotting the PC curves for the reservoir trained with CL dynamics as proposed by Rivkind et al., we observe nonfluctuating pattern of activity across both high and lower order PCs. This suggests an absence of chaotic activity in the system. However, as Rajan et al. have demonstrated, there will always be chaos in a reservoir coinciding with stable patterns of activity. We believe that the assumption Rivkind et al. make regarding the convergence of the system to stable state before analyzing the network activity (Eqn. (2)) causes such discrepancy. PCA results further establish the correctness of our methodology in preserving the integrity of the system and its temporal dynamics.
6 Conclusion
We present a first of its kind methodology to analyze the stability of reservoir models (with feedback) during training. Essentially, we unroll the reservoir dynamics over time and analyze the eigenvalue spectra of the reservoir. The shrinking spectra during training underscores the success of the training methodology while signifying the convergence of the reservoir’s chaotic activity to more convergent stable states. To minimize the number of unrolling time steps for large scale simulations, we also presented the integrated unrolling over time methodology. However, we observed that the accuracy of the system gets affected when the unrolling instances are done over longer time intervals. We showed the effectiveness of our proposed methodology for training reservoir models on fixed point as well as time varying targets. Our analysis establishes eigenvalue spectra /breaking the closed loop methodology as a reliable metric/technique to evaluate/gauge the stability of training in reservoir models, respectively. In the future, we would like to extend this proposal to analyze the robustness of the network prediction and examine the interpretability of the network’s behavior during and after training.
Acknowledgement
The work was supported in part by, Center for Braininspired Computing Enabling Autonomous Intelligence (CBRIC), a DARPA sponsored JUMP center, by the SRC, the NSF, Intel Corporation, the DoD Vannevar Bush Fellowship and by the U.S. Army Research Laboratory and the U.K. Ministry of Defense under Agreement Number W911NF1630001.
References
 Abbott et al. (2011) Abbott, L. F., Rajan, K., and Sompolinsky, H. Interactions between intrinsic and stimulusevoked activity in recurrent neural networks. The Dynamic brain: an exploration of neuronal variability and its functional significance (Ding M, Glanzman DL, eds), pp. 65–82, 2011.
 Panda & Roy (2017) Panda, P. and Roy, K. Learning to generate sequences with combination of hebbian and nonhebbian plasticity in recurrent spiking neural networks. Frontiers in neuroscience, 11:693, 2017.
 Panda & Srinivasa (2018) Panda, P. and Srinivasa, N. Learning to recognize actions from limited training examples using a recurrent spiking neural model. Frontiers in neuroscience, 12:126, 2018.
 Rajan (2009) Rajan, K. Spontaneous and stimulusdriven network dynamics. Columbia University, 2009.
 Rajan & Abbott (2006) Rajan, K. and Abbott, L. Eigenvalue spectra of random matrices for neural networks. Physical review letters, 97(18):188104, 2006.
 Rajan et al. (2010a) Rajan, K., Abbott, L., and Sompolinsky, H. Inferring stimulus selectivity from the spatial structure of neural network dynamics. In Advances in Neural Information Processing Systems, pp. 1975–1983, 2010a.
 Rajan et al. (2010b) Rajan, K., Abbott, L., and Sompolinsky, H. Stimulusdependent suppression of chaos in recurrent neural networks. Physical Review E, 82(1):011903, 2010b.
 Rivkind & Barak (2017) Rivkind, A. and Barak, O. Local dynamics in trained recurrent neural networks. Physical review letters, 118(25):258101, 2017.
 Sompolinsky et al. (1988) Sompolinsky, H., Crisanti, A., and Sommers, H.J. Chaos in random neural networks. Physical review letters, 61(3):259, 1988.
 Stern et al. (2014) Stern, M., Sompolinsky, H., and Abbott, L. Dynamics of random neural networks with bistable units. Physical Review E, 90(6):062710, 2014.
 Sussillo & Abbott (2009) Sussillo, D. and Abbott, L. F. Generating coherent patterns of activity from chaotic neural networks. Neuron, 63(4):544–557, 2009.
 Sussillo & Barak (2013) Sussillo, D. and Barak, O. Opening the black box: lowdimensional dynamics in highdimensional recurrent neural networks. Neural computation, 25(3):626–649, 2013.
Comments
There are no comments yet.