1 Introduction
The conceptualization of systems within a network framework has become popular within the last decades, see Kolaczyk (2009) for a broad overview. This is mostly because network models provide useful tools for describing complex dependence structures and are applicable to a wide variety of research fields. In the network approach, the mathematical structure of a graph is utilized to model network data. A graph is defined as a set of nodes and relational information (ties) between them. Within this concept, nodes can represent individuals, countries or general entities, while ties are connections between those nodes. Dependent on the context, these connections can represent friendships in a school (Raabe et al., 2019), transfers of goods between countries (Ward et al., 2013), sexual relations between people (Bearman et al., 2004) or hyperlinks between websites (Leskovec et al., 2009) to name just a few. Given a suitable data structure for the system of interest, the conceptualization as a network enables analyzing dependencies between ties. A central statistical model that allows this is the Exponential Random Graph Model (ERGM, Robins and Pattison, 2001). This model permits the inclusion of monadic, dyadic and hyperdyadic features within a regressionlike framework.
Although the model allows for an insightful investigation of withinnetwork dependencies, most realworld systems are typically more complex. This is especially true if a temporal dimension is added, which is relevant, as most systems commonly described as networks evolve dynamically over time. It can even be argued that most static networks are de facto not static but snapshots of a dynamic process. A friendship network, e.g., typically evolves over time and influences like reciprocity often can be found to follow a natural chronological order.
Of course, this is not the first paper concerned with reviewing temporal network models. Goldenberg et al. (2010) wrote a general survey covering a wide range of models. The authors laid the foundation for further articles and postulated a soft division of statistical network models into latent space (Hoff et al., 2002) and models (Holland and Leinhardt, 1981), all originating in the EdösRényiGilbert random graph models (Erdös and Rényi, 1959). Kim et al. (2018) give a contemporary update on the field of dynamic models building on latent variables. Snijders (2005) discusses continuous time models and reframes the independence and reciprocity model as a Stochastic Actor oriented Model (SAOM, Snijders, 1996). Block et al. (2018) provide an indepth comparison of the Temporal Exponential Random Graph Model (TERGM, Hanneke et al., 2010) and the SAOM with special focus on the treatment of time. Further, the ERGM and SAOM for networks which are observed at single time points are contrasted by Block et al. (2019), deriving theoretical guidelines for model selection based on the differing mechanics implied by each model.
In the context of this compendium of articles, the scope is to give an update on the dynamic variant of the second strand of models relating to models. We therefore extend the summarizing diagram of Goldenberg et al. (2010) as depicted in Figure 1. Generally, we divide temporal models into two sections, by differentiating between discrete and continuous time network models.
Statistical models for time discrete data rely on a Markov chain assumption and condition the state of the network at time point
on previous states. This includes the TERGM and the Separable TERGM (STERGM, Krivitsky and Handcock, 2014). There exists a wide range of recent applications of the TERGM. White et al. (2018) use a TERGM for modeling epidemic disease outcomes and Blank et al. (2017) investigate interstate conflicts. In He et al. (2019) Chinese patent trade networks are inspected and Benton and You (2017) use a TERGM for analyzing shareholder activism. Applications of STERGMs are given for example by Stansfield et al. (2019) that model sexual relationships and Broekel and Bednarz (2019) that study the network of research and development cooperation between German firms.In case of timecontinuous data, the model regards the network as a continuously evolving system. Although this evolution is not necessarily observed in continuous time, the process is taken to be latent and explicitly models the evolution from the state of the network at time point to (Block et al., 2018). In this paper we discuss the relational event model (REM, Butts, 2008) for the analysis of event data. Eventually, the REM is adapted to timediscrete observations of networks. That is, we observe the timecontinuous developments of the network at discrete observation times only. Applications of the REM for nonclustered observations are manifold and range from explaining the dynamics of health behavior sentiments via Twitter (Salathé et al., 2013), interhospital patient transfers (Vu et al., 2017), online learning platforms (Vu et al., 2015), and animal behavior (Tranmer et al., 2015) to structures of project teams (Quintane et al., 2013).
The paper is structured as follows. In Section 2 we present the international arms trade network of major conventional weapons (MCW) that will be analyzed as an illustrative example and give basic definitions that are used throughout the paper. After that, Section 3 introduces timediscrete and Section 4 timecontinuous network models. In Section 5 further models are shortly discussed and differences between the proposed models are exhibited.
2 Definitions and Data Description
Descpription  Year  

Time  2016  2017  
Number of countries included  148  148  
Number of possible ties  21 756  21 756  
Density  0.020  0.019  
Transitivity  0.202  0.207  
Reciprocity  0.085  0.087 
As a running example throughout this paper, we use data on international arms trading. The arms trading data are provided in a comprehensive database by the Stockholm International Peace Research Institute (SIPRI, 2018) and includes data on the exchange of major conventional weapons (MCW) together with the volume of each transfer. Since this article only regards binary network models, the trade network is discretized with a threshold of zero. This means that a tie from actor to actor indicates that the sender country traded with a receiver country in the respective year. This information can then be represented in an adjacency matrix , where represents the set of all possible networks with nodes, in our example countries. The entry of is ”1” if country sold MCW to country in year and ”0” otherwise. Further, the discrete time points of the observations of are denoted as . For demonstration purposes, we restrict our analysis to two time points only and consider the years 2016 and 2017. Hence we look at annual changes of the network structure and set . In many networks including our running example self loops are meaningless. We therefore fix throughout the article. Further, all subscripted indices () are assumed to be discrete and and all indices in brackets () continuous. The temporal indicator denotes the observation times of the network and to notationally differ this from timecontinuous model we write for continuous time.
Table 1 gives some descriptive measures and Figure 2 visualizes the arms trade network. There are no compositional changes of the involved countries, whereby the number of possible ties stays the same as well. The density of a network is the proportion of realized edges out of all possible edges and is similar in both years, indicating the sparsity of the modeled network. Clustering can be expressed by the transitivity measure, providing the percentage of connected triplets out of all possible triplets. The reciprocity of a graph is the ratio of reciprocated ties in a graph and is similar in both years, see Annex A for a description of the degree distribution (Csardi and Nepusz, 2006).
Additionally, information on different kinds of exogenous covariates may be controlled for in statistical network models. In the given example we use the logarithmic Gross Domestic Product (GDP) (World Bank, 2017) as monadic covariates in respect to the sender and receiver of weapons. We also include the absolute difference of the so called polity IV index (Center for systemic Peace, 2017), ranging from 20 (highest ideological distance) to zero (no ideological distance), as a dyadic exemplary covariate. These covariates are assumed to be nonstochastic and we denote them by .
3 Dynamic Exponential Random Graph Models
3.1 Temporal Exponential Random Graph Model
The Exponential Random Graph Model (ERGM) is certainly among the most popular models for the analysis of static network data. Holland and Leinhardt (1981) introduced the model class, which was subsequently extended with respect to fitting algorithms and network statistics (see Lusher et al., 2012, Robins et al., 2007). Spurred by the popularity of ERGMs, dynamic extensions of this model class emerged, pioneered by Robins and Pattison (2001) who developed timediscrete models for temporally evolving social networks. Before we start with a description of the model, we want to highlight that the TERGM as well as the STERGM are most appropriate for equidistant time points. That is, we observe the networks at discrete and equidistant time points . Only in this setting the parameters allow for a meaningful interpretation. See Block et al. (2018) for a deeper discussion.
Hanneke et al. (2010) is the main reference for the TERGM, a model class that utilizes the Markov structure and, thereby, assumes that the transition of a network from time point to time point
can be explained by exogenous covariates as well as structural components of preceding networks. We assume a first order Markov dependence structure that applies to probability distributions
with parameter vector
. Conditioning on the first network, the resulting dependence structure of the model can be factorized into(1) 
Depending on the phenomenon of interest, it is also possible to allow for different parameter vectors for each transition probability (i.e. , ). Given the dependence structure (1), the TERGM assumes that the transition from to is generated according to an exponential random graph distribution with the parameter :
(2) 
Generally, specifies a dimensional function of sufficient network statistics which may depend on the previous network as well as on covariates. These network statistics can include static components, designed for crosssectional dependence structures, e.g., outdegree, indegree, reciprocity or transitivity (see Morris et al., 2008 for more examples). However, the formulation explicitly allows for temporal interactions, e.g. delayed reciprocity
(3) 
This statistic governs the tendency whether a tie in will be reciprocated in . Another important temporal statistic is stability
(4) 
In this case, the first product in the sum measures whether existing ties in persist in and the second term is one if nonexistent ties in remain nonexistent in . The proportionality sign is used since in many cases the network statistics are scaled into a specific interval (e.g. or ). Such a standardization is especially sensible for networks where the actor set changes with time. Additionally, exogenous covariates can be included, e.g., timevarying dyadic covariates
(5) 
There exists an abundance of possibilities for defining interactions between ties in and . From this discussion and equation (2) it also becomes obvious, that in a situation where the interest lies in the transition between two periods , a TERGM can be modelled simply as an ERGM, including lagged network statistics (for example by incorporating as explanatory variable).
Concerning the estimation of the model, maximum likelihood appears to be a natural candidate due to the simple exponential family form (
2). However, the normalization constant in the denominator of model (2) often poses an inhibiting obstacle when estimating (T)ERGMs. This can be seen by inspecting the normalization constant , that requires summation over all possible networks . This task is virtually infeasible, except for very small networks. Therefore, Markov Chain Monte Carlo (MCMC) methods have been proposed in order to approximate the logarithmic likelihood function (see Geyer and Thompson (1992) for Monte Carlo maximum likelihood and Hummel et al. (2012) for its adaption to ERGMs). A notable special case arises if the network statistics are restricted such that they decompose to(6) 
with being a function that is evaluated only at the lagged network and covariates for tie . With this restriction, we impose that the ties in are independent, conditional on the network structures in
. This greatly simplifies the estimation procedure and allows to fit the model as a logistic regression model (see for example
Almquist and Butts, 2014).3.2 Separable Temporal Exponential Random Graph Model
A useful extension of the TERGM model (2) is the STERGM proposed by Krivitsky and Handcock (2014). This model can be motivated by the fact that the stability term leads to an ambiguous interpretation of its corresponding parameter. Given that we include (4) in a TERGM and obtain a positive coefficient after fitting the model it is not clear whether the network can be regarded as ”stable” because existing ties are not dissolved (i.e. ) or because no new ties are formed (i.e. ). To disentangle this, the authors propose a model that allows for the separation of formation and dissolution.
Krivitsky and Handcock (2014) define the formation network as , being the network that consists of the initial network together with all ties that are newly added in . The dissolution network is given by and contains exclusively ties that are present in and . Given the network in together with the formation and the dissolution network we can then uniquely reconstruct the network in , since . Define as the joint parameter vector that contains the parameters of the formation and the dissolution model. Building on that, Krivitsky and Handcock (2014) define their model to be separable in the sense that the parameter space of is the product of the parameter spaces of and together with conditional independence of formation and dissolution given the network in :
(7) 
The structure of the model is visualized in Figure 3. On the left hand side the state of the network is given, consisting of two ties and . In the formation network (top in the middle plot) all ties that could possibly be formed are shown in dashed and the actual formation in this example is shown in solid. On the bottom, the two ties that could possibly be dissolved are shown and in this example () persists while is dissolved. On the right hand side of Figure 3 the resulting network at time point is displayed.
Given this structure and the separability assumption (3.2), it is assumed that a TERGM structure (2) is appropriate for both, the formation and the dissolution process. For practical reasons it is important to understand that the term ”dissolution” model is somewhat misleading since a positive coefficient in the dissolution model implies that nodes (or dyads) with high values for this statistic are less likely to dissolve. This is also the standard implementation in software packages, but can simply be changed by switching the signs of the parameters in the dissolution model.
3.3 Software and Application
When it comes to software, there exist essentially two main R packages that are designed for fitting TERGMs and STERGMs. Most important is the extensive statnet library (Goodreau et al., 2008) that allows for simulationbased fitting of ERGMs (which can be interpreted as TERGMs when including lagged network statistics). The library contains the package tergm with implemented methods for fitting STERGMs using conditional maximum likelihood. However, currently the package tergm (version 3.5.2) does not allow for fitting STERGMs with timevarying dyadic covariates for more than two time periods jointly. The package btergm (Leifeld et al., 2018) is designed for fitting TERGMs as shown in equations (2) using either maximum pseudolikelihood (often regarded as unreliable, see vanduijn2009) or MCMC maximum likelihood estimation routines.
In order to ensure comparable estimates we estimate the TERGM as well as the STERGM with the statnet library, using MCMC based likelihood inference techniques. We use the package ergm and include the lagged previous network as a dyadic covariate, which is in fact equivalent to the stability term (4) after some reformulation (see Block et al., 2018). The STERGM is fitted using the tergm package.
TERGM  STERGM  
Formation  Dissolution  
Lagged Network 
    
    
Edges  
Reciprocity  
OutDegree Sender (Geometrically weighted)  
InDegree Receiver (Geometrically weighted)  
Edgewise Shared Partners (Geometrically weighted)  
log(GDP) Sender  
log(GDP) Receiver  
Polity Score (Absolute Difference)  
Log Likelihood 
945.282  663.287  258.293 
AIC  1908.564  1342.574  532.585 
AIC  1908.564  1875.159 
Comparison of parameters obtained from the TERGM (first column) and the STERGM (Formation in the second column, Dissolution in the third column). Standard errors in brackets and stars according to
values smaller than (), () and (). Decay parameter of the geometrically weighted statistics is set to .The results obtained for the arms trading data discussed in the previous section are displayed in Table 2. For a detailed interpretation of effects focusing on political, social, and economic aspects we refer to the relevant literature (see e.g. Thurner et al., 2018). Here we want to comment on a few aspects only. First of all, concerning the general interpretation, note that the STERGM coefficients are implicitly dynamic because the corresponding statistics are evaluated either on the formation or the dissolution network and both are formed with the networks in and . In contrast to that, in the TERGM (first column), except the lagged stability term all network statistics are evaluated on the network in . Note further, that the TERGM coefficients try to explain the network structure in based on , while the STERGM coefficients provide information either on the formation or the dissolution.
Given that, it is not surprising that the coefficients can substantially differ in terms of significance and sign of the coefficients. For example, the statistic geometrically weighted Indegree of the receiver has a coefficient that is high in absolute terms and a low value in the TERGM. However, the effect is mainly driven by the formation, which can be seen by a weak and insignificant effect in the dissolution model but an even stronger and significant effect in the formation model. Hence, the TERGM suggests that nodes with a relatively high indegree are overall rather unlikely. However, this does not apply for the persistence of ties, where a high indegree of the receiver only slightly weakens the probability of retaining the import relationship.
Similarly, we find that sending countries with high outdegrees are rare in the network, see the significant and low values for the geometrically weighted outdegree. However, although it is unlikely that a country with a high outdegree adds a new edge in the formation process, the effect is insignificant in the dissolution model. This indicates that in the dissolution model, the persistence of ties is not strongly driven by the receivers’ outdegree.
Comparable effects can be found for the exogenous covariates.Consider, for instance, the coefficient of the logarithmic GDP of the importing country. The TERGM assigns a higher probability to observing ingoing ties to countries with a high GDP. However, disentangling the model towards formation and dissolution we see highly significant coefficients in the dissolution model while the effect for the formation model is insignificant.
Overall we observe that the STERGM allows to decompose the dynamics, which can also be quantified by the AIC as a model selection criterion. Based on the independence assumption in (3.2) we can sum up the two AIC values and see that the AIC value of the STERGM is smaller than of the TERGM.
4 Relational Event Model
4.1 TimeContinuous Event Processes
The second type of dynamic network models results by comprehending network changes as a continuously evolving process (see Girardin and Limnios, 2018 as a basic reference for stochastic processes). The idea was originally introduced by Holland and Leinhardt (1977). According to their view, tie changes are not occurring at discrete time points but as a continuously evolving process, where only one tie can occur at a time. This framework was extended by Butts (2008) to model behavior, which is understood as a directed event at a specific time, that potentially depends on the past. For instance, country sending weapons to at a given time point is a behavior, hereinafter called event. The overall aim is to understand the dynamic structure of events conditional on the information of the past (Lerner et al., 2013).
To model the event based approach, we leverage results from the field of timetoevent analysis, or survival analysis respectively (see, e.g., Kalbfleisch and Prentice, 2002 for an overview of timetoevent models). The central concept of this framework can be motivated by the introduction of a multivariate timecontinuous Poisson counting process
(8) 
where counts how often actors and interacted in . Note that we indicate continuous time with a tilde to distinguish from the discrete time setting with assumed in the previous section. Process (8) is characterized by an intensity function for , which is defined as:
This is the instantaneous probability of observing a jump of size ”1” in , which indicates observing the event at time . Since we assume that there are no selfloops holds.
4.2 TimeContinuous Observations
Butts (2008) introduced the Relational Event Model (REM) to analyze the intensity when timestamped data on the events are available. He assumed that the intensity is constant over time but depends on timevarying relational information of past events and exogenous covariates. Vu et al. (2011) extended the model by postulating a semiparametric intensity similar to Cox (1972):
(9) 
where is an arbitrary baseline intensity, the parameter vector and a statistic that depends on the (possibly timecontinuous) covariate process and the counting process just prior to . Examples for are the out and indegree of countries and .
To understand the relational nature of the observed events, model (9) takes a local timecontinuous point of view, whereby all global structural effects are assumed to originate on the dyadic level and become global by aggregation of multiple similar dyadic effects (Stadtfeld, 2018). This differing level of modeling necessitates defining the statistics on a dyadic level. To give an example, the dyadic version of reciprocity for the event now regards, whether already having observed the event prior to has an effect on , in comparison to the network level version (3) that counted the number of reciprocated ties between and . Therefore, the mathematical formulation is straightforward:
where is the indicator function. Since the effect of a past event at time , say, on a present event at time may vary according to the elapsed time , Stadtfeld and Block (2017) introduced windowed effects, which only regard events that occurred in a prespecified time window, e.g. a year. We will come back to this point in the next section.
In case of survival data, Cox (1972) introduced the partial likelihood to estimate without having to specify a parametric form of the baseline hazard nor a distribution on the times between events. In the same way, can be estimated with a Nelson Aalen estimator (see Kalbfleisch and Prentice, 2002 for further details on the estimation).
Extensions of this model building on already well established methods in social network and timetoevent analysis were numerously proposed. Perry and Wolfe (2013) used a stratified Cox model in (9) and allowed multicastevents, which are events that are possibly directed at multiple receivers. Stadtfeld et al. (2017) adopted the Stochastic Actor oriented Model (SAOM) to events. DuBois and Smyth (2010) and DuBois et al. (2013) extended the Stochastic Block Model (SBM) for timestamped relational events. Further, DuBois et al. (2013) adopted a Bayesian hierarchical model to event data when information is only available in smaller groups.
4.3 TimeClustered Observations
Generally, the approach discussed above requires timestamped network data, meaning that we observe the precise time points of all events. For the running example this means that we need the exact time point of an arms trade between country and . Often, such exact timestamped data are not available and, in fact, trading between states can hardly be stamped with a single time point . Indeed, we often only observe the timecontinuous network process at discrete time points . In such setting, we may assume some kind of Markov structure in that we do not look at the entire history of the process but just model the intensity (9) in the time frame between and . Let therefore be adapted to and for . We then reframe (9) as:
(10) 
In other words, we assume that the intensity of events between and does not depend on states of the multivariate counting process prior to . The history of the counting process is reset after each time interval. This is a reasonable assumption, if one is primarily interested in shortterm dependencies between the individual counting processes.
If we observe the continuous process at discrete time points it is inevitable that we observe time clustered observations, meaning that two or more events happen at the same time point. This is a to some extend inherent problem, as motivated on the basis of the arms trading above. Under the term tied observations this phenomenon is well known in timetoevent analysis and treated with several approximations. We make use of the so called Breslow approximation (see Peto, 1972; Breslow, 1974). Let therefore
where element is replicated times in , that is if an event between and occurred multiple times in the interval from to then appears respective times in . Given that we have not observed the exact time point of an event we also get no information on the baseline intensity in (9) for so that the model simplifies to a discrete choice model structure (see, e.g., Train, 2009) which resembles the partial likelihood by Cox (1972). Let therefore denote the set of all possible ties between countries that may be observed at time point , so that the partial likelihood is defined as:
(11) 
where .
Alternatively, one can replace the denominator in (11) by considering all possible orders of the unobserved events in . Since this can be a combinatorial and hence numerical challenge, some random sampling of time point orders among observations, that are time clustered, can be used as well with subsequent averaging, which we call KalbfleischPrentice approximation (see Kalbfleisch and Prentice, 2002).
4.4 Software and Application
Marcum and Butts (2015) implemented the R package to estimate the REM for timestamped data. It was followed by the package by Stadtfeld and Hollway (2018) for generally modeling timestamped data. The latter package is highly customizable in terms of endogenous user terms and will be used in the following application to the arms trade network.
As mentioned before, we do not have time stamps for the arms trades. While this is an slight misuse of the timecontinuous model, we apply this analysis here for demonstration purposes and to allow for a comparison with the results in the previous chapter. In other words, we either observe an arms trade (i.e. ) or no trade ().
The estimates are shown in Table 3, the first column represents the estimates of the Breslow, whereas the second column regards the estimation via the KalbfleischPrentice approximation with random orders. Regarding the significant terms, the estimates lead to similar conclusions. Only the estimates concerning transitivity, are slightly singnificant in the KalbfleischPrentice approximation but not in the Breslow approximation.
It should be noted that the general interpretation is now on the dyadic level, in comparison to the global interpretation of the effects in section 3.3. Therefore, e.g., the positive effect of the outdegree of the sender translates to a higher intensity of observing if had a higher outdegree.
Breslow  KalbfleischPrentice  

Reciprocity  0.029  0.154 
(0.189)  (0.176)  
Outdegree Sender  0.037  0.032 
(0.004)  (0.004)  
Indegree Receiver  0.153  0.142 
(0.188)  (0.0159 )  
Transitivity  0.033  0.062 
(0.031)  (0.031)  
log(GDP) Sender  0.479  0.441 
(0.037)  (0.038)  
log(GDP) Receiver  0.221  0.184 
(0.03)  (0.031)  
Polity Score (Absolute Difference)  0.033  0.030 
(0.009)  (0.008)  
Log Likelihood  3621.419  3581.731 
AIC  7256.84  7177.46 
Similar to the application of Section 3.3 reciprocated ties are not more likely to occur than nonreciprocated ones judged by their significance. The degreerelated covariates concern the role of centrality in respect to the intensity of an event. Apparently, both a high outdegree of the sender and indegree of the receiver result in a higher intensity, thus spur trade relations. Consequentially, countries that have a high outdegree are more likely to send weapons and countries with a high indegree to receive weapons. It is notable that this interpretation is different but not inconsistent with the findings regarding the geometrically weighted in and outdegree in the TERGM, both having negative coefficients indicating that the network exhibits a general tendency of having rather low out and indegrees. Both estimates indicate an asymmetric degree structure, yet the estimates from Section 3.3 are to be understood on the global level and translate to less countries with high in and outdegree than expected under under a completely random graph. In the REM, on the other hand, the estimates indicate that already having been highly involved in the network makes future trade activity more probable. In contrast, a country that was never active is less likely to send weapons, which again results in the asymmetric degree structure mentioned above.
Local clustering as indicated by the significantly positive parameter of transitivity can not clearly be detected. The respective estimate indicates, that having common trade partners is not a catalyzing factor in trading among countries. Additionally, this effect was not found on the global level of the analysis in Section 3.3, where the analog statistic is called geometrically weighted edgewise shared partners.
Further, we find additional confirmation on the influence of the logarithmic GDP of the sender and receiver on the intensity of a trade, which is in line with Section 3.3. For instance, the economic power of the exporter country has a strong effect on the intensity of receiving weapons.
Lastly, it should be mentioned that the indicated AIC values cannot be compared to the models in Section 3.3.
5 Discussion
5.1 Further models
Snijders (1996) formulated a twostage process model operating in a continuous time framework. The dynamics are considered to evolve according to unobserved microsteps. At first, a sender out of all eligible actors gets the opportunity to change the state of all his outgoing ties. Consecutively, the actor needs to evaluate the probability of changing the present configuration with each possible receiver, which entails each actors knowledge of the complete graph whenever he has the possibility to toggle one of his ties. Lastly, the decision is randomly drawn relative to the probabilities of all possible actions. In general, the SAOM is a well established model for the analysis of social networks, that was successfully applied to a wide array of network data, e.g., in Sociology (Agneessens and Wittek, 2012; de Nooy, 2002), Political Science (Kinne, 2016; Bichler and Franquez, 2014), Economics (Castro et al., 2014), and Psychology (Jason et al., 2014).
Another notable model that can be regarded as a bridge between the ERGM and continuous time models is the Longitudinal ERGM (LERGM, Snijders and Koskinen, 2013; Koskinen et al., 2015). In contrast to the TERGM, the LERGM assumes that the network evolves in microsteps as a continuous time Markov process with an ERGM being its limiting distribution. Similar as in the SAOM, the model builds on randomly assigning the opportunity to change, followed by a function that governs the probability of a tie change.
5.2 Resume
In this article, we put emphasis on two popular dynamic network models, the TERGM and the REM. Comparisons between these models can be drawn on the level at which each implied generating mechanism works, how time perceived, and to what extend withinnetwork and betweennetwork dependence can be analyzed.
The overall aim in the TERGM is to find an adequate distribution of the adjacency matrix including information on previous realizations of the network. In the separable extension the aim remains unchanged, only splitting into two smaller subnetworks that include all possible ties that were and were not present in separately. Contrasting to this aim, the REM tackles the intensity on a dyadic level. Therefore, models from Section 3 take a global and models from Section 4 a local pointofview, which results in substantially different interpretations of the estimates as seen in Sections 3.3 and 4.4.
The most apparent difference is the perception of time in the respective models. Where the TERGM can be framed as a Markov chain model in discrete time, the REM is operating in continuous time, although it is discretized due to the sampling scheme of the international arms trade network.
As a result from viewing the network as either evolving in continuous or discrete time, the possibilities to differentiate between withinnetwork and betweennetwork dependencies are affected. The only model that can clearly isolate these two dependencies is the TERGM, where the withinnetwork dependence is captured by all terms of that are only concerned with , and betweennetwork dependence is controlled for by the terms that only depend on . Due to the separability assumption, whereby all these statistics partially depend on and , this clear cut is not any more possible, as already noted in Section 3.3. Lastly, the model framework in continuous time does not allow this distinction, because the model is solely concerned with the effect of covariates on the intensity of observing the event .
Acknowledgement
The project was supported by the European Cooperation in Science and Technology [COST Action CA15109 (COSTNET)]. We also gratefully acknowledge funding provided by the German Research Foundation (DFG) for the project KA 1188/101: International Trade of Arms: A Network Approach
. Furthermore we like to thank the Munich Center for Machine Learning (MCML) for funding.
References
 Agneessens and Wittek (2012) Agneessens, F. and R. Wittek (2012). Where do intraorganizational advice relations come from? the role of informal status and social capital in social exchange. Social Networks 34(3), 333 – 345.
 Almquist and Butts (2014) Almquist, Z. W. and C. T. Butts (2014). Logistic Network Regression for Scalable Analysis of Networks with joint Edge/Vertex Dynamics. Sociological methodology 44(1), 273–321.
 Bearman et al. (2004) Bearman, P., J. Moody, and K. Stovel (2004). Chains of Affection: The Structure of Adolescent Romantic and Sexual Networks. American Journal of Sociology 110(1), 44–91.
 Benton and You (2017) Benton, R. A. and J. You (2017). Endogenous dynamics in contentious fields: Evidence from the shareholder activism network, 2006–2013. Socius 3, 2378023117705231.
 Bichler and Franquez (2014) Bichler, G. and J. Franquez (2014). Conflict Cessation and the Emergence of Weapons Supermarkets, pp. 189–215. Cham: Springer International Publishing.
 Blank et al. (2017) Blank, M., M. Dincecco, and Y. M. Zhukov (2017). Political regime type and warfare: evidence from 600 years of european history. Available at SSRN 2830066.
 Block et al. (2018) Block, P., J. Koskinen, J. Hollway, C. Steglich, and C. Stadtfeld (2018). Change we can believe in: Comparing longitudinal network models on consistency, interpretability and predictive power. Social Networks 52, 180 – 191.
 Block et al. (2019) Block, P., C. Stadtfeld, and T. Snijders (2019). Forms of Dependence: Comparing SAOMs and ERGMs From Basic Principles. Sociological Methods & Research 48(1).
 Breslow (1974) Breslow, N. (1974). Covariance Analysis of Censored Survival Data. Biometrics 30(1), 89–99.
 Broekel and Bednarz (2019) Broekel, T. and M. Bednarz (2019, Jan). Disentangling link formation and dissolution in spatial networks: An application of a twomode stergm to a projectbased r&d network in the german biotechnology industry. Networks and Spatial Economics.
 Butts (2008) Butts, C. (2008). A Relational Event Framework for Social Action. Sociological Methodology 38(1), 155–200.
 Castro et al. (2014) Castro, I., C. Casanueva, and J. L. Galán (2014). Dynamic evolution of alliance portfolios. European Management Journal 32(3), 423 – 433.
 Center for systemic Peace (2017) Center for systemic Peace (2017). Polity IV Annual TimeSeries, 18002015, Version 3.1. http://www.systemicpeace.org. Accessed: 20170602.
 Cox (1972) Cox, D. (1972). Regression Models and LifeTables. Journal of the Royal Statistical Society. Series B (Methodological) 34(2), 187–220.
 Csardi and Nepusz (2006) Csardi, G. and T. Nepusz (2006). The igraph software package for complex network research. InterJournal, Complex Systems 1695(5), 1–9.
 de Nooy (2002) de Nooy, W. (2002). The dynamics of artistic prestige. Poetics 30(3), 147 – 167.
 DuBois et al. (2013) DuBois, C., C. Butts, D. McFarland, and P. Smyth (2013). Hierarchical models for relational event sequences. Journal of Mathematical Psychology 57(6), 297 – 309. Social Networks.

DuBois
et al. (2013)
DuBois, C., C. Butts, and P. Smyth (2013).
Stochastic blockmodeling of relational event dynamics.
In C. M. Carvalho and P. Ravikumar (Eds.),
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics
, Volume 31 of Proceedings of Machine Learning Research, Scottsdale, Arizona, USA, pp. 238–246. PMLR.  DuBois and Smyth (2010) DuBois, C. and P. Smyth (2010). Modeling Relational Events via Latent Classes. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’10, New York, NY, USA, pp. 803–812. ACM.
 Erdös and Rényi (1959) Erdös, P. and A. Rényi (1959). On Random Graphs i. Publicationes Mathematicae Debrecen 6, 290.
 Geyer and Thompson (1992) Geyer, C. J. and E. A. Thompson (1992). Constrained Monte Carlo maximum likelihood for dependent data. J. R. Statist. Soc. B, 657–699.
 Girardin and Limnios (2018) Girardin, V. and N. Limnios (2018). Applied Probability (2 ed.). Heidelberg: Springer.
 Goldenberg et al. (2010) Goldenberg, A., A. X. Zheng, S. E. Fienberg, and E. M. Airoldi (2010). A Survey of Statistical Network Models. Foundations and Trends® in Machine Learning 2(2), 129–233.
 Goodreau et al. (2008) Goodreau, S. M., M. S. Handcock, D. R. Hunter, C. T. Butts, and M. Morris (2008). A statnet Tutorial. Journal of statistical software 24(9), 1.
 Hanneke et al. (2010) Hanneke, S., W. Fu, E. P. Xing, et al. (2010). Discrete temporal models of social networks. Electronic Journal of Statistics 4, 585–605.
 He et al. (2019) He, X., Y. bo Dong, Y. ying Wu, G. rui Jiang, and Y. Zheng (2019). Factors affecting evolution of the interprovincial technology patent trade networks in china based on exponential random graph models. Physica A: Statistical Mechanics and its Applications 514, 443 – 457.
 Hoff et al. (2002) Hoff, P. D., A. E. Raftery, and M. S. Handcock (2002). Latent Space Approaches to Social Network Analysis. Journal of the American Statistical Association 97(460), 1090–1098.
 Holland and Leinhardt (1977) Holland, P. and S. Leinhardt (1977). A dynamic model for social networks. The Journal of Mathematical Sociology 5(1), 5–20.
 Holland and Leinhardt (1981) Holland, P. W. and S. Leinhardt (1981). An exponential family of probability distributions for directed graphs. J. Am. Statist. Ass. 76(373), 33–50.
 Hummel et al. (2012) Hummel, R. M., D. R. Hunter, and M. S. Handcock (2012). Improving simulationbased algorithms for fitting ERGMs. Journal of Computational and Graphical Statistics 21(4), 920–939.
 Jason et al. (2014) Jason, L. A., J. M. Light, E. B. Stevens, and K. Beers (2014). Dynamic Social Networks in Recovery Homes. American Journal of Community Psychology 53(34), 324–334.
 Kalbfleisch and Prentice (2002) Kalbfleisch, J. and R. Prentice (2002). The Statistical Analysis of Failure Time Data. WileyBlackwell.
 Kim et al. (2018) Kim, B., K. H. Lee, L. Xue, and X. Niu (2018). A review of dynamic network models with latent variables. Statist. Surv. 12, 105–135.
 Kinne (2016) Kinne, B. J. (2016). Agreeing to arm: Bilateral weapons agreements and the global arms trade. Journal of Peace Research 53(3), 359–377.
 Kolaczyk (2009) Kolaczyk, E. D. (2009). Statistical analysis of network data. Methods and Models. New York: Springer Science & Business Media.
 Koskinen et al. (2015) Koskinen, J., A. Caimo, and A. Lomi (2015). Simultaneous modeling of initial conditions and time heterogeneity in dynamic networks: An application to Foreign Direct Investments. Network Science 3(1), 58–77.
 Krivitsky and Handcock (2014) Krivitsky, P. N. and M. S. Handcock (2014). A separable model for dynamic networks. J. R. Statist. Soc. B 76(1), 29–46.

Leifeld
et al. (2018)
Leifeld, P., S. J. Cranmer, and B. A. Desmarais (2018).
Temporal exponential random graph models with btergm: estimation and bootstrap confidence intervals.
Journal of Statistical Software 83(6), doi: 10.18637/jss.v083.i06.  Lerner et al. (2013) Lerner, J., M. Bussmann, T. Snijders, and U. Brandes (2013). Modeling frequency and type of interaction in event networks. Corvinus journal of sociology and social policy 4(1), 3–32.
 Leskovec et al. (2009) Leskovec, J., K. J. Lang, A. Dasgupta, and M. W. Mahoney (2009). Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large WellDefined Clusters. Internet Mathematics 6(1), 29–123.
 Lusher et al. (2012) Lusher, D., J. Koskinen, and G. Robins (2012). Exponential random graph models for social networks: Theory, methods, and applications. Cambridge: Cambridge University Press.
 Marcum and Butts (2015) Marcum, C. and C. Butts (2015). Constructing and Modifying Sequence Statistics for relevent Using informR in R. Journal of Statistical Software, Articles 64(5), 1–36.
 Morris et al. (2008) Morris, M., M. S. Handcock, and D. R. Hunter (2008). Specification of exponentialfamily random graph models: terms and computational aspects. Journal of statistical software 24(4), 1548.
 Perry and Wolfe (2013) Perry, P. and P. Wolfe (2013). Point process modelling for directed interaction networks. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 75(5), 821–849.
 Peto (1972) Peto, R. (1972). Contribution to the discussion of the paper by Dr. Cox. Journal of the Royal Statistical Society. Series B (Methodological) 34(2), 187–220.
 Quintane et al. (2013) Quintane, E., P. Pattison, G. Robins, and J. Mol (2013). Short and longterm stability in organizational networks: Temporal structures of project teams. Social Networks 35(4), 528 – 540.
 Raabe et al. (2019) Raabe, I. J., Z. Boda, and C. Stadtfeld (2019). The Social Pipeline: How Friend Influence and Peer Exposure Widen the STEM Gender Gap. Sociology of Education 92(2), 105–123.
 Robins and Pattison (2001) Robins, G. and P. Pattison (2001). Random graph models for temporal processes in social networks. Journal of Mathematical Sociology 25(1), 5–41.
 Robins et al. (2007) Robins, G., P. Pattison, Y. Kalish, and D. Lusher (2007). An introduction to exponential random graph (p*) models for social networks. Social Networks 29(2), 173–191.

Salathé et al. (2013)
Salathé, M., D. Q. Vu, S. Khandelwal, and D. Hunter (2013).
The dynamics of health behavior sentiments on a large online social
network.
EPJ Data Science
2(1), 4.  Sarkar and Moore (2006) Sarkar, P. and A. W. Moore (2006). Dynamic Social Network Analysis using Latent Space Models. In Y. Weiss, B. Schölkopf, and J. C. Platt (Eds.), Advances in Neural Information Processing Systems 18, pp. 1145–1152. MIT Press.
 SIPRI (2018) SIPRI (2018). Arms Transfers Database. https://www.sipri.org/databases/armstransfers. Accessed: 20190301.
 Snijders (1996) Snijders, T. (1996). Stochastic actor‐oriented models for network change. The Journal of Mathematical Sociology 21(12), 149–172.
 Snijders (2005) Snijders, T. (2005). Models for Longitudinal Network Data, pp. 215–247. Structural Analysis in the Social Sciences. Cambridge University Press.
 Snijders and Koskinen (2013) Snijders, T. and J. Koskinen (2013). Longitudinal Models, pp. 130–140. Structural Analysis in the Social Sciences. Cambridge University Press.
 Stadtfeld (2018) Stadtfeld, C. (2018). The MicroMacro Link in Social Networks. Emerging Trends in the Social and Behavioral Sciences, Forthcoming.
 Stadtfeld and Block (2017) Stadtfeld, C. and P. Block (2017). Interactions, Actors, and Time: Dynamic Network Actor Models for Relational Events. Sociological Science 4(14), 318–352.
 Stadtfeld and Hollway (2018) Stadtfeld, C. and J. Hollway (2018). goldfish: Goldfish – Statistical network models for dynamic network data. R package version 1.2.
 Stadtfeld et al. (2017) Stadtfeld, C., J. Hollway, and P. Block (2017). Dynamic Network Actor Models: Investigating Coordination Ties through Time. Sociological Methodology 47(1), 1–40.
 Stansfield et al. (2019) Stansfield, S. E., J. E. Mittler, G. S. Gottlieb, J. T. Murphy, D. T. Hamilton, R. Detels, S. M. Wolinsky, L. P. Jacobson, J. B. Margolick, C. R. Rinaldo, et al. (2019). Sexual role and hiv1 set point viral load among men who have sex with men. Epidemics 26, 68–76.
 Thurner et al. (2018) Thurner, P. W., C. S. Schmid, S. J. Cranmer, and G. Kauermann (2018). Network Interdependencies and the Evolution of the International Arms Trade. Journal of Conflict Resolution.
 Train (2009) Train, K. (2009). Discrete Choice Methods with Simulation. Cambridge University Press.
 Tranmer et al. (2015) Tranmer, M., C. S. Marcum, B. Morton, D. Croft, and S. de Kort (2015). Using the relational event model (rem) to investigate the temporal dynamics of animal social networks. Animal Behaviour 101, 99 – 105.
 Vu et al. (2017) Vu, D., L. Alessandro, M. Daniele, and P. Francesca (2017). Relational event models for longitudinal network data with an application to interhospital patient transfers. Statistics in Medicine 36(14), 2265–2287.
 Vu et al. (2011) Vu, D., D. Hunter, P. Smyth, and A. Asuncion (2011). ContinuousTime Regression Models for Longitudinal Networks. In J. ShaweTaylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger (Eds.), Advances in Neural Information Processing Systems 24, pp. 2492–2500. Curran Associates, Inc.
 Vu et al. (2015) Vu, D., P. Pattison, and G. Robins (2015). Relational event models for social learning in MOOCs. Social Networks 43, 121–135.
 Ward et al. (2013) Ward, M., J. S. Ahlquist, and A. Rozenas (2013). Gravity’s Rainbow: A Dynamic Latent Space Model for the World Trade Network. Network Science 1.
 White et al. (2018) White, L. A., J. D. Forester, and M. E. Craft (2018). Covariation between the physiological and behavioral components of pathogen transmission: Host heterogeneity determines epidemic outcomes. Oikos 127(4), 538–552.
 World Bank (2017) World Bank (2017). World Bank Open Data, Real GDP. http://data.worldbank.org/. Accessed: 20170401.
Appendix A Annex: Additional Descriptives
Figure 4 depicts the distribution of in and outdegrees in the network. This is the number of in and outwards directed ties each country had in a specific year. A strongly asymmetric relation is revealed, indicating that about 70 of the countries do not export any weapons, while a small percentage of countries accounts for the major share of trade relations. The distribution of the indegree is not that extreme but still we have roughly one third of all countries not importing at all. These measure were calculated with the package in R (Csardi and Nepusz, 2006).
Comments
There are no comments yet.