Log In Sign Up

Model-based Pre-clinical Trials for Medical Devices Using Statistical Model Checking

by   Haochen Yang, et al.

Clinical trials are considered as the golden standard for medical device validation. However, many sacrifices have to be made during the design and conduction of the trials due to cost considerations and partial information, which may compromise the significance of the trial results. In this paper, we proposed a model-based pre-clinical trial framework using statistical model checking. Physiological models represent disease mechanism, which enable automated adjudication of simulation results. Sampling of the patient parameters and hypothesis testing are performed by statistical model checker. The framework enables a broader range of hypothesis to be tested with guaranteed statistical significance, which are useful complements to the clinical trials. We demonstrated our framework with a pre-clinical trial on implantable cardioverter defibrillators.


Improving clinical trial interpretation with Acceptability Curve Estimation (ACE)

Effective decision making from randomised controlled clinical trials rel...

A framework for the pre-specification of statistical analysis strategies in clinical trials (Pre-SPEC)

Bias can be introduced into clinical trials if statistical methods are c...

Designing an exploratory phase 2b platform trial in NASH with correlated, co-primary binary endpoints

Non-alcoholic steatohepatitis (NASH), an inflammatory and more progressi...

Towards More Flexible False Positive Control in Phase III Randomized Clinical Trials

Phase III randomized clinical trials play a monumentally critical role i...

Interim recruitment prediction for multi-centre clinical trials

We introduce a general framework for monitoring, modelling, and predicti...

Analyzing Basket Trials under Multisource Exchangeability Assumptions

Basket designs are prospective clinical trials that are devised with the...

1 Introduction

With the development of science and technologies, modern medical devices become more ”intelligent”, such that critical diagnosis and therapeutic decisions can be made by the device algorithm without interventions from medical professionals. These Medical Cyber-Physical Systems (MCPS) [20, 19] usually aim to deliver timely therapies during high risk or life-threatening conditions, so that patients with the devices can have improved life quality without the need to stay in hospital. For instance, Implantable Cardio-verter Defibrillator (ICD) detects irregular heart rhythms, and deliver therapies like pacing and defibrillation to restore normal heart rhythm [11]. Responsive NeuroStimulation (RNS) systems detect abnormal brain activities and deliver high-frequency electrical stimulation to terminate seizures or Parkinson syndromes [24].

The medical device industry is highly regulated. Due to the autonomy and high risk therapies, MCPSs are classified as Class III devices, and the safety and efficacy of MCPSs have to be rigorously validated in clinical settings before the devices can enter the market

[25]. Due to the large variability of patient conditions, diagnosis in these MCPSs cannot be 100% accurate. Therapies may be absent during high-risk conditions (false-negative), and therapies may be delivered during normal or low-risk conditions (false-positive). Device manufacturers and regulators need clinical evidence that MCPSs can reduce the number of false-positives and false-negatives as low as possible [1].

Cost is a primary consideration when designing clinical trials. A clinical trial takes many years to complete and costs millions of dollars [21]. Therefore the experiments in these clinical trials are carefully designed to achieve statistical significance with the least amount of enrolled patients. In order to strike a balance between statistical significance and the size of enrolled patients, assumptions are made regarding the device under evaluation and the target population. For example, the new device is X% better in terms of safety/efficacy compared to the standard care. These assumptions are made based on prior knowledge from previous clinical studies, animal tests or clinical literature, which may not reflect the ground truth. Sacrifices are also made to the endpoint of the clinical trials, so that the duration of the trials and the number of patients enrolled are financially feasible. For example, instead of choosing ”survival rate of patients” as endpoint of a clinical trial, the trial designer may choose ”first time for inappropriate therapy”, which has much shorter expected trial duration. As the result, although clinical trials are the golden standard for clinical evidence, their results may not be directly useful to guide clinical decisions due to the trade-offs. If wrong assumptions are made during the design of clinical trials, the trial may fail to generate result with statistical significance, essentially wasting all the money and time invested [23].

Figure 1: Overview of Our Proposed Framework

Physiological models have been proposed to provide safety and efficacy evidence for medical devices. Closed-loop model checking of medical device software can provide formal and rigorous evidence, but may not be scalable to more complex algorithms [15]. Model-based (in-silico) pre-clinical trials use physiological models as virtual patient, and have been used to complement and support clinical trials [13, 18]. However, these pre-clinical trials do not fully align with the statistical approaches in clinical trials, and may not be able to generate clinically-relevant results that can guide clinical decisions.

In this paper, we propose a pre-clinical trial framework for medical devices using statistical model checking. The framework is illustrated in Fig. 1. A Disease Mechanism module captures physiology of disease mechanism which can 1) interact with device algorithm in closed-loop, and 2) provide physiological context for adjudication. A Patient Mode Switching module captures the switching between patient conditions by changing the parameters of the physiological model. A Parameter Sampling module randomly samples a set of physiological parameters for the physiological model as well as the patient mode switching module. Two virtual patients with the same set of parameters are used to evaluate two different devices for better comparability. A Correctness Adjudication module provides quantified evaluation of a simulation. A Comparison Adjudication module compares the quantified evaluation of the two device algorithm, and the sequential hypothesis testing block inside statistical model checker UPPAAL SMC [8] then determines whether the hypothesis has been accepted with statistical significance after each iteration. The framework can be used to test hypothesises regarding the difference between a proposed device algorithm and an older algorithm with guaranteed statistical significance. These hypothesises include not only the ones tested in clinical trials, but also the ones interesting but not feasible in clinical trials. The result of these hypothesis testings can provide guidance to future clinical trials, including predicting trial results as well as validating assumptions during clinical trials.

The contribution of this paper is 3-folds: 1) We proposed a framework of pre-clinical trials for MCPSs using statistical model checking; 2) we proposed a set of clinically-relevant hypothesises that can be tested with our proposed framework; 3) we performed a series of pre-clinical trials on two algorithms for implantable cardioverter-defibrillator.

The rest of this paper is arranged as follows: in Section 2, the statistical approaches used in clinical trials are introduced, and the limitations of clinical trials are demonstrated using an example trial; in Section 3, our proposed pre-clinical trial framework using statistical model checking is introduced; in Section 4, we demonstrate our proposed framework using a case study; we then end the paper with discussion and future work.

2 Clinical Trials for MCPSs

In this section, we introduce the statistical approaches used in clinical trials and use an example to illustrate their limitations.

2.1 Clinical Trial Basics

There are several key properties of a clinical trial, which can affect its cost and clinical significance. These properties are determined before the conduction of the trial.

2.1.1 Endpoint

An endpoint of a clinical trial is a well-accepted, easily-observable and quantifiable criterion that reflects the purpose of the trial [10, 22]. For example, if a device is designed to treat a life-threatening disease, the death of patients due to the disease would be an appropriate endpoint for the clinical trial. The endpoint affects the duration of the trial which is closely related to the cost of the trial. For instance, if the death of patients is the endpoint of a trial and the patients in the target population are relatively young and healthy, the duration of the trial may be too long to be feasible. Therefore sacrifices have to be made when choosing the endpoint in order to ensure reasonable trial duration.

2.1.2 Randomized Control Trials

The objective of a clinical trial is to determine whether the device under evaluation is better in certain aspects compared to the standard care device. Since it is infeasible to evaluate two devices on the same patient, the evaluation has to be performed on different patients, which are not directly comparable. Therefore the recruited patients are assigned into two groups: the treatment group with the new device, and the control group with the standard care device. In order to make the two groups comparable, patients are randomly assigned into the groups to avoid bias [5].

2.1.3 Hypothesis Testing in Clinical Trials

In clinical trials, hypothesis testing is used as the primary statistical method, and superiority hypothesis testing is usually used for medical device trials [17]

. The null hypothesis

for superiority trial claims that the new device and standard care device have no difference, and the corresponding alternative hypothesis claims that the treatment group is superior than the control group in terms of the trial endpoint. Hypothesis testing aims to prove the correctness of the alternative hypothesis by rejecting the null hypothesis. The significance level of the test is first determined such that:

Based on the test statistics from the significance test, the value and distribution of the probability for

to be accepted (the p value) can be determined. If the p value is larger than , there is not enough evidence to reject ; and if the p value is less or equal to , is accepted. Traditionally is set to 0.05 in hypothesis testings.

2.1.4 Sample Size for Clinical Trials

In clinical trials, the number of patients required to guarantee statistical significant results (the sample size) is determined before the trial. For superiority trials, the desired sample size is related to how much better the new device is compared to the old device. Since this information is not available before the trial, trial designers have to make an assumption (the new device is better than the old device by ) base on prior knowledge like literature review. is referred to as the effect size

, which can significantly affect the sample size estimation

[17]. For example, reducing the effect size by half will increase the sample size by 4 times [9]. Sample size is also related to the significance level and power of the trial, as well as the estimated drop-off rate of the patients. The sample size is the most important parameter of a clinical trial, as it directly determines the cost of the trial and the statistical significance of the trial result.

2.2 The RhythmID Head-to-Head Trial (RIGHT)

We use a previous clinical trial on implantable cardioverter defibrillator to illustrate the limitations of clinical trials.

2.2.1 Implantable Cardioverter-Defibrillators

Coordinated contractions of the heart is governed by electrical signals. Anomalies in generation and conduction of electrical signals can cause irregular heart rhythm, which is referred to as arrhythmia. The severity of arrhythmia depends on the locations of the anomalies. Fast ventricular rate is referred to as ventricular tachycardia (VT), which can be fatal. On the other hand, fast atrial rate is referred to as Supra-Ventricular Tachycardia (SVT), which is non-fatal. Implantable Cardioverter-Defibrillators (ICD) are designed to detect and treat VT with anti-tachycardia pacing or defibrillation [11]. Treatments during SVT are considered as inappropriate therapies. Modern ICDs are equipped with SVT-VT discrimination algorithms to reduce the number of inappropriate therapies while ensuring all VTs are treated.

2.2.2 Background

In 2009, the ICD manufacturer Guidant has developed a new SVT-VT discrimination algorithm called RhythmID. Based on estimations before the clinical trial, RhythmID should have less risk of inappropriate therapies compared to PRLogic, which is an algorithm developed by Medtronic. Therefore Guidant proposed the RhythmID Head-to-Head Trial (RIGHT) [3] to validate this hypothesis, which is a randomized, unblinded, two-arm, prospective clinical trial.

2.2.3 Endpoint

The primary endpoint of RIGHT is the time of the first inappropriate therapy. It can be categorized as a ”Event-free Survival” endpoint, which generally results in relatively shorter trial duration and smaller sample size. However, patients receiving inappropriate therapy once are considered as dead when using this endpoint, which is not the case for inappropriate therapies of ICD and may compromise the clinically-relevance of the result. ”The rate/chance of inappropriate therapy over X years” is a clinically more interesting endpoint, but maybe infeasible for a clinical trial due to longer trial duration and larger sample size.

2.2.4 Sample Size

1962 patients were enrolled in RIGHT. Based on device manufacturer (Guidant (GDT) vs. Medtronics (MDT)) and type of device (Single chamber vs. Dual chamber), patients enrolled in RIGHT are divided into 4 separate groups: 507 for GDT Single, 478 for GDT Dual, 503 for MDT Single and 474 for MDT Dual. The sample size was calculated based on the assumption that ”the GDT algorithm has 25% less risk for inappropriate therapies compared to the MDT algorithm”. The relatively large assumed difference results in smaller sample size, but may risk the statistical significance if the assumption is wrong.

2.2.5 Hypothesis Testing of RIGHT

Since the endpoint for RIGHT is related to how long a patient can survive (with no inappropriate therapies), the hypothesis testing is based on survival analysis. The survival function is a function of the number of patients who received inappropriate therapy before time t. In RIGHT, the alternative hypothesis is .

2.2.6 Adjudication of Recorded Traces

The saved execution traces of the ICDs are analyzed by three physicians to determine whether and when an inappropriate therapy has happened. An adjudication is made if two out of three physicians have an agreement, otherwise the trace is discarded. The lack of physiological context in the execution traces affects the accuracy of adjudication, which can further affect the trial results.

2.2.7 Result

At the end of RIGHT, results showed that the MDT group has longer survival time compared to the GDT group. The GDT device has 34% higher risk for inappropriate therapies compared to the MDT algorithm (P=0.003),which is the opposite to the initial assumption [12].

2.3 Limitations of Clinical Trials

From the example above, the limitations of clinical trials can be summarized as follows:

2.3.1 Cost-sensitive:

Clinical trials are very costly and many sacrifices have to be made in order for a trial to be feasible. The endpoint sacrifices physiological-relevance for trial duration. Bold superiority assumptions are made in order to maintain reduced but feasible sample size. All these sacrifices may compromise the information we learn from the trial.

2.3.2 Sampling from Target Population:

In clinical trials, the patients recruited may not be a good representation of the target population. Patients with rarer conditions may not be recruited, and will not yield statistical significant result due to the rarity. Sampling is usually performed on meta-data like gender and age, instead of physiological parameters, which may affect the comparability between groups.

2.3.3 Comparison:

Since it’s not feasible to evaluate two devices on the same patient at the same time, the comparisons between devices are made indirectly using statistical methods.

2.3.4 Adjudication:

Adjudications of clinical trials on MCPSs are made by the physicians on the recorded execution traces of the devices. Due to the embedded nature of most MCPSs, only segments of device executions can be recorded. The recorded traces also lack physiological context, which may affect the adjudication results.

3 Pre-clinical Trials with Statistical Model Checking

The limitations of clinical trials can be complemented with model-based approaches. In this section, we introduce our proposed model-based framework for pre-clinical trials using statistical model checking.

The overview of the framework is illustrated in Fig. 1. Physiological model of the patient provides physiological context which enables automated and accurate adjudication. The cost of physiological model simulation is negligible compared to the clinical trials. Target population can be sampled base on physiological parameters of the physiological model, which can better cover rare patient conditions. Moreover, devices can be evaluated on the same ”patient”, or ”patient” with the same physiological parameters. Finally, statistical significance of the results can be guaranteed using sequential statistical model checking.

3.1 Patient Model

The physiological model of the patient can be separated into two modules. The disease mechanism module captures physiological mechanism of the disease to interact with the device algorithm. The mode switching module captures the different conditions or modes of the patient which correspond to correct/incorrect diagnosis and therapy for easy adjudication. The parameters of the model should reflect physiological variability of the patients.

3.2 Parameter Sampling

The parameters of the patient model should be given as distributions, and statistical model checker samples the parameters independently to create a virtual patient in each iteration. Two virtual patients with the same parameters evaluate both the new device and the standard care device to provide better comparability than in clinical trials.

3.3 Adjudication

The adjudication module has access to the state of the patient model, so that quantitative and qualitative assessment of the execution traces can be performed. The adjudication module contains two parts. The correction adjudication evaluates each execution trace from one device, and comparison adjudication compares the evaluations of execution traces from the two devices. The result of the adjudication is used by the statistical model checker to determine whether the trial has achieved desired statistical significance.

3.4 Statistical Model Checking

Statistical Model Checker performs hypothesis testing on stochastic systems. The key idea is to infer whether the system satisfies the quantitative properties by monitoring simulations of system, and use hypothesis testing to provide statistical evidence.

In this project, we use the UPPAAL SMC [8] to implement our framework. UPPAAL SMC use an extension of the Timed Automata (TA) [2] formalism. Stochastic behaviors can be modelled using probabilistic transitions (Fig. 2.(1)), as well as rate of exponential on locations (Fig. 2.(2)).

Figure 2: An example of timed automaton in UPPAAL SMC

UPPAAL SMC can perform statistical model checking using several queries. In this paper, we primarily use the following query:

which compares the probability for given system to satisfy the property within bound , with the probability for given system to satisfy the property within bound .

For given two properties . The null and alternate hypothesis are shown as following.

Probability comparison can be reduced to testing the hypothesis against .

In UPPAAL SMC, Extend Wald’s Sequential Hypothesis Testing [7] is used to test the hypothesis. Sequential Hypothesis Tesing is an iterative process. In each iteration, the system executes for two independent simulations and , and are evaluated in their corresponding runs. The result for this iteration is a pair , in which if is satisfied in , and otherwise. The cumulative result is then analyzed by SMC. The system enters another iteration if neither nor is accepted, and terminates when either or is accepted.

Compared to the hypothesis testing approach used in clinical trials, the sequential hypothesis testing used in UPPAAL SMC does not need to decide the sample size beforehand, and can provide guaranteed statistical significance. However, it is usually infeasible to use sequential statistical testing in clinical trials for medical devices, in which the duration for each patient’s enrollment usually takes many years.

4 Case Study: Pre-clicnial Trials for RIGHT Using Statistical Model Checking

4.1 Physiological Model

A dual chamber ICD receive three channels of electrical signals from the heart. Through the sensing circuit and algorithm, the Atrial channel and the Ventricular channel report whether there are atrial and ventricular contractions within the heart, which correspond to the and events. The Shock channel senses far-field ventricular electrogram, and through the morphology comparison algorithm, the ICD can judge whether a event correspond to a VT event (). The event from the ICD resets the states of the heart model, mimicking the effect of defibrillation. The interface between the heart model and the ICD algorithm is shown in Fig. 3.

Figure 3: Interface between the heart and the ICD
Figure 4: Timed automata heart model

In this case study, we have adopted the Virtual Heart Model (VHM) [14] to model the interaction between the heart and the ICD, as well as disease mechanism. The VHM is a timed-automata-based heart model which is capable of simulating the electrical activities of various heart conditions, and has been used to model the electrical activities of the heart during model checking and closed-loop testing of implantable cardiac devices [15, 13]. The VHM model used in this case study is illustrated in Fig. 4. The two node automata and model the generation of the three input events to the ICDs. The two path automata and model the conduction between node automata. The node automaton models the blocking property of the AV node. The heart model is capable of illustrating the origin and conduction pattern of electrical signals within the heart, which provides enough physiological context for the adjudication.

Figure 5: Mode Switching model for atria and ventricles

4.2 Patient Mode Switching Model

The switching between tachycardia and normal heart rhythm is relatively quick, and tachycardia can occur in either atria or ventricles independently. Therefore we created two mode switch models and to switch the parameters of the heart model, so that combinations of atrial and ventricular tachycardia can be modeled (Fig. 5). Each mode has a duration, which is governed by the rate of exponential of the locations. Switching among modes are governed by probabilistic transitions.

4.3 Parameter Sampling

The mode switching model and the physiological model have 68 parameters in total, and are independently sampled from normal distributions. The distributions of parameters were obtained from literature review

[16]. Since the patients are in Normal Sinus Rhythm (NSR) 99% of the time, we shortened the duration of the NSR state which can significantly increase the effective duration of the trial.

4.4 Adjudication

There are two adjudication modules in this case study: The correctness adjudication module determines whether a therapy is appropriate. The comparison adjudication module determines which algorithm is better in various criteria, such that hypothesis testing can be performed.

Figure 6: Correction adjudication module determines whether a therapy is appropriate

The correctness adjudication module for this study is shown in Fig. 6. It utilizes the state of the patient mode switch model to determine whether there has been a persistent ventricular tachycardia, which cannot be obtained just from the execution traces recorded by the devices. The number of inappropriate therapies and the time for first inappropriate therapy are saved for hypothesis testing.

Figure 7: Comparison adjudication module compares the evaluation of the two algorithms

The comparison adjudication module is shown in Fig. 7. After the simulation finishes, a function evaluates whether there exists difference between the two algorithms. Then the following query was checked:

4.5 The VT-SVT Discrimination Algorithms

In this case study, we use the RhythmID algorithm from GDT and PRLogic algorithm from MDT presented in [13], which were inferred from algorithm descriptions in open literature. The algorithms are converted from Matlab code to timed automata [2], which are attached in the Appendix for interested readers. Since saving clock values is not allowed in timed automata, we had to sacrifice the symbolic state representation within timed automata in order to model the algorithm. As the result, traditional model checking on these algorithms is not feasible due to state explosion problems.

4.6 Pre-clinical Trials for RIGHT

A series of pre-clinical trials are conducted using the framework we proposed. All trials are conducted on the dual-chamber version of the ICD algorithms.

4.6.1 Trial 1: Compare Chance of Inappropriate Therapy

The chance of inappropriate therapy on any patient within a prolonged period of time is a clinically-relevant criterion to compare two algorithms, but is not feasible in clinical trials as two devices cannot be evaluated on the same patient. The hypothesis under evaluation are:

In which ms and and represent the number of inappropriate therapies when the same patient has GDT and MDT devices, respectively. In UPPAAL SMC, the comparison adjudication module is programmed such that:

After 582 simulations, was rejected and was accepted with 95% confidence and 95% power, suggesting that the new algorithm from GDT has a higher chance of inappropriate therapy compared to the MDT algorithm, which aligns with the result of RIGHT.

4.6.2 Trial 2: Event-free Survival Time

Event-free survival time is a common endpoint for clinical trials. The survival time for a patient is defined as:

in which is the total simulation time, and is the time of the first inappropriate therapy. We compare whether GDT has better survival time compared to MDT with the following hypothesis:

The comparison adjudication module is programmed such that:

After 582 simulations, was rejected and was accepted with 95% confidence and 95% power, suggesting that the new algorithm from GDT has shorter event-free survival time compared to the MDT algorithm.

4.6.3 Trial 3: Mean Survival Time Comparison

The survival function can be used to describe the survival probability of a patient at different time. The survival functions for GDT and MDT devices can be defined as follow:

in which

are the random variables representing the survival time of a patient in the GDT group and the MDT group. The survival function can be estimated using the Kaplan-Meier Method

[4]. From the survival function, the Mean Survival Time can be calculated as:

The comparison of the mean survival time is clinically-relevant, but in an actual clinical trial, the comparison does not have statistical significance.

In this trial, we aim to provide statistical significance to the comparison of mean survival time. The protocol of this pre-clinical trial is different from the previous two trials. virtual patients are created for each simulation instead of one patient in other trials. The virtual patients are duplicated and are assigned to the GDT and MDT groups. The following hypothesis compares the mean survival time of the GDT and MDT groups.

patients are simulated in each iteration and their mean survival times are calculated. Effectively, each iteration is a RIGHT trial on its own. The comparison adjudication module is programmed such that:

In this trial we set , and after 582 iterations, was rejected and was accepted with 95% confidence and 95% power, suggesting that the new algorithm from GDT has shorter mean event-free survival time compared to the MDT algorithm. Effectively we have conducted 582 trials with sample size 25, which is infeasible in actual clinical trials.

4.6.4 Trial 4: Hazard Ratio Comparison

The Hazard Ratio (HR) of the two groups is define as the ratio of two hazard functions .

The HR reflects the comparative risks for inappropriate therapies between two devices, which can be estimated using the Cox Proportional Hazards Model [6].

In RIGHT, the hazard ratio was calculated based on the trial result, but its validity cannot be claimed with statistical significance. In this pre-clinical trial, we aim to compare whether GDT has higher hazard compared to MDT with statistical significance using the following hypothesis:

The comparison adjudication module is programmed such that:

The result shows that was rejected and was accepted with 95% confidence and 95% power, suggesting that the new algorithm from GDT has higher hazard compared to the MDT algorithm.

5 Summary and Future Work

Clinical trials for medical devices are costly, and many sacrifices have to be made for the trial to be feasible. Simulations of ”virtual patients” cannot replace clinical trials, but can provide useful information that can guide clinical decisions and help the design of clinical trials. In this paper, we proposed a model-based pre-clinical trial framework using statistical model checking. Physiological models can provide ground truth for automated adjudication. Statistical model checker was used to sample physiological parameters, and test clinically-relevant hypothesises with guaranteed statistical significance. With the proposed framework, we can test hypothesises that are not feasible in clinical trials, or provide statistical significance to the the analysis results during clinical trials.

The next step for this project is to formally prove and quantify the statistical advantage of this framework, and develop templates that can be easily used by researchers in other domains.


  • [1] S. Abdel-Aleem (2009) Design, execution, and management of medical device clinical trials. Wiley Online Library. Cited by: §1.
  • [2] R. Alur and D. L. Dill (1994) A theory of timed automata. Theoretical Computer Science 126, pp. 183–235. Cited by: §3.4, §4.5.
  • [3] R. D. Berger, D. R. Lerew, J. M. Smith, C. Pulling, and M. R. Gold (2006) The rhythm id going head to head trial (right): design of a randomized trial comparing competitive rhythm discrimination algorithms in implantable cardioverter defibrillators. J Cardiovasc Electrophysiol 17 (7), pp. 749–53. External Links: ISSN 1045-3873 (Print) 1045-3873 (Linking), Document, Link Cited by: §2.2.2.
  • [4] J. M. Bland and D. G. Altman (1998) Survival probabilities (the kaplan-meier method). British Medical Journal 317 (7172), pp. 1572–1580. Cited by: §4.6.3.
  • [5] C. J. Bulpitt (1996) Randomised controlled clinical trials. 2 edition, , Vol. , Springer US. External Links: ISBN 978-1-4613-7915-7,978-1-4615-6347-1 Cited by: §2.1.2.
  • [6] D. R. Cox (1972) Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological) 34 (2), pp. 187–202. Cited by: §4.6.4.
  • [7] A. David, K. G. Larsen, A. Legay, M. Mikučionis, D. B. Poulsen, J. Van Vliet, and Z. Wang Statistical model checking for networks of priced timed automata. Conference Proceedings In International conference on formal modeling and analysis of timed systems, pp. 80–96. Cited by: §3.4.
  • [8] A. David, K. G. Larsen, A. Legay, M. Mikučionis, and D. B. Poulsen (2015) Uppaal smc tutorial. International Journal on Software Tools for Technology Transfer 17 (4), pp. 397–415. External Links: ISSN 1433-2787 Cited by: §1, §3.4.
  • [9] P. Fayers and D. Machin (1995) Sample size: how many patients are necessary?. British journal of cancer 72 (1), pp. 1–9. Cited by: §2.1.4.
  • [10] U.S. FDA (2018) Clinical trial endpoints for the approval of cancer drugs and biologics guidance for industry. Book, Food and Drug Administration. Cited by: §2.1.1.
  • [11] M. Glikson and P. A. Friedman (2001) The implantable cardioverter defibrillator. The Lancet 357 (9262), pp. 1107–1117. External Links: ISSN 0140-6736 Cited by: §1, §2.2.1.
  • [12] M. R. Gold, S. Ahmad, K. Browne, K. C. Berg, L. Thackeray, and R. D. Berger (2012) Prospective comparison of discrimination algorithms to prevent inappropriate icd therapy: primary results of the rhythm id going head to head trial. Heart Rhythm 9 (3), pp. 370–7. External Links: ISSN 1556-3871 (Electronic) 1547-5271 (Linking), Document, Link Cited by: §2.2.7.
  • [13] Z. H. Jiang, H. Abbas, K. J. Jang, M. Beccani, J. Liang, S. Dixit, and R. Mangharam (2016) In-silico pre-clinical trials for implantable cardioverter defibrillators. 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Embc), pp. 169–172. Cited by: §1, §4.1, §4.5.
  • [14] Z. H. Jiang, M. Pajic, and R. Mangharam (2012) Cyber-physical modeling of implantable cardiac medical devices. Proceedings of the IEEE 100 (1), pp. 122–137. External Links: ISSN 0018-9219, Document Cited by: §4.1.
  • [15] Z. Jiang, M. Pajic, R. Alur, and R. Mangharam (2014) Closed-loop verification of medical devices with model abstraction and refinement. International Journal on Software Tools for Technology Transfer 16 (2), pp. 191–213. External Links: ISSN 1433-2787, Document Cited by: §1, §4.1.
  • [16] M. E. Josephson (2015) Josephson’s clinical cardiac electrophysiology. Lippincott Williams & Wilkins. Cited by: §4.3.
  • [17] S. A. Julious (2009) Sample sizes for clinical trials. CRC Press. Cited by: §2.1.3, §2.1.4.
  • [18] B. P. Kovatchev, M. Breton, C. D. Man, and C. Cobelli (2009) In silico preclinical trials: a proof of concept in closed-loop control of type 1 diabetes. Journal of Diabetes Science and Technology 3 (1), pp. 44–55. Cited by: §1.
  • [19] I. Lee, O. Sokolsky, S. Chen, J. Hatcliff, E. Jee, B. Kim, A. King, M. Mullen-Fortino, S. Park, and A. Roederer (2011) Challenges and research directions in medical cyber–physical systems. Proceedings of the IEEE 100 (1), pp. 75–90. External Links: ISSN 0018-9219 Cited by: §1.
  • [20] I. Lee and O. Sokolsky Medical cyber physical systems. Conference Proceedings In Design automation conference, pp. 743–748. External Links: ISBN 1450300022 Cited by: §1.
  • [21] L. Martin, M. Hutchens, C. Hawkins, and A. Radnov (2017) How much do clinical trials cost?. Nature Reviews Drug Discovery 16 (6), pp. 381. External Links: ISSN 1474-1784 Cited by: §1.
  • [22] C. Mcleod, R. Norman, E. Litton, B. R. Saville, S. Webb, and T. L. Snelling (2019) Choosing primary endpoints for clinical trials of health care interventions. Contemporary Clinical Trials Communications 16. Cited by: §2.1.1.
  • [23] S. J. Pocock and G. W. Stone (2016) The primary outcome fails—what next?. New England Journal of Medicine 375 (9), pp. 861–870. External Links: ISSN 0028-4793 Cited by: §1.
  • [24] T. L. Skarpaas, B. Jarosiewicz, and M. J. Morrell (2019) Brain-responsive neurostimulation for epilepsy (rns system). Epilepsy research 153, pp. 68–70. External Links: ISSN 0920-1211 Cited by: §1.
  • [25] WHO (2003) Medical device regulations: global overview and guiding principles. Book, World Health Organization. Cited by: §1.

Appendix 0.A VT-SVT Discrimination Algorithms

0.a.1 RhythmID from Guidant (GDT)

Figure 8: The control flow of the RhythmID algorithm
Figure 9: RhythmID algorithm in UPPAAL SMC

0.a.2 PRLogic from Medtronics (MDT)

Figure 10: The control flow of the PRLogic algorithm
Figure 11: PRLogic algorithm in UPPAAL SMC