Methodology of this study including the emulator development and the evaluation. The simulator is visualized by a schematic human ventricular cardiomyocyte that includes all currents considered for the emulator training. Inputs of the emulator (see Figure 3) are the corresponding maximum conductances (G) but for the sake of illustration, only three representatives are shown. Output is the AP (Vm(t)). Training and validation data (maximum conductances and APs m(t)) were generated by the simulator. The evaluation was performed for forward and inverse problems and to this end, the pharmacological parameter s was introduced. This describes the interaction between the drug and a given target and was defined as scaling factor of the respective maximum conductance in control conditions (s & 1: block, s = 1: no effect, s > 1: enhancement). Synthetic data (control maximum conductances and drugged maximum conductances obtained through scaling, and control and drugged APs) and experimental data (control and drugged APs) were used for the evaluation [Orvos et al., 2019]. The forward problem was only solved for synthetic data, whereas the inverse problem was solved for both synthetic and experimental data. When experimental data were used, estimated pharmacological parameters were compared to distributions derived from data published within the CiPA initiative [Li et al., 2017, Chang et al., 2017].

Processed APs used for training and validation (left and center). Additionally we show the excluded APs on the right (see text for description of the exclusion criteria).

AP biomarkers and their experimental ranges used to generate the population of synthetic cardiomyocytes. These were adopted from [Passini et al., 2017]. Experimental data were collected at 37 °C in small right ventricular trabeculae and papillary tissue preparations obtained from healthy human hearts during pacing with a cycle length of 1000 ms [Britton et al., 2017, O’Hara et al., 2011].

Pharmacological parameter samples (synthetic drugs) with scaling factors for GKr and PCa to generate the drug data of data set #3.

Summary of the data used in this study, along with their usage and the number of valid samples. Note that each AP is counted individually, also in cases of control/drug pairs.

Conceptual architecture of the neural network emulator. The maximum conductances x are encoded into depolarization parameters di and a latent space representation ϑ that uniquely defines the time series functional fϑ : R R. The time is normalized and encoded in polynomials up to degree 8 (tp for p ∈ {1, 2, . . . 8}), before being appended to the latent code ϑ. fϑ is then used in conjunction with the depolarization helper tanh to approximate the AP Vm(t).

Comparison of an averaged raw and an averaged filtered experimental AP. One dofetilide control AP is shown as example.

Analysis of solution accuracy of the forward problem on synthetic data including normal APs (drug data of data set #2). Left: histogram of RMSEs for the APs, right: APs with the largest RMSEs. The RMSE is given above each subplot.

Analysis of solution accuracy of the forward problem on synthetic data including normal APs (drug data of data set #2) with respect to AP biomarkers. Histograms of mismatches for each biomarker are shown and the RMSE is given in the upper left corner. The number in the right upper corner denotes the number of outliers of the 10,000 samples which lie outside the shown ranges.

Analysis of solution accuracy of the forward problem on synthetic data including abnormal APs exhibiting EADs (subset of data set #3). Left: histogram of RMSEs for the APs, right: APs with the largest RMSEs. 124 of the 171 emulated APs exhibit the expected EADs (based on the criterion outlined in Appendix A). The RMSE is given above each subplot. All emulated APs are shown in Appendix C

Analysis of solution accuracy of the inverse problem on synthetic data (data set #2). Left: boxplot of errors between normalized estimated and ground truth control maximum conductances, middle: boxplot of errors between normalized estimated and ground truth drugged maximum conductances, right: boxplot of errors between estimated and ground truth pharmacological parameters. Error definitions are given in (4) and (5). The RMSE over all data is given below each parameter.

Analysis of fit quality of the inverse problem on experimental data. Comparison of the fitted APs (solid lines) and the experimental APs (dashed lines) at control (red) and after drug administration (blue) for all drugs.

Average RMSE over control and drugged APs measured in all preparations per drug. All values in mV.

Analysis of solution accuracy of the inverse problem using experimental data. The histograms compare the estimated pharmacological parameters (dashed vertical lines) from data of multiple preparations with the CiPA distributions (blue; see Section 2.3.3). The black dash dotted vertical lines are the borders of the range (grey) that was used to determine if the estimation of the given pharmacological parameter was successful. The range is spanned up by µ ± (0.15 + σ) of the CiPA distribution. Successfully estimated parameters are shown as green lines and unsuccessfully estimated parameters are shown as red lines. The number in the upper left corner indicates for how many preparations the parameters could be successfully estimated in relation to the total number of preparations for the given drug.

Pharmacological parameters related to maximum conductances that were considered successfully or unsuccessfully estimated across all preparations and drugs. For each channel, the drugs are stated for which respective data from the CiPA initiative were available. C, D, S, T, V, A mark cisapride, dofetilide, sotalol, terfenadine, verapamil, all drugs respectively.

Global sensitivity analysis of the ToR-ORd simulator. Sobol’ sensitivity indices are shown for each maximum conductance relative to each AP biomarker. Left: first-order (S1), right: total-effect (ST) Sobol’ sensitivity coefficient.

Emulated APs based on the pharmacological parameters of data set #3. See also Section 3.1.1 and Figure 7. From left to right and top to bottom, the plot shows the true positive, false negative, false positive and true negative samples. The number next to the title specifies the number of samples belonging to each category. The classification criterion is outlined in Appendix A.

Same as Figure 12, but showing the simulated APs.