Data mining methodology for response to hypertension symptomology—application to COVID-19-related pharmacovigilance

  1. Xuan Xu
  2. Jessica Kawakami
  3. Nuwan Indika Millagaha Gedara
  4. Jim E Riviere
  5. Emma Meyer
  6. Gerald J Wyckoff
  7. Majid Jaberi-Douraki  Is a corresponding author
  1. 1DATA Consortium, www.1DATA.life, United States
  2. Department of Mathematics, Kansas State University, United States
  3. Kansas State University Olathe, United States
  4. School of Pharmacy, Division of Pharmacology and Pharmaceutical Sciences, University of Missouri-Kansas City, United States
  5. Molecular Biology and Biochemistry, School of Biological and Chemical Sciences, University of Missouri-Kansas City, United States
  6. Department of Business Economics, University of Colombo, Sri Lanka
  7. Kansas State University and North Carolina State University, United States
5 figures, 4 tables and 12 additional files

Figures

Workflow of data-driven methodology for pulmonary symptomology in hypertension using machine learning models from preprocessing and dictionary creation to storing tables in the database an analysis.

As a part of data cleaning, we were also challenged by multiple technical issues when combining drugs: (i) there were many drugs’ names that did not track a specific standard. (ii) Formulations of the same active ingredient with different generic or brand names for different routes of administration created confusion in collecting data (for instance, Revatio, Viagra, sildenafil, sildenafil citrate, APO sildenafil, sildenafil film-coated tablet, sildenafil citrate Aurobindo pharma, sildenafil Amneal Pharmaceuticals, Teva sildenafil, sildenafil Pfizer, sildenafil Greenstone, sildenafil Hormosan Filmtabletten, Revathio, sildenafil SUP, etc.). For this purpose, we combined drugs with or without salt, alcohol, etc. from different generic names and brand names.

Figure 2 with 1 supplement
Principal component analysis of the expected count for 134 drugs (from 12 Anatomical Therapeutic Chemical [ATC] drug classes) in 2D (A) and 3D (B) spaces using the log expected value of RR, logE.

In Panel B, individual drugs are (significantly) separated on the extreme edges are marked by (1), amlodipine, (2) quinapril, (3) trandolapril, (4) nilvadipine, (5) azosemide, (6) azelnidipine, and (7) treprostinil. An interactive figure can be found on the 1DATA home page. Click the following URL to see the figure: https://1data.life/pages/publication/figure1B.html.

Figure 2—figure supplement 1
Principal component analysis of expected count for all the 117 drugs when excluding antihypertensive drugs in 2D (A) and 3D (B) space using empirical Bayes geometric mean (EBGM) for angiotensin-converting enzymes (ACEIs), angiotensin-II receptor blockers (ARBs), beta-blocking agents (BBAs), calcium channel blockers (CCBs), and TDs.

In Panel B, individual drugs are (significantly) separated on the extreme edges are marked by (1) amlodipine, (2) quinapril, (3) trandolapril, (4) nilvadipine, (5) azosemide, (6) azelnidipine, and (7) treprostinil.

Figure 3 with 2 supplements
Two layouts of Circos plot for 22 hypertensive drugs selected by graphical least absolute shrinkage and selection operator (GLASSO).

Circos plots of drugs were obtained based on the empirical Bayes geometric mean (EBGM) matrix after applying GLASSO. Edge bundling linkages for better visualization and drugs were selected by GLASSO with edge bundling. Grouped drugs based on their classes were assigned the same color based on their classes (A). Applying reverse Cuthill-McKee (RCM) reordering and edge bundling for grouping drugs based on the Anatomical Therapeutic Chemical (ATC) class and edge bundling (B).

Figure 3—figure supplement 1
Two layouts of Circos plot for 22 hypertensive drugs selected by graphical least absolute shrinkage and selection operator (GLASSO) when excluding antihypertensives agents (AHAs).

Circos plots of drugs were obtained based on the empirical Bayes geometric mean (EBGM) matrix after applying GLASSO. Edge bundling linkages for better visualization and drugs were selected by GLASSO with edge bundling. Grouped drugs in high-level terms were assigned the same color based on their classes (A). Applying reverse Cuthill-McKee (RCM) reordering and edge bundling for grouping drugs based on the Anatomical Therapeutic Chemical (ATC) class and edge bundling (B).

Figure 3—figure supplement 2
Arc diagram visualization of 22 drugs selected from graphical least absolute shrinkage and selection operator (GLASSO) and associated pulmonary adverse drug event (ADE)-drug combination.

Three inset maps as examples of highly correlated and complex interactions are shown. Large-filled circles show drugs and small circles are used for ADEs.

Figure 4 with 1 supplement
Pairwise Wilcoxon signed-rank test between different Anatomical Therapeutic Chemical (ATC) classes.

No pairwise significant comparison was found similar to Supplementary file 7. But the group comparison was highly significant, p-value = 0.00072.

Figure 4—figure supplement 1
Pairwise Wilcoxon signed-rank test between different Anatomical Therapeutic Chemical (ATC) classes.

No pairwise significant comparison was found similar to Supplementary file 7. But the group comparison was very significant, p-value = 0.044.

Figure 5 with 1 supplement
Pairwise Wilcoxon signed-rank test between different classes defined by graphical least absolute shrinkage and selection operator (GLASSO) (A) and pairwise Wilcoxon signed-rank test between different classes defined by GLASSO excluding tadalafil (B).
Figure 5—figure supplement 1
Pairwise Wilcoxon signed-rank test for different classes defined by graphical least absolute shrinkage and selection operator (GLASSO) (A) and the same test for different classes defined by GLASSO excluding warfarin (B) similar to Supplementary file 8.

Tables

Table 1
Drug class after applying first the two filtering rules to obtain 44 drugs and then the elimination process from the penalized regression graphical least absolute shrinkage and selection operator (GLASSO) to obtain 22 drugs.
Drug class# Reports(Total 612,733)# Drugs after initial filtering(total 134)# Drugs correspond to ≥2 ADEs in HLT codes when EB05 > 1 (total 44)Drugs using GLASSO (total 22)
ACEIs69,3271331
ARBs87,415853
Other RAS agents3,471100
Other antihypertensive120,4251474
Antithrombotic agents67,7671073
Beta blocking agents74,5741331
Calcium channel blockers86,39918106
Diuretics29,3941431
Lipid modifying agents2,634400
Urologicals18,186422
Vasoprotectives909100
Combinations52,2323441
Table 2
The number of pulmonary adverse drug events (ADEs) when relative reporting ratio (RR) larger than two or the fifth quantile of empirical Bayes geometric mean (EBGM), EB05, larger than two after graphical least absolute shrinkage and selection operator (GLASSO) filtering process implemented in Table 1.
Drug# Pulmonary ADEsOrder by EBGM# Pulmonary ADEsOrder by RR
Macitentan161102
Bosentan142511
Epoprostenol11494
Selexipag105102
Sildenafil10676
Tadalafil107344
Beraprost710131
Nifedipine513511
Candesartan416334
Althiazide/Spironolactone320418
Bisoprolol321#N/A#N/A
Imidapril324511
Azelnidipine230423
Azilsartan Kamedoxomil231332
Bendroflumethiazide232333
Benidipine233511
Cilnidipine234511
Doxazosin236336
Lercanidipine239190
Nicardipine240511
Rilmenidine242#N/A#N/A
Telmisartan243430
Table 3
Comparative analysis of each drug and associated pulmonary adverse drug events (ADEs) based on the new classification from different graphical least absolute shrinkage and selection operator (GLASSO) (GL) Clusters.
DrugDrug classADEs for EB05 > 1 (n) *GL Cluster
MacitentanAHAs1–15,17 (16)1
BosentanAHAs1,2,4–15 (14)1
EpoprostenolATAs1,2,4–9,11,12,15 (11)1
SelexipagATAs2,4–12 (10)1
SildenafilUAs1,2,4–12 (10)1
TadalafilUAs1,2,4–12 (10)1
BeraprostATAs1,2,5–9 (7)1
NifedipineCCBs1–3,15,16 (5)2
CandesartanARBs1,3,14,16 (4)2
Althiazide\SpironolactoneCOMBs4,10,11 (3)3
RilmenidineAHAs4,10 (2)3
BisoprololBBAs1,2,14 (3)4
LercanidipineCCBs1,14 (2)4
ImidaprilACEs1–3 (3)5
AzelnidipineCCBs1,3 (2)5
Azilsartan KamedoxomilARBs1,3 (2)5
BenidipineCCBs1,2 (2)5
CilnidipineCCBs1,2 (2)5
TelmisartanARBs1,3 (2)5
BendroflumethiazideTDAs3,13 (2)6
DoxazosinAHAs3,13 (2)6
NicardipineCCBs3,13 (2)6
  1. *Below ADEs can be found corresponding to each drug:

  2. 1. Parenchymal lung disorders NEC.

  3. 2. Pneumothorax and pleural effusions NEC.

  4. 3. Lower respiratory tract inflammatory and immunologic conditions.

  5. 4. Respiratory tract disorders NEC.

  6. 5. Breathing abnormalities.

  7. 6. Lower respiratory tract signs and symptoms.

  8. 7. Pulmonary oedemas.

  9. 8. Respiratory failures (Excl Neonatal).

  10. 9. Vascular pulmonary disorders NEC.

  11. 10. Bronchospasm and obstruction.

  12. 11. Coughing and associated symptoms.

  13. 12. Respiratory syncytial viral infections.

  14. 13. Bronchial conditions NEC.

  15. 14. Pulmonary thrombotic and embolic conditions.

  16. 15. Lower respiratory tract infections NEC.

  17. 16. Fungal lower respiratory tract infections.

  18. 17. Pleural infections and inflammations.

Table 4
The Friedman test for drugs in Anatomical Therapeutic Chemical (ATC) class and graphical least absolute shrinkage and selection operator (GLASSO) class.
ATC classp-value (44 drugs)p-value (22 drugs)GL ClusterThe p-value for 22 drugs
ACEIs0.2711<0.001 (0.199, when excluding tadalafil)
ARBs<0.001<0.00120.110
AHAs<0.001<0.00130.884
ATAs<0.001<0.00140.346
BBAs0.023250.127
CCBs0.0010.00160.0522
COMBs0.236
TDAs0.0329
UAs0.1270.127
  1. The p-value for statistical significance is <0.05.

Additional files

Supplementary file 1

30 pulmonary ADEs.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp1-v2.docx
Supplementary file 2

Contribution of Pulmonary ADEs in 2D and 3D PCAs.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp2-v2.docx
Supplementary file 3

Frequency of pulmonary ADEs when RR larger than two or the 5th quantile of EBGM, EB05, large than two.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp3-v2.docx
Supplementary file 4

Description of arc diagram visualization.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp4-v2.docx
Supplementary file 5

Comparative analysis of drug and associated pulmonary ADEs in different GLASSO Clusters.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp5-v2.docx
Supplementary file 6

Friedman test for drugs in ATC class and GLASSO class.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp6-v2.docx
Supplementary file 7

A. Multiple comparisons of different ATC classes together with the adjusted p-value using the rigorous paired Wilcoxon signed-rank test with Bonferroni correction.

B. Multiple comparisons of different ATC classes excluding AHAs and UAs.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp7-v2.docx
Supplementary file 8

A. Multiple comparisons of drugs from GL Clusters and multiple comparisons of drugs from GL clusters excluding Tadalafil.

B. Multiple comparisons of drugs from GLASSO clusters and multiple comparisons of drugs from GLASSO clusters excluding Warfarin.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp8-v2.docx
Supplementary file 9

Dose distribution related to tadalafil and sildenafil and ADE.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp9-v2.docx
Transparent reporting form
https://cdn.elifesciences.org/articles/70734/elife-70734-transrepform1-v2.pdf
Source data 1

This folder contains the data for R programs in the Source Code files.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp10-v2.zip
Source code 1

This folder contains R codes for the manuscript: Data-Driven Methodology COVID19 Related Pharmacovigilance.

https://cdn.elifesciences.org/articles/70734/elife-70734-supp11-v2.zip

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Xuan Xu
  2. Jessica Kawakami
  3. Nuwan Indika Millagaha Gedara
  4. Jim E Riviere
  5. Emma Meyer
  6. Gerald J Wyckoff
  7. Majid Jaberi-Douraki
(2021)
Data mining methodology for response to hypertension symptomology—application to COVID-19-related pharmacovigilance
eLife 10:e70734.
https://doi.org/10.7554/eLife.70734