Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India

  1. Irene Man  Is a corresponding author
  2. Damien Georges
  3. Maxime Bonjour
  4. Iacopo Baussano
  1. Early Detection, Prevention and Infections Branch, International Agency for Research on Cancer (IARC/WHO), France
3 figures, 3 tables and 2 additional files

Figures

Hierarchical structure of availability of cervical cancer epidemiological data.
Figure 1—source data 1

Overview of availability of cervical cancer epidemiological data by state.

https://cdn.elifesciences.org/articles/81752/elife-81752-fig1-data1-v1.docx
Figure 2 with 2 supplements
Identified clusters of registry-specific cervical cancer incidence.

Clusterings under (A) 2, (B) 3, and (C) 4 prefixed clusters. Each panel within a row corresponds to a cluster within a k-clustering, with the cluster label given on top of the panel. The cervical cancer incidence data were extracted from volume XI of Cancer Incidence in Five Continents (CI5) (Bray et al., 2017) and the 2012–2016 report by the Indian National Centre for Disease Informatics and Research (NCDIR) (Report of National Cancer Registry Programme, 2020). Black: cluster mean of cervical cancer incidence; dark grey: registry incidence assigned to the cluster; light grey: registry incidence assigned to other clusters.

Figure 2—source data 1

Registry-specific cervical cancer incidence data from Cancer Incidence in Five Continents (CI5) and National Centre for Disease Informatics and Research (NCDIR).

https://cdn.elifesciences.org/articles/81752/elife-81752-fig2-data1-v1.xlsx
Figure 2—source data 2

Estimated model parameters under Poisson regression models.

https://cdn.elifesciences.org/articles/81752/elife-81752-fig2-data2-v1.docx
Figure 2—figure supplement 1
Registry-specific cervical cancer incidence data from Cancer Incidence in Five Continents (CI5) and National Centre for Disease Informatics and Research (NCDIR).

See Figure 1—source data 1 for whether registries belong to CI5 or NCDIR.

Figure 2—figure supplement 2
Mean age-specific cervical cancer incidence by cluster.
Figure 3 with 1 supplement
Sexual behaviour data from National AIDS Control Organization (NACO) by Indian state.

Indian state-specific data on (A) median age of first sex, (B) proportion of respondents reporting sex with non-regular partners in the last 12 months, (C) proportion of male respondents reporting sex with commercial partners in the last 12 months, and (D) proportion of male respondents by number of commercial partners in the last 12 months. Each violin plot and the associated cloud of circles correspond to a sexual behaviour variable. Each circle corresponds to the data of a state (or group of states). The data were extracted from the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India (National Behavioural Surveillance Survey: General Population, 2006). Blue and red: Indian states identified in the high and low cervical cancer incidence clusters. Grey: states without cervical cancer incidence data and therefore unknown cluster.

Figure 3—source data 1

Indian state-specific sexual behaviour data from National AIDS Control Organization (NACO).

https://cdn.elifesciences.org/articles/81752/elife-81752-fig3-data1-v1.xlsx
Figure 3—source data 2

Predictive values of the sexual behavior variables for cervical cancer incidence cluster.

https://cdn.elifesciences.org/articles/81752/elife-81752-fig3-data2-v1.docx
Figure 3—figure supplement 1
Indian state-specific sexual behaviour data from National AIDS Control Organization (NACO).

States or groups of states as reported in the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India. Other North Eastern states include Arunachal Pradesh, Nagaland, Meghalaya, Mizoram, and Tripura.

Tables

Table 1
Estimated parameters of clusters of cervical cancer incidence patterns.
Number of prefixed clustersBIC*Cluster label iNumber (%) of registries in clusterMaximum incidenceMaximum incidence patternMaximum incidence age groupMaximum incidence age group pattern
26933127 (82%)47 casesLow60–64 yearsLate
26 (18%)91 casesHigh55–59 yearsEarly
35700119 (58%)38 casesLow60–64 yearsLate
25 (15%)92 casesHigh55–59 yearsEarly
39 (27%)64 casesIntermediate60–64 yearsLate
45532118 (55%)39 casesLow60–64 yearsLate
25 (15%)92 casesHigh55–59 yearsEarly
39 (27%)64 casesIntermediate60–64 yearsLate
41 (3%)20 casesVery low60–64 yearsEarly
  1. *

    Bayesian information criterion for evaluating the goodness-of-fit of obtained clustering.

  2. Maximum incidence given in cases per 100,000 women-years.

  3. Five-year age group in which the maximum incidence is located.

Table 2
Clustering of cervical cancer incidence of Indian states based on clustering of registries.
State/group of states*
2-Clustering3-Clustering4-Clustering
1(low, late)2(high, early)1(low, late)2(high, early)3(interm., late)1(low, late)2(high, early)3(interm., late)4(very low, early)
Andhra Pradesh
Assam●●●●●●●●
Delhi
Gujarat+Dadra and Nagar Haveli
Karnataka
Kerala +Lakshadweep●●●●●●
Madhya Pradesh
Maharashtra●●●●●●●●●●●●●●●●●●●●
Manipur●●●●●●
Other North Eastern states
●●●●●●●●●●●●●●●●●●●●●●●●●
Punjab +Chandigarh
Sikkim
Tamil Nadu +Puducherry
West Bengal +Andaman and Nicobar Islands
  1. Each circle represents the count of one registry being assigned to the corresponding cluster. Grey shading represents the cluster including the highest number of registries, either exclusively or in a draw with another cluster.

  2. Cluster labels and the corresponding patterns of maximum incidence and maximum incidence age group given in the second row were defined in the third, sixth, and eighth columns of Table 1, respectively.

  3. *

    States/or groups of states were defined as reported in the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India (National Behavioural Surveillance Survey: General Population, 2006).

  4. Other North Eastern states included Arunachal Pradesh, Nagaland, Meghalaya, Mizoram, and Tripura.

Table 3
Identified and classified cluster of cervical cancer incidence pattern by Indian state.
Cervical cancer incidence data*State/group of statesIdentified clusterClassified cluster§Probability of belonging to the low-incidence cluster
AvailableAndhra PradeshLowLow0.60
AssamLowLow0.69
DelhiHighHigh0.42
Gujarat+Dadra and Nagar HaveliLowLow0.69
KarnatakaHighLow0.63
Kerala+LakshadweepLowLow0.60
Madhya PradeshHighHigh0.44
MaharashtraLowLow0.57
ManipurLowLow0.65
Other North Eastern statesHighLow0.53
Punjab+ChandigarhHighHigh0.41
SikkimLowLow0.63
Tamil Nadu+PuducherryHighHigh0.38
West Bengal+Andaman and Nicobar IslandsLowLow0.71
UnavailableBihar-Low0.67
Chhattisgarh-Low0.66
Goa+Daman and Diu-Low0.54
Haryana-Low0.66
Himachal Pradesh-Low0.58
Jammu and Kashmir-Low0.63
Jharkhand-Low0.71
Orissa-Low0.68
Rajasthan-Low0.66
Uttar Pradesh-Low0.64
Uttarakhand-Low0.69
  1. *

    Availability of cervical cancer incidence data was based on the incidence data from volume XI of Cancer Incidence in Five Continents (CI5) (Bray et al., 2017) and the 2012–2016 report of the National Centre for Disease Informatics and Research (NCDIR) (Report of National Cancer Registry Programme, 2020).

  2. States/groups of states were defined as reported in the 2006 National Behaviour Surveillance Survey of the National AIDS Control Organization of India (National Behavioural Surveillance Survey: General Population, 2006).

  3. Identified clusters derived in the Clustering step.

  4. §

    Classified clusters derived in the Classification step. A given state was classified to the low-incidence cluster if the probability of belonging to the low-incidence cluster (given in the next column) was above 0.50. For the Indian states with available cervical cancer incidence data and hence already in an identified cluster, classification was done for the purpose of validation.

  5. Other North Eastern states included Arunachal Pradesh, Nagaland, Meghalaya, Mizoram, and Tripura.

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Irene Man
  2. Damien Georges
  3. Maxime Bonjour
  4. Iacopo Baussano
(2023)
Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India
eLife 12:e81752.
https://doi.org/10.7554/eLife.81752