Abstract
Respiratory syncytial virus is the leading cause of lower respiratory tract infection among infants. RSV is a priority for vaccine development. In this study, we investigate the potential effectiveness of a twovaccine strategy aimed at motherstobe, thereby boosting maternally acquired antibodies of infants, and their household cohabitants, further cocooning infants against infection. We use a dynamic RSV transmission model which captures transmission both within households and communities, adapted to the changing demographics and RSV seasonality of a lowincome country. Model parameters were inferred from past RSV hospitalisations, and forecasts made over a 10year horizon. We find that a 50% reduction in RSV hospitalisations is possible if the maternal vaccine effectiveness can achieve 75 days of additional protection for newborns combined with a 75% coverage of their birth household coinhabitants (~7.5% population coverage).
Introduction
Respiratory syncytial virus (RSV) is the most common viral cause of acute lower respiratory infection (Nair et al., 2010). A large majority of children contract RSV by the age of two (Glezen et al., 1986; Ohuma et al., 2012), but the chance of developing severe disease from a RSV infection is much greater amongst young infants (6 months) (Hall et al., 2009) and decreases rapidly with the age of the infected child. Vaccine development aimed at protecting young children against RSV disease has become a global health priority (World Health Organization, 2017). As of December 2018, there are over 40 RSV vaccines in development (PATH, 2018). In particular, two vaccination approaches have been identified as potentially effective: a single dose vaccine aimed at motherstobe leading to antibody transfer across the placenta thereby boosting maternally acquired immunity among newborns, and paediatric vaccination aimed directly at infants (Modjarrad et al., 2016; World Health Organization, 2017). Moreover, it is possible that a prophylactic extended halflife monoclonal antibody could act as a vaccine surrogate whilst replicating the desired effect of a maternal vaccine (Zhu et al., 2017; Domachowske et al., 2018). A serious complication in RSV vaccine development has historically been the risk of causing enhanced disease amongst the immunologically naive (Chin et al., 1969), therefore it might be more prudent to target a paediatric vaccine at older children with better developed immune systems rather than young infants most at risk of RSV disease (Anderson et al., 2013). Epidemiological data suggests older individuals (elder siblings, parents) are potential sources of infection for the infant of the household (Graham, 2014), for whom temporary boosted immunity might best be achieved using a subunit vaccine (Anderson et al., 2013).
The desired effect of vaccinating older children is twofold: the vaccine both decreases the risk of morbidity in the vaccinated child and reduces the risk of transmission from the older child to any young infant the vaccinated child contacts (Anderson et al., 2013). Molecular analysis of nasopharyngeal samples collected from a semirural community in Kenya has identified that the majority of RSV infections among young infants originated from within their household rather than the wider community, with older siblings being the usual household index case (Munywoki et al., 2014), echoing a previous household study of RSV transmission (Hall et al., 1976), although it should also be noted that the young infant was herself the index case on a significant number of occasions. This finding emphasises that reducing transmission to young infants within the household could be an effective way of reducing RSV disease in low and middleincome countries (LMICs). However, the significant number of young infant index cases within households suggest that ‘cocooning’ young infants from transmission by vaccinating others in their household may not be sufficient by itself. Ideally, cocoon protection should be achieved in conjunction with directly protecting the young infants using a maternal vaccine.
At this time, the only reported phase III trial on RSV vaccine effectiveness is for the maternally targeted ResVax, which failed to meet its primary objective but nonetheless showed partial effectiveness at reducing hospitalisations due to RSV (NovaVax, 2019). The possibility that a vaccine for only one target population might be only partially effective, and the importance of RSV transmission within the household, motivates our modelling approach. In this paper, we assess the efficacy of a mixed vaccination strategy in a LMIC setting, Kilifi county Kenya. In our scenarios, there was at least one maternal vaccine and one paediatric vaccine available as per WHO priority (World Health Organization, 2017). In Kenya, there are very high rates of prenatal contact between pregnant women and health professionals (97.5% in Kilifi county; KNBS, 2015). This suggested targeting pregnant women as part of their prenatal contact, and then offering the paediatric vaccine to all over one year olds, including adults, cohabiting with the pregnant mother. The essential idea was to leverage prenatal contact to achieve a very high coverage of a maternal antibody boosting (MAB) vaccine, and also to target her household cohabitants with an immune response provoking (IRP) vaccine. The IRP vaccine elicits an immune response and, therefore, a temporary reduction in susceptibility to RSV for the vaccinated individual. We follow (Yamin et al., 2016) in assuming that the elicited period of immunity to RSV from receiving the IRP vaccine would be similar to that of a natural infection.
Predictions of vaccine effect are derived from a dynamic transmission model designed to capture the demographic structure of the population, the seasonality of RSV transmission and how rapidly, and to whom, RSV is transmitted in both households and the wider community. Unknown model parameters were inferred using data from the largescale longrunning Kilifi Health and Demographic Surveillance System (KHDSS; Scott et al., 2012), and hospitalisation admissions at Kilifi county hospital (KCH) confirmed as due to RSV since 2002. It should be noted that targeting vaccination in this way is not an approach that one would expect to greatly reduce RSV infections under the assumptions of simple compartmental models of RSV transmission because the rate of vaccination deployment would be too low (see Box 1). However, we shall see that these vaccines are efficiently targeted at creating protection for the young infants most at risk of hospitalisation if they caught RSV.
Box 1.
Vaccination predictions from a simple unstructured RSV epidemic model.
The essential idea in this paper is to use prenatal contact between motherstobe and health professionals to deploy two separate vaccines: first, a vaccine targeting the motherstobe which boosts the duration of protection her newborn will have against RSV (MAB vaccine), and second, a vaccine aimed at the motherstobe’s household cohabitants giving each a period of RSV immunity, equivalent to that of a natural infection (IRP vaccine). As a baseline for understanding RSV transmission we can use a simple mechanistic model which captures the essential biology of RSV infection; newborns are born with a period of immunity to RSV infection which is lost during their first year of life, after contracting RSV the individual is infectious for a period before gaining temporary waning immunity to RSV reinfection. Assuming homogeneous transmission the dynamics of the simple RSV transmission model can be described using four dynamic variables describing the numbers of currently maternally protected individuals (M), susceptibles (S), infecteds (I) and immune/recovereds (R). The evolution of the epidemic, after vaccination, can be given as a standard ODE:
where each term above describes the rate of events that change the epidemic state: Births ($B$), loss of maternally derived protection after MAB vaccination, (${\alpha}_{vac}$), mortality (µ), RSV force of infection ($\beta I/N$), recovery ($\gamma $), reversion to susceptibility ($\nu $), as standard in the literature (Anderson and May, 1991; Keeling and Rohani, 2008). The rate at which IRP vaccines successfully vaccinate susceptibles is $B\u27e8H\u27e9{V}_{cov}S/(S+I+R)$; that is the mean size of a pregnant woman’s household ($\u27e8H\u27e9$) times the effective coverage of the vaccine ($0\le {V}_{cov}\le 1$) time the likelihood of selecting a susceptible and not wasting the vaccine assuming that we are only targeting those who have definitely lost their maternal protection to RSV ($S/(S+I+R)$). For simplicity, we can treat the duration of maternal protection as very short compared to the typical person’s lifetime (i.e. ${\alpha}_{vac}\gg \mu $). The equilibrium of the simple RSV model is analytically tractable (see appendix 2):
where ${R}_{0}=\beta /(\gamma +\mu )$ is the reproductive ratio of RSV, and we are assuming that the birth rate is at replacement $B=\mu N$. The simple RSV model makes some general predictions about the efficacy of IRP vaccination:
Therefore, a naive simple model of RSV transmission is pessimistic about the joint vaccination strategy. However, in this study, we also account for more detailed social structure, differential susceptibility, infectiousness, and risk of disease dependent on the age of the individual and seasonality in transmission. We will see that targeting vaccines socially close to young infants is much more effective than the simple model predicts.
The MAB vaccine does not significantly effect transmission in the general population.
The efficiency of the IRP vaccine (avoided infections per effective dose) should not change with coverage.
Using parameters typical of the study population at Kilifi (see appendix 2), the reduction in RSV transmission due to IRP vaccination can be modest because the deployment rate is too low; for ${R}_{0}=2$ the maximum achievable reduction in transmission is < 4% compared to no vaccination.
The modelling approach used in this paper differs from the majority of RSV modelling approaches extant in the literature, which largely focus on deterministic age structured transmission models (Pitzer et al., 2015; Kinyanjui et al., 2015; Yamin et al., 2016; Hogan et al., 2016). In contrast, we explicitly model the social clustering of individuals into households. The advantage of explicit inclusion of household structure in the model is that the social contacts within the household are persistent over multiple RSV seasons, whereas agestructured models implicitly assume random mixing; that is all people of a given age group are equally likely to be contacted by any individual at any instant and therefore the chance of repeated contact become zero as the population size becomes large. In the specific case of modelling highly seasonal RSV transmission, it is likely that capturing the networklike transmission structure of the population is important for representing the relevant epidemiology. Most people have caught RSV by the age of two, and will have multiple repeated episodes during their lifetime. The time between recovery from an episode and reversion back to at least partial susceptibility is estimated to be 6 months (Ohuma et al., 2012). In Kilifi county, there are sharp annual peaks of RSV hospitalisation at each seasonal RSV epidemic, and so one should expect the population to consist of large numbers of entirely susceptible individuals, who have never caught RSV before and are primarily in their first 2 years of life, and partially susceptible individuals, who have caught RSV at least once before, due to the interepidemic period being longer than the typical time over which loss of immunity to RSV occurs. These general considerations suggest that (i) RSV seasonal epidemics will be akin to repeated invasions of a nearly susceptible population, that is closer to an epidemic scenario than an endemic scenario, and (ii) RSV transmission is much closer to a SIS rather than a SIR paradigm. Social network effects in epidemiological forecasting are most important during an epidemic invasive growth phase and are typically more important for SIStype dynamics with persistent contacts (Miller, 2009; Sun et al., 2015). Both these features appear to be important for seasonal RSV transmission in Kilifi and therefore provide strong motivation for the networktype epidemic model we have used.
Two possible explanations for the comparative lack of using household structure in RSV modelling are: first, accounting for the interplay of demography and household structure remains a significant modelling challenge (Glass et al., 2011; Geard et al., 2015), and second, the dynamics of age structured transmission models can be predicted using a comparatively small set of deterministic rate equations (Keeling and Rohani, 2008). Moreover, whenever natural immunity is longlasting and/or high levels of effective vaccination coverage exist for the population, household structure is less important and can be captured using simple approximations, for example, the motherchild contact approximation (Atkins et al., 2016). As a possible alternative modelling framework stochastic individualbased models (IBMs) for epidemics benefit from additional realism and flexibility compared to deterministic models, and there does exist at least one modelling study considering the effect of social structure on RSV transmission using a nonseasonal approximation within a stochastic individualbased model (IBM) (Poletti et al., 2015). However, rigorous inference of model parameters for stochastic IBMs of epidemics is highly challenging because, along with other difficulties, the random infection times of each case will not typically be known (O’Neill and Roberts, 1999). The model used in this paper required a rate equation for each possible household configuration (House and Keeling, 2008). Specifically for RSV modelling it has been noted that this could lead to thousands of rate equations that must be simulated simultaneously (Kinyanjui, 2014), effectively rendering the model impractical for regression against data due to slow integration time. Nonetheless, this work demonstrates that by making appropriate simplifications, and using numerical solvers adapted to large systems (in this case ~2000 variables), it was possible to both include realistic household structure and rigorously infer model parameters for a model of RSV transmission in a LMIC setting.
Results
The RSV transmission model parameters were either drawn from the RSV literature or inferred from agestratified weekly hospitalisations at Kilifi county hospital (KCH) between 2002 and 2016. The underlying biology of the transmission model was similar to a simple compartmental model of RSV infection and waning immunity (see Box 1) with two main differences: (i) the age of the individuals affected their susceptibility to RSV, infectiousness after contracting RSV, duration of RSV infectiousness, and likelihood of developing severe disease and being hospitalised after contracting RSV, partly because of agespecific effects, and partly because we assumed that every person had caught RSV at least once after their first year of life, and (ii) infectious contacts were distributed at two levels of social mixing differentiating between persistent contacts between household cooccupants and randomly assigned contacts within the community of Kilifi county based on the ages of the infected and infectee (Figure 1 and Materials and methods). The joint age and household distribution of the population accessing KCH was chosen to match the ongoing findings of the Kilifi Health and Demographic surveillance system (KHDSS; Scott et al., 2012). The seasonality of RSV hospitalisations at KCH has historically been erratic with peak months for RSV hospitalisation varying as widely as November to April (appendix 1). Moreover, over the 15year period we are studying in this paper, there was demographic change in the underlying population both in age profile and household size distribution. We addressed these modelling challenges: first, by rejecting the typical epidemiological modelling assumption that population demographic structure is at equilibrium in favour of directly modelling demographic change, and second, by treating the shifting seasonality of RSV transmission in Kilifi as being driven by an underlying latent random process to be jointly inferred with model parameters. The goal was to account for factors influencing the rate of hospitalisations that changed over the 15 years of study so as to get an unbiased estimate of parameters we assumed were static over the period, such as the persontoperson rate of transmission within a household. We were able to broadly capture the yeartoyear variation in hospitalisation, and age profile of the hospitalised, with only six free parameters (Figure 2, Materials and methods, and appendix 1). The 2005/2006 RSV year (see appendix 1 for RSV year definition) was anomalous in that there were three peaks in RSV hospitalisation separated by at least a month: two smaller peaks on 11th Dec 2005 and 24th Mar 2006 around a larger peak on 24th Feb 2006. The model was unable to explain this unusual year, other years having solitary peaks. Outside of the 2005/2006 RSV year there were 2174 hospitalisations during the period of study compared to a model prediction of 2147 hospitalisations ([2057, 2238] 95% prediction interval ). We were unable to jointly identify the rate of school children contacting other school children with the rate of homogeneous contact among all over one year olds, therefore we considered a range of within school contact rates, and for each value inferred the other six free model parameters and assessed the efficacy of vaccination for a range of MAB vaccine effectiveness values and IRP vaccine coverage values. Each scenario gave similar results for the efficacy of household targeted vaccination (see appendix 3), therefore we have only presented results in the main Results section for the scenario with the highest rate of within school mixing. At KCH all RSV hospitalisations occurred in the under five year olds with 84% of hospitalisations occurring in the under 1 year olds (Figure 2B). This finding is consistent with the much higher rates of hospitalisation per RSV infection for younger infants (Kinyanjui et al., 2015). However, the hospitalisation time series has to also be understood in the context of dynamic RSV transmission and demographic change in the study population. A general trend of increasing hospitalisations between 2002–2009 is at least partially explained by a 16% increase in under ones in the population over that period. The rest of yeartoyear variation in hospitalisation was explained by seasonal epidemic dynamics, themselves driven by shifting seasonality (Figure 2A; 1).
We found that, prevaccination, school age children suffered on average the highest force of infection, that is the percapita rate of infectious contacts, from outside of the household followed by under 1 year olds (Figure 3A). This finding was dependent on assuming that we had a high degree of homophily in the social contacts of schoolage children (the high within school transmission scenario mentioned above). Other scenarios were considered with lower levels of ingroup preference for schoolage children to contact other schoolage children; in the alternate scenarios, the parameter imputation process found slightly higher rates of contacts within the household and homogeneously outside of the household but lead to very similar results (appendix 3 ). The infectious contacts outside the household were distributed predominantly to individuals within households of size 2–5 (Figure 3). This reflected the household distribution of the population; school children and under ones who were most at risk of making social contact with those infected with RSV outside the household tended to live in households of this size (Figure 3B).
Force of infection is a less natural concept for measuring within household infection due to small numbers of individuals per household, and intense frequent contacts. Instead, we measured the true rate of RSV transmission between individuals cohabiting a household. The highest percapita rates of infection within households were for 7 year olds (Figure 3C); this reflected the typical age of individuals within the households most at risk of RSV introduction and with severest transmission rates after introduction. The infection rate among under ones increased rapidly until it plateaued at ~6 months old. The rapid increase in percapita infection rate was due to waning of maternally acquired immunity to RSV, which we inferred as lasting on average 21.6 days ([17.2, 26.1] 95% CI; see Table 3 for all inferred parameters). The total infection rate within households was greatest in size 5 and 6 households (Figure 3D). This differed from the household size where each person was at most risk of contracting RSV outside the household. Two factors shifted the burden of RSV infection to larger households: first, there are more people in larger households therefore risk of RSV introduction can be higher even if the perperson rate is lower, and second, the intensity of transmission within households is higher for larger households.
We evaluated a series of scenarios where a combination of a maternal antibody boosting (MAB) and an immune response provoking (IRP), vaccine were targeted at, respectively, motherstobe in their third trimester, and their household cohabitants upon the birth of the newborn. Between scenarios we varied (i) the effectiveness of the MAB vaccine, (ii) the coverage of the MAB vaccine, and (iii) the household coverage of the IRP vaccine, see Table 1 for a list of all vaccination scenarios modelled in this paper. The protective effect of the vaccines on individuals was the same as for the unstructured population model presented in Box 1: the MAB vaccine increased the period over which a newborn was protected from RSV by maternally acquired antibodies, and the IRP vaccine, given to all household cohabitants of some participating motherstobe, initiated an immune response in the vaccinated which gave a period of protection from acquiring RSV similar to that following a natural infection. The high prenatal contact levels in Kilifi county suggested that vaccination coverage of motherstobe had the potential to be very high, especially if maternal immunisation to boost newborn immunity became an established method for a range of vaccines including influenza and Group B Streptococcus. However, an available MAB vaccine might only be effective if delivered in the third trimester of pregnancy and, whilst having at least one prenatal contact is very common for pregnant women in Kilifi county, it is not clear that prenatal contact always occurs at the relevant stage of pregnancy. Therefore, we consider both an optimistic scenario (100% MAB coverage), and a more conservative uptake (50% MAB coverage). The number of days of additional maternally derived protection donated to the newborns by MAB vaccinated mothers was uncertain, we considered a range of MAB protection 0–90 days. We assumed that if the pregnant mother’s household cohabitants agreed to receive an immune response provoking vaccine then all were vaccinated at the birth of the newborn to maximise the overlap between the protection period of the cohabitants and the first months of life of the newborn. As is common in vaccine strategy analysis, we combine coverage and effectiveness into one effective coverage (coverage times effectiveness c.f. Keeling and Rohani, 2008), although in this case effective coverage could be considered both within and between households.
We assumed that the maximum coverage of the vaccine would be reached within a year, and considered 10 years of RSV transmission after this implementation. When inferring model parameters we took care to account for the known changes in demography over the study period, both in the age and the household occupancy distributions of the population. However, for the 10year forecasting in this paper, we assumed that the total birth rate was constant (8601 per year), and that the population age and household occupancy distributions remained static. The model inference stage included inferring the statistics of yearly variation in RSV seasonality. The decrease in rates of RSV hospitalisation and infection due to vaccination over ten years presented are median improvements over 500 independent realisations of random future seasonal patterns compared to a baseline of no intervention. If the MAB vaccine was unavailable or ineffective (0 days MAB protection), we found that it was still possible to reduce RSV hospitalisations by up to 25% using only the IRP vaccine on the household members of young infants at time of birth (Figure 4A and B). If 100% maternal vaccination could be achieved then the MAB vaccine was more successful as a sole vaccine option compared to IRP vaccination; in the sense that 90 days of additional protection from RSV delivered a 45% reduction in hospitalisation even with no IRP vaccine coverage. Nonetheless, even with an effective MAB vaccine there was added benefit to also using a IRP vaccine; a greater than 50% reduction in hospitalisations was achieved with a MAB vaccine that gave 75 additional days of RSV protection and a 75% coverage of the pregnant womens’ households (Figure 4A; a colorblindfriendly version of this plot can be found as appendix 4 Fig D). If only 50% maternal vaccination coverage could be achieved then unsurprisingly also using the IRP vaccine became relatively more important. The mixed vaccination strategy that achieved better than 50% hospitalisation reduction with 100% maternal coverage achieved 38% reduction in hospitalisations with 50% maternal coverage (Figure 4B); halving the maternal coverage didn’t necessarily halve the success of the vaccination programme so long as IRP vaccine was also available. Improving the effectiveness of the MAB vaccine caused a significant improvement in hospitalisations, but had an almost negligible effect on the total infections in the population (Figure 4C and D). IRP vaccination was more effective at reducing total RSV infections, but even at 75% coverage of the households of women giving birth the reduction in infections was $<4$% (Figure 4C and D). That IRP vaccination had a modest effect on the true infection rate, and that MAB vaccination has a negligible effect on the true infection rate, was in line with the prediction of the simple nonseasonal RSV model (Box 1). However, the simple model cannot predict that the percentage reduction in hospitalisations would be significantly greater than for total infections because of the direct and indirect protection of those most at risk of disease. For the mixed strategy achieving a 50% reduction in RSV hospitalisations described above (75 days direct MAB protection at 100% MAB coverage with 75% IRP household coverage), the seasonal dynamics of hospitalisations postvaccination equilibrated rapidly (Figure 5A). There was a reduction in median hospitalisations in every age group, but predominantly in 0–3 month years old (who are nearly all protected by the MAB vaccine) and 3–6 month year olds (Figure 5B). However, targeting pregnant women and their cohabitants did not prevent sufficient RSV infections as to significantly disrupt RSV transmission within the population at large, which may explain the rapid approach to new RSV hospitalisation dynamics. Nonetheless, those who were protected were overwhelmingly among those at most risk of disease if they had caught RSV.
Each vaccine used decreased the expected number of RSV infections and hospitalisations. As well as measuring the overall effectiveness of RSV vaccination (see above), we also measured the efficiency of vaccination, defined as number of infections or hospitalisations averted per vaccine (of either type). Unsurprisingly, as the duration of protection given by the MAB vaccine increased the efficiency of vaccination also increased; significantly for hospitalisations (Figure 6A) and marginally for infections (Figure 6B). This was true whether an IRP vaccine was used, or not. If there is no MAB vaccine available then the efficiency of using only IRP vaccination doesn’t change with coverage; that is that when increasing IRP household coverage the improvement per vaccine used stayed static, in line with what one might expect from a homogeneous mixing RSV model (see Box 1). However, when MAB and IRP vaccines were used in conjunction there was an efficiency penalty due to redundancy in the each vaccine’s protective effect. For example, if a MAB vaccine was available that gave 90 days protection the marginal benefit in terms of decreased hospitalisations of having an IRP vaccine was decreased because most atrisk infants were already protected by the MAB vaccine (Figure 6A). Using two types of vaccine always decreased infections and hospitalisations (see above), but the total reduction was always less than simply adding the reductions of each vaccine in the absence of the other.
Discussion
Our modelling analysis suggested that a highcoverage vaccination campaign of motherstobe with a vaccine inducing elevated levels of transplacenta RSV antibody transfer to her newborn, alongside targeting the newborn’s cohabitants with a generic vaccine that provoked a period of immunity to RSV can achieve greater than 50% reduction in hospitalisations due to RSV. This combined vaccination strategy suggested itself due to the high prenatal contact rates between motherstobe and health professionals in Kilifi county, Kenya (97.5% KNBS, 2015). We found that the combined vaccination strategy was efficient at targeting effort towards directly protecting young infants most at risk of developing RSV disease with boosted antibodies, and filling in any gap in protection with indirect cocoon protection within the household using a vaccine aimed at older cohabitants. Even at maximum effective household coverage for the IRP vaccination only ~10% of the population were vaccinated each year with a modest reduction in the RSV infection rate of ~5%. Nonetheless, at that coverage IRP vaccination alone achieved a 25% reduction in hospitalisations at KCH even without an effective MAB vaccine to provide direct protection to young infants. This demonstrated that although we were vaccinating at a low rate compared to population size, with only a modest reduction in infection rate, those people we did vaccinate were efficient at cocooning young infants from transmission and therefore risk of severe disease. If an effective MAB vaccine was also available the reduction in hospitalisations was greater, although the additional protection due to cocooning was relatively less since young infants were also protected from contracting RSV at the age when they were at most risk of severe disease.
We constructed the model used in this paper with the purpose of estimating the efficacy of targeting pregnant women and their households for vaccination. In order to make predictions mechanistic models of disease transmission must approximate the social structure of the population being modelled, and hence the contact rates between individuals. The focus on household transmission in this paper necessitated including households into the modelled social structure; this represented significant additional effort in model construction, computational resource and inference compared to simpler models. A more common approach in the literature is to treat the contact rates between individuals as being determined only by their respective ages. This approach has the benefit of being conceptually straightforward and draws on a number of recent and highquality studies which quantify social contact patterns by age stratification (Mossong et al., 2008; Kiti et al., 2014; Prem et al., 2017). However, the fundamental theory of agestructured transmission models for endemic diseases was developed mainly with reference to diseases that induce very long term or lifelong immunity (Anderson and May, 1991). For diseases provoking longlasting immunity, one would expect most older household members to be immune and therefore household structure to be a relatively less important factor in predicting risk of transmission compared to the agestructured transmission outside of the household. Indeed, simulation study of a generic strongly immunizing infection with realistic demography found limited difference in predicted incidence rate by age for people at schooling age or older between models with household structure and age structure compared to models with only age structure (Geard et al., 2015). However, it is not clear that neglecting household structure is a good approximation for modelling seasonal RSV transmission for two reasons: first, previously infected people lose effective immunological protection to RSV rapidly enough that each season could be closer to an ’epidemic’ scenario rather than an ’endemic’ scenario. Second, every hospital admission at KCH confirmed as due to RSV was a preschool aged child; in contrast to predicted incidence rates for school age and older individual, the simulation study cited above (Geard et al., 2015) predicted that incidence was lower for 0–5 year olds, especially so for under 1 year olds, once household structure was taken into account. It would be of great interest to have a more general theoretical understanding of which epidemiological questions require household structure, or a more general metapopulation structure, for epidemiological modelling, and which don’t. This remains an active area of research (Ball et al., 2015).
A cocooning protective effect of households could explain the big discrepancy between our estimate of the mean period of protection against RSV after birth due to transplacental transfer of antibodies from mother to baby in the the womb (21.6 days of natural protection on average) compared to a RSV transmission modelling study by Kinyanjui et al on the same population using an agestructured model (Kinyanjui et al., 2015) (2.3 months of natural protection if the age mixing was based on diary estimates of contacts (Kiti et al., 2014) or 4 months of natural protection if the age mixing was based on household cooccupancy and schooling ages). The agestructured model used in the Kinyanjui et al study reported high or very high reproductive ratios: 7.08 for the diary based contact patterns, and 25.60 for the household cooccupancy and schooling age based contact pattern. Therefore, to fit the KCH hospitalisation data the age structured model necessarily predicted a very high level of natural protection due to maternal antibodies to compensate for the predicted high force of infection on young infants. In our model, we included household structure and we fit to the same KCH data but with a much lower level of natural protection from RSV. This in turn changes the guidance modelling gives to vaccination strategy; some age structured RSV transmission models have emphasized reducing force of infection by vaccinating infants directly (Kinyanjui et al., 2015), and find that maternal vaccination is likely to be of limited impact (PanNgum et al., 2017), because they have inferred that the RSV reproductive ratio is high and, therefore, natural protection to RSV is also inferred to be high. In contrast, we infer that natural protection to RSV is low and therefore find that maternal vaccination in combination with elevating the cocoon protection to young infants provided by vaccinating household coinhabitants is a highly efficient strategy. Another agestructured RSV transmission model (Yamin et al., 2016) has found that vaccinating underfives to RSV along with their influenza vaccination was highly efficient because of the large number of secondary cases generated per infected underfive year old. Again, it is not clear whether this result extends to a population structured into households where it is known that clustering in contacts has a complex interplay with disease dynamics, either reducing spread because infectious contacts are ‘trapped’ in the local cluster (e.g. the household) or promoting spread by enhancing persistence (Miller, 2009; Sun et al., 2015).
This was a modelling study and, as ever, there are factors that we have neglected in our analysis that could be addressed in future work. First, we treated coverage of the maternal vaccine and the IRP vaccine as independent. In reality, the simplest and cheapest scenario whereby the household cohabitants of pregnant mothers are recruited to the vaccination programme is if they attend prenatal contact with the mothertobe. The percentage of pregnant women for have at least one prenatal contact in Kilifi county is high (97.5%; KNBS, 2015), however it is not clear that prenatal contact always occurs in the mothertobe’s third trimester. Both the MAB and IRP vaccines are likely to be best deployed late in the pregnancy, in order to maximise direct protection from the MAB vaccine and the duration of indirect protection from the IRP vaccine for the newborn. This means that if the only prenatal contact with the mothertobe is relatively early in her pregnancy then both the MAB and IRP vaccines might fail; that is the households outside of MAB coverage are also likely to be those outside of IRP coverage violating our independent deployment assumption. Our results suggest that a MAB vaccine at a high coverage sharply reduces RSV hospitalisation even when the amount of additional protection is low (15 days) and if the MAB vaccination coverage is reduced to 50% IRP coverage becomes relatively more important to reducing hospitalisations. To avoid having many household unprotected by both MAB and IRP vaccination, it could be cost effective to devote extra resources towards encouraging pregnant women, and their cohabitants, who present early in the pregnancy to return for vaccination later in the pregnancy. Second, the cost per vaccine remains unknown and we have not considered any measurement of the burden of disease other than hospitalisations at KCH. RSV hospitalisations have been identified as a crude proxy for the true disease burden; the passive reporting of RSV hospitalisation can vary for reasons completely independent of RSV epidemiology (Modjarrad et al., 2016). Third, despite accounting for demographic change in our inference of model parameters we neglect demographic change in our forecasting, concentrating instead on predicting the reduction in hospitalisations compared to a baseline of a static population without intervention. Including demographic change in our parameter inference step allowed us to disentangle seasonal variation in hospitalisation from simply changing numbers of atrisk children. The demography in Kilifi will continue to change in the future, the crude birth rate in Kilifi has followed a declining trend in line with the rest of Kenya. However, this leads to a total birth rate which is much closer to static (~8500 births per year), and therefore the number of atrisk underones has been approximately static since 2009. We avoided exploring complications such as the effect increased crowding within households might have on the risk pernewborn in this paper by assuming that the rest of the population was also static over the 10 years of forecasting. Further exploring more detailed issues around shifting patterns of household cohabitancy would be an interesting avenue to explore in future work. Our primary goal in this paper has been to establish the importance of thinking jointly about hospitalisation risk, population structure (in particular household cooccupancy) and future vaccination programmes. We have demonstrated that, all other things be equal, combining partially effective vaccines can be complementary in a householdstructured setting. These issues would suggest that RSV vaccination policy would benefit from further costbenefit analyses tailored to LMIC settings, possibly using more flexible stochastic IBMs with the model parameters inferred in this study.
In conclusion, in this paper, we have analysed the performance of a joint maternal and household targeting RSV vaccination strategy measuring both reduction in hospitalisations and the true population incidence rate. We drew our conclusions based on rigorous inference of underlying transmission parameters and the inherent protection to RSV newborns received from their mothers, taking into account potential confusing factors such as variable seasonality and demography. Two central insights from our study were that the duration of natural protection to RSV that newborns inherit from their mother was likely to be much shorter than previously estimated and that RSV attack rates within the household were significant in maintaining RSV transmission. Therefore, targeting pregnant women and their households for RSV vaccination is likely to be an effective and efficient strategy under a wide range of different scenarios.
Materials and methods
The dynamical RSV model used in this paper simulated infection and transmission of RSV among a population described by the Kilifi Demographic and Health surveillance system (KHDSS Scott et al., 2012) between September 2001 and September 2016. The population was assumed to mix and transmit RSV at two social levels: within their household and outside their household among the wider community. RSV infection was modelled using a modified version of the classic susceptible, infected, recovered (SIR) compartmental framework (Anderson and May, 1991; Keeling and Rohani, 2008). The main modifications were consistent with previous RSV transmission models; we assumed that: (i) individuals were born with a temporary immunity to RSV which faded over time, and (ii) RSV infection episodes provide individuals with only temporary protection from reinfection (mean 6 months Scott et al., 2006; White et al., 2007; Moore et al., 2014; Pitzer et al., 2015; Kinyanjui et al., 2015; Yamin et al., 2016). The high dimensionality of the ODE model (see below) used in this paper necessitated a relatively simple compartmental structure for RSV infection progression, therefore the population is only crudely age stratified into underone year olds (U1s) and overone year olds (O1s). However, more detailed information about the age of the individuals in the model was available by considering their age distributions conditional on their crude age category and the type of household they inhabited (see below). After an initial RSV infection there is evidence that individuals retain reduced susceptibility to subsequent RSV infection (Henderson et al., 1979; Hall et al., 1991), and will potentially have less infectious asymptomatic episodes if infected (Hall et al., 2001; Yamin et al., 2016). Some RSV transmission models, using simpler social structures, therefore allow individuals to be characterised by both their age and their number of previous RSV infections (Kinyanjui et al., 2015; Yamin et al., 2016). In the model used in this paper, we assumed that all U1 individuals susceptible to RSV were at risk of their first RSV episode and that all O1 individuals had already been infected at least once, since reinfection within the same yearly epidemic is unlikely but nearly everyone has caught RSV by the age of two years old (Glezen et al., 1986).
Joint distributions of age and household occupancy
Request a detailed protocolAs mentioned above, the high dimensionality of the RSV transmission model with two levels of social mixing was a limiting factor on the possible complexity of the compartmental framework representing the possible combinations of age and disease state (see appendix 2). In order to both capture the structure of the population in households and incorporate finergrained information about the ages of the modelled individuals, we calculated empirical joint distributions for the proportion of individuals of different ages in various household sizes, and whether that household contained an underone year old. We did not restrict the age categories of this joint ageandhousehold distribution to just underone or overone, instead preferring finergrained age categories: (i) each month of first year of life, (ii) each year of life aged 1–18 and (iii) 18+ years old. We used the Kilifi health and demographic surveillance system (KHDSS; Scott et al., 2012) to construct the joint distributions, which records for each individual a unique person ID, a birth date, immigration into the KDHSS date(s), outmigration from the KHDSS date(s), and a unique building ID for where they live during their time in the KHDSS. By combining this data we could calculate,
where ${N}_{t}(a,n,U)$ was the number of individuals on day $t$ who were jointly in age category $a$, lived in a household of size $n$, which either contained at least one under one year old ($U=1$) or not ($U=0$), and ${N}_{t}$ was the total population size on day $t$. The joint distribution changed over time, we calculated ${\mathbb{P}}_{t}(a,n,U)$ for a series of yearstart days t = 1 st Jan 2000, 2001,…, 2016. We then used ${\mathbb{P}}_{t}$ as representative for the rest of the year. Because the exact birth dates where missing for a large number of people, and for model simplicity, we assumed that all U1 individuals aged to become O1 individuals at a constant rate 1 per year, which was equivalent to assuming that given that the exact age of an U1 individual was uniformly distributed between 0 and 1 years old, independently of the U1’s household configuration.
Conditional age of individuals
Request a detailed protocolThe dynamic model of transmission tracks whether individuals are underone or overone years old; however, for estimating the risk of disease per infection it was useful to use the conditional age distribution for the finergrained age category of an individual based on her dynamic model age category $a<1\text{year}$ or $a>1\text{year}$, her household size and whether the household contained an U1 or not, for example,
The conditional distributions for an individual’s household size and whether they lived in a household containing an U1 based on their age were constructed similarly. The reason we included a variable indicating whether the household of the individual contained an under one or not was because it was important to capture the pathway to transmission to the underone year olds most at risk of disease due to contracting RSV.
Model dynamics, forces of infection and susceptibility to RSV
Request a detailed protocolThe fundamental unit of the RSV transmission model developed for this paper was the household. Each household was described by the number of each type of individual inhabiting it, which we call the household configuration. The type of individual within each household was identified by her RSV disease state and age category. The RSV transmission model described the dynamics of the number of households that were in each possible household configuration using an approach introduced by House and Keeling, 2008. Mathematically, the number of households in a given household configuration at time $t$ was denoted ${H}_{{s}_{1},{i}_{1},{r}_{1},{s}_{2},{i}_{2},{r}_{2}}(t)$, referring to the household configuration with exactly ${s}_{1}$ U1 susceptibles, ${i}_{1}$ U1 infecteds, ${r}_{1}$ U1 recovered, ${s}_{2}$ O1 susceptibles, ${i}_{2}$ O1 infecteds, and ${r}_{2}$ O1 recovereds. In order to limit the number of possible household states, we included only households of total size ten or less with two or fewer under ones. We chose these limits on the household size based on capturing ≈99% of the U1s in the population, and therefore the pathway to them catching RSV (appendix 2). There were 1926 possible household configurations in the RSV transmission model. The vector $\mathit{\bm{H}}(t)$ of number of households in each possible household configuration evolved according to the semilinear ODE:
Each term describing the vector field of Equation (3) corresponded to a dynamic component of the model:
RSV transmission within households, recovery of infected individuals, loss of immunity of recovered individuals, aging from U1 to O1 and turnover in household occupancy due to births and individuals leaving the household (${A}_{t}\mathit{\bm{H}}(t)$).
RSV transmission between households due to agegroup specific mixing (${\mathit{\bm{f}}}_{t}(\mathit{\bm{H}}(t))$).
Change in household numbers due to population flux, (${\mathit{\bm{\rho}}}_{t}(\mathit{\bm{H}}(t))$).
See appendix 2 for further details. The force of infection due to transmission within a household of generic configuration (${s}_{1},{i}_{1},{r}_{1},{s}_{2},{i}_{2},{r}_{2}$) was density dependent; that is the persontoperson infection rate in the household did not depend on household size,
where $\tau $ is the basic withinhousehold transmission rate, ${\iota}_{2}$ is the relative decrease in infectiousness of O1s compared to U1s, and $\beta (t)$ is the seasonal variation in the transmission rate of RSV (see appendix 1). Transmission outside of the household within the wider community was assumed to be based on the finergrained age categories introduced above. The conditional age distributions of the individuals allowed us to construct matrices (${P}_{H\to A,t}$) to convert between the household configuration vector into a vector of number of infected individuals in each age category, weighted by their relative infectiousness, for any time $t$ during the simulation: $\mathit{\bm{I}}(t)={P}_{H\to A,t}\mathit{\bm{H}}(t)$ (appendix 2). The force of infection on each individual due to agebased mixing in the community was,
where $T$ was the community infection rate matrix and $N(t)$ was the total population size at time $t$. In this formulation, the rate at which an infected in age group $b$ creates infectious contacts in the community with individuals of age group $a$ is ${T}_{ab}N(a,t)/N(t)$ where $N(a,t)$ is the number of individuals in age group $a$ at time $t$(Keeling and Rohani, 2008). The force of infection on an individual within a given household was calculated using matrices constructed from the conditional distribution of an individual’s household type given her age, ${\lambda}_{com}={P}_{A\to H,t}{\mathit{\bm{\lambda}}}_{age}$. The total force of infection on each individual was the sum of her infectious contact rates within the household and within the community, $\lambda ={\lambda}_{hh}+{\lambda}_{com}+{\lambda}_{ext}$. Where ${\lambda}_{ext}=\u03f5\beta (t)/N(t)$ was the force of infection from outside KHDSS.
The actual infection rate for each individual was the force of infection ‘felt’ by the individual times the susceptibility of the individual. The susceptibility of underone year olds (${\sigma}_{U1}$) depended on whether or not the U1 individual was still protected from RSV by maternally acquired antibodies, which we modelled as giving a random $M$ days of protection; that is for an individual of age $A$ days, ${\sigma}_{U1}=0$ if $M>A$ and ${\sigma}_{U1}=1$ otherwise. In general, the infection status of an individual correlates with her age. However, because RSV is strongly seasonal we do not treat the age of an U1 as correlated with her susceptibility arguing that every U1 is facing her first RSV season irrespective of whether she is 1month old or 11 months old. Therefore, the mean susceptibility for underones was ${\overline{\sigma}}_{U1}=\mathbb{P}(M\le A)$. The susceptibility of overone year olds was chosen as if the individual had definitely received at least one RSV infection in the past, and definitely had no chance of being maternally protected. We modelled the duration of maternal protection $M$ as a truncated exponential distribution conditioned on being less than 1 year in duration; that is $M\sim \mathrm{exp}(\alpha )(M\le 1\text{year})$ (appendix 2).
Hospitalisation rates
Request a detailed protocolThe chance of an infected individual becoming severely diseased after contracting RSV, and then seeking care at hospital, depended on that person’s age and number of infections (Nokes et al., 2008; Ohuma et al., 2012). When an U1 was infected in the model her age at infection was given by conditioning on the age of the U1 being greater than her maternal protection period,
Which was calculated exactly (see appendices 2 and 4). This took into account that increasing the duration of maternal protection would increase the age at infection and therefore reduce the risk of disease. O1s were assumed to have no maternal protection but their conditional age depended on their household type [Equation (2)]. We used these conditional distributions to convert the incidence rate of U1s and O1s in each household type into dynamic incidence rates in each age category, ${\mathcal{I}}_{a}(t)$. By assuming that all O1s had been infected at least once we could use previously published agedependent hospitalisation odds per infection ${h}_{a}$ (Kinyanjui et al., 2015 and appendix 3) to determine the cumulative hospitalisations predicted by the model for each age category $a$ and week interval ${w}_{i}=({t}_{i,1},{t}_{i,2})$,
Parameter inference
Request a detailed protocolThe majority of the parameters for the RSV transmission model were drawn from the RSV literature (see Table 2 and appendix 3) leaving four parameters, and the five hyperparameters of a normal distribution describing the random yearly variation in logseasonality, to be inferred from hospitalisation data (see Table 3 for parameter estimates and appendix 1 for further details on seasonality model). The free parameters and distribution of the RSV transmission model were:
Community infection rate outside the household between U1s and all others in the community accessing KCH (${b}_{U1}$).
Community infection rate outside the household among all O1s in community (${b}_{O1}$).
Infectious contact rate within the household to all other household members ($\tau $).
Mean duration of maternally derived immunity to RSV ($M$).
The joint normal distribution of the yearly logseasonality amplitude and phase ($[\xi ,\varphi ]\sim \mathcal{N}(\mathit{\bm{\mu}},\mathbf{\mathbf{\Sigma}}$)).
We also included an infectious contact rate for children of schooling age (5–18 years old; ${b}_{S}$) which acted additionally to ${b}_{O1}$; that is children of schooling age were at additional risk of contracting RSV on top of the risk due to mixing in the community. This meant that the mixing matrix in Equation (5) was in block form,
where the blocks represented respectively underone age categories, overones at school age categories and overones above school age categories. Unfortunately, we were unable to reliably identify ${b}_{S}$ parameter jointly with the other parameters. Investigating a range of ${b}_{S}$ values gave similar results for model fit and predictions for vaccine efficacy, the results in the main paper were for the highest value of ${b}_{S}$ considered which was mildly pessimistic compared to ${b}_{S}=0$ (see appendix 3).
The data for parameter inference was RSVconfirmed, agespecific weekly admissions to Kilifi county hospital (KCH) hospitalisation data from September 2001 until September 2016 (see Nokes et al., 2009 for study details). KCH serves as the primary care facility for the KHDSS population, and we assumed that all KHDSS members who accessed urgent hospital treatment due to RSV disease accessed their treatment at KCH. However, a significant number of admissions were from people not within the KHDSS survey leading to data rescaling (see appendix 3). The loglikelihood for a particular simulation corresponded to Poisson errors,
where ${\mathcal{D}}_{i,a}$ was the cumulative number of hospitalisation observed at KCH in age category $a$ on week ${w}_{i}$ and ${f}_{poi}(x\mu )$ is the probability mass function for a Poisson distribution with mean µ.
If the yearly realisations of the random seasonality (see appendix 1) were known, then the entire model would be deterministic and $\mathrm{ln}\mathcal{L}$ would be a function of the unknown parameters. Therefore, we treated the yearly variation in seasonality as missing data and used the Expectationmaximisation (EM) algorithm (Dempster et al., 1977) to converge onto maximum likelihood estimates for the four free parameters, and the two hyperparameters of the logseasonality model, 95% confidence intervals were constructed using the likelihood profile technique (e.g. King et al., 2008 and appendix 3).
Modelling vaccination
Request a detailed protocolThere were two vaccines used in this modelling study, which were deployed as part of the prenatal contact between pregnant women and skilled health professionals. We assumed that the maternal vaccine was delivered as one injection to the pregnant women in her third trimester. This achieved some unknown additional period of maternal protection, $P$ days, on top of the random period $M$, that is after maternally vaccinating the period of protection became ${M}_{vac}=M+P$. Achieving an effective maternal vaccination coverage of ${V}_{cov}$ shifted the mean susceptibility of U1s to ${\overline{\sigma}}_{U1}=\mathbb{P}({M}_{vac}<A){V}_{cov}+\mathbb{P}(M<A)(1{V}_{cov})$, a linear increase in ${V}_{cov}$. The change in distribution of age at infection was nonlinear in ${V}_{cov}$ because, conditional on an U1 being infected, it was more likely that the U1’s mother had not been vaccinated than the unconditional probability of nonvaccination, $1{V}_{cov}$ (see appendix 4). We also assumed that there was a vaccine available that provoked an immune response in the vaccinated individuals similar to a natural infection; that is a susceptible $O1$ who is vaccinated immediately becomes ‘recovered’ and immune to RSV infection until her immunity waned. Immune response provoking vaccination was offered to all O1s in households when a birth occurred, as an addendum to the prenatal contact between motherstobe and health professionals. In principle, there were three dimensions to the coverage of the immunity provoking vaccine: (i) coverage of households, (ii) coverage within households, and (iii) vaccine effectiveness. For simplicity, we bundled these dimensions together, and vaccinated whole households at an effective vaccination coverage (the product of the three dimensions of coverage). Over 10 years of forecasted RSV epidemics if a MAB vaccine was available, and given to every pregnant mother, 8601 MAB vaccines were deployed each year. 0–24,095 IRP vaccines were deployed each year depending on household coverage. It should be noted that by 2016 the KHDSS population was around 240,000 people, hence 100% effective coverage of the households where births occurred corresponded to ~10% effective coverage of the total population.
Model simulations
Request a detailed protocolWe simulated the model by numerically solving the high dimensional ODE [Equation (3)] simultaneously with the ongoing cumulative hospitalisations in each age category, ${\dot{\mathscr{H}}}_{a}={h}_{a}{\mathcal{I}}_{a}(t)$, which allowed us to solve for the model predicted weekly hospitalisations [Equation 7]. The initial state of the model was unknown. We initialised the model by starting with a completely susceptible population with the population demography set to mimic that of the KHDSS on 1st Jan 2000. We then simulated RSV transmission for 10 years, with demographic rates (e.g. birth rates) chosen to match those of KHDSS in year 2000 and the seasonal amplitude and phase of $\mathrm{ln}\beta $ set to their latest mean estimate, in order to provide an initial state of the household model. Finally, we ran the model from 1st Jan 2000 until 1st September 2001. This provided the initial point for comparison to hospitalisation data. Numerical solutions were provided using the Sundials CVODE solver (Cohen et al., 1996) implemented within the DifferentialEquations package for Julia 0.6 (Rackauckas and Nie, 2017). For retrospective simulations comparing model predictions to data (Figure 2), we used the most probable values of the yearly seasonality. For forecast simulations, we generated 500 realisations of yearly seasonality over 10 years from the distribution inferred in model inference, this gave 500 predictions for the time series of future hospitalisations. We typically presented medians of these predictions (e.g. Figure 4). The code for the RSV household model used in this paper, and the data used for parameter inference, is available from https://github.com/SamuelBrand1/RSVHouseholdModel (Brand, 2020; copy archived at https://github.com/elifesciencespublications/RSVHouseholdModel).
Appendix 1
Modelling seasonality in RSV transmission among KHDSS
RSV is a seasonal virus, in temperate climates the peak month for RSV incidence tends to be consistent yearonyear. Therefore, modelling approaches aimed at understanding RSV transmission in temperate climates have used an annually periodic deterministic function, with the timing of peak infectiousness of RSV being either a model parameter (Yamin et al., 2016) or itself a function of climatic variable to be fitted using regression methods (Pitzer et al., 2015).
The seasonal drivers of RSV transmission in the tropics are less clear (Paynter, 2015). At KCH the most common trough month for RSV hospitalisations was September, which lead us to define the RSV ‘year’ as September  September. The most common month for peak hospitalisation in each RSV year was January, however there was significant variation in peak month between RSV seasons with peaks occurring in each month November  April between 2002 and 2016 (Appendix 1—figure 1).
The yearonyear variation in peak month for RSV hospitalisation means that naively inferring a single fixed peak infectiousness parameter would not be a successful inference strategy. However, determining the precise mechanistic reason for shifting seasonality was challenging for the KHDSS population. RSV has been positively associated with the rainy season in some tropical settings (Paynter et al., 2013; Paynter, 2015); however, this is not obviously the case in Kilifi county where the rainy season is April to June with short rains October to December. There have been many proposed mechanisms for erratic periodicity in transmission (for a wide variety of infectious pathogens) which could be relevant to RSV transmission in Kilifi, for example, dynamical attractor switching (Keeling et al., 2001), or the effect of species/strain interaction (Bhattacharyya et al., 2018). In particular, strain competition between RSV A and RSV B has been identified a mechanism for generating complex seasonal dynamics (White et al., 2005).
In this paper, we took an agnostic view and rather than choosing a mechanistic hypothesis for erratic seasonality from the many possible, we assume that the timevarying infectiousness of RSV alters randomly (but from a common distribution) year to year:
where the RSV infectiousness (${\xi}_{n}$) and seasonal peak timing (${\varphi}_{n}$) for each RSV year $n$ are drawn jointly from a normal distribution common to each year $({\xi}_{n},{\varphi}_{n})\sim \mathcal{N}(\mathit{\bm{\mu}},\mathbf{\mathbf{\Sigma}})$. During model inference the yearly ${\xi}_{n}$ and ${\varphi}_{n}$ realisations are treated as latent variables; their mean and covariance matrix are imputed along with other model parameters.
Appendix 2
Household and agestructured RSV transmission model details
As described briefly in the main text, we developed a dynamic model for simulating the spread of RSV through the KHDSS population. The model was a hybrid between a mechanistic ODE approach, this included detailed household structure but only a simplified set of ageanddisease states for individuals within the households, and a datadriven empirical model, this used the observed joint distributions of KHDSS individuals’ household occupancy and ages to generate conditional predications of individual detail beyond that of the mechanistic part of the model.
Brief comparison to agestructured RSV transmission models
A commonly used conceptual framework for modelling epidemic transmission with a population is the compartmental model (Anderson and May, 1991; Keeling and Rohani, 2008); each person’s disease state is described as being one of a finite number of possibilities, for example susceptible, infectious, recovered, which define that person’s risk of contracting the infectious pathogen or transmissibility whilst infected with the pathogen. Additionally, it is usually important to capture the heterogeneity of the population, also called the population structure, in contrast to unstructured populations where every individual is treated as interchangeable. Therefore, each person will be described by their position in the population with sufficient detail that a rate of contact can be modelled between any pairs of individuals, see Diekmann and Heesterbeek for a more detailed discussion on modelling population structure (Diekmann and Heesterbeek, 2000). RSV transmission models have most commonly used age structure to describe heterogeneity in the population; each individual is described jointly by their disease state and which age interval (from some predetermined set of intervals) they occupy (Pitzer et al., 2015; Kinyanjui et al., 2015; Yamin et al., 2016). For agestructured RSV transmission models, there are two dynamical elements: the transmission of disease and the demographic turnover of the population (births, deaths and ageing). At the level of the individual these are modelled as discrete random events occurring at some percapita rate (Rock et al., 2014). However, for large populations, there will be a very large number of individuals in each ageanddisease state, and the flux of population density in each ageanddisease state converges in probability onto the solution of a set of ordinary differential equations (ODEs) as the population size is treated as converging to infinite size (Kurtz, 1970; Kurtz, 1971; Diekmann and Heesterbeek, 2000). The limiting ODE model has as many degrees of freedom as there are ageanddisease state combinations in the epidemic model. In most epidemic modelling studies, it is the deterministic evolution of the solution to these ODEs that is usually given as the transmission model description.
In this paper, the essential modelling concept was to shift the focus away from numbers of individuals in each ageanddisease state and towards the number of households in each possible household configuration. A household configuration describes the number of individuals in each ageanddisease state who cohabit within a single household. Including households within the model adds a potentially relevant layer of realism; the social contacts within a household are persistent, therefore pairs of individuals that cohabit will repeatedly have the opportunity to infect one another if RSV enters the household but be relatively cocooned from infection if RSV has not entered the household. Agestructured transmission models implicitly assume that no two individuals contact one another more than once. To see this consider a population size of $N$; the rate of any individual contacting another single individual is $\mathcal{O}(1/N)$ therefore the probability that an individual selects the same other individual twice for contact over any finite time horizon goes to zero as $N\to \mathrm{\infty}$ (which is also the limit at which the ODE model is valid). For household models the discrete random events that change the state of individuals (infection, death etc.) also change the household configuration. When the number of households is very large, there will be a large number of households in each possible household configuration and, as with agestructured models, there is convergence onto a set of ODEs with as many degrees of freedom as the number of possible household configurations.
The possible household configurations, or state space, of a household and agestructured RSV transmission model is considerably larger than it would be for the equivalent agestructured model. If there are $m$ possible ageanddisease states then the number of possible household configurations for a household of size $n$ is given by a standard combinatorial identity, $\left(\genfrac{}{}{0pt}{}{n+m1}{n}\right)$. In this paper, we consider a range of household sizes up to a maximum size ${n}_{max}$, therefore the number of household configurations was,
The number of possible household configurations grows very rapidly (Appendix 2—figure 1). Therefore, having a sufficiently large ${n}_{max}$ to capture the target population required using a relatively simple compartmental ageanddisease state model for RSV infection.
Derivations for equilibrium behaviour of unstructured RSV transmission models
The ageandhousehold structured model we used in the main paper to make predictions of potential vaccine effectiveness in a population with persistent social structure. However, it can be useful to compare comparatively complex simulation studies to simpler models which are at least partially analytically tractable; this comparison identifies which features of a model are generic as opposed to emerging from more complicated factors (like seasonality or social structure).
A simple unstructured compartmental model of RSV transmission with two types of vaccine in a population of size $N$ was presented in the main paper (Box 1). Individuals are born into the population at rate $B$ and are initially protected against RSV by maternal antibodies ($M$). All individuals die at rate µ. They lose maternal protection at rate ${\alpha}_{vac}$ (the rate associated with the maternal vaccine) and become susceptible to RSV infection ($S$). Each susceptible is infected at a rate $\beta I/N$ where $\beta $ is the product of the contact rate and the probability of transmission per contact and $I$ is the number of infected individuals in the population. Infected individuals clear their infection and become recovered and are temporarily immune to reinfection ($R$) at rate $\gamma $. Recovered individuals lose their temporary immunity to reinfection at rate $\nu $. A vaccine aimed at provoking an immune response akin to a natural infection (IRP vaccine) is also used to control RSV. This is given to individuals in the population at effective rate $V$ (rate of delivery times probability the vaccine dose is successful). For simplicity, we assume that the IRP vaccine is not given to children so young they are likely to be in the $M$compartment, but their isn’t memory of which individuals have been vaccinated recently, therefore the chance that an individual selected for vaccination is actually susceptible is $S/(S+I+R)$. If a susceptible individual is vaccinated she transitions to becoming temporarily immune to RSV, this temporary immunity being lost at rate $\nu $.
The ODE equations for the dynamics of the basic unstructured model are:
We solve for the equilibrium state of this simple model, denoted $({M}^{*},{S}^{*},{I}^{*},{R}^{*})$, assuming that the population has reached a steady size of $N$, with replacement birth rate $B=\mu N$. For the simple RSV model, we use a mortality rate µ that corresponds to a life expectancy of 65 years, the Kenyan average. The reproductive ratio for the model is ${R}_{0}=\beta /(\gamma +\mu )$.
Since, the rate of loss of maternal immunity is fast compared to the mortality $({\alpha}_{vac}\gg \mu )$ nearly all the population survive their $M$ period and become available for infection,
We use ${S}^{*}+{I}^{*}+{R}^{*}=N$ below to simplify the notation, but $N$ could be replaced with ${N}_{eff}=\frac{{\alpha}_{vac}}{{\alpha}_{vac}+\mu}N$. Note that the maternal vaccine does not alter the incidence rate for the simple RSV model at equilibrium, it simply delays the typical infection time. Equation (13) implies that either ${I}^{*}=0$ (disease free state), or,
Therefore,
Combining Equations (12), (15), (16), (17) gives that if RSV is endemic then,
Equation (18) implies that for the simple RSV model the critical rate at which an IRP vaccine eliminates RSV is ${V}_{c}=(\mu +\nu )N({R}_{0}1)$.
At an endemic equilibrium, the RSV incidence rate with vaccination rate $V$, denoted ${\iota}_{V}^{*}$, is therefore,
Equation (19) implies the two results which are presented in Box 1 of the main text:
The relative reduction in incidence due to IRP vaccination compared to no vaccination is,
 (20) $\frac{{\iota}_{0}^{*}{\iota}_{V}^{*}}{{\iota}_{0}^{*}}=\mathrm{min}\{\frac{V}{N(\mu +\nu )({R}_{0}1)},1\}.$
In this paper, we model a scenario where cohabitants of newborn children each receive an IRP vaccine. This fixes $V$ to be proportional to the birth rate, $V=\mu N\u27e8H\u27e9{V}_{cov}$, where $\u27e8H\u27e9$ is the average number of cohabitants that a newborn has and ${V}_{cov}$ is the effective IRP coverage of households. This gives,
 (21) $\mathrm{R}\mathrm{e}\mathrm{l}\mathrm{a}\mathrm{t}\mathrm{i}\mathrm{v}\mathrm{e}\text{}\mathrm{r}\mathrm{e}\mathrm{d}\mathrm{u}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{}\mathrm{i}\mathrm{n}\text{}\mathrm{t}\mathrm{r}\mathrm{a}\mathrm{n}\mathrm{s}\mathrm{m}\mathrm{i}\mathrm{s}\mathrm{s}\mathrm{i}\mathrm{o}\mathrm{n}\text{}\mathrm{d}\mathrm{u}\mathrm{e}\text{}\mathrm{t}\mathrm{o}\text{}\mathrm{v}\mathrm{a}\mathrm{c}\mathrm{c}\mathrm{i}\mathrm{n}\mathrm{a}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}=min\{\frac{\mu \u27e8H\u27e9{V}_{cov}}{(\mu +\nu )({R}_{0}1)},1\}.$
Whilst RSV is not eliminated the reduction in incidence rate due to IRP vaccination is linear in $V$, with the improvement per extra vaccine used being a constant
 (22) $\mathrm{R}\mathrm{e}\mathrm{d}\mathrm{u}\mathrm{c}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{}\mathrm{i}\mathrm{n}\text{}\mathrm{t}\mathrm{r}\mathrm{a}\mathrm{n}\mathrm{s}\mathrm{m}\mathrm{i}\mathrm{s}\mathrm{s}\mathrm{i}\mathrm{o}\mathrm{n}\text{}\mathrm{p}\mathrm{e}\mathrm{r}\text{}\mathrm{I}\mathrm{R}\mathrm{P}\text{}\mathrm{v}\mathrm{a}\mathrm{c}\mathrm{c}\mathrm{i}\mathrm{n}\mathrm{e}=\frac{(\gamma +\mu )}{(\gamma +\mu +\nu ){R}_{0}}.$
The mean number of overone year olds living in households with at least one underone year old in the KHDSS (see below) fluctuated yearly, but was never greater than five ($\u27e8H\u27e9<5$). Therefore, using a reversion to susceptibility rate $\nu =2$ per year (see Table 2) with Equation (21) suggests that if, say, ${R}_{0}=2$ then the maximum achievable relative reduction in RSV incidence using this strategy with a Kilifi like population implied by the simple RSV model is 3.8%.
Ageanddisease states for the household model
A literature review of mechanistic RSV transmission models revealed a number of critical common features:
At birth newborns are born protected against RSV infection due to antibodies gained from their mother via transplacental transfer. This is typically modelled as a maternally protected disease state $M$ e.g. (Yamin et al., 2016).
The probability of developing severe disease and being hospitalised depends on a person’s age, and number of times infected in the past, e.g. (Kinyanjui et al., 2015).
The susceptibility to RSV infection per infectious contact, their infectiousness after infection, and the expected time taken to become recovered from RSV depend on number of times previously infected, e.g. (Kinyanjui et al., 2015).
The high dimensionality of household and agestructured models necessitated using the most minimal ageanddisease state model possible for RSV (see above). To do this we use an extremely parsimonious approach. The possible ageanddisease state for individuals are: susceptible or maternally protected and under the age of one (${S}_{1}$), infectious and under the age of one (${I}_{1}$), recovered and under the age of one (${R}_{1}$), susceptible and over the age of one (${S}_{2}$), infectious and over the age of one (${I}_{2}$) and recovered and over the age of one (${R}_{2}$). An underone year old (U1) experiencing some force of infection $\lambda $ becomes infected (${S}_{1}\to {I}_{1}$) and infectious to RSV at a rate ${\sigma}_{U1}\lambda $ where ${\sigma}_{U1}$ is the average susceptibility of an U1 year old to RSV. After becoming infected the U1 ceases to become infectious at a rate ${\gamma}_{1}$ (${I}_{1}\to {R}_{1}$) and then is immune to reinfection to RSV for a period of time. The immunity derived from natural infection is lost at a rate $\nu $, and the U1 revert to susceptibility but in the ${S}_{2}$ category (${R}_{1}\to {S}_{2}$). The reason we transition recovered U1s to a susceptible overone year old (O1) is that due to the seasonality of RSV it is very rare for a person to be infected more than once in one epidemic season, therefore functionally by the time an individual is facing the risk of their second RSV lifetime infection they will very likely be over one. All U1s age at the rate $\eta =1/365.25$ days_{1} becoming individuals in the same disease state but overone (${S}_{1}\to {S}_{2}$, ${I}_{1}\to {I}_{2}$, ${R}_{1}\to {R}_{2}$). An O1 individual experiencing a force of infection $\lambda $ becomes infected and infectious (${S}_{2}\to {I}_{2}$) with RSV at a rate ${\sigma}_{O1}\lambda $ where ${\sigma}_{O1}$ is the relative susceptibility of O1s compared to an U1 no longer protected by maternal antibodies. Infectious O1s cease being infectious (${I}_{2}\to {R}_{2}$) at a faster rate than U1s, ${\gamma}_{2}>{\gamma}_{1}$, but revert to susceptibility (${R}_{2}\to {S}_{2}$) at the same rate $\nu $ (Appendix 2—figure 2).
As mentioned in the main document we relate this simple ageanddisease state model to more complicated RSV models by (i) using the conditional age distribution of individuals to address questions that required a more complicated age structure than a simple under/overone binary choice, for example whether susceptible under ones were still protected by maternal antibodies, and (ii) by assuming that all overones have been infected at least once and all susceptible U1s have never been infected and might still be protected by maternal antibodies.
Household and agestructured model dynamics
A household configuration is a tuple of the number of individuals in each ageanddisease state who cohabit a household. The generic household configuration is denoted $h=({s}_{1},{i}_{1},{r}_{1},{s}_{2},{i}_{2},{r}_{2})$, indicating that the household has precisely ${s}_{1}$ individuals in state ${S}_{1}$, ${i}_{1}$ individuals in state ${I}_{1}$ etc. The household size is the number of people living in the household (i.e. ${s}_{1}+{i}_{1}+{r}_{1}+{s}_{2}+{i}_{2}+{r}_{2}$). We denote the space of possible household configurations $\mathrm{\Sigma}$ and number of households in the state $h$ at time $t$ as ${H}_{h}(t)$. It is useful to consider a vector quantity over all possible household configurations such as $\mathit{\bm{H}}(t)=({H}_{h}(t)h\in \mathrm{\Sigma})$ where we have generated some ordering for elements $h\in \mathrm{\Sigma}$. It is clear that the knowledge of $(\mathit{\bm{H}}(t),t\ge 0)$ would allow us to reconstruct the dynamics of individuals. For example, using the function $f(h)={s}_{1}$ for each $h\in \mathrm{\Sigma}$ in a vectorised form $\mathit{\bm{f}}=(f(h)h\in \mathrm{\Sigma})$ allows us to track the dynamics of numbers of individuals: $(f\cdot \mathbf{H}(t),t\ge 0)$.
As mentioned above, agestructured models are constructed by considering the per capita rate of events affecting the state of individuals. Household and agestructured models are constructed by considering the per household rate of events that affect the household configuration (see House and Keeling, 2009 for further mathematical details). In the following we list the events that change the household model divided into three groups: events due to transmission within the household, events due to transmission between households and events due to demographic turnover.
Events due to RSV transmission within the household
Infection of susceptibles from within the household:
 (23) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{U}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1}1,{\mathrm{i}}_{1}+1,{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}{\sigma}_{\mathrm{U}1}\beta (\mathrm{t})\tau {\mathrm{s}}_{1}({\mathrm{i}}_{1}+{\iota}_{2}{\mathrm{i}}_{2}),$
 (24) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{O}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2}1,{\mathrm{i}}_{2}+1,{\mathrm{r}}_{2}]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}{\sigma}_{\mathrm{O}1}\beta (\mathrm{t})\tau {\mathrm{s}}_{2}({\mathrm{i}}_{1}+{\iota}_{2}{\mathrm{i}}_{2}).$
$\tau $ is the household infection rate, τ2 is the reduction in infectiousness due to being an O1, $\beta (t)$ is the seasonally varying component to the transmission rate and ${\sigma}_{O1}$ is the reduction in susceptibility due to being O1. Note that the true infection rate for U1s is ${\sigma}_{U1}{\lambda}_{hh}$ and for O1s is ${\sigma}_{O1}{\lambda}_{hh}$ as defined in main text. ${\sigma}_{U1}$ is the probability that an U1 individual is no longer protected by maternal antibodies, calculated by integrating over the individuals conditional age distribution as follows. Maternal protection was assumed to be 100% effective but only for a random duration per newborn of $M$ days, therefore using the uniform age distribution conditional on the individual being under one years old (see above),
 (25) ${\sigma}_{U1}={\displaystyle \frac{1}{T}}{\displaystyle {\int}_{0}^{T}}\mathbb{P}(M\le a)\mathrm{d}a.$
where $T$ is the duration of a year expressed in the units of the simulation (we used days so $T=365.25$ days). The probabilistic model for the duration of maternal protection was $P\sim \mathrm{exp}(\alpha )M\le T$ days , where $\alpha $ is the waning maternal immunity rate. The distribution function for $M$ is
 (26) $\mathbb{P}(M\le a)=\{\begin{array}{cc}\hfill (1\mathrm{exp}(a/\overline{M}))/(1\mathrm{exp}(T/\overline{M}))\hfill & \hfill 0\le a\le T\hfill \\ \hfill 1\hfill & \hfill \text{otherwise}\hfill \end{array}$
where $\overline{M}=1/\alpha $ is the mean period of maternal protection without conditioning on $M\le T$, the true mean period of protection is $\mathbb{E}[M]=\overline{M}T/({e}^{T/\overline{M}}1)$ but this turns out to be a very small correction to $\overline{M}$ since we fit to $\overline{M}$ being less than 30 days (see below), therefore for simplicity we call $\overline{M}$ the mean duration of maternal protection to RSV. Substituting into Equation (25) and direct integration gives,
 (27) ${\sigma}_{U1}={\displaystyle \frac{1}{1{e}^{T/\overline{M}}}}{\displaystyle \frac{\overline{M}}{T}}.$
Note that ${\sigma}_{U1}\approx 1\overline{M}/T$ when $\overline{M}\ll T$.
Recovery of infecteds:
 (28) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{U}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1},{\mathrm{i}}_{1}1,{\mathrm{r}}_{1}+1,{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}{\gamma}_{1}{\mathrm{i}}_{1},$
 (29) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{O}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2}1,{\mathrm{r}}_{2}+1]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}{\gamma}_{2}{\mathrm{i}}_{2}.$
Where ${\gamma}_{1}$ and ${\gamma}_{2}$ are the recovery rates of U1s and O1s.
Reversion to susceptibility:
 (30) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{U}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1}1,{\mathrm{s}}_{2}+1,{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}\nu {\mathrm{r}}_{1},$
 (31) $\displaystyle \mathrm{F}\mathrm{o}\mathrm{r}\phantom{\rule{thinmathspace}{0ex}}\mathrm{O}1\mathrm{s}:\text{}[{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2},{\mathrm{i}}_{2},{\mathrm{r}}_{2}]\to [{\mathrm{s}}_{1},{\mathrm{i}}_{1},{\mathrm{r}}_{1},{\mathrm{s}}_{2}+1,{\mathrm{i}}_{2},{\mathrm{r}}_{2}1]\text{}\text{}\mathrm{a}\mathrm{t}\phantom{\rule{thinmathspace}{0ex}}\mathrm{r}\mathrm{a}\mathrm{t}\mathrm{e}:\text{}\nu {\mathrm{r}}_{2}.$
Where v is the reversion to susceptibility/waning immunity rate.
Events due to RSV transmission from without the household
In a purely agestructured transmission model, the number of RSV infecteds in each age category, $\mathit{\bm{I}}(t)={({I}_{a}(t))}_{a\in \mathcal{A}}$, is a dynamic model variable which evolves according to a set of ODEs. For the household and agestructured model we derived $\mathit{\bm{I}}(t)$ from the household configuration dynamics and the conditional age distributions as the expected number of infecteds in each category given the distribution of household configurations $\mathit{\bm{H}}(t)$. Note that knowing a household configuration specifies both the household size $n={s}_{1}+{i}_{1}+{r}_{1}+{s}_{2}+{i}_{2}+{r}_{2}$ and the underone occupant boolean $U=\mathrm{\U0001d7cf}({s}_{1}+{i}_{1}+{r}_{1}>0)$. Therefore, we could define a $\mathcal{A}\times \mathrm{\Sigma}$ conversion matrix to convert between the dynamic $\mathit{\bm{H}}(t)$ variables into the implied $\mathit{\bm{I}}(t)$ variables,
The agedependent force of infection on each individual in age category $a$, ${\lambda}_{age}(a)$ depends on a community age mixing matrix $T={(T(a,b))}_{a\in \mathcal{A},b\in \mathcal{A}}$,
where $N(t)$ is the total population size at time $t$. This is a standard formulation for force of infection between different age groups (see Keeling and Rohani, 2008). In principle any agemixing matrix can be used as $T$; however, we use a simple matrix in block form that differentiated only between U1s, O1s of school age, and all other O1s (see main text). The force of infection on U1 and O1 individuals within households was calculated using a $\mathrm{\Sigma}\times \mathcal{A}$ conversion matrix, and a small force of infection from outside the KHDSS was added, $\u03f5$,
The external infection event changes the household configuration:
Infection of susceptibles from outside the household:
Events due to demographic change in the population
In the householdandagestructured RSV model, we track demographic change both by using the yearly updated joint distributions of age and household size and by the dynamics of the household configurations $\mathit{\bm{H}}(t)$. The number of households of each size $n$ changed over time due to the effect of people leaving home, births, deaths, outmigration from KHDSS and inmigration into KHDSS. Moreover, the mean number of U1s per household of each size evolved over time. Rather than track all the possible events that change the demography of the KHDSS, we focus on (i) the ageing of the U1s becoming O1s, (ii) capturing the household size dependent birth rate, and (iii) capturing the change in household numbers for each household size.
The recorded birth rate that can be inferred from the KHDSS data set included newborns who outmigrate, neglected newborns that inmigrate at a very young age, and obviously some newborns die whilst very young. As mentioned above, we did not mechanistically track every possible demographic event, but instead calculated the effective birth rate that arrived at the correct mean number of U1s for each household size. For simplicity, we assumed that the effective birth rate was a turnover rate for households; that is each birth is associated with a percapita rate of an O1 leaving the household. This arrived at the correct density of U1s in the population, and in each size group of households, at the cost of assuming that events occurred at the same time rather than at the same rate.
The number of households of each size changed over time as the overall population size changed and individuals left households in order to form new households. As with the demographic turnover rate, there were multiple different mechanisms whereby new individuals entered the population and formed new houses or individuals and groups left the population, for example whole groups arrived and formed a new house, individuals arrived and joined houses etc. Moreover, the RSV infection status of the new entrants to the population were unknown. We assumed that new entrants arrived as households with the same distribution of household configurations as already observed in the population; that is that new arrivals didn’t have a net effect on the proportion of individuals in each ageanddisease state just by arriving, although obviously as the population grew this has an effect of the number of hospitalisations we expected.
The demographic events that changed the household configurations were:
Aging:
where $\eta =1/T$ is the aging rate at which U1s become O1s. $T$ is the duration of a year expressed in the units of the simulation (we used days so $T=365.25$ days).
Demographic turnover due to births and O1s leaving their household:
If there is at least one O1 left in the household, the birth/turnover rate is zero for households with only 1 O1; that is there are never any households of only U1s. $\mu (n,t)$ is the turnover rate per O1 household member in a household of size $n$ at time $t$ replacing them with susceptible U1s for households of size $n$. The turnover rates for each year were chosen so that the correct density of U1s per household was achieved (approximately). Following is a description of the fitting process so that the turnover rate lead to this household demography:
Collect the empirical distribution of U1s per household size. For each household size $n=1,\mathrm{\dots},{n}_{max}$ we calculated the mean number of U1s per household at y = 1st jan 20002017, this was denoted: ${\overline{N}}_{U1}(n,y)$.
Calculate the implied distribution of U1s per household size for any given birth/turnover rate. For any given birth/turnover rate, µ, the equilibrium probability of finding $k$ U1s in a household of size $n$ is
 (46) $\pi (kn,\mu )\propto {\left({\displaystyle \frac{\mu}{\eta}}\right)}^{k}\left({\displaystyle \genfrac{}{}{0pt}{}{n}{k}}\right)\mathit{}k=0,\mathrm{\dots},n1,$
 (47) $\pi (nn,\mu )=0.$
Equation (46) is just the equilibrium distribution of a birthdeath process (Grimmett and Stirzaker, 2001).
Matching the empirical distribution to the implied distribution. We used a rootfinder to find the turnover rate that matches the simulation’s mean number of U1s per household of each size to the empirical data, for the next year:
 (48) $\displaystyle \mu (n,t)\text{}\mathrm{i}\mathrm{s}\text{}\mathrm{t}\mathrm{h}\mathrm{e}\text{}\mathrm{s}\mathrm{o}\mathrm{l}\mathrm{u}\mathrm{t}\mathrm{i}\mathrm{o}\mathrm{n}\text{}\mathrm{t}\mathrm{o}\sum _{k=0}^{n1}k\pi (kn,\mu (n,t))={\overline{N}}_{U1}(n,y+1)\text{}\mathrm{f}\mathrm{o}\mathrm{r}\text{}\mathrm{a}\mathrm{l}\mathrm{l}\text{}t\text{}\mathrm{i}\mathrm{n}\text{}\mathrm{y}\mathrm{e}\mathrm{a}\mathrm{r}\text{}y.$
Change in number of households due to population flux
where ${\mathrm{\Sigma}}_{n}=\{h=[{s}_{1},{i}_{1},{r}_{1},{s}_{2},{i}_{2},{r}_{2}]{s}_{1}+{i}_{1}+{r}_{1}+{s}_{2}+{i}_{2}+{r}_{2}=n\}$ was the set of household configurations of households of size $n$. $r(n,t)$ was the daily rate of change of number of households of size $n$ interpolated between the empirical distribution dates.
Simulating the model
The model above could in principle have an infinite number of states if the household size was not limited (see above). We chose limits on the household size based on capturing ≈99% of the U1s in the population, and therefore the pathway to them catching RSV. The limits were: (i) no household is bigger than size 10, and (ii) no household has more than 2 U1s. This also covers the big majority of the total numbers of households (see Appendix 2—figure 3). The ${n}_{max}=10$ limit was imposed by initialising the model without households of size $>10$, and setting $r(n,t)=0$ for all $n>10$. The ≤2 U1 limit was imposed by setting the birth/turnover rate to zero for all households with 2 U1s. Putting the limits in reduces the dimensionality of the system to 1926 different household configurations.
Note that the events that either change a household’s configuration or change the number of households described above can be divided into two categories: (Nair et al., 2010) those with rates that only depended on the household’s configuration, e.g. infection within the household, or ageing of U1s, and, (Glezen et al., 1986) those with rates that depended on the configurations of other households, e.g. transmission between households or the rate of change of household numbers. The events in category (Nair et al., 2010) translate to linear dynamics for $\mathit{\bm{H}}(t)$, events in category (Glezen et al., 1986) translate to nonlinear dynamics (House and Keeling, 2008). Overall, the dynamics of $\mathit{\bm{H}}(t)$ obey the semilinear dynamical system,
A_{t} is a matrix which encodes the dynamics of events in category (Nair et al., 2010), ${\mathit{\bm{f}}}_{t}(\mathit{\bm{H}}(t))$ encodes the transmission between households, and ${\mathit{\bm{\rho}}}_{t}(\mathit{\bm{H}}(t))$ encodes the rate of change of numbers of households in each configuration. We initialised the dynamics of Equation (51) by starting with a completely susceptible population on 1 st Jan 1990, allowing RSV to be introduced via the external force of infection and running for 10 years (see main text).
Equation (51) has two properties that are important to note:
The change rate in households of size $n$ is independent of the transmission dynamics:
 (52) ${\partial}_{t}\left({\displaystyle \sum _{h\in {\mathrm{\Sigma}}_{n}}}{H}_{h}(t)\right)=r(n,t),n=1,\mathrm{\dots},10.$
The dynamics of the proportion of households in a given state ${P}_{h}(t)={H}_{h}(t)/{\sum}_{{h}^{\prime}}{H}_{{h}^{\prime}}(t)$ is not directly affected by the change rates (${\mathit{\bm{\rho}}}_{t}$) in households:
 (53) ${\partial}_{t}{\mathit{\bm{P}}}_{t}={A}_{t}{\mathit{\bm{P}}}_{t}+{\displaystyle \frac{{\mathit{\bm{f}}}_{t}\left({\mathit{\bm{H}}}_{t}\right)}{{\sum}_{{h}^{\prime}}{H}_{{h}^{\prime}}(t)}}$
Equations (52) and (53) guarantee the desired modelling features discussed above. Equation (52) gives that the change in the number of households of each size matches the empirical rate of change for each year, we also verified this by numerical solution of Equation (51) (Appendix 2—figure 4). Equation (53) shows that the rate of change of household numbers doesn’t directly effect the proportion of households in any given configuration. We also verified that the number of U1s and O1s was close to their empirical values (Appendix 2—figure 5).
Equation (51) was difficult to solve efficiently because it is both numerically stiff and high dimensional. We numerically solved Equation (51) using the Julia DifferentialEquations package implementation of the CVODE solver, with an efficient Krylov method (GMRES) to solve the implicit timestepping (see main text). We also used the DifferentialEquations efficient event handling which allowed us to change parameters (like the household change rate) at specific times without damaging the performance of the solver, or having to restart simulations.
Appendix 3
Parameters for the household and agestructured RSV transmission model
The parameters for the household and agestructured transmission model were drawn from four sources:
A literature review of infectiousness duration and other epidemiological quantities; main Table 2.
Calculated from the empirical joint distributions (see above Appendix 3—table 1).
Agedependent hospitalisation probability per RSV infection derived from Kinyanjui et al., 2015; Appendix 3—table 2. Hospitalisation probability was the probability that an infected individual would develop severe disease, multiplied by the probability that severely diseased individuals would require hospitalisation. The probability that an infected individual became diseased depended on whether it was the individual’s primary infection episode or not. The underlying data for estimating these probabilities was drawn from cohort studies on RSV disease rates (Ohuma et al., 2012; Nokes et al., 2008). We adapted these probabilities for our model using our assumption that all infected underones were experiencing their first RSV episode, and all overones were experiencing their second or subsequent infection.
Inferred from the KCH hospitalisation data set (see below).
Parameter inference for the household and age model
As mentioned in the main text we used the EM algorithm (Dempster et al., 1977) to estimate parameters for the model. Again, as described in the main text the parameters we chose for inference were:
Infectious contact rate outside the household between U1s and all others in the community accessing KCH (${b}_{U1}$).
Infectious contact rate outside the household among all O1s in community (${b}_{O1}$).
Infectious contact rate within the household ($\tau $).
Rate of loss of maternally derived immunity to RSV ($\alpha $).
The joint normal distribution of the yearly logseasonality amplitude and phase ($[\xi ,\varphi ]\sim \mathcal{N}(\mathit{\bm{\mu}},\mathbf{\mathbf{\Sigma}}$)).
where the community age mixing matrix $T(a,b)$ was in block form:
The loglikelihood for our model [Equation (8) main text] was defined using the incidence rates ${\mathcal{I}}_{a}(t)$ predicted by solving the model. The incidence rate for all the households in the generic household configuration was,
where the household force of infection for the generic household configuration was ${\lambda}_{hh}=\tau ({i}_{1}+{\iota}_{2}{i}_{2})$. We converted the household incidence rate into an age structured incidence rate by using conditional age distributions, and this allowed us to calculate the cumulative hospitalisations in age category $a$, predicted by a given set of parameters and yearly seasonality realisations, in weekly intervals ${w}_{i}=({t}_{i,1},{t}_{i,2})$ using the agedependent hospitalisation rates per infection ${h}_{a}$ (see Table 2)
Here, $K(t)$ is a timevarying scale factor that accounted for the fact that whilst we were modelling RSV infection for the KHDSS population, other individuals were accessing KCH for treatment of RSVinduced severe disease. To fit $K(t)$, we first performed a polynomial regression $R(t)$ against the ratio of KHDSS members using KCH against nonKHDSS members (Appendix 3—figure 1) t = 0 (days) is 22nd April 2002 fitted curve is R(t) = 1.24+ 0.00224 $t$  2.45e6t^{2}+ 9.45e10t^{3}  1.55e13t^{4} + 9.10e18t^{5} for $t<0$, and $R(t)$ took its final value for times after 1st sept 2016. Having fitted the ratio, the scale factor was $K(t)=(1+R(t))/R(t)$, which we derived by assuming that nonresidents were experiencing RSV hospitalisations at proportionally the same rate as residents.
The conditional age category of an U1 who has definitely been infected, where $a=({a}_{0},{a}_{1})$,
An implication of expression (Equation (60)) is that if ${a}_{0}$ and ${a}_{1}$ are both significantly less than $\overline{M}=1/\alpha $ then $\mathbb{P}(A\in aM<A,A\le 1\text{}\mathrm{y}\mathrm{e}\mathrm{a}\mathrm{r})\approx 0$; that is that, although we have assumed that the conditional age of an U1 is distributed evenly over the first year of life, the conditional age distribution of an U1 who has been infected is typically older than $\overline{M}$. This allowed us to extract information for inferring $\alpha $ from the age distribution of hospitalised children at KCH despite only using a crude U1/O1 age distinction in the mechanistic formulation of the householdandage model. The loglikelihood $l(\theta ,\mathit{\bm{\xi}},\mathit{\varphi})$ [Equation(59)] could be determined for a given set of parameters and realisations of the yearly seasonal amplitude and phase by solving the full ODE system numerically [Equation (51)], and thereby also calculating the weekly hospitalisations. $\theta $ represented the model parameters to be inferred, $\mathit{\bm{\xi}}$ and $\mathit{\varphi}$ were the vectors of the seasonal transmission model Equation (10), and ${\mathcal{D}}_{i},a$ was the KCH hospitalisation data for the ith week in the $a$ age category.
The main difficulty in the inference for the unknown parameters $\theta $ was that the actual realisations of $\mathit{\bm{\xi}}$ and $\mathit{\varphi}$ are not observed, therefore $l(\theta ,\mathit{\bm{\xi}},\mathit{\varphi})$ could not be calculated directly. Instead, we use the EM algorithm to converge onto a maximiser of the marginal likelihood, $\mathcal{L}(\theta )=\int \mathbb{P}(\mathcal{D},\mathit{\bm{\xi}},\mathit{\varphi}\theta )d\mathit{\bm{\xi}}d\mathit{\varphi}$. The EM algorithm converges a sequence of parameter estimates ${({\theta}^{(n)})}_{n\ge 0}$ towards a local maximum of the marginal likelihood by alternatively, (1) calculating the expected value of the loglikelihood over the conditional distribution of $\mathit{\bm{\xi}}$ and $\mathit{\varphi}$ given the observed data $\mathcal{D}$ and the current estimate of the parameters, which we dub the $Q$ function [E step], and, (2) finding the parameters which maximised the $Q$ function [M step]. We now give details of how this was implemented for the specific model developed in this paper:
E step: The conditional distribution of $\mathit{\bm{\xi}}$ and $\mathit{\varphi}$ given the $n$th parameter estimate ${\theta}^{(n)}$, from the previous Mstep, and $\mathcal{D}$ could not be calculated in closed form. In principle, this distribution could have estimated numerically (e.g. by using a particle filter method), however, because the household and agestructured RSV transmission model was comparatively slow to integrate (~ 40 secs per simulation) we resorted to saddlepoint integration. Our argument is that because nearly every year has a sharply peaked hospitalisation rate then, given a parameter estimate ${\theta}^{(n)}$, the conditional probability of $(\mathit{\bm{\xi}},\mathit{\varphi})$ should be concentrated around a particular value, making saddlepoint integration an appropriate approximation (see [Hinch, 1991] for further details on saddlepoint integration). Using the saddlepoint approximation, we could solve for the $Q$ function,
The approximation step in Equation (61) is the saddlepoint integration approximation of the average, and the quadratic form is due to our assumption that the seasonal amplitude and phases are distributed jointly normally. Saddlepoint integration is equivalent to assuming that the full mass of the conditional distribution of $(\mathit{\bm{\xi}},\mathit{\varphi})$ was concentrated at its most probable value,
We determined $({\mathit{\bm{\xi}}}^{*},{\mathit{\varphi}}^{*})$ by sequentially optimising Equation (62) over each season by simulating the model repeated and using the NelderMead algorithm implemented within the Optim package for Julia 0.6. Note that saddle point integration has converted solving for the function $Q$ into a regularised maximum likelihood problem where the regularisation was provided by the mean and covariance matrix for logseasonal amplitude and phase derived in the previous M step.
M step: Having constructed the $Q$ function associated with the $n$th parameter iteration [Equation (61)], we maximised $Q$ over $\theta $. The maximum point of $Q$ being ${\theta}^{(n+1)}$ for the next Estep. Maximisation proceeded in three stages:
The maximising values for the mean and covariance matrix of the random seasonal amplitude and phase were given by maximum likelihood using $({\mathit{\bm{\xi}}}^{*},{\mathit{\varphi}}^{*})$ derived in the Estep. This was performed using the fit_mle function provided by the Julia Distributions package.
We performed a global optimisation for $Q$ over a box in parameter space defined by limits $[0,1]$ for transmission parameters and $1/\alpha =\overline{M}\in [10,120]$ days for the inverse rate of loss of maternal immunity. Global optimisation was performed by running 600 iterations of a differential evolution optimiser (Storn and Price, 1997) with 50 agents. The differential evolution optimiser was implemented by the adaptive_de_rand_1_bin_radiuslimited optimiser from the Julia BlackBoxOptim package. The purpose of the global optimisation step was to reduce the dependence on choosing an initial guess about $\theta $ since the whole plausibility space of the parameters was explored at each iteration of the EM algorithm. We called the best performing agent’s parameter set on the $(n+1)$ th step, ${\stackrel{~}{\theta}}^{(n+1)}$.
We used ${\stackrel{~}{\theta}}^{(n+1)}$ as the starting point for a further local optimisation of $Q$ using the NelderMead algorithm implemented by the Julia Optim package. This step provided ${\theta}^{(n+1)}$ for the next Estep.
We iterated EM algorithm until no further improvement in the value of ${Q}^{*}={\mathrm{max}}_{\theta}Q$ was achieved, and then retained ${\theta}^{*}=\mathrm{arg}{\mathrm{max}}_{\theta}Q$ as the maximum likelihood estimator for the parameters. 95% confidence intervals were estimated by using univariate profile likelihood for $Q$; that is varying one parameter at a time whilst keeping others fixed until a ${\chi}^{2}$ region was determined around the maximum of $Q$ (see King et al., 2008 for a description of 95% CIs for dynamical systems).
School mixing scenarios and inference results
We were unable to identify a mixing rate within schools ${b}_{S}$, see Equation (54), therefore we considered four values of ${b}_{S}$ each determined by what a baseline reproductive value for RSV would be if only school children mixed together and the seasonality was just $\beta (t)=1$, ${R}_{S}$, using the simple formula,
These four scenarios were: zero schools transmission (${R}_{S}=0$), low schools transmission (${R}_{S}=0.5$), medium schools transmission (${R}_{S}=1$), and, high schools transmission (${R}_{S}=1.5$). We saw that once maximum likelihood estimation was performed on the free parameters: $\theta =({b}_{U1},{b}_{O1},\tau ,\alpha ,\mathit{\bm{m}},{\mathrm{\Sigma}}_{\xi \varphi})$ the resultant fits to the data were very similar visually (see Appendix 3—figure 2). We noticed that the outcomes of vaccination were also similar for each four scenarios (see below and Figure 1). Therefore, for robustness of conclusion we used the most pessimistic scenario within the main body of the paper, which was high schools transmission ${R}_{S}=1.5$. The maximum likelihood estimates for parameters using the high schools transmission scenario are given in main Appendix 3—table 3, and the maximum likelihood estimates for all scenarios summarised in Appendix 3—figure 3.
Appendix 4
Modelling vaccination in the household and agestructured RSV transmission model
As described in the main paper we modelled the use of two different vaccines: a vaccine deployed to boost the period during which a newborn is protected from RSV by an unknown period $P$ with coverage $V}_{cov$ [MAB vaccine], and a vaccine deployed to O1 household members of the newborn which provokes a period of protection to RSV infection similar to the immunity period of a natural infection at household coverage $H}_{cov$ [IRP vaccine]. Already infected or recovered O1s were not affected by the IRP vaccine. We assumed that the MAB and IRP vaccines were deployed independently, which is useful for gauging potential effectiveness, but unrealistic. In reality, any reason a mothertobe might miss being MAB vaccinated would also be a reason that the household O1s wouldn’t get vaccinated.
The IRP vaccine altered the effective birth events by also provoking transitions to $R}_{2$ state at the point of birth,
Demographic turnover due to births with vaccination:
The MAB vaccine altered both the probability that an U1 is protected, and the age distribution of those who are infected. We denote the random period of time a newborn born to a MAB vaccinated mother is protected from RSV as ${M}_{vac}=M+P$, which has distribution function,
The mean susceptibility of U1s after MAB vaccination has been applied to the population was,
The conditional age category of an U1 who has definitely been infected, where $a=({a}_{0},{a}_{1})$, after MAB vaccine has been deployed at coverage $V}_{cov$ was,
where $\stackrel{~}{M}$ is the random maternal protection duration of a newborn before we observe whether the newborn’s mother had been MAB vaccinated. The function $f(a,P)$ completes Equation (71) by giving the age distribution of U1s who had boosted maternal protection to RSV but was nonetheless infected,
Note that because $\sigma}_{U1,vac$ depended on $V}_{cov$ the age distribution of infected U1s depended on $V}_{cov$ in a nonlinear fashion.
We considered a range of values for $P$ and $H}_{cov$ for each of the schools transmission scenarios; using the maximum likelihood estimators for the inferred parameters for each scenario. In each scenario, at ${V}_{cov}=1$ the median reduction in hospitalisations was similar, although for the high school transmission scenario vaccination was slightly less effective (Appendix 4—figure 1 and Appendix 4—figure 2 colorblindfriendly version ). Therefore, we used this scenario in the main paper as a pessimistic/robust example. As mentioned in main text we simulated 10 years into the future over 500 independent realisations of the random seasonality. Presented are medians of % reduction in hospitalisations at KCH compared to no intervention.
Data availability
All data generated or analysed during this study are included in the manuscript, supporting files or on the cited Github Repository. Source data files have been provided for Figures 26.
References

Genetic relatedness of infecting and reinfecting respiratory syncytial virus strains identified in a birth cohort from rural KenyaThe Journal of Infectious Diseases 206:1532–1541.https://doi.org/10.1093/infdis/jis570

BookInfectious Diseases of Humans: Dynamics and ControlNew York: Oxford University Press.

CostEffectiveness of pertussis vaccination during pregnancy in the united statesAmerican Journal of Epidemiology 183:1159–1170.https://doi.org/10.1093/aje/kwv347

CVODE, A stiff/Nonstiff ODE solver in CComputers in Physics 10:138–143.https://doi.org/10.1063/1.4822377

Maximum likelihood from incomplete data via the EM algorithmJournal of the Royal Statistical Society: Series B (Methodology) 39:1–22.

BookMathematical epidemiology of infectious diseases: model building, analysis, and interpretationWiley.

Safety, tolerability and pharmacokinetics of MEDI8897, an extended Halflife Singledose respiratory syncytial virus prefusion Ftargeting monoclonal antibody administered as a single dose to healthy preterm infantsThe Pediatric Infectious Disease Journal 37:886–892.https://doi.org/10.1097/INF.0000000000001916

“Risk of primary infection and reinfection with respiratory syncytial virus,”American Journal of Diseases of Children 140:543–546.https://doi.org/10.1001/archpedi.1986.02140200053026

Protecting the family to protect the child: vaccination strategy guided by RSV transmission dynamicsJournal of Infectious Diseases 209:1679–1681.https://doi.org/10.1093/infdis/jiu075

BookProbability and Random ProcessesNew York: Oxford University Press.https://doi.org/10.1007/9781475720242_1

Respiratory syncytial virus infections within familiesNew England Journal of Medicine 294:414–419.https://doi.org/10.1056/NEJM197602192940803

Immunity to and frequency of reinfection with respiratory syncytial virusJournal of Infectious Diseases 163:693–698.https://doi.org/10.1093/infdis/163.4.693

Respiratory syncytial virus infections in previously healthy working adultsClinical Infectious Diseases 33:792–796.https://doi.org/10.1086/322657

The burden of respiratory syncytial virus infection in young childrenNew England Journal of Medicine 360:588–598.https://doi.org/10.1056/NEJMoa0804877

Respiratorysyncytialvirus infections, reinfections and immunity A prospective, longitudinal study in young childrenThe New England Journal of Medicine 300:530–534.https://doi.org/10.1056/NEJM197903083001004

Exploring the dynamics of respiratory syncytial virus (RSV) transmission in childrenTheoretical Population Biology 110:78–85.https://doi.org/10.1016/j.tpb.2016.04.003

Deterministic epidemic models with explicit household structureMathematical Biosciences 213:29–39.https://doi.org/10.1016/j.mbs.2008.01.011

Household structure and infectious disease transmissionEpidemiology and Infection 137:654–661.https://doi.org/10.1017/S0950268808001416

Seasonally forced disease dynamics explored as switching between attractorsPhysica D: Nonlinear Phenomena 148:317–335.https://doi.org/10.1016/S01672789(00)001871

Modeling infectious diseases in humans and animalsClinical Infectious Diseases 47:864–865.https://doi.org/10.1086/591197

BookModelling the Transmission Dynamics of RSV and the Impact of Routine VaccinationThe Open University.

ReportKenya Demographic and Health Survey 2014Demographic and Health Surveys Program (DHS).

Solutions of ordinary differential equations as limits of pure jump markov processesJournal of Applied Probability 7:49–58.https://doi.org/10.2307/3212147

Limit theorems for sequences of jump markov processes approximating ordinary differential processesJournal of Applied Probability 8:344–356.https://doi.org/10.2307/3211904

Spread of infectious disease through clustered populationsJournal of the Royal Society Interface 6:1121–1134.https://doi.org/10.1098/rsif.2008.0524

The source of respiratory syncytial virus infection in infants: a household cohort study in rural KenyaThe Journal of Infectious Diseases 209:1685–1692.https://doi.org/10.1093/infdis/jit828

Incidence and severity of respiratory syncytial virus pneumonia in rural kenyan children identified through hospital surveillanceClinical Infectious Diseases 49:1341–1349.https://doi.org/10.1086/606055

WebsiteNovavax announces topline results from phase 3 PrepareTM trial of ResVax(TM) for prevention of RSV disease in infants via maternal immunizationNovavax. Accessed February 1, 2019.

The natural history of respiratory syncytial virus in a birth cohort: the influence of age and previous infection on reinfection and diseaseAmerican Journal of Epidemiology 176:794–802.https://doi.org/10.1093/aje/kws257

Bayesian inference for partially observed stochastic epidemicsJournal of the Royal Statistical Society: Series A 162:121–129.https://doi.org/10.1111/1467985X.00125

Sunshine, rainfall, humidity and child pneumonia in the tropics: timeseries analysesEpidemiology and Infection 141:1328–1336.https://doi.org/10.1017/S0950268812001379

Humidity and respiratory virus transmission in tropical and temperate settingsEpidemiology and Infection 143:1110–1118.https://doi.org/10.1017/S0950268814002702

Projecting social contact matrices in 152 countries using contact surveys and demographic dataPLOS Computational Biology 13:e1005697.https://doi.org/10.1371/journal.pcbi.1005697

DifferentialEquations.jl – A Performant and FeatureRich Ecosystem for Solving Differential Equations in JuliaJournal of Open Research Software 5:15.https://doi.org/10.5334/jors.151

Dynamics of infectious diseasesReports on Progress in Physics 77:026602.https://doi.org/10.1088/00344885/77/2/026602

Molecular analysis of respiratory syncytial virus reinfections in infants from coastal KenyaThe Journal of Infectious Diseases 193:59–67.https://doi.org/10.1086/498246

Profile: the Kilifi health and demographic surveillance system (KHDSS)International Journal of Epidemiology 41:650–657.https://doi.org/10.1093/ije/dys062

Differential evolution – A Simple and Efficient Heuristic for global Optimization over Continuous SpacesJournal of Global Optimization 11:341–359.https://doi.org/10.1023/A:1008202821328

Contrasting effects of strong ties on SIR and SIS processes in temporal networksThe European Physical Journal B 88:414–418.https://doi.org/10.1140/epjb/e2015605684

A highly potent extended halflife antibody as a potential RSV vaccine surrogate for all infantsScience Translational Medicine 9:eaaj1928.https://doi.org/10.1126/scitranslmed.aaj1928
Decision letter

Anna AkhmanovaSenior and Reviewing Editor; Utrecht University, Netherlands

Katherine AtkinsReviewer
In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.
Acceptance summary:
Brand and colleagues present an agentbased model of respiratory syncytial virus transmission and vaccination and use it to explore potential vaccination schedules in pregnant women. They find up to 50% reductions in infant RSV infections with fairly high coverage of the prenatal vaccine. This work is important for informing future vaccination policy when RSV vaccines become available.
Decision letter after peer review:
Thank you for submitting your article "Reducing RSV hospitalisation in a lowerincome country by vaccinating motherstobe and their households" for consideration by eLife. Your article has been reviewed by Neil Ferguson as the Senior Editor, a Reviewing Editor, and two reviewers. The following individuals involved in review of your submission have agreed to reveal their identity: Katherine Atkins (Reviewer #2).
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
Summary:
In the current study Brand et al. use a mathematical model to estimate reductions in RSV hospitalizations in children following a theoretical maternal and >1 y/o vaccines. The study builds on much previous work conducted by members of the same group and adds an important dimension to the question surrounding RSV epidemiology and the potential for vaccination.
The authors find modest reductions in overall hospitalizations but a sizable reduction in hospitalizations in children <1 y/o – the most vulnerable group. It is encouraging to see the modelling being conducted with an LMIC setting too, as most previous RSV modelling efforts have been HICbased. The modeling methods are sound (and elegant) but the manuscript would benefit from some clarification of both methods and results.
Essential revisions:
1) I think it would be helpful if more motivation were to be provided on the reasoning behind the household model choice. If I understand correctly, there is a serious computational downside of solving these types of models, at the expense of some epidemiological realism (with respect to neglecting exposuredependent parameters, for instance). I'm not advocating the authors conduct a comparison, but I'm interested to know whether the choice of household model can ultimately reflect the impact of householdbased strategies more accurately than a more epidemiologicallyrealistic model can using approximations for the motherchild contact (for this, see again, Atkins et al., 2016).
2) The paper needs to be grounded in relation to vaccines currently being developed. There is no discussion as of now of potential future vaccines and there should be to motivate the work.
3) From what I understand, the contact within the household is assumed to be density dependent, whereas the contact outside of the household is assumed to be frequency dependent – is this right? Could you mention this?
4) Hasn't there been mixing matrices conducted in Kilifi that could be used?
5) It is a big assumption that the demographics are fixed in the 10 years of prediction. The birthrate has been declining in Kenya over the past 15 years (see https://www.indexmundi.com/g/g.aspx?c=ke&v=25). Thus, the reported reductions may be overestimated.
6. I'd like to have a comprehensive parameter table that describes all parameters used (vaccine duration, uptake etc. – including fitted parameters). This would help in understand the base case scenario – which was difficult to find as well as the reliability of the model and deviation from previous work.
7) Figure 2 suggests to me that the model underestimates the number of hospitalisations. It would be useful to see the absolute numbers stratified by age (rather than just by% ).
8) Figure 5A: It is extremely surprising to me that the postvaccine dynamics immediately equilibrate – do the age distribution infection also equilibrate immediately? Presumably, but this is also surprising.
9) Figure 6: I can't decipher what these combined strategies are – we need more information in the caption accompanied with a separate table that spells out which vaccine strategies are being considered. It is very difficult to interpret the differences between the strategies otherwise.
10) Does Figure 6 really report avoided hospitalizations? If so, why do avoided hospitalizations go down with increasing coverage? Also, I would suggest reversing the order of the legend to match the order of the lines.
11) Introduction: I'm not sure I agree with this assessment. Admittedly in the context of pertussis, but nevertheless, our modelling study suggested the benefit of cocooning in the presence of direct protection of the infant was extremely marginal (Atkins et al., 2016). That is, in terms of impact, there was no point in cocooning when substantial direct protection had been achieved.
12) Results section: Where are the derivations for the equations in box 1? They are not immediately obvious.
13) Results section: I'm a little sceptical at both the values of R0 and the conclusions drawn, namely that community transmission has an R0 < 1 (on average an infection initiated at random) produces <1 other case in the community). This is because when R0 is calculated from a model, the structure of the model, as the authors note, can substantially impact the value of R0 calculated. While the authors note the difference between age and household related structure, the number of exposure classes will also make a difference I think. Perhaps the authors can comment on this.
14) Materials and methods section: Originally you said you split up <1 and >1 years, but there is discussion of finer age groups here, this needs to be explained more clearly as I'm confused with the age stratification, its parameterisation and how it was implemented in the model.
15) Subsection “Conditional age of individuals”: When you say 'we calculated empirical distributions', it's not clear how you constructed these distributions, or how they were 'empirical' – more information please. Presumably there are parameterised from the KDHSS survey, but it's not clear. More link between the data and the distributions needed I think.
16) Subsection “Hospitalisation rates”: I think I understand what the authors are trying to do here – there needs a little bit of a fudge to account for the exposuredependent nature of RSV infection that is missing from the model. However, there are two forces at play here, which I think need to be captured independently. First, is the agespecific nature of infection – evidence points to severe infection / hospitalisation being necessarily age dependent, when the lung pathways are not fully developed and infected infants are more at risk of bronchiolitis than their older counterparts. (arguably very young infants are also more likely to be picked up in surveillance through increased testing and reporting). There is then the exposuredependent nature of infection, that is the higher chance of asymptomatic infection with increasing exposures. Thus, with passive protection from either extended life monoclonals or by maternal vaccination, the idea is to push infants out of their most risky period (agedependent severe infection), with the tradeoff that no vaccine or naturalimmunity is elicited and they still have the same risk of symptomatic infection as younger individuals. If I understand correctly, the model captures the latter mechanism, but not the first. More clarity is needed on distinguishing these phenomenons I think.
https://doi.org/10.7554/eLife.47003.sa1Author response
Essential revisions:
1) I think it would be helpful if more motivation were to be provided on the reasoning behind the household model choice. If I understand correctly, there is a serious computational downside of solving these types of models, at the expense of some epidemiological realism (with respect to neglecting exposuredependent parameters, for instance). I'm not advocating the authors conduct a comparison, but I'm interested to know whether the choice of household model can ultimately reflect the impact of householdbased strategies more accurately than a more epidemiologicallyrealistic model can using approximations for the motherchild contact (for this, see again, Atkins et al., 2016).
We have now expanded on this motivation in the main text (Introduction).
The additional computational complexity of simulating the ageandhousehold model used in this paper, compared to an agestructured model using an agespecific contact matrix, was a significant challenge. The advantage of explicit inclusion of household structure in the model is that the social contacts within the household are persistent over multiple RSV seasons, whereas agestructured models implicitly assume random mixing within the confines of the mixing matrix (that is all people of a given age group are equally likely to be contacted by any individual at any instant and therefore the chance of repeated contact become zero as the population size becomes large). In short, the ageandhousehold model used in this paper is a special type of contact network model whereas the standard agestructured model is effectively a random mixing model. Therefore, the usefulness of the household model depends on whether the network clustering of social contacts is important for simulating RSV transmission, and in particular for estimating the risk of transmission to under one year olds.
In the specific case of modelling highly seasonal RSV transmission, we would argue that a networklike transmission structure is important for capturing the relevant epidemiology. Most people have caught RSV by the age of two and will have multiple repeated episodes during their lifetime. The time between recovery from an episode and reversion back to at least partial susceptibility is estimated to be ~6 months. In Kilifi county, there are sharp annual peaks of RSV hospitalisation at each seasonal RSV epidemic, and so one should expect the population to consist of large numbers of entirely susceptible and partially susceptible individuals due to the interepidemic period being longer than the typical time over which loss of immunity to RSV occurs. These general considerations suggest that (i) RSV seasonal epidemics will be akin to repeated invasions of a nearly susceptible population, i.e. closer to the “epidemic” scenario than an “endemic” scenario, and (ii) RSV transmission is much closer to the “SIS” than the “SIR” paradigm. Network effects are most important during an epidemic invasive growth phase (Miller 2009) and are typically more important for SIStype dynamics with persistent contacts (Sun, Baronchelli and Perra, 2015). Both these features appear to be important for seasonal RSV transmission in Kilifi and therefore provide strong motivation for the networktype epidemic model we have used.
Another motivation for using detailed household structure in an epidemic model for Kilifi county was the availability of detailed demographic data covering the period over which we also have hospitalisation data. Our model was parametrised using the joint distribution of household type and age of household inhabitants; this allowed us to include agerelated effects (such as the agedependent probability of hospitalisation conditional on being infected) by reference to the age distribution for the type of household in which infection occurs. This would not be an effective modelling approach for infectious diseases with long periods of immunity (i.e. decades) where the agedistribution of susceptibility is a highly important factor. However, for the specific case of highly seasonal RSV we argue that this sufficiently captures the age distribution of susceptibles whilst allowing us to account for networklike social structure.
We consider the motherchild approximation used in Atkins et al. very appropriate for modelling pertussis in a highincome country with household sizes typically smaller than Kilifi, but believe it would be insufficient for modelling RSV in our setting. In a household containing a newborn one might also expect one or two parents, and possibly older siblings of the newborn. The length of time over which immunity to pertussis wanes after either a natural infection or routine childhood vaccination is believed to be in the order of decades. Therefore, the older siblings cohabiting with the newborn will have a reasonable probability of being immune (after either vaccination or natural infection) and not contribute to transmission within the household. Therefore, ignoring social clustering in the household is reasonable since a number of members of the household cluster cannot contract or transmit. This argument is supported by simulation studies comparing SIR type transmission both with and without explicit household structure (Glass et al., 2013); that is that the differences in longterm incidence between the two model types were marginal for most age groups when parametrised using the same demographic data. However, as we have argued above, specifically for modelling seasonal RSV this approximation is inadequate because most households will have a number of at least partially susceptible members, and therefore, social clustering is a relevant factor in epidemic prediction.
2) The paper needs to be grounded in relation to vaccines currently being developed. There is no discussion as of now of potential future vaccines and there should be to motivate the work.
The opening paragraph of the Introduction refers the reader to WHO goals for potential RSV vaccines and a summary document of current vaccines and their development stage. We have also presented our results in light of the partially successful ResVax trail.
These provide justification for investigating the option of boosting infant antibody through maternal immunization (or longlasting monoclonal at birth). Motivation for the option of vaccinating household cohabitants of motherstobe, arises in part from the same sources, which identify the paediatric population as the second key target group for vaccine development. We are further influenced by epidemiological studies suggesting that elder family members such as school age children or parents are possible sources introducing RSV into households leading to infant infection (Anderson et al., 2013; Graham, 2014). Of the current vaccines under development, temporary immunity to RSV in these older (seropositive) household individuals would most likely be achieved by a subunit vaccine (Anderson et al., 2013). We have updated the text and included the additional reference to Graham, 2014.
3) From what I understand, the contact within the household is assumed to be density dependent, whereas the contact outside of the household is assumed to be frequency dependent – is this right? Could you mention this?
This is correct, we assume density dependent transmission within the household. We have corrected the paper to mention this explicitly (subsection “Model Dynamics, forces of infection and susceptibility to RSV”).
4) Hasn't there been mixing matrices conducted in Kilifi that could be used?
The main paper that has used mixing matrices for Kilifi was Kinyanjui et al., 2015. Two different mixing matrices were considered each derived from a separate data source: (i) a contact diary study, and (ii) cooccupancy as recorded in the Kilifi health and demographic surveillance system (KHDSS) over a snapshot of Sept 2010 – Jan 2011 (Scott et al. 2012). We couldn’t construct a reliable estimate of the full social contact graph from 568 diary responses, therefore we used the KHDSS cooccupancy data to construct a social contact graph of household cohabitants, and assumed that other social contacts, for example at schools, occurred as agestructured random mixing.
Our approach to using the same KHDSS data differed from Kinyanjui et al., 2015 in two ways: first, they used household cooccupancy to create an agestructured mixing matrix, effectively deconstructing the social contact network, whereas we maintain the social contact network for households, and second, they used a single snapshot of the KHDSS sample population, whereas we used multiple snapshots and dynamically evolved the household structure between snapshots.
We have made our approach clearer in a new subsection “Joint distributions of age and household occupancy”.
5) It is a big assumption that the demographics are fixed in the 10 years of prediction. The birthrate has been declining in Kenya over the past 15 years (see https://www.indexmundi.com/g/g.aspx?c=ke&v=25). Thus, the reported reductions may be overestimated.
We have been sensitive to incorporating known historic changes in demography into our inference method, both in terms of numbers of individuals available to be infected and their joint ageandhousehold type distribution. This had the benefit that we could disentangle seasonal variation in hospitalisation from simply having a larger number of individuals at risk of hospitalisation.
However, for forecasting, we felt that it was reasonable to consider an idealised demographically static population with results presented in terms of percentage reduction of hospitalisation. Our reasoning was that although the crude percapita birth rate has declined in Kenya as the population size has increased, the size of the critically atrisk under oneyearold population in Kilifi has stabilised at roughly 8500. Also, in forecasting demographic change in our model we would need to consider combinations of factors. For example, if there was a declining number of critically atrisk newborns but also an increase in the typical number of individuals per household due to increasing absolute population size we might expect hospitalisations to increase due to increasing transmission risk per newborn, despite the number of newborns atrisk decreasing. This would be an interesting avenue of investigation for future work, but not appropriate for this paper. We have added an explicit discussion on this into the Discussion section.
6. I'd like to have a comprehensive parameter table that describes all parameters used (vaccine duration, uptake etc. – including fitted parameters). This would help in understand the base case scenario – which was difficult to find as well as the reliability of the model and deviation from previous work.
We have now added a table of vaccination scenarios (Table 1) and moved both the literature derived parameter table (Table 2) and the inferred parameters table (Table 3) into the main text from the supporting information.
7) Figure 2 suggests to me that the model underestimates the number of hospitalisations. It would be useful to see the absolute numbers stratified by age (rather than just by% ).
The prediction of the model, after parameter inference, is that the total number of hospitalisations would be Poisson (2271) distributed whereas the true number of hospitalisations was 2382 (4.7% relative error compared to mean prediction).
However, most of the error is concentrated in the outlier RSV epidemic during 20052006. This was the only year which had three pronounced peaks in hospitalisations at least a month apart: two smaller peaks on 11th Dec 2005 and 24th Mar 2006 around a larger peak on 24th Feb 2006. The model was unable to capture this unusual temporal pattern in hospitalisation.
With the 20052006 RSV season excluded (all datapoints from 4th Nov 2005 to 9th June 2006) the model predicts an average of 2147 hospitalisations over the rest of the period compared to 2174 actual hospitalisations (1.2% relative error). This is now discussed in Results section.
Figure 2 has been altered to present absolute numbers.
8) Figure 5A: It is extremely surprising to me that the postvaccine dynamics immediately equilibrate – do the age distribution infection also equilibrate immediately? Presumably, but this is also surprising.
We were initially surprised by the rapid shift in dynamics as well (it is also true for the age distribution). However, in the context of seasonal RSV transmission and the specific vaccination strategies considered in the paper, we find the rapid shift understandable. First, even with both vaccination types used at the maximum effectiveness and coverage considered in the paper the reduction in total RSV infections is slight (<4% reduction compared to no vaccination in Figure 5A). Therefore, despite a significant reduction in hospitalisation, the overall epidemic should not be thought of as having been pushed far from its previous dynamics. Second, as mentioned above, seasonal RSV is closer to the “SIS” paradigm of disease rather than the “SIR” paradigm. Therefore, a perturbation away from equilibrium does not necessarily cause additional oscillatory dynamics (typical of perturbed SIR models) unless we get close to eradication. We noted that in Kinyanjui et al., 2015for their agestructured model based on household cooccupancy, which can be thought of as the agestructured “version” of our model, there is also rapid transition from high rates of hospitalisation prevaccination to lower levels postvaccination. We mention the rapid change as notable on Results section, as well as discussing the more SIStype dynamics in the Introduction.
9) Figure 6: I can't decipher what these combined strategies are – we need more information in the caption accompanied with a separate table that spells out which vaccine strategies are being considered. It is very difficult to interpret the differences between the strategies otherwise.
10) Does Figure 6 really report avoided hospitalizations? If so, why do avoided hospitalizations go down with increasing coverage? Also, I would suggest reversing the order of the legend to match the order of the lines.
Reply to (9) and (10): The purpose of Figure 6 was to demonstrate the efficiency of vaccination at different levels of coverage; that is either reduced numbers of hospitalisations or infections per vaccine. However, we agree that its current format leaves the reader unsure about the message of the plot. We have reworked this plot for additional clarity. We have redone Figure 6 to remove the dashed lines and give the efficiency results in stacked bar plot format, as well as redoing the caption to reference Table 1. We have also substantially rewritten the paragraph explaining these findings (Results section).
11) Introduction: I'm not sure I agree with this assessment. Admittedly in the context of pertussis, but nevertheless, our modelling study suggested the benefit of cocooning in the presence of direct protection of the infant was extremely marginal (Atkins et al., 2016). That is, in terms of impact, there was no point in cocooning when substantial direct protection had been achieved.
We agree that if substantial direct protection for infants from RSV could be achieved then additional measures would not be necessary, e.g. a vaccine that gave substantial protection from birth for first two years of life would dramatically reduce RSV hospitalisations irrespective of other factors.
However, it is not yet clear whether such a highly effective vaccine will become available; the results of the Novavax ResVax maternal vaccine were mixed indicating only partial success in direct protection of infants. One outcome of our modelling study is to demonstrate that a mixed strategy of partially effective vaccines could be complementary. We have adjusted our Discussion section to make this clearer. This is mentioned in the Introduction.
We would caution against drawing too much analogy with pertussis in the USA. Essentially, ‘cocooning’ has already occurred in that context since high coverage and effectiveness of childhood pertussis vaccination already exists. As argued above, in a typical household containing a newborn and one or two parents in the USA the most likely other household cohabitants would be older siblings of the newborn, and due to (DTaP) vaccination coverage these older siblings have a high probability of being immune to transmission. In this context, one might expect 6 months of effective direct protection given to a newborn by a booster (Tdap) vaccine for the mother, as per Atkins et al., 2016 to be sufficient.
We have reworded this section to make the comparison (and differences) with pertussis clear (Introduction).
12) Results section: Where are the derivations for the equations in box 1 (Results section)? They are not immediately obvious.
This was an oversight; we have added a complete derivation to the supporting information. Also, there was a typo in the relative reduction result as presented in Box 1 which has been corrected.
13) Results section: I'm a little sceptical at both the values of R0 and the conclusions drawn, namely that community transmission has an R0 < 1 (on average an infection initiated at random) produces <1 other case in the community). This is because when R0 is calculated from a model, the structure of the model, as the authors note, can substantially impact the value of R0 calculated. While the authors note the difference between age and household related structure, the number of exposure classes will also make a difference I think. Perhaps the authors can comment on this.
This section was misjudged. We calculated the R0 value from a nextgeneration matrix based on the inferred transmission rates. However, the transmission rates were inferred jointly with the seasonality parameters so analysing them independently was a poor decision. We have now removed this paragraph it did not add substantially to the main message of the paper.
The relationship between exposure classes and R0 for RSV is interesting because it is currently believed that individuals are typically less susceptible, less infectious and have a shorter infectious duration on their second and subsequent exposure to RSV infection. However, R0 is calculated with reference to an idealised completely naïve population; that is that everyone in the idealised population has never been exposed. Therefore, threshold R0 only depends on the susceptibility and infectiousness of the first exposure class.
In principle, this could mean that the first exposure class (effectively newborns and young children) is critical to maintaining RSV in the population, since it is possible that RSV could not be sustainable amongst only people in multiple exposure classes. However, it seems unlikely that a virus with seemingly high prevalence in every population is being sustained by such a small group. Whilst this is an interesting aspect, we haven’t added to the main manuscript to reflect this discussion because we feel it would dilute from focus of this paper: estimating transmission rates and therefore the potential impact of a specific set of vaccination strategies.
14) Materials and methods section: Originally you said you split up <1 and >1 years, but there is discussion of finer age groups here, this needs to be explained more clearly as I'm confused with the age stratification, its parameterisation and how it was implemented in the model.
This was a poorly described aspect of the model. From a dynamical point of view we use a simple <1 and >1 year old age classification, however within those groups we get richer detail by considering what age a <1/>1 year old is likely to be given the household they live within.
In brief, age stratification is implicit within our modelling framework. We know the agestructure within all households in the KHDSS area with children under and over one – therefore we can relate any infection in a household to potential ages. This is used to both derive agedependent infection probabilities and agestructured mixing between households.
In this modelling study, we invert the usual approach to averaging over epidemiologically important quantities for “SIR” type infection models. For infectious pathogens that provoke longlasting or permanent immunity the agedistribution of susceptibility is a critical dynamic component, and therefore, it makes sense to model age structure dynamically and average over social structure (i.e. household cohabitation). For seasonal RSV, we expect a large fraction of the population will be at least partially susceptible at the beginning of each epidemic. In this setting, we believe that age structure is less important than social structure to the dynamics of transmission. Consequently, we model household structure explicitly and consider the age of individuals, conditional on their household and whether they are known to be underone or overone. This allowed us to estimate agedependent quantities such as the risk of hospitalisation without having to include them as explicit dynamic model variables.
We feel that this was an innovative aspect to our modelling approach, which we have not explained clearly. We have added an extra subsection on this aspect in the model description (subsection “Joint distributions of age and household occupancy”).
15) Subsection “Conditional age of individuals”: When you say 'we calculated empirical distributions', it's not clear how you constructed these distributions, or how they were 'empirical' – more information please. Presumably there are parameterised from the KDHSS survey, but it's not clear. More link between the data and the distributions needed I think.
As the reviewer guessed this was derived by linking individuals presents in the KHDSS surveys by their house ID number. We have substantially increased our description of this in the paper subsection “Joint distributions of age and household occupancy” and Appendix.
16) Subsection “Hospitalisation rates”: I think I understand what the authors are trying to do […] If I understand correctly, the model captures the latter mechanism, but not the first. More clarity is needed on distinguishing these phenomenons I think.
It is correct that the main protective effect of the maternal/extended life monoclonal vaccine is to shift the age of first infection away from the critical early months of life. However, we argue that our model is better at capturing agespecific effects of infection, and therefore possible hospitalisation compared to exposuredependent effects. Because a significant percentage of children have caught RSV by the age of one, and a large majority by the age of two, it is hard to disentangle exposure vs age as risk factors for severe disease due to RSV. However, there is some evidence that age is the more critical factor (Ohuma et al., 2012).
As mentioned above, we assume that individuals, within the crude underone/overone dynamic groups, have an age distribution which depends on their household size and whether the household contains an underone year old or not. This allowed us to capture social structure aspects such as households with an underone also typically having school age children (if the household size is > 3) which potentially alters the risk of RSV introduction.
The exposuredependent nature of infection is covered by the approximation that all overones have been infected at least once by RSV. A large majority of all people over the age of two have contracted RSV, so only the 12 year olds are likely to be significantly misrepresented by this assumption. If a truly effective RSV vaccine became available, this approximation would become increasingly problematic as the average age of first infection increased, however for the scenarios investigated in this paper we consider this approximation reasonable.
https://doi.org/10.7554/eLife.47003.sa2Article and author information
Author details
Funding
The Wellcome Trust (102975)
 David James Nokes
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
This work was funded by the Wellcome Trust (Grant ref 102975), and was published with permission of the Director of KEMRI. Kat Rock kindly supplied some clipart for plotting.
Senior and Reviewing Editor
 Anna Akhmanova, Utrecht University, Netherlands
Reviewer
 Katherine Atkins
Publication history
 Received: March 19, 2019
 Accepted: March 26, 2020
 Accepted Manuscript published: March 27, 2020 (version 1)
 Version of Record published: September 10, 2020 (version 2)
Copyright
© 2020, Brand et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics

 669
 Page views

 89
 Downloads

 5
 Citations
Article citation count generated by polling the highest count across the following sources: PubMed Central, Crossref, Scopus.