 Research
 Open Access
 Published:
The crosssectional average length of healthy life (HCAL): a measure that summarizes the history of cohort health and mortality
Population Health Metrics volume 18, Article number: 21 (2020)
Abstract
Background
Healthy life years have superseded life expectancy (LE) as the most important indicator for population health. The most common approach to separate the total number of life years into those spent in good and poor health is the Sullivan method which incorporates the health dimension to the classic period life table, thus transforming the LE indicator into the health expectancy (HE) indicator. However, life years derived from a period life table and health prevalence derived from survey data are based on different conceptual frameworks.
Method
We modify the Sullivan method by combining the health prevalence data with the conceptually better fitting crosssectional average length of life (CAL). We refer to this alternative HE indicator as the “crosssectional average length of healthy life” (HCAL). We compare results from this alternative indicator with the conventional Sullivan approach for nine European countries. The analyses are based on EUSILC data in three empirical applications, including the absolute and relative level of healthy life years, changes between 2008 and 2014, and the extent of the gender gap.
Results
HCAL and conventional HE differ in each of these empirical applications. In general, HCAL provides larger gains in healthy life years in recent years, but at the same time greater declines in the proportion of healthy life years. Regarding the gender gap, HCAL provides a more favourable picture for women compared to conventional HE. Nonetheless, the extent of these differences between the indicators is only of minor extent.
Conclusions
Albeit the differences between HE and HCAL are small, we found some empirical examples in which the two indicators led to different conclusions. It is important to note, however, that the measurement of health and the data quality are much more important for the healthy life years indicator than the choice of the variant of the Sullivan method. Nonetheless, we suggest to use HCAL in addition to HE whenever possible because it widens the spectrum of empirical analyses and serves for verification of results based on the highly sensitive HE indicator.
Background
Healthy life years have superseded life expectancy as the most important indicator for population health. It enables researchers to investigate, e.g. the proportion of life years spent in good/poor health, trends in life years spent in good respective poor health (the “compressionexpansiondebate”), and differences between women and men [1,2,3,4]. In order to estimate the quality dimension of life years, the health expectancy indicator (HE) has been developed, which combines mortality and morbidity in a single indicator by incorporating the health dimension into the life table [5]. Even though several methods have been proposed for this purpose, the approach developed by Sullivan [6] is the most prominent one up to now [7]. It uses agespecific prevalence (proportions) of the population in the (un)healthy state, usually obtained from crosssectional survey data, to apportion the life table personyears lived between the states of good and poor health [8].
In the application of the Sullivan method, it is frequently overlooked that life years derived from a period life table and health prevalence derived from survey data are based on different conceptual frameworks. Whereas the former reflects the life span of a hypothetical population constructed on the basis of current agespecific death rates, the latter reflects the actual health condition of real individuals [9, 10]. To overcome this conceptual mismatch between health and mortality information, it was suggested to base the Sullivan method on the “CrossSectional Average Length of Life” (CAL) instead of conventional period LE [11,12,13]. To our knowledge, this approach has not been applied empirically so far. We aim at closing this research gap by using this variant of the Sullivan approach, to which we refer as “CrossSectional Average Length of Healthy Life” (HCAL). Our central research question is to what extent the underlying mortality indicator, i.e. LE vs. CAL, affects the resulting estimates for healthy life years, i.e. HE vs. HCAL.
The paper is structured as follows: We start with a conceptual description of CAL in comparison to period and cohort LE to demonstrate that CAL is a combination of these two approaches. Then, we construct the HCAL indicator and discuss the difference between the mortality information in CAL and conventional LE with respect to its applicability to the Sullivan method. The empirical section starts with a description of data, followed by the presentation of our results. Here, we compare HCAL and conventional HE for nine European populations with regard to the absolute and relative level of healthy years, changes between 2008 and 2014, and differences between women and men. Finally, we discuss the advantages and disadvantages of HCAL as an alternative to HE.
Methods
Life expectancy and crosssectional average length of life
Longevity measures usually follow a period or cohort concept. In a cohort life table, the observed agespecific survival probabilities define the survivorship function for a particular cohort born in time t. Integrating across all ages yields cohort LE at birth (\( {e}_0^c \)), i.e. the mean age at death for this particular cohort. Formally, \( {e}_0^c \) can be written as
with p_{c}(x, t) being the probability for individuals born in time t to survive until age x. Because \( {e}_0^c \) can only be calculated for extinct cohorts, and thus reflects past mortality conditions, period LE is a more convenient summary measure for current mortality levels and for tracking recent mortality trends. In this concept, the agespecific survival probabilities do not correspond to one particular birth cohort but to one particular period, i.e. constructed from the observed agespecific death rates of this calendar year. Integrating the resulting periodspecific survival probabilities over all ages leads to period LE at birth (\( {e}_0^p \)) for year t
with p(x, t) being the probability for individuals to survive until age x if they had been exposed to the survival probabilities prevailing at time t throughout their lives from birth to age x. Consequently, period LE reflects the average age at death of a hypothetical cohort under the assumption that the periodspecific death rates remain unchanged over their entire life course.
The CAL concept combines the two classic concepts in the sense that it (1) refers to actual cohort mortality (i.e. it is based on longitudinal survival probabilities) and (2) corresponds to all cohorts alive in a given period (resulting in a crosssectional summary measure of mortality experiences). CAL was originally introduced by Brouard [14] and further elaborated by Guillot [15] and CanudasRomo and Guillot [16]. In the literature, this mixed period cohort concept has been labelled the “wedgeperiod perspective” [17], “crosssectional cohort average” [18], or the “crosssectional cohort mortality index” [19]. CAL is based on cohort survival probabilities (proportion of survivors) from birth until the last age reached at time t. Integrating this function across all ages yields CAL(t) as
with p_{c}(x, t − x) being the probability that a member of the cohort born at time t − x survives until age x. CAL(t) can be interpreted as a period longevity measure in the sense that it “[...] refers to a particular period t, but takes into account the actual mortality conditions to which cohorts present in the population at time t have been subject” ([15], p. 42).
Figure 1 illustrates the three demographic concepts using the example of French males in 2015, all constructed with data of the Human Mortality Database [20]. The upper panel shows the basic concepts in a Lexis surface. While the classic life table concept summarizes the mortality experiences of one single cohort (real or hypothetical) over its life course, CAL includes all mortality experiences experienced by cohorts alive in a given period. The lower panel shows the empirical survivorship functions corresponding to the three concepts. The areas under the curves yield cohort LE, period LE, and CAL, respectively, being 53.13 years for the cohort born in 1915, 79.02 years for the calendar year 2015, and 73.85 years for CAL in 2015. The 1915 birth cohort experienced relatively high mortality over all ages. Periods LE and CAL show similar survivorship patterns up to age 30. Then, the p(x, t) function of period LE is more rectangular compared to the p_{c}(x, t − x) function of CAL. This is because the p_{c}(x, t − x) function corresponds to actual cohorts of which many experienced higher mortality than current conditions which are reflected in period LE. The specific construction of CAL is also the reason why p_{c}(x, t − x) is not a monotonically decreasing function. Whenever a cohort has been exposed to higher mortality conditions compared to the mortality experience of the previous (older) cohort, p_{c}(x, t − x) will increase. This can be seen in Fig. 1 for French males born in 1945 (who reached age 70 in 2015). The proportion of cohort survivors of the 1944 birth cohort (who reached age 71 in 2015) is higher compared to the 1945 birth cohort even though they were born earlier, and therefore were longer exposed to the risk of dying.
Whether an indicator should be based on the cohort, period, or crosssectional cohort average perspective depends on the purpose of its use. The synthetic cohort approach is a powerful tool if the aim is to examine period mortality, e.g. to investigate changes in period death rates. However, the actual survival trajectory of individuals is usually poorly captured in the hypothetical cohort scenario. For that reason, the cohort perspective is essential to analyse the real mortality experience of people in the framework of age and calendar time. However, a population at a certain time is composed of a number of cohorts. Therefore, the experience of one cohort is not representative for the entire population. This is the central advantage of the crosssectional cohort average concept. It refers to the population as a whole by taking into account the complete mortality history of cohorts, i.e. how the actual population alive results from the cohorts’ past mortality experiences. Nonetheless, the empirical application of CAL is rare, mainly because of its high demand on the data. It has been used to study the impact of mortality on population size and growth [15], to evaluate population momentum [21], and to compare populations in terms of their mortality history [16].
Derivation of HE and HCAL with the Sullivan method
The Sullivan method divides the total number of life years into those spent in good and in poor health. Using the life table notation, life years are expressed as personyears lived (_{n}L_{x}). The _{n}L_{x} function allows to define e_{o}^{p}(t) as the sum of all agespecific personyears lived (divided by the period life table radix l_{0}^{p}):
with \( {}_n{L}_x^p \) being the number of personyears lived between age x and x + n in a life table for the period t and \( {l}_0^p \) the corresponding number of people alive at age 0 (i.e. the number of newborns). CAL can be constructed from personyears lived as well. In fact, CAL is the sum of the age and cohortspecific personyears lived divided by the cohort life table radix \( {l}_0^c \), i.e. the number of newborns to which all cohorts are standardized:
with \( {}_n{L}_x^c \) being the number of personyears lived between age x and x + n in the life table for the cohort born between (t – x − n) and (t − x) and \( {l}_0^c \) the corresponding number of people alive at age 0.
The Sullivan method is based on the idea of applying the agespecific prevalence (proportions) of the population in an (un)healthy state to the agespecific personyears lived. In this way, the total life years in each age interval can be divided into those spent in good and in poor health. Summing up only the healthy personyears lived across all ages gives HE and HCAL, respectively, from:
with _{n}π_{x} being the agespecific prevalence (proportion) of poor health in the age interval x to x + n at time t. The proportion of healthy life years on total life years is given by the ratios HE/LE and HCAL/CAL, respectively. Alternatively, HE and HCAL can be derived directly from the corresponding survivorship functions by weighting the survival probabilities with the population proportions of individuals being in good health. Integrating the derived functions across all ages yields HE and HCAL in continuous time from
Equations 8 and 9 demonstrate that HE and HCAL solely differ in terms of the underlying survivorship function (p(x, t) vs. p_{c}(x, t − x)), while the π(x, t) function remains the same for both measures. The combination of p_{c}(x, t − x) with π(x, t), i.e. HCAL, is illustrated in Fig. 2 with data for French males in 2015. Each of the vertical lines in the right panel corresponds to a proportion of cohort survivors. The left shows the proportion of poor health according to the EUSILC data [22]. For example, about 80% of the 1955 birth cohort survived up to 2015 (i.e. reached age 60) and approximately 30% of the same birth cohort reported to be mildly or strongly limited in 2015. Combining these two quantities gives the probability of being both healthy and alive in 2015: 0.8 · (1 – 0.3) = 0.56. The agespecific survival in good health (i.e. free of limitations) is shaded in dark grey.
Since both the p_{c}(x, t − x) and the π(x, t) function correspond to the same group of individuals, HCAL creates a consistent combination of mortality and health quantities. By contrast, combining the p(x, t) function with the π(x, t) function, the conventional Sullivan method procedure (Eq. 8) combines survival probabilities corresponding to one hypothetical cohort with the health statespecific prevalence of several real cohorts. Strictly speaking, this results in a probability, which is neither reflecting healthy survival in a synthetic cohort fashion nor in a real cohort perspective. A detailed description of the particular implications for HE and HCAL resulting from the different conceptual approaches can be found in the Appendix.
Data sources for single agespecific mortality and prevalence
We estimate HE and HCAL with data for Denmark, Finland, France, Germany, Italy, the Netherlands, Norway, Sweden, and the UK for the years 2008 to 2014. To test whether the indicators provide different results, we compare the corresponding countryspecific estimates with regard to the total level of healthy life years, genders differences, and changes over time. The HE and HCAL estimation requires single age, period, and cohortspecific death rates and single agespecific proportions of the (un)healthy population observed in the given country and year.
Agespecific death rates for estimating LE and CAL were taken from the Human Mortality Database (HMD). For the UK, HMD data is available from 1922 onwards. However, calculating (H)CAL in 2005 requires data beginning in 1905 (defining age 100 as the highest age). To obtain mortality rates before the year 1922, we combined HMD data for the UK with HMD data for England and Wales which is available from 1841 onwards. For Germany, we used cohort mortality data from Destatis [23] because HMD provides German mortality data only from 1956 onwards. While Destatis publishes cohort life tables, the HMD provides cohort death rates only for cohorts that have lived at least 30 years (from age 0 to 29). Yet, period life tables are available also for more recent years. Therefore, we reconstructed the cohort survivorship for the (H)CAL calculation for all countries but Germany by combining the agespecific death rates of the period life tables longitudinally along the cohorts’ life course (see supplementary material). This technique has already been used in previous empirical estimations of CAL [16].
Agespecific prevalence data was taken from the European Union Statistics on Income and Living Conditions (EUSILC) [22]. We defined “being healthy” on the basis of the “Global Activity Limitation Indicator” (GALI). GALI has been developed for providing a harmonized health indicator for monitoring population health in Europe [24] and refers to the question: “For at least the past six months, to what extent have you been limited because of a health problem in activities people usually do?” with the three answer categories “strongly limited”, “limited, but not strong”, and “not limited”. We defined being in the healthy state if respondents reported to be “not limited”. Unfortunately, the harmonization of GALI is still imperfect, hampering the comparison of HE estimates between countries and over time [25]. Previous research found that health indicators are sensitive with respect to the mode of data collection [26], the choice of the survey [27], and the wording of the health survey question [28]. Our selection of countries and time span was therefore driven by avoiding any substantial breaks in the time series and by choosing countries with the required mortality data available in the HMD (besides Germany for which data was taken from Destatis).
The countryspecific sample sizes and the prevalence of being unhealthy using GALI are presented in Table 1 (separated by gender). Some data problems become apparent in these figures. For example, the prevalence of being unhealthy decreased strongly in Sweden between 2013 and 2014, and we, therefore, excluded Sweden from the time trend analysis. In Norway and Finland, we find extreme outliers in the prevalence values in 2011 and 2013, respectively, while for Italy, no EUSILC data is available in 2010. These breaks and outliers are also mentioned in the Eurostat database [29], indicating that we cannot analyse the full time span 2008 to 2014, but at least we can compare the years 2008 and 2014. Because the health data is highly fluctuating between single age groups, we applied the R package “MortalitySmooth”. The package has been developed for smoothing count data, which can be assumed to be Poissondistributed [30] and provides two smoothing functions: “Mort1Dsmooth” assumes smoothness in a onedimensional (over age) way and “Mort2Dsmooth” for a twodimensional setting (over ages and years). We applied “Mort2Dsmooth” to the data for countries without a break or an outlier between 2008 and 2014 (France, the UK, Denmark, the Netherlands, and Germany). The remaining countries (Finland, Norway, Italy, and Sweden) were smoothed in a onedimensional way. Single agespecific proportions of being unhealthy were derived from the smoothed health data. In order to take into account also the uncertainty from the survey sample size, we approximated single agespecific standard errors by using the approximation formula ([8], p. 27), i.e. applying the smoothed proportions of being unhealthy to the observed number of persons in the corresponding age intervals. These standard errors were used to approximate 95% confidence intervals for HE and HCAL estimates which are presented in Tables 4 and 5. As to be expected, the uncertainty in single agespecific prevalence data is substantial, and we do not find statistically significant differences between HE and HCAL in an any of our empirical analyses. Therefore, we compare the two indicators for healthy life years without confidence intervals in the following section. We come back to this issue at the end of the paper when we discuss the properties of HE and HCAL.
Results
Level of healthy life years estimated with HCAL and conventional HE
Table 2 shows the estimates LE, HE, CAL, and HCAL for the nine European countries, separately for females and males. As expected, CAL is lower than LE in each country and for each gender group. This results from the fact that CAL includes also (higher) historical death rates, whereas LE is solely build up from recently observed (relatively lower) death rates. Interestingly, the differences between HE and HCAL is smaller than the differences between LE and CAL. This relationship reverses in relative terms, however. The ratio HCAL/CAL is slightly higher than the ratio HE/LE in all nine countries (and for both genders). This is due to the relative difference between the p(x, t) function and the p_{c}(x, t − x) function. In relative terms, the p_{c}(x, t − x) function is higher at young ages and lower at older ages compared to the p(x, t) function. In other words, the relative number of deaths is higher at young ages and lower at older ages on the basis of CAL. Since the prevalence of individuals in the unhealthy state is usually low in younger ages but increases with age, HCAL provides higher proportions of healthy life years than conventional HE (see Appendix for more details). Especially, Italian, French, and German males show a (comparatively) large gap between the ratios HE/LE and HCAL/CAL because these populations experienced high mortality in the past, particularly during the World War II. For example, Italian males spend 81.06% of their total life years in good health on the basis of HCAL, whereas the proportion of healthy life years is only 78.44%. Including the mortality history of cohorts results also in a different country ranking on the basis of CAL. While LE ranks Italy (for both genders) relatively high, CAL favours Sweden and Norway. However, HE and HCAL rank the nine analysed countries similar for females and males, the ranking changes only slightly. This is because the prevalence of activity limitations varies substantially between countries, compensating most mortality differences between LE and CAL.
Changes in healthy life years over time: compression vs. expansion of morbidity
Figure 3 shows changes in LE, HE, CAL, and HCAL from 2008 to 2014 for France, the Netherlands, Denmark, and Germany, separated by gender. While CAL increases in a more or less linear fashion, LE shows some fluctuations over time. The robust trend in CAL results from the fact that it is based on a large number of agespecific death rates, and thus, it is not much affected by shortterm fluctuations in period mortality (for more details see [15]). Nonetheless, the trend in HCAL is not as linearly increasing as the trend in CAL. Instead, it follows the trend in HE, indicating that prevalence data is the driving force in the corresponding time trend in healthy life years and that the choice of the basic survival function does not matter significantly.
Moreover, CAL is increasing faster than LE over time. While LE for French males increases between 2008 and 2014 by 1.67 years, CAL rises by about 2 years in the same period (see Table 3). Consequently, the increase in HCAL is also higher than the increase in HE (0.52 years vs. 0.29 years).
Note that differences in the increase in the total number of life years according to CAL and LE affect also the trend in the proportion of healthy life years, i.e. the ratio HCAL/CAL and the ratio HE/LE. In general, the increase in CAL exceeds the increase in LE between 2008 and 2014, resulting in higher decreases in the HCAL/CAL ratio. Denmark appears as a special case because both males and females show higher gains in LE compared to CAL. Accordingly, the reduction in the proportion of healthy life years is more pronounced on the basis of conventional HE.
Nonetheless, the proportions HCAL/CAL and HE/LE largely agree on the direction of the trend in healthy life years, i.e. whether we observe an expansion or compression of morbidity. The only exception is Italian females. Whereas the HE/LE ratio indicates a relative increase in healthy life years, i.e. relative compression of morbidity, the HCAL/CAL ratio suggests a slight trend in the direction of relative morbidity expansion.
Gender differences in healthy life years
Both mortality indicators (LE and CAL) show a female advantage in the total number of life years for all analysed populations (see Fig. 4). This gender gap is larger for CAL, indicating that the difference between male and female mortality was higher in the past compared to recent years. As a consequence, the gender gap in healthy life years according to HCAL is weighted stronger in the direction of females, i.e. HCAL provides either a larger female advantage or a smaller female disadvantage compared to HE. The differences between HCAL and HE in the extent of the gender gap differ, therefore, depending on whether females or males have a higher number of healthy life years. In Finland, Norway, Sweden, Denmark, and the Netherlands, males show a lower agespecific prevalence of activity limitation compared to women. Consequently, the gender gap is larger based on conventional HE in these populations. Yet, in countries where the agespecific prevalence of activity limitation is lower among females (Italy, the UK, and France), the gender gap is larger according to HCAL. In Germany, we find a specific situation in which HE and HCAL appear as contradictive in terms of the direction of the gender gap. The female survival advantage in CAL is large enough to compensate their higher agespecific prevalence of activity limitations, resulting in more healthy life years for females in HCAL, whereas conventional HE gives more healthy life years for males.
In relative terms, the proportion of healthy life years on total life years is higher for males in all nine countries. This male advantage is larger according to HCAL than according to HE (see Table 2). As mentioned above, differences between HE and HCAL in relative terms result from the relative difference between the p(x, t) function and the p_{c}(x, t − x) function, leading to an (un)favourable agespecific weighting scheme. The comparatively high mortality for males measured with CAL promotes a situation in which higher weights are assigned to the young ages with low prevalence of being unhealthy (see Fig. 6).
Discussion
We started this paper with a description of three different concepts for measuring longevity in a population. We have concluded that only cohort LE is an appropriate choice if the aim is to examine the reallife course experience of people in the framework of age and calendar time. However, the concepts of period LE and CAL are more convenient for monitoring health and mortality on the population level as they summarize the mortality information of an entire population, instead of focusing exclusively on a specific group of individuals. In addition, CAL and period LE are more timely than cohort LE because they can be estimated also for recent periods. However, CAL includes a high proportion of historic death rates, while period LE solely reflects recent mortality rates (see [18] for more details).
Whereas period LE reflects the life course of one hypothetical cohort by linking together a set of agespecific death rates observed in a given period, CAL summarizes the complete mortality experience of all actual living cohorts from theirs birth until the current period. A specific feature of CAL is that it takes into account the natural process of survival, i.e., the overall mortality in a certain year or period is conceptualized as the product of past exposures and health behaviour that have accumulated over the entire life span of the currently living cohorts [31]. Therefore, an observed increase in CAL between two points in time corresponds to the factual longevity gains experienced by individuals present in the given population. By contrast, trends in period LE are more difficult to interpret and might be distorted by several effects such as cohort and tempo effects or heterogeneity [19, 32]. For example, the stagnation and rise seen in Danish women’s period LE has been attributed to specific cohorts rather than to changes in period mortality conditions [33], and public health researchers are currently investigating to which extent the recent observed stalling in period LE in the UK and Europe represents a “real” deterioration of population health [34].
Period LE and CAL can be extended to HE and HCAL by applying the Sullivan method. While HCAL links the proportions of healthy individuals observed in a given population to the corresponding proportions of cohort survivors, conventional HE combines the health information of real cohorts with the survival trajectory of a hypothetical cohort. These features make CAL a more appropriate basis for the estimation of healthy life years with the Sullivan method. However, CAL is less affected by changes in recent death rates and increases also in years where period LE decreases. Therefore, one could argue that conventional HE estimates are more timely than HCAL estimates.
It is important to note that the Sullivan method itself has been criticized for producing misleading results regarding monitoring changes in population health [35,36,37]. The main argument of these critiques focused on using prevalence instead of incidence data. In terms of health, prevalence reflects the proportion of individuals in the unhealthy state at a given point in time. This includes individuals who transitioned from healthy to unhealthy in the observation period as well as those who experienced this transition already in the past. The incidence of being unhealthy, however, refers exclusively to individuals who experienced transitions during the given calendar year (or period). As a consequence, health indicators estimated with the Sullivan method cannot capture a sudden shortterm change in population health regardless of the choice of the mortality information [38]. Nevertheless, prevalencebased indicators such as HE and HCAL are convenient for measuring the current health composition of a population, i.e. the actual proportion of healthy/unhealthy individuals in a population [39].
The use of health prevalence data makes HCAL the conceptually more coherent indicator for monitoring population health. The data demands are, however, somewhat higher for HCAL than for HE. This leads to the question, whether results between HCAL and HE differ to an extent that justifies this higher data demand. Therefore, we compared the HCAL indicator to conventional HE in three empirical applications for nine European countries. In general, HCAL is lower in absolute terms and slightly larger in relative terms due to incorporating (higher) historical death rates. Our examination of the gender gap in health and mortality shows that HCAL provides a larger female advantage in healthy life years than conventional HE. This finding illustrates the implication of using period mortality (in LE) instead of cohort mortality information (in CAL). The genderspecific differences in the prevalence of being unhealthy can be (partly) attributed to the different health risks and exposures, i.e. the mortality, experienced by women and women over their life span [40, 41]. In the case of HCAL, prevalence is related to the corresponding mortality history, which had considerable higher levels for men compared to women. Conventional HE, on the other hand, relates the prevalence to current period death rates, which show lower gender differences in mortality than actual cohorts have been experienced over their life courses.
The probably most discussed question in health research is, whether gains in longevity are spent primarily in good or poor health, in the context of the socalled “expansion vs. compression of morbidity debate”. The ratio of LE/HE respective CAL/HCAL is particularly relevant in this context because it shows the relative share of healthy life years on total life years. In general, we found larger gains in CAL compared to period LE between 2008 and 2014, resulting in a slightly faster decreasing HCAL/CAL ratio compared to the LE/HE ratio. This can be interpreted as a stronger relative expansion of morbidity on the basis of HCAL, i.e. gains in longevity are mostly spent in poor health.
Nonetheless, the overall trend in healthy life years is very similar for both measures, indicating that the prevalence is the driving force in HCAL as well as in conventional HE. Albeit we observed gains in total life years for all analysed countries between 2008 and 2014, regardless if measured with period LE or CAL. Many populations still experienced declines in healthy life years in the same period. Naturally, these declines are attributed to increases in the agespecific prevalence of being in the unhealthy state. This finding demonstrates once more the great relevance for accurate health data for analysing healthy life years. Therefore, our results suggest that the health data incorporates large problems as we have shown by unexplainable jumps in the data or outliers. One example for this is the relatively large decrease of about 4 years in conventional HE for Finish females between 2008 and 2014 which is more attributed to random fluctuations in the health data than to real health deteriorations in the Finish population [42]. Last but not the least, the statistical insecurity related to the health data is so high that confidence intervals are not helpful for the analysis. This can be seen from the data presented in Tables 4 and 5. Taking into account these 95% confidence intervals, there are no statistically significant differences between HCAL and conventional HE as well as in the changes in healthy life years over time. This uncertainty results almost entirely from the health data rather than from the mortality data. For this reason, the lack of statistical difference does not imply that the two conceptual approaches are indistinguishable.
We used the GALI health indicator for our empirical applications which refers to a selfreported survey question about longstanding limitations in daily activities. In general, GALI has been validated positively, i.e. GALI is strongly associated with limitations in activities of daily living (such as washing, getting dressed or out of the bed), intermediately associated with limitations in instrumental activity of daily living (such as the need of assistance in doing light housework or managing medication), and somewhat lower association with physical limitations (biting, chewing, or kneeling) in most European countries [43]. Nevertheless, selfreported survey questions might still be influenced by age, culture, and social background of the respondent [44]. For these reasons, the presented trends should be interpreted with caution. In this paper, they served primarily the purpose of demonstrating differences between HCAL and conventional HE with respect to the underlying mortality information. All the discussed conceptual and health datarelated issues would apply likewise to other selfreported health indicators from surveys such as EUSILC.
Conclusions
HCAL is a summary measure of health and mortality based on the Sullivan method. Using proportions of cohort survivors instead of period mortality rates requires a long time series of mortality data. We have demonstrated several advantages of HCAL which suggest it is an attractive measure for population health. First, HCAL yields a coherent quantity, i.e., combining the health of the real cohort survivors with mortality of the real cohorts. Second, HCAL offers an alternative perspective on health and mortality. Previous approaches have focused either on one single period or on one single cohort. HCAL, on the other hand, is the sum over all cohorts, of the probability surviving and being in good health at the time of observation. In this sense, HCAL is also a measure of population dynamics and, thus, provides new insights into the evolution of the healthy/unhealthy shares in a population.
The empirical analysis suggests that the quality of health data is much more important than the decision between CAL and LE as basis for the total number of life years. We have shown that the prevalence of being in the (un)healthy state varies notably between populations and across time. These differences have by far the strongest impact on healthy life years derived with the Sullivan method. The overall trend in healthy life years is similar in conventional HE and HCAL and taking into account the uncertainty stemming from the health data does not result in statistically significant differences between both indicators.
Nevertheless, conventional HE and HCAL should not be treated as interchangeable as they correspond to two different concepts. The analysis of the gender gap in healthy life years demonstrates that the choice of the survivorship function can indeed affect the result. By taking into account the past mortality experiences of males and females, HCAL gives a more favourable picture for women compared to conventional HE. Also regarding the compressionexpression debate we found that the two indicators can suggest different trends as in the case among Italian females. Accordingly, researchers should consider using HCAL especially in applications where period mortality differs strongly from the actual cohort experience. It is important to note, however, that we do not argue that conventional HE should be replaced by HCAL. Yet, given that HCAL is the conceptually more coherent approach, it is worth to be used in addition to conventional HE whenever it is possible to estimate both indicators because it widens the spectrum of empirical analyses and serves for verification of results based on the highly sensitive HE indicator.
Availability of data and materials
All data used in this paper is publicly available. Mortality data can be found at www.mortality.org and on www.destatis.de for Germany. Health data is available on https://ec.europa.eu/eurostat/web/microdata/europeanunionstatisticsonincomeandlivingconditions.
Abbreviations
 CAL:

Crosssectional average length of life
 EUSILC:

European Union Statistics on Income and Living Conditions
 GALI:

Global Activity Limitation Indicator
 HCAL:

Crosssectional average length of healthy life
 HMD:

Human Mortality Database
 LE:

Life expectancy
 WWII:

World War II
References
 1.
Verbrugge LM. Longer life but worsening health? Trends in health and mortality of middleaged and older persons. Milbank Memorial Fund Quart. 1984;62:475–519.
 2.
Christensen K, Doblhammer G, Rau R, Vaupel JW. Ageing populations: the challenges ahead. Lancet. 2009;374:1196–12083.
 3.
Jagger C. Compression or expansion of morbidity  what does the future hold? Age Ageing. 2000;29:93–4.
 4.
Luy M, Yuka M. Gender gapslife expectancy and proportion of life in poor health. Health Rep. 2014;25:12–9.
 5.
Mathers CD. Health expectancies: an overview and critical appraisal. In: CJL M, Salomon JA, Mathers CD, Lopez AD, editors. Summary measures of population health: concepts, ethics, measurement and applications. Paris: John Libbey Eurotext; 2002. p. 177–204.
 6.
Sullivan DF. A single index of mortality and morbidity. HSMHAHealth Rep. 1971;86:347–54.
 7.
Laditka SB, Hayward MD. The evolution of demographic methods to calculate health expectancies. In: Robine JM, Jagger C, Mathers CD, Crimmins EM, Suzman RM, editors. Determining health expectancies. England: Wiley; 2003. p. 221–34.
 8.
Jagger C, Hauet E, Brouard N. Health expectancy calculation by the Sullivan method: a practical guide. REVES Paper. 2001:1–29.
 9.
Goldstein JR, Wachter KW. Relationships between period and cohort life expectancy: gaps and lags. Popul Stud. 2006;60:257–69.
 10.
Wilmoth JR. On the relationship between period and cohort mortality. Demogr Res. 2005;13:231–80.
 11.
Brouard N, Robine JM. A method for calculating of health expectancy applied to longitudinal surveys of the elderly in France. In: Robine JM, Mathers CD, Bone MR, Romieu I, editors. Calculation of health expectancies: harmonization, consensus achieved and future perspectives. Paris: John Libbey Eurotext; 1992. p. 87–97.
 12.
Brouard N. Theory and applications of backward probabilities and prevalences in crosslongitudinal surveys. In: ASRS R, Rao CR, editors. Handbook of statistics: integrated population biology and modeling, Part B. Volume 40. NorthHolland, Amsterdam: Elsevier; 2019. p. 435–86.
 13.
Guidici C, Arezzo MF, Brouard N. Estimating health expectancy in presence of missing data: an application using HID survey. Stat Methods Appl. 2013;22:517–34.
 14.
Brouard N. Structure et dynamique des populations: La pyramide des années à vivre, aspects nationaux et exemples régionaux. Espaces Populations Soc. 1986:157–68.
 15.
Guillot M. The crosssectional average length of life (CAL): a crosssectional mortality measure that reflects the experience of cohorts. Popul Stud. 2003;57:41–54.
 16.
CanudasRomo V, Guillot M. Truncated crosssectional average length of life: a measure for comparing the mortality history of cohorts. Popul Stud. 2015;69:147–59.
 17.
Schoen R. Dynamic population models. New York: Springer; 2006.
 18.
Luy M. Tempo effects and their relevance in demographic analysis. Comparative Population Studies  Zeitschrift für Bevölkerungswissenschaft. 2010;35:415–46.
 19.
Guillot M. Period versus cohort life expectancy. In: Rogers RG, Crimmins EM, editors. International handbook of adult mortality. Dordrecht: Springer; 2011. p. 533–51.
 20.
Human Mortality Database, University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany). Available at www.mortality.org. Accessed 20 June 2019.
 21.
Guillot M. The momentum of mortality change. Popul Stud. 2005;59:283–94.
 22.
EUSILC: Eurostat, EU statistics on income and living conditions. 2019.
 23.
Destatis, Statistisches Bundesamt, available at https://www.destatis.de/DE/Themen/GesellschaftUmwelt/Bevoelkerung/SterbefaelleLebenserwartung/Publikationen/_publikationeninnenkohortensterbetafel.html. Accessed 9 June 2019.
 24.
Berger N, Robine JM, Ojima T, Madans J, Van Oyen H. Harmonising summary measures of population health using global survey instruments. Epidemiol Commun Health. 2016;70.
 25.
Jagger C, Robine JM. Healthy life expectancy. In: Rogers RG, Crimmins EM, editors. International handbook of adult mortality. Dordrecht: Springer; 2011. p. 551–68.
 26.
Ekholm O, Brrønnum H. Crossnational comparisons of nonharmonized indicators may lead to more confusion than clarification. Scand J Public Health. 2009;37:661–3.
 27.
ValverdeRubio JR, Nusselder WJ, Mackenbach JP. Educational inequalities in Global Activity Limitation Indicator disability in 28 European countries: does the choice of the survey matter? Int J Public Health. 2019;64:461–74.
 28.
Saito Y, Robine JM, Crimmins EM. The methods and materials of health expectancy. Stat J IAOS. 2014;30:209–23.
 29.
Eurostat database: Healthy life years (from 2004 onwards), available at https://appsso.eurostat.ec.europa.eu/nui/show.do?dataset=hlth_hlye&lang=en. Accessed 20 June 2019.
 30.
Camarda CG. MortalitySmooth: an R package for smoothing Poisson counts with Psplines. J Stat Softw. 2012;50.
 31.
Guillot M, CanudasRomo V. Revisiting life expectancy rankings in countries that have experienced fast mortality decline. In: Schoen R, editor. Dynamic demographic analysis: the Springer series on demographic methods and population analysis. Volume 39. Cham: Springer; 2016. p. 51–69.
 32.
Luy M, Di Giulio D, Di Lego V, Lazarevič P, Sauerberg M. Life expectancy: frequently used, but hardly understood. Gerontology. 2019;66:95–104.
 33.
LindahlJacobsen R, Rau R, Jeune B, CanudasRomo V, Lenart A, Christensen K, Vaupel J. Rise, stagnation, and rise of Danish women’s life expectancy. Proc Nat Acad Sci U S A. 2016;113:4015–20.
 34.
Murphy M, Luy M, Torrisi O. Mortality change in the United Kingdom and Europe. In: Social Policy Working Paper 1119. London: LSE Department of Social Policy; 2019.
 35.
Barendregt JJ, Bonneux L, Van der Maas PJ. How good is Sullivan’s method for monitoring changes in population health expectancies? J Epidemiol Community Health. 1997;51:578–9.
 36.
Barendregt JJ, Bonneux L, Van der Maas PJ. Health expectancy: an indicator for change? J Epidemiol Community Health. 1994:482–7.
 37.
Mathers CD, Robine JM. How good is Sullivan’s method for monitoring changes in population health expectancies. J Epidemiol Community Health. 1997;51:80–6.
 38.
Murray CJ. Quantifying the burden of disease: the technical basis for disabilityadjusted life years. Bull WHO. 1994;72:429–45.
 39.
Barendregt JJ. Incidence and prevalencebased SMPH: making the twain meet. In: CJL M, Salomon JA, Mathers CD, Lopez AD, editors. Summary measure of population health: concepts, ethics, measurement and applications. Paris: John Libbey Eurotext; 2002. p. 221–33.
 40.
Verbrugge LM. Gender and health: an update on hypotheses and evidence. J Health Soc Behav. 1985;26:156–82.
 41.
Cambois E, SoléAuró A, Robine JM. Gender differences in disability and economic hardship in older Europeans. Eur J Popul. 2019;35:777–93.
 42.
EHLEIS Country Reports: Health expectancy in Finland. In European Health and Life Expectancy Information System  EHLEIS, available at http://wwweurohexeu (accessed 26 Feb 2020); 2018.
 43.
Berger N, Van Oyen H, Cambois E, Fouweather T, Jagger C, Nusselder WJ, Robine JM. Assessing the validity of the Global Activity Limitation Indicator for fourteen European countries. BMC Med Res Methodol. 2015;15.
 44.
Spitzer S, Weber D. Reporting biases in selfassessed physical and cognitive health status of older Europeans. PLoS ONE. 2019;14.
 45.
Guillot M, Yu Y. Estimating health expectancies from two crosssectional surveys: the intercensal method. Demogr Res. 2009;21:503–34.
 46.
Rogers A, Rogers RG, Belanger A. Longer life but worse health? Measurement and dynamics. The Gerontologist. 1990;30:640–9.
 47.
Crimmins EM, Hayward MD, Saito Y. Changing mortality and morbidity rates and the health status and life expectancy of the older population. Demography. 1994;31:159–75.
 48.
Lièvre A, Brouard N, Heathcote C. The estimation of health expectancies from crosslongitudinal surveys. Math Popul Stud. 2003;10:211–48.
 49.
Crimmins EM, Saito Y, Hayward MD. Sullivan and multistate methods of estimating active life expectancy: two methods, two answers. In: Robine JM, Mathers CD, Bone MR, Romieu I, editors. Calculation of health expectancies: harmonization, consensus achieved and future perspectives. Paris: John Libbey Eurotext; 1992. p. 155–60.
 50.
Guillot M. Multistate transition models in demography. In: Wright JD, editor. International Encyclopedia of the Social & Behavioral Sciences, 2nd edition. Oxford: Elsevier; 2015. p. 109–15.
 51.
Mathers CD, Robine JM. Health expectancy indicators: a review of the work of REVES to date. In: Robine JM, Mathers CD, Bone MR, Romieu I, editors. Calculation of health expectancies: harmonization, consensus achieved and future perspectives. Paris: John Libbey Eurotext; 1992. p. 1–21.
Acknowledgements
The authors thank Vladimir CanudasRomo for providing helpful comments to an earlier version of this manuscript and Werner Richter for language editing.
Funding
This project has received funding from the European Research Council under the EU’s Horizon 2020 Research and Innovation Programme, grant agreement no. 725187 (LETHE).
Author information
Affiliations
Contributions
ML developed the research idea and designed the study together with MS. MS carried out the analyses and wrote the paper. ML supervised the analyses and gave inputs to the manuscript. MG contributed to the interpretation of the data and commented on all parts of the paper with corresponding inputs to the text. The authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Additional file 1.
R code example for estimating HE and HCAL.
Appendix
Appendix
Crosssectional prevalence rates from a cohort perspective
The proportions of cohort survivors, p_{c}(x, t − x), can be derived on the basis of cohort and agespecific survival probabilities. Since death is irreversible, this proportion is monotonically decreasing with age. In other words, the stock of cohort survivors at a given point in time is the product of past survival probabilities. The proportions of the unhealthy population, π(x, t), on the other hand, are more complex in the sense that they are a function of all the past transitions between the states “healthy”, “unhealthy”, and “death” [36]. Unfortunately, detailed data on such transitions are not available for a long time series. Still, crosssectional health data provides information about the proportion of healthy individuals for a given cohort at a particular point in time as a result of these transitions. For example, a representative health survey conducted in the year 2015 allows us to estimate the fraction of healthy individuals over all individuals for each birth cohort alive in 2015, i.e. cohorts born between 1915 and 2015 (assuming the survey includes the population at age zero to 100). In this way, the exact health trajectories for cohorts remain unobserved but have been implicitly included in the π(x, t) function. This is illustrated in Fig. 5 for the 1935 and 1965 birth cohorts. Guillot and Yu [45] provide an equation for calculating the population proportion of unhealthy individuals at a particular point in time on the basis of transition probabilities between health states and death ([45], p. 508). This equation has been applied to simulated data for the sake of demonstrating the relationship between in and outflows, i.e. transitions to the (un)healthy state, and the proportion of the unhealthy population as a stock. Please note that assuming an exponential trend for health transition probabilities is in line with previous research [46,47,48]. The transitions (left panel) produce the prevalence of being unhealthy (right panel). This is 96 percent for the 1935 birth cohort at age 80 (in 2015). The younger 1965 birth cohort turns 50 in 2015 and reaches a proportion of 77% unhealthy individuals. The younger cohort has been exposed to favourable health and mortality conditions (indicated by favourable transition probabilities). This is why a higher share has been found unhealthy at age 50 for the 1935 birth cohort compared to the 1965 birth cohort (over 80% vs. 77%). The observed crosssectional prevalence at time t, i.e. the π(x, t) function, does not provide any information about the exact morbidity and mortality trajectories of cohorts but gives an estimate of the unhealthy population stock for the cohorts reaching age x in time t (born in time t − x) in accordance with their underlying multistate process.
HCAL and HE in relative terms
Differences in the pattern of the p_{c}(x, t − x)function compared to the p(x, t) affect the outcome of the measure. First, the less rectangular pattern of the p_{c}(x, t − x) function causes lower absolute values for HCAL. Obviously, the lower agespecific survival probabilities result in a lower total number of personyears lived and therefore, HE will exceed HCAL in absolute terms. Interestingly, the choice of the survivorship function also affects the measure in relative terms, i.e. the ratio HCAL/CAL vs. the ratio HE/LE. The reason for that can be revealed by looking at personyears lived in relative terms (dividing the agespecific count of personyears lived by the total number of personyears lived). As already mentioned, the total number of personyears lived is lower in CAL. However, at young ages, both functions show similar number of personyears lived. This is why dividing the healthy life years at young ages by the total number of life years yields greater ratios for HCAL/CAL compared to HE/LE (Fig. 6). This relationship inverses at older ages so that the ratios for HE now exceed the ones for HCAL. In other words, the p_{c}(x, t − x) function (in relative terms) gives more weights on young ages and less weights on old ages compared to the p(x, t) function. Since individuals are mostly healthy at young ages and become increasingly more unhealthy at older ages, the share of healthy life years on total life years is higher in HCAL.
Interpreting HE and HCAL results
The main purpose of HE is to combine the health and mortality conditions prevailing in one particular period to one single measure. The mortality information in the p(x, t) function refers solely to time t so that LE is a pure period mortality measure [10]. What about the π(x, t) function? This function refers to the agespecific prevalence of the population in healthy and unhealthy states at time t and, therefore, can be seen as a period estimate as well. In this sense, HE(t) is a period measure that reflects the health composition of the real population at time t adjusted for period mortality [49]. However, it is not a pure period indicator in a synthetic cohort fashion [5]. We have illustrated that the health statespecific prevalence depends on past mortality rates and health transitions rates. Therefore, HE cannot be interpreted as an average number of healthy life years, lived by currently newborns under the assumption that they are solely exposed to the health and mortality conditions observed in one single period. In order to interpret HE in the same manner as LE, one would need to replace the observed prevalence of being in the (un)healthy state with a constructed “period” or “equilibrium” prevalence of being in the (un)healthy state [50]. This is a synthetic prevalence calculated from transitions between health states and death observed in a given period. Lièvre et al. [48] have shown that this modelled prevalence is currently lower than the observed prevalence in the USA. Only if health and mortality conditions are constant over time, i.e. constant transition rates, the period prevalence equals the observed prevalence [50]. In this scenario, the Sullivan method yields a period HE estimate, which is equal to the results based on the multistate life table method [51].
CAL differs from period and cohort LE because it does not refer to one single cohort. While period and cohort life tables give an estimate of the average number of personyears lived for one cohort (real or synthetic), CAL is the average number of personyears, which has been lived by all cohorts alive in a given period (assuming a closed population with a constant inflow of annual births). Thus, HCAL refers to the average number of healthy personyears lived by these cohorts. In other words, HCAL reflects the health and mortality conditions that the cohorts present in a population at a given point in time have been exposed to during their past life course. Alternatively, CAL(t) can also be seen as the relative population size at time t in a constantbirth population [15]. Imagine a population where all cohorts have the same initial size, i.e. a population with a constant number of births each year. Formally, this model population is expressed as follows:
with N(t) being the total population at time t and B the number of the annual inflow of births. Rewriting the equation and substituting CAL(t) for integrated p_{c}(x, t − x) function yields the population size at time t as the product of CAL(t) and B.
Let us assume that the initial size of each cohort is 100,000 (B = 100,000). By applying the cohort survivor probabilities (given by the p_{c}(x, t − x) function) to this modelled population, each cohort is exposed to the mortality regime, which actually took place during its past life course (from birth up to time t). Since CAL(t) summarizes the cohort survivor proportions in time t, multiplying the CAL value by 100,000 yields the population size in time t. Applying the same population model to HCAL allows to interpret HCAL(t) as the relative size of the healthy population in time t.
In this perspective, looking at HCAL estimates can be seen as looking at population data, while controlling for fluctuations in births. Comparing CAL and HCAL estimates for a given population over time indicates clearly how the healthy and total share of individuals in a population has evolved. In the case of an increasing HCAL, changes in transition rates promote a situation in which the healthy population is increasing in size over time. This is the clearest sign that people are indeed living longer and healthier lives.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Sauerberg, M., Guillot, M. & Luy, M. The crosssectional average length of healthy life (HCAL): a measure that summarizes the history of cohort health and mortality. Popul Health Metrics 18, 21 (2020). https://doi.org/10.1186/s12963020002205
Received:
Accepted:
Published:
Keywords
 Sullivan method
 Crosssectional average length of life
 Healthy life years
 Longevity
 Population health