Measuring the health of the Indian elderly: evidence from National Sample Survey data

Background Comparable health measures across different sets of populations are essential for describing the distribution of health outcomes and assessing the impact of interventions on these outcomes. Self-reported health (SRH) is a commonly used indicator of health in household surveys and has been shown to be predictive of future mortality. However, the susceptibility of SRH to influence by individuals' expectations complicates its interpretation and undermines its usefulness. Methods This paper applies the empirical methodology of Lindeboom and van Doorslaer (2004) to investigate elderly health in India using data from the 52nd round of the National Sample Survey conducted in 1995-96 that includes both an SRH variable as well as a range of objective indicators of disability and ill health. The empirical testing was conducted on stratified homogeneous groups, based on four factors: gender, education, rural-urban residence, and region. Results We find that region generally has a significant impact on how women perceive their health. Reporting heterogeneity can arise not only from cut-point shifts, but also from differences in health effects by objective health measures. In contrast, we find little evidence of reporting heterogeneity due to differences in gender or educational status within regions. Rural-urban residence does matter in some cases. The findings are robust with different specifications of objective health indicators. Conclusions Our exercise supports the thesis that the region of residence is associated with different cut-points and reporting behavior on health surveys. We believe this is the first paper that applies the Lindeboom-van Doorslaer methodology to data on the elderly in a developing country, showing the feasibility of applying this methodology to data from many existing cross-sectional health surveys.


Background
Improving and maintaining population health are considered important and agreed-upon objectives of health systems [1,2]. Not only is the average level of health important, the issue of health inequalities has also been prominent on the policy agendas of national governments and international organizations [3][4][5][6]. In particular, with a growing share of the elderly in developing countries such as China and India, the health of older populations constitutes an issue of growing policy importance.
A reliable and comparable measure of health is necessary for undertaking health inequality studies or any investigation of the impact of policy interventions on population health. Mortality indicators, such as life expectancy and infant mortality rate, do not adequately capture the morbidity aspect of health. Thus, many surveys rely on aggregate measures of self-reported health (SRH), usually as categorical responses, which are easy to administer and have been shown to have good predictive power for subsequent mortality [7,8]. However, the interpersonal, intertemporal, and interregional comparability of SRH is questionable if people understand and respond to questions on the state of their overall health in different ways. Specifically, consider a mapping from an underlying latent health variable (that reflects true health) to an appropriate SRH response category, where the cut-point thresholds at which individuals transit from one categorical response to the next systemically vary across populations [9,10]. Without correcting for those cut-point differences, comparisons based on SRH will lead to misleading conclusions.
This paper assesses the health of the elderly in India using data from the National Sample Survey (NSS) collected in 1995-96, a nationally representative survey of nearly 120,000 households, including about 34,000 elderly persons aged 60 years and above. Our analysis investigates the role of reporting heterogeneity in selfassessed measures of health and tests whether reporting patterns vary by individual characteristics, using responses to additional questions on disability, hospitalization, and chronic disease status in the survey. For this purpose, we use a method developed by Lindeboom and van Doorslaer, who devised tests for assessing whether variations in SRH responses are due to differences in objective health or reflect reporting heterogeneity due to individual attributes such as age, gender, and socioeconomic status [11].
Our paper adds to the literature that aims to better interpret SRH indicators commonly collected in healthrelated surveys and make the best use of existing information instead of proposing methods to be employed in future surveys, often at enormous expense. For instance, in their classic work, King and colleagues proposed an "anchoring vignettes methodology" to assess and correct for the cut-point shift in individual responses [12], an approach adopted by the World Health Organization in pilot studies of the World Health Survey [9]. Though intuitively appealing and theoretically promising, the anchoring vignettes approach is likely to be difficult to administer in large-scale surveys. Nor is this approach helpful to researchers analyzing and interpreting results already collected from existing surveys, often conducted at great cost in resource-constrained settings. For the vignette approach to yield an unbiased assessment, one still has to assume that respondents use the same sets of cut-points in evaluating the fictitious vignettes and their own health. This paper also contributes to research on the health of the elderly in India by analyzing a large, nationally representative household survey. The few existing studies on the health of the Indian elderly that we were able to identify focused on small samples and local populations [13][14][15]. There is one exception, which also used the National Sample Survey data of 1995-96 to assess the relationship between a range of individual attributes and an indicator variable showing whether an individual had a disability or a chronic condition, using a probit regression model [16]. Our approach is differentiated by our focus on assessing the relationship between SRH and more objective indicators of health status available in the survey, and the role that socioeconomic characteristics play in modifying this relationship. Our study also contributes to understanding interregional differences in how individuals report their own health status, such as the paradox highlighted by Amartya Sen, who noted that the people of Kerala reported worse health than the people of Bihar, India's poorest region, even though the former enjoyed much higher levels of life expectancy [17].

Methods
Following Lindeboom and van Doorslaer [11], we modeled the link between true health status H* and a vector of objective measures of health H o as follows: For any individual "i", let Z denote a collection of explanatory variables that influence an individual's true health state in addition to any objective indicators of health. Note that θ and a are (vectors of) parameters to be estimated, and ε is an error term.
; , * when n with and Here, τ k are the cut-points, with k = 1...n, with cutpoints depending on individual characteristics X and parameter vector β. Combining (1) and (2), we get ; We assume that ε i is distributed as N H i ( , ) * 1 . Equation (3) then describes a standard hierarchical ordered probit model, characterized by cut-points τ that depend on individual characteristics, but not on an individual's objective health measures. The likelihood function is given by ℓ(ε/b,θ,a), the product of the probabilities of observing categorical responses for individuals. Without exclusion restrictions that force Z to be different from X and without a specific functional form for g(.), there is no way to identify the β parameters separately from the θ parameters. When X and Z are the same, all we can measure are b 01 -θ 0 , b 02 -θ 0 ,... b 0n-1 -θ 0 for the intercept terms, b 11 -θ 1 ,... b 1n-1 -θ 1 for the coefficients of the first term of the vector X, b 21 -θ 2 ,... b 2n-1 -θ 2 for the coefficients of the second term of the vector X, and so on. Thus, as specified, it is not possible to separate shifts in cut-points and the effect of X variables on the true health status H*.
Lindeboom and van Doorslaer [11] described a test for cut-point shifts, by testing the model in (1)  If the null hypothesis of restrictions was to be rejected, differences in reported health could be explained as arising at least partly from nonparallel shifts in cut-points across individuals as a result of different X. However, imposing these restrictions in the above specification does not affect parameter estimates if Z and X are the same, and there is no difference in estimates based on the restricted and the unrestricted likelihood functions. Moreover, there is the problem of what happens if we cannot reject the null, given our interest in separating the relative impact of cut-points versus true health effects stemming from the X variables. This means obtaining the estimates of β parameters (cut-point effect) and θ (true health effect) separately, instead of the feasible β-θ.
To address these concerns, Lindeboom and van Doorslaer suggested an approach that relies on stratifying the data so that all intervening variables that influence the relationship between the true health status H* and objective measures of health H o are eliminated from the specification. This means drilling down to fine subgroups where objective measures of health can be treated as being the sole determinants of the true indicator of health, and the cut-points are the same for all members within a subgroup. Thus, if gender, place of residence (rural versus urban), and age-group influence H* separately from H o , we can limit our attention to subgroups such as women over age 60 living in urban areas in the Northern region.
Consider one such subgroup 'm', such that Alternately, Undertaking the estimation exercise -maximizing a likelihood function resulting from (3) -for each group, we end up with separate cut-points τ m , and separate estimates of the parameters of the g(.) function that generates the categorical responses, namely a m , except for any intercept terms that cannot be separated from parallel shifts in the cut-points. A normalization that constrains the value of any intercept term in g(.) to be zero could address this issue, given that it will not affect the actual categorical responses of the household.
Lindeboom and van Doorslaer suggested the following tests for differences in cut-points and a. Specifically, consider two groups m andm . In the first instance, one can estimate τ and a separately for the two groups and calculate their likelihood function values using a standard ordered probit model, given that cut-points τ m and a m are identical for all individuals within each group. Let L U be the sum of the log likelihood values of the unrestricted models. Next, one can construct a joint likelihood function by combining data for the 2 subgroups, using the restriction that   When working with data pooled from both subgroups, we could work with either one of two specifications in (5).
D is a dummy variable indicating membership of the two subgroups. Note that (5b) is more general than (5a) in that it allows for an independent effect of social group membership on true health status. Therefore, we used specification (5b) in our analysis. Effectively, estimating the restricted model means combining the datasets for the two groups m andm , and then carrying out the necessary maximum likelihood estimation exercise.
If the null hypothesis is rejected via the abovementioned Likelihood Ratio test when specification 5(b) is used, there could be one (or both) of two possible causes -differences in τ (nonparallel changes in the cutpoints) and/or differences in a (differences in the function g(.) across the two subgroups). Note that we cannot test for a parallel cut-point shift, as this is not separately identifiable from a shift in the true health effect of the specific subgroup, as captured by the parameter θ. To assess whether it is differences in τ or differences in a (or both) that drive the observed results, Lindeboom and van Doorslaer suggested additional LR tests. To fix ideas, let the log likelihood function for the case where the restriction is solely   =ˆis valid, we could use the test statistic -2(L Ra -L U ), which assesses the difference between no restrictions at all and the case where the g(.) function is the same. In specification (5b), to allow for an independent health effect (θ) by the stratifier, dummy variables indicating membership are included. The degrees of freedom of the test statistic are the number of a parameters that are free under the no restrictions case times (n -1) minus the number of dummy variable indicating the membership, where n is the number of groups over which we are testing the differences. An alternative approach is to consider the test statistic -2 (L R -L Rτ ), where we compare the (log) likelihood ratio of the fully restricted likelihood function and the partially restricted likelihood function (with cut-points held constant).

Data
The data are from the Indian National Sample Survey of 1995-96. The survey collected information on SRH for all elderly (aged 60 years and above) in the households surveyed. It also collected information on what we designate objective measures of health of the elderly members of the household -such as information on events resulting in hospitalization, chronic illness, and disability. We consider hospitalization an objective measure of health because it is a discrete event unlikely to be easily forgotten or influenced by cultural and other factors driving the reporting of less serious types of illness, and reflects a significant shock to the health of the individual. Of course, income, timing, and the distance to health care facilities might lead to a seriously ill person foregoing hospitalization, and limiting the focus solely to hospital stays in the last one year as per the survey instrument would rule out individuals hospitalized in previous years. One way to address the concern above using information on hospitalization would be to include information on chronic ailments such as heart disease, hypertension, cancers, diabetes, arthritis, and so forth that are of a longer-term nature. The National Sample Survey included this information in two forms, starting with conditions that were manifested in acute illness resulting in either hospitalization in the last one year or illness in the two weeks preceding the survey. A separate section for the elderly specifically inquired whether elderly respondents currently experienced these conditions, whether or not with a recent acute manifestation. In the paper, we report results using the second set of indicators of chronic conditions. Our results are not dependent, however, on whether we use indicators of chronic conditions with a recent acute manifestation or without.
We also included information on four indicators of disability among household members related to movement, sight, hearing, and speech. In general, indicators of disability, especially in self-assessments based on how difficult respondents find it to move, see, hear, and so forth, can be problematic and subject to the same difficulties in comparative assessments [10,11]. However, in the case of the National Sample Survey, indicators of disability can be treated as objective indicators of specific dimensions of health of the elderly population. There are two reasons for doing so. First, the survey inquiries relate to features of disability that go beyond a purely subjective assessment. For instance, the NSS defines disability in terms of impairments. Locomotive disability is defined as physical deformity even if it does not influence mobility. The definition also includes loss of activity of part of the hand or leg due to amputation, paralysis, or deformity. This can be directly observed by the interviewer. Visual disability in the survey is defined as when a person has no light perception and cannot correctly count the fingers of the hand from a distance of 10 feet in broad daylight (with/without spectacles). Second, a simple (1 if disabled, 0 if not) cut-off rule that respondents used ought to further eliminate any reporting biases at the upper and lower ends of the disability range.
Even when impairments are self-reported, as is likely in the case of visual acuity in the NSS, it is worth reflecting on the findings of the labor economics literature that commonly uses self-reported disability indicators in empirical estimates of labor supply models. Typically in this literature, the addition of other objective (clinically defined) health measures adds very little to the explanatory power of analyses [18]. Some researchers have criticized the validity of self-reported disability measures and argued that people may overreport disability to justify their difficulties in the labor market [19][20][21][22]. Other studies, however, have found little evidence of endogeneity of self-reported disability measures and labor force participation [23,24]. In our study, moreover, the validity of disability measures is not compromised by this justification hypothesis as the National Sample Survey for India did not explore questions of health and retirement in the same survey. Given the extremely limited nature of social protection programs, there is no real incentive for individuals to overreport disability.
The use of self-reported measures of specific disabilities to circumvent issues of rationalization in global measures of health is well-supported in the literature. For instance, even though Bound and colleagues argued against the use of global questions such as "How would you rate your health?" they assessed measures of limitation in physical function to be less susceptible to measurement and endogeneity problems [21]. Researchers have found that self-reported disability, when assessed by independent verification, provides a good description of true status [18,25,26]. Guralnik et al. also showed that self-report of disability was strongly associated with performance in a physical test among the elderly [27]. Moreover, Bailis et al., using data from various rounds of the longitudinal population health survey in Canada in the 1990s, found that self-reported aggregate health measures responded not only to changes in individuals' physical and mental health, but also to their intentions/ expectations about their health behaviours in the future [28], a finding supported by [29] that used longitudinal data on health among adolescents for the United States. For these reasons, we expect disability measures in the NSS to provide a closer proxy to the true health of the elderly than the SRH. Moreover, generic instruments commonly used in measuring individual health status, such as Health Utility Index and EQ-5D, are all based on reporting of specific domains of functional limitations, such as mobility, hearing, vision, etc. These instruments are employed in various economic studies, clinical trials, and cost-effectiveness analyses to indicate objective health status and their validity, reliability, and comparability are widely accepted.
In addition to the objective indicators of health H o used in this paper, we also assumed the region of residence, sex, educational attainment, and rural-urban residence to influence the mapping from H o to self-assessed health H S and the cut-points used by individuals. We categorized states into four regions -North, East, South, and West based on geographic location (Table 1). Sex and ruralurban residence are defined as binary variables. Current location was used rather than individuals' original place of birth, both because the NSS data do not collect such information and also because of strong neighborhood effects on self-reported health documented in the literature [30,31]. Our indicator of educational attainment dichotomizes respondents into those with at least primary education and those with less than primary education. We chose the threshold of having completed primary education because several studies have shown that primary schooling offers the highest economic returns at the margin [32] and presumably the largest impact on attitudes and behavior. Unfortunately, experimenting with higher thresholds for education proved difficult because doing so drastically reduced the number of observations in some of the subgroups. Where the number of observations was sufficient to achieve statistical power, our findings remained unaffected.

Results
Descriptive statistics related to the elderly (60 years and above) are presented in Table 2. The sample size of the elderly in the 52 nd round of NSS was 33,940. The mean age of the sample elderly was 68 years, and the number of females in the sample is slightly more than the number of males. More than 70 percent of the elderly resided in rural areas. About one-fifth of the elderly in the sample had completed their primary education, with the elderly in the South and West regions having higher educational attainment than in other regions. Their average annual per capita consumption expenditure was INR 4,700 (about US$144), ranging from INR 4,000 rupees in the Eastern region to INR 5,400 in the West. Ties within the family and filial support are generally strong: about 78% of the elderly lived with their children, and only about 14% of the elderly lived alone (or with their spouse). Even among those living alone or with their spouse, a majority had children, grandchildren, or siblings staying in the same village or town, suggesting that (at least in 1995-96) family support systems for the Indian elderly were uniformly strong.
The main variable of interest in our study is responses to the question on elderly perceptions of their own current health status. The response categories were excellent (SRH = 4), good (SRH = 3), fair (SRH = 2), and poor (SRH = 1). In terms of the notation used in the statistical model of the previous section, this measure corresponds to H S . We see that the large majority of the elderly reported being in fair health (71.7 percent) and another 19.3 percent in poor health. Our sample data also confirm the findings of Amartya Sen [17], who observed that the population residing in the Southern region (including Kerala) had a lower proportion of individuals self-reporting to be in excellent or good health (7.8%) than the Eastern region (9.5%), which includes Bihar and is characterized by much lower levels of economic status and educational attainment. The prevalence of hospital stays, chronic conditions, and disabilities are also summarized in Table 2.

Table 1 List of states by region
The empirical strategy of the previous section was applied to data on the elderly from the 52nd round of the National Sample Survey. We chose four factors to stratify our sample into homogeneous groups: gender, education, rural-urban residence, and region. In a developing country like India, where the life experience and social expectations of the two sexes are different, it is possible that males and females do not have the same expectations when assessing their health. The rationale behind education as a stratification device is also rather straightforward. Education likely shapes people's attitudes and perceptions toward their surroundings and themselves. Even though it has not been shown to affect cut-point differences in previous analyses [11], the substantial impact of primary education on health knowledge in developing countries documented in the literature warrants its inclusion. Hence, we divided the elderly into two groups, based on whether or not they completed their primary education. Economic status (in the form of consumption expenditure per capita) is not used for stratification because income and education were highly correlated, and using both income and education would have resulted in overstratification and limited sample sizes. Lastly, we used rural-urban residence and region as factors that would affect expectations of health, mainly because they are likely to be closely correlated with the living environment and other unobservable factors in the local context. In India, this is especially important owing to the considerable cultural differences (including language) that exist across regions.
Our results are presented in Tables 3, 4, 5 and 6. First, we tested whether the region of residence affected people's reporting behavior. Again, we should emphasize that what is likely at work is not the region per se but other commonly unobservable factors that are closely associated with where individuals live, such as ethnicity, language, and cultural practices. We separated the sample into homogeneous subgroups by gender, education, rural-urban residence, and region. As shown in Table 3, region generally has a significant impact on how women perceive their health (as indicated by SRH).
For males, the region of residence made a difference in health reporting behavior among two groups of men: rural men who had not completed their primary education and urban men who had completed their primary education. Robustness checks conducted by using only disability indicators as objective health indicators (results not shown) confirm these basic findings. Again, the underlying factor is the cut-point differences.
To visualize the cut-point differences and their effect on reporting heterogeneity among residents of different regions, we illustrate the cut-points of urban male and female elderly with at least primary education as an example in Figure 1.
Note in Figure 1 that cut-points can differ across groups in two ways: in the level and in the distance between cut-points within each group. Take the South and East regions as an example. Men and women in the Southern region had higher cut-points 2 and 3 than those in the Eastern region, meaning that at a given health status of fair or better, the elderly in the East would report a better health status than those living in the South. At the same time, the distances between cutpoints are also wider in Southern region -it has a lower cut-point 1 compared to the Eastern region; therefore, compared with the elderly in the Eastern region, the former would tend not to report extreme values when assessing their own health. Reporting heterogeneity in SRH can arise not only from cut-point shifts, but also due to differences in parameters of the g(.) function; that is, due to differences in the influence of objective health measures (the coefficients α in the above model specification). This appears to be the case for women in our sample.
We also tested for differences in cut-points and a across gender and by educational achievement (see Tables 4 and 5). Given our previous finding that cutpoints and the parameters of the H* equation vary by region, we did not aggregate the observations across different regional groups. In fact, in the remainder of this analysis, the sample remained disaggregated as 16 subgroups. Lindeboom and van Doorslaer tested for reporting heterogeneity across language groups and found that language made a difference in only a few subgroups. Hence, they aggregated the sample across language groups in the later tests for the impact of other factors. In order to ensure that we did not falsely reject the null hypothesis because of the aggregation of subgroups, we chose the conservative strategy and kept the sample in homogeneous disaggregated groups. As seen in Tables 4 and 5, we found little evidence of differences in cut-points and a by gender or educational status.
In Table 6 we report the results of tests for differences by rural and urban residence. In some cases, namely elderly that had completed primary education in the Eastern region and elderly women with less than primary education in the Southern region, there are rural-urban differences. In general, though, there is no evidence of reporting heterogeneity between rural and urban areas. Table 7 presents estimates of the vector α of parameters that characterize the relationship between objective indicators of health and latent true health H* for a specific case, rural females with less than primary education: estimates from ordered probit models for different regions, with and without the restriction of equal cutpoint and true health coefficients. These estimates are consistent with what one might expect -the coefficients are of the right sign (indicators of chronic conditions, hospitalization, and disability are negatively related to the indicator of true health H*) and statistically significant except for speech disability. The fixed-effect estimates of the regional dummy variables in the restricted specification illustrate the regional impact on the distribution of elderly SRH via some combination of a direct impact on true health and a parallel shift in cut-points. On average, the regional impact was highest among elderly SRH in the West region, followed by the South and the North. Note from the results on the unrestricted models that the effect of disability measures on true health varied across disability types, with motor function having a larger effect than, say, speech, and that there was variation in these coefficients across the different regions.
As a further robustness check, we sought to assess whether sample sizes in our subgroups for education and gender were large enough to allow for sufficient power in the detection of differences in cut-points and coefficients on our measures of objective health. Because no closed-form solutions for assessing the power of the test statistic were available, we performed Monte Carlo simulations to do so. Our results do not lend support to the idea that statistical insignificance across subgroups based on gender and education was the result of inadequate sample size. As an illustration, to detect a relatively small (10%) difference between the lowest cut-point for males and that of females at the 5 percent level of significance, we estimated the power of the test to be approximately 50% with a sample size of 200. The power increased to 76% with a sample size of 400 and more than 80% with a sample size of 600. This was satisfied in all of the stratified groups, except for one case -individuals who had completed primary education in the rural West.
As another robustness check of our findings, we also conducted a series of analyses using current economic status (whether above/below per capita consumption expenditure) and caste status (scheduled castes/tribes and other) as stratifiers for socioeconomic status. We also applied the same empirical testing procedure to a different dataset, the 60 th round of the National Sample Survey. Similar to the 52 nd round, the 60 th round of NSS, which was conducted in 2004, also collected information on socioeconomic status, health status, and service utilization, as well as a special section on the elderly. Unfortunately, several questions on elderly disability and chronic conditions are missing in the 2004 survey, and thus do not allow a direct comparison of the results from the two surveys. Qualitatively, though, the results are similar: In most of the homogeneous subgroups, reporting behavior differs by the region of residence, and excepting a few groups, there was no statistical evidence to suggest that gender and education divisions influence the relationship between H* and SRH.

Discussion and Conclusions
Comparable health measures across different populations are essential in understanding the distribution of health outcomes. It is also instrumental in assessing the impact of health policy interventions. SRH is a commonly used indicator of health in existing household surveys and has been shown to be predictive of future mortality. However, the susceptibility of SRH to influence by individuals' expectations complicates its interpretation and undermines its usefulness. Our paper supports the thesis that the region of residence is associated with different cut-points and reporting behavior on health survey questions, although gender, education, and rural-urban residence do not appear to systematically compromise the comparability of self-reported health measures. The cut-points can differ in two ways: they can be systematically higher or lower, and the distance between cut-points, which reflects the degree of aversion to reporting extreme values, can vary as well. These findings are robust with different specifications of socioeconomic status or when we restrict the objective health indicators to only functional disabilities. In this, our finding lends limited support to the work of Sen [17] on differences in reported health between Kerala (located in South India) and Bihar (located in East India). As we demonstrated, the cutpoints among elderly of different regions were different, and hence the regional distribution of health based on self-assessed indicators may not adequately capture true health differentials. This is illustrated by our findings in Figures 1 showing that the elderly with fair or better  Figure 1 SRH cut-points among urban population with at least primary education. Note: Cut-point 1 is the cut-point between "poor" and "fair" health; cut-point 2 is the cut-point between "fair" and "good" health; and cut-point 3 is the cut-point between "good" and "excellent" health. Upper and lower bars indicate the upper and lower limits of the 95% confidence intervals of the estimates, respectively.
health status in the Southern region had higher cutpoints and tended to underrate their health status This might help explain why the difference in health between the East and South regions was not larger despite the much better development and health achievement in the South region documented in the literature. Whether intraregional comparisons of SRH can be made is unclear, however. Further testing of our model via the construction of within-region subgroups was limited by the size of our sample. There is little doubt that the populations in the regions we chose are culturally more similar to each other than their counterparts in other regions in terms of language, colonial history, and Islamic influence. But it is difficult to argue in favor of complete within-region homogeneity and comparability owing to differences that are sometimes evident even across neighboring districts within a region [33]. There are several limitations to our study. First, the findings may be specific to the case of Indian elderly, and one needs to be careful in generalizing the findings. There might well be interactions between region and other elements of socioeconomic status; that is, education could affect health expectations in one place but not another. Second, the validity of the method employed hinges on the choice of observable objective health indicators (H 0 ). The variables we used as objective indicators of health are self-reported, even if NSS defines disability in terms of specific impairments. In this, our analysis is similar to instruments that have previously been used to assess objective health status in the literature, such as the Health Utility Index. Of course, biomarkers could serve as ideal objective H 0 , but unless they cover a wide range, they would reflect a very specific and narrowed domain of health. To the extent that any health impacts associated with biomarkers are unobserved by the individuals in question, we should not expect them to change an individual's own understanding of true health. Capturing data on a huge array of biomarkers is also extremely difficult to execute in a large-scale survey. Finally, one could come up with a long list of potential factors that result in reporting heterogeneity. However, to stratify across finer groups would have reduced subgroup sample sizes to impractically small. These shortcomings apart, our study suggests that local contexts matter in people's expectations relating to their health. Even within the same country, people can be quite dissimilar in their health perceptions, and thus attempts to use SRH to compare the population health of different regions within countries ought to be conducted with caution.  The estimates are based on analysis of the rural females with less than primary education.