Skip to main content

Characteristics of hemoglobin distributions in preschool children and non-pregnant women of reproductive age and their implications for establishing quality control criteria for hemoglobin data in field surveys: evidence from 483 surveys conducted in refugee settings worldwide

Abstract

Background

Currently, there is a lack of clear guidance on hemoglobin (Hb) data quality parameters and plausible flagging ranges for population-representative surveys. There is a need to determine which properties of Hb data indicate lower data quality and increased measurement error and which represent intrinsic statistical properties of Hb distributions rather than quality problems.

Methods

We explored statistical characteristics of Hb distributions and plausible exclusion ranges in population-representative surveys of non-pregnant women of reproductive age (WRA) (15–49 years, n = 401 surveys) and children (6–59 months, n = 461 surveys) conducted in refugee settings by the United Nations High Commissioner for Refugees (UNHCR). Hb distribution characteristics [standard deviation (SD), skewness and kurtosis] were compared to those from Demographic and Health Surveys (DHS).

Results

Overall, 0.08% of child and 0.14% of WRA Hb values were outside of the previously proposed 4.0–18.0 g/dL plausible range. Surveys conducted in Uganda tended to have unusually high SD compared with surveys from other settings, possibly an indication of problematic measurement quality. We therefore used summary results on SD, skewness and kurtosis excluding surveys from Uganda when comparing with DHS results or proposing plausible ranges. Both WRA and child Hb distributions tended to be left-skewed and had excess positive kurtosis. Mean survey-level SD was greater, mean skewness more negative, and mean kurtosis more positive in WRA surveys compared to child surveys. All these findings were broadly similar to those from DHS surveys. Mean SD in DHS surveys was higher than that in our data for both children (1.48 vs. 1.34) and WRA (1.58 vs. 1.43).

Conclusions

We observed several statistical characteristics of Hb distributions that may not necessarily be indicative of data quality problems and bear strong similarities with the characteristics found in DHS surveys. Hb distributions tended to be negatively skewed and positively kurtotic, and SD in many surveys exceeded 1.5 (previously proposed upper plausible range). Based on our empirical evidence, surveys with skewness above + 0.2 and kurtosis below -0.5 or Hb SD outside the range of 1.1–1.55 g/dL for children (6–59 mo) or 1.1–1.65 g/dL for non-pregnant WRA (15–49 y) may require further quality investigation.

Peer Review reports

Background

Anemia, as determined by hemoglobin (Hb) concentration, is a problem of significant public health concern. In 2019, the global estimated prevalence of anemia was 30% among women of reproductive age (15–49 years, WRA) and 40% among children aged 6–59 months, with similar estimates found among displaced women and children [1, 2]. Progress toward international reduction targets has stagnated, and the World Health Organization (WHO) has called for a better understanding of the burden through more reliable assessments of Hb concentration [1, 3]. In field settings, portable photometric point-of-care analyzers (e.g., HemoCue devices) represent the current standard of care for hemoglobin determination [4, 5]. However, these devices are subject to numerous preanalytic and analytic factors that introduce bias and imprecision [4, 6,7,8].

In order to assess the anemia burden, there is a need for valid and robust Hb estimates from population-representative surveys. To date, limited research has explored Hb distributions in nationally representative population-based surveys [9, 10], and there is a further dearth of research among displaced persons. Currently, no clear guidance exists for the quality control of Hb data in the postanalytic phase [4, 8, 11]. This is especially true for population-representative surveys, where two important metrics of measurement quality, the standard deviation (SD) of Hb distribution and the percentage of implausible extreme Hb values (flags), lack definitive guidance on acceptable ranges. Indeed, commonly cited values for acceptable SDs of Hb distribution (1.1–1.5 g/dL range), as determined by the HemoCue system in cross sectional surveys, are based on the authors’ empirical experience (with no supporting data presented) [12]. Multiple ranges for minimum and maximum plausible Hb values for non-pregnant women and children have been described and used in recent literature without further justification, for example, 2.0 g/dL to 30.0 g/dL [7], 2.5 g/dL to 20.0 g/dL [1], 4.0 to 18.0 g/dL [12], and 4.0 g/dL to 21.0 g/dL [13].

The consideration of these two metrics is important as measurement error can lead to increased dispersion (SD) of the distribution and, thus, the overestimation of the proportion of the population with anemia [9]. A 2017 report exploring the quality of Hb data in Demographic and Health Surveys (herein referred to for brevity as the “DHS Report”) found important differences between Hb distributions in WRA and children. This report analyzed various parameters of Hb distributions in 80 DHS surveys from multiple countries conducted between 2000 and 2016. All 80 of these surveys had child Hb data and 65 had WRA Hb data. The authors found that Hb SDs above 1.5 g/dL are quite common for both children (46% of surveys, excluding implausible values outside of the 4.0 to 18.0 g/dL range) and WRA (71% of surveys, excluding implausible values in the same range) [9]. Important peculiarities in Hb distribution shape were also noted for both WRA and children: Hb distributions were left-skewed and positively kurtotic. Both SD, left skew and positive kurtosis tended to be larger in WRA Hb distributions than in child Hb distributions [9]. Following the exclusion of implausible values, the overall mean SDs for children and WRA across all analyzed surveys were 1.48 g/dL and 1.58 g/dL, respectively—suggesting a potential need to revisit the commonly cited upper acceptable SD value of 1.5 g/dL [9, 12].

Given the lack of clear guidance on Hb data quality parameters and acceptable ranges, there remains a need to determine which properties of Hb data indicate lower data quality and increased measurement error and which represent intrinsic statistical properties of population Hb distributions and not necessarily quality problems [8,9,10]. This study aimed to explore statistical characteristics of Hb distributions, plausible exclusion ranges, and the probability of observing extreme outliers using data from population-representative surveys of non-pregnant women aged 15–49 years and children aged 6–59 months in refugee settings. We also compared these distribution characteristics with those found in large national surveys as described in the DHS report [9].

Methods

Surveys were conducted in refugee and emergency settings from 2013 to 2021 by the United Nations High Commissioner for Refugees (UNHCR) and its implementing partners. Questionnaires, tools, and sampling methods were based on the UNHCR Standardized Expanded Nutrition Survey (SENS) guidelines; survey designs consisted of two-stage cluster samples, systematic or simple random samples, or exhaustive samples [14, 15].

For surveys that included the anemia module, Hb measurements were taken using the HemoCue Hb 301 Analyzer [5, 16]. Per the manufacturer’s specification, values outside of 0 to 25.6 g/dL were considered invalid for this analysis. Altitude-adjusted Hb values were used in the provided data, which followed guidelines provided by UNHCR SENS [5]. Anemia was classified as < 11.0 g/dL for children aged 6 to 59 months and < 12.0 g/dL for non-pregnant women aged 15–49 years, as described by WHO guidelines [5, 11]. Surveys were included in the analysis if Hb measurements were taken and provided for children aged 6 to 59 months, non-pregnant women aged 15 to 49 years, or both groups. We limited our analysis to surveys with at least 30% valid Hb values to include surveys that may have conducted sub-sampling of respondents for anemia analyses. According to SENS guidelines, either all or half of the households selected in the survey are used to conduct Hb measurements in eligible household members [5]. Within each survey, we excluded duplicated observations prior to analysis.

First, to broadly describe the dataset, we classified the included surveys by country and by year and described the total number of children and non-pregnant WRA, the number of surveys, and the number of unique sites in each of these groupings. One of the goals of the present analysis was to assess the frequency of extreme recorded Hb measurements. To this end, we calculated the number of extreme values for each survey and, across all surveys, the overall percentage of valid Hb observations below 4.0, 5.0, and 6.0 g/dL and above 16.0, 17.0, and 18.0 g/dL. Another goal was to assess the shape of Hb distributions. Thus, for each survey, the mean, median, standard deviation (SD), skewness, and kurtosis of Hb and anemia prevalence were estimated based on Hb values in the range of 4.0 to 18.0 g/dL, following the existing literature on ranges [12]. For each of these summary statistics, we then calculated the mean, SD, and various quantiles (minimum, 2.5%, 10%, 25%, 50%, 75%, 90%, 97.5%, maximum). We then produced histograms of the distributions of SD, kurtosis, and skewness across surveys for both WRA and children. Among both WRA and children, we examined the relationship between Hb survey-level means and SDs with scatter plots and estimates of the Spearman correlation coefficient. For all analyses including kurtosis, we presented excess kurtosis (subtracting 3 from kurtosis), which is centered at zero under a normal distribution.

To determine whether Hb has different distributional properties in WRA versus children, among the surveys where both children and WRA with Hb data were measured, we summarized and compared the average SDs, skewness, and kurtosis of Hb measurements. Among these paired observations, we also plotted SDs, kurtosis, and skewness estimates for WRA and children to better understand their relationship within the same survey. Finally, we compared several key results from our analysis with those from the DHS Methodological Reports No.18 which analyzed Hb distributions in national-level DHS surveys [9].

Data aggregation and cleaning were done in SAS and R 4.0.3 [17, 18]. Survey-level summary statistics were calculated using R version 4.0.3; skewness and kurtosis were estimated using the G1 and G2 methods, respectively [19, 20]. Plots and quantiles were performed in Stata version 15 [21]. This study constituted a secondary analysis of routinely collected de-identified programmatic data, it was reviewed by the Centers for Disease Control and Prevention (CDC) and determined to be conducted consistent with applicable federal law and CDC policy (45 C.F.R. part 46, 21 C.F.R. part 56; 42 U.S.C. Section 241(d); 5 U.S.C. Section 552a; 44 U.S.C. Section 3501 et seq.).

Results

Data for these analyses come from 483 surveys conducted by UNHCR in refugee and emergency settings from 2013–2021. There were no surveys conducted in 2020 due to the COVID-19 pandemic. In total, 461 surveys included valid Hb data for children aged 6–59 months, 401 surveys included valid Hb data for non-pregnant WRA aged 15–49 years, and 379 surveys included data for both WRA and children (Table 1). Median sample size of child surveys was 400 (5th percentile: 175; 95th percentile: 649), median sample size of WRA surveys was 226 (5th percentile: 99; 95th percentile: 406). Overall, 187,641 child and 93,731 WRA records were included in the analysis. Data were available from refugee settings located in 26 countries across four UNICEF regions [22], with Ethiopia, Chad and Uganda having the largest number of surveys. Full information on the numbers of surveys, unique cites and sample sizes per country and region is presented in Table 1.

Table 1 Included hemoglobin surveys, by country, and year

Table 2 describes the Hb outlier values for WRA and children across all surveys. Approximately 0.08% of all child and 0.14% of WRA records were outside of the 4.0–18.0 g/dL range. Overall, only 2.2% of surveys in children and 3.2% of surveys in WRA had more than 1% of outliers outside of the 4.0–18.0 g/dL range. On the other hand, 88.3% of child and 90.3% of WRA surveys had no Hb values outside of the 4.0–18.0 g/dL range.

Table 2 Hemoglobin outlier values for non-pregnant women and children across all surveys, by exclusion cutoff

Overall, the average survey-level mean Hb in WRA was much higher than in children (12.6 g/dL vs. 11.1 g/dL) (Table 3). Consequently, the mean prevalence of total anemia was higher (43.5%) in child surveys than in WRA surveys (30.2%). Mean survey-level SD was greater (1.47 vs. 1.39), mean skewness was more negative (− 0.53 vs. − 0.37), and mean kurtosis was more positive (1.15 vs. 0.54) in WRA surveys compared with child surveys. On further investigation, we noted that a large proportion of the surveys with very large (above the 90th percentile across all surveys) SD both in children and WRA were from Uganda: 36 out of 46 surveys in children and 20 out of 40 in WRA. We therefore repeated the analysis of SD in Table 3 excluding all Uganda surveys in children and WRA. In this analysis, the 90th percentile of SD across all remaining surveys in children decreased from 1.67 to 1.53, and in WRA from 1.74 to 1.66. We also include in Table 3 skewness and kurtosis analysis excluding Uganda surveys, which did not meaningfully differ before and after exclusion.

Table 3 Survey-level hemoglobin distribution parameters and prevalence of total anemia in children and women

Survey-level distributions of standard deviation, skewness, and kurtosis are presented in Fig. 1, with surveys from Uganda marked in red for visibility. For both children and WRA, there was a negative survey-level correlation between the mean and SD of Hb (Spearman’s rho = -0.33, p < 0.0001 and Spearman’s rho = -0.15, p = 0.003, respectively) (Fig. 2). After excluding Uganda surveys correlations for both children and WRA increased to -0.49 and -0.25, respectively. Among surveys that measured Hb in both children and WRA, both mean SD, skewness, and kurtosis were significantly different (p < 0.0001) (Table 4). The difference followed the same direction as described for all surveys: WRA had higher SD, more positive kurtosis, and more negative skewness. Hb SD in WRA and children in the same surveys were highly correlated (Spearman’s rho = 0.52, p < 0.0001), while correlations of skewness and kurtosis between WRA and children were much smaller (Spearman’s rho = 0.15, p = 0.003, and Spearman’s rho = 0.19, p = 0.0002, respectively) (Fig. 3). These correlations did not change meaningfully after excluding Uganda surveys.

Fig. 1
figure 1

Survey-level distributions of standard deviation, skewness, and kurtosis among 461 child surveys and 401 WRA surveys. Distributions of survey-level standard deviation, skewness, and kurtosis among a children (6–59 mo) and b non-pregnant WRA (15–49 y) using a Hb exclusion range of 4.0–18.0 g/dL. Red bars indicate the distribution of SD, skewness, and kurtosis for surveys conducted in Uganda (n = 56 child surveys and n = 39 WRA surveys). Hb Hemoglobin; SD standard deviation; WRA Women of reproductive age

Fig. 2
figure 2

Correlation between survey-level hemoglobin means and standard deviations in 461 child surveys and 401 women surveys. Correlation of a child (6–59 mo) survey-level Hb SD and Hb means and b non-pregnant WRA (15–49 y) survey-level Hb SD and Hb means using a Hb exclusion range of 4.0–18.0 g/dL. Red circles represent data from surveys conducted in Uganda (n = 56 child surveys and n = 39 WRA surveys). Spearman’s rho was calculated both including and excluding Uganda data. Hb Hemoglobin; SD Standard deviation; WRA Women of reproductive age

Table 4 Comparison of survey-level standard deviations, skewness, and kurtosis of hemoglobin distributions in the 379 overlapping surveys
Fig. 3
figure 3

Correlation of survey-level standard deviations, skew, and kurtosis in the 379 overlapping surveys. Legend: Correlation of a children and WRA survey-level SDs, b children and WRA survey-level skewness, and c children and WRA survey-level kurtosis using a Hb exclusion range of 4.0–18.0 g/dL. Red circles represent data from surveys conducted in Uganda (n = 38 overlapping surveys). Spearman’s rho was calculated both including and excluding Uganda data. Hb Hemoglobin; SD Standard deviation; WRA Women of reproductive age

Table 5 presents the comparison of SD, skewness and kurtosis in our analysis (excluding Uganda surveys as described above) to those from the DHS report. In comparison with DHS surveys, UNHCR surveys had lower mean survey-level SD for both WRA (1.43 vs. 1.58) and children (1.34 vs. 1.48), as well as a lower percentage of surveys with SD > 1.5 for both WRA and children. Mean skew for children was similar between UNHCR and DHS surveys (− 0.38 vs. − 0.39) while it was slightly more negative among DHS surveys for WRA (− 0.61 vs. − 0.55). Mean kurtosis was similar between UNHCR and DHS surveys for both children (0.55 vs. 0.54) and WRA (1.22 vs. 1.25).

Table 5 Comparison of survey-level standard deviations, skewness, and kurtosis of hemoglobin distributions between UNHCR and DHS Surveys

Discussion

Our analysis, which was based on a large number of recent field surveys conducted in refugee settings worldwide that followed the SENS survey guidelines and used standard equipment and procedures for Hb collection, produced several remarkable findings that have important implications for formulating criteria for quality assessment of Hb survey data in the post-data collection phase. Below, we will discuss, in turn, the findings related to SD, skewness, kurtosis, and implausible outliers. We compare, where possible, our findings from small-scale refugee surveys to those from the large national DHS surveys (Table 5) [9] and formulate the potential implications of these findings for defining quality assessment criteria. We found that surveys conducted in refugee settings in Uganda tended to have an unusually high SD compared with surveys from other settings, which may be an indication of problematic measurement quality. We, therefore, take a conservative approach in our interpretation and use, in this discussion summary, results for SD, skewness, and kurtosis that exclude surveys from Uganda when comparing to DHS results or proposing plausible ranges. There is a multitude of possible problems leading to inaccurate Hb measurements that can arise in the field, such as excessive squeezing (“milking”) the finger, overfilling of the cuvette, using the first instead of the third drop of capillary blood, the presence of air bubbles in the cuvette, inappropriate cuvette storage, using of expired cuvettes, among many others described in more detail elsewhere [4, 23]. It is therefore of utmost importance to conduct proper training of measurers, implement thorough supervision of the field work, and closely follow all standard procedures and quality assurance steps as described in the Module 3 “Anemia” of the SENS guidelines [5].

Regarding the SD of Hb distributions, we note two important findings. First, SD in WRA tends to be somewhat higher on average than SD in children, with a difference of about 0.08 g/dL. This is remarkably similar to what was found in DHS surveys [9] where SD in WRA was also higher on average by about the same value (0.1 g/dL) (Table 5). This leads us to hypothesize that these differences in SD magnitude between WRA and children are due to inherent properties of Hb distributions in these demographic groups rather than a result of differences in data quality. We also note that SD observed, on average, in DHS data were slightly higher (by about 0.14 g/dL) than those observed in our analysis: 1.58 vs. 1.43 in WRA and 1.48 vs. 1.34 in children, respectively. Rather than attributing this difference to the differences in Hb data quality between DHS and UNHCR surveys, we hypothesize that this difference may be explained by the fact that smaller refugee populations are often homogenous ethnically and have very similar access to food, health services, shelter, water and sanitation, among other factors, and may have slightly lower Hb variability than the Hb variability found in larger, nationally representative populations.

The second important finding for SD is that even after excluding surveys from Uganda, a substantial percentage of both WRA (33%) and child (13%) surveys had SDs exceeding 1.5 g/dL, the upper limit of the plausible 1.1–1.5 g/dL SD range previously proposed by Sullivan et al. [12]. The same phenomenon albeit to a much larger degree is also seen in DHS surveys where SD of Hb distribution exceeded 1.5 g/dL in 71% of surveys among WRA and 46% of surveys among children [9]. This similarity leads us to suggest that the upper plausible range of SD can be raised both for children and for WRA, and more so for WRA since, as described above, WRA tend to have, on average, higher SD than children. We can provisionally suggest the values that approximately correspond to the 90th percentile of SD in our empirically observed distributions excluding Uganda surveys (rounded to the nearest 0.05): 1.55 for children and 1.65 for WRA. Interestingly, the analysis of DHS surveys found a large negative correlation (Pearson’s r = -0.68) between the mean and SD of Hb in child data, implying that in populations with higher anemia prevalence, the SD of Hb distributions tends to be markedly larger [9]. We found a similarly high negative correlation between Hb mean and SD in children (excluding Uganda surveys): Spearman’s rho = -0.49 (Pearson’s r = -0.43). Unfortunately, the DHS report only reported the mean SD correlation in children and not in WRA [9]. One possible explanation of this phenomenon is that settings with lower resources may be more likely to have both higher anemia (thus lower mean Hb) and lower capacity to produce accurate Hb measurements (thus higher SD of Hb).

For skewness, we note two important phenomena. First, both WRA and child Hb distributions tend to be negatively skewed, and second, WRA distributions tend to be more skewed to the left, on average, than child distributions (mean skewness across all surveys -0.55 and -0.38, respectively). Positive skewness was quite unusual—less than 2% of all child or WRA surveys had skewness above + 0.2. Remarkably, DHS surveys showed the same phenomenon (Table 5)—most of the child and WRA Hb distributions were skewed to the left and WRA distributions were on average more skewed than those in children (− 0.61 vs. − 0.39, respectively).

Regarding kurtosis, we also note two important phenomena. First, both WRA and child Hb distribution across all surveys tend to have positive kurtosis (indicative of relatively large tails and a small body of the distribution), and second, WRA distributions tend to be more positively kurtotic than child distributions (mean kurtosis across all surveys 1.22 vs. 0.55, respectively). Negative kurtosis was minimal—less than 1% of all child or WRA surveys had kurtosis below -0.5. DHS surveys demonstrated the same phenomenon (Table 5)—most of the child and WRA Hb distributions were positive and WRA distributions were, on average, more positively kurtotic than the child ones: 1.25 vs. 0.54 [9]. Such notable consistency in skewness and kurtosis tendencies, in both small-scale refugee SENS surveys and large national DHS surveys, leads us to suggest that these distinctive features of distribution shape are inherent characteristics of child and WRA Hb distributions and not artifacts of low data quality. This consistency also leads to a suggestion for data quality checks based on our empirical evidence: surveys with highly unusual shapes of distributions (defined as skewness above + 0.2 and/or kurtosis below -0.5) may be flagged for further investigation of possible quality problems.

In surveys with available Hb data for both WRA and children, excluding Uganda, SD of Hb were highly correlated (Spearman’s rho = 0.43), suggesting that in surveys with lower quality (more dispersion) of child Hb data WRA data also tends to have a lower quality as indicated by larger dispersion. The correlations of skewness and kurtosis were low, indicating no relationship between the degree of skew or kurtosis in Hb distributions of WRA and children in the same population.

Finally, our data largely agrees with the previously proposed 4.0–18.0 g/dL flagging range for plausible Hb values used in several recent publications, including the DHS report [9, 10, 12]. In our data, only 0.08% of all observations in children and 0.14% of all observations in WRA were outside of this range. Given that only 2.2% of all surveys in children and 3.2% of surveys in WRA had more than 1% of flags defined by this exclusion range, defining > 1% of flags as a quality problem seems logical.

A major strength of this study is the large number of surveys conducted using standard sampling approaches and consistent standard equipment and procedures for data collection. Our data encompasses the period of the last 10 years and, thus, reflects the most recent field practices from a variety of refugee settings in 26 countries. This study, however, has several notable limitations. First and foremost, Hb in SENS surveys is measured via drops of capillary blood using HemoCue 301 devices, which remains a standard practice in field surveys across world regions and survey platforms [5]. Therefore, our findings and suggested quality assessment ranges are not generalizable to Hb data measured by methods other than HemoCue 301 photometry, and even to data measured by HemoCue 301 using venous as opposed to capillary blood. For instance, one recent study showed marked differences in mean, SD, skewness and kurtosis of Hb distributions in data measured by HemoCue in venous versus capillary blood in both children and WRA [10]. Second, the surveys we analyzed were conducted predominantly in populations with high levels of anemia, including more surveys with lower anemia prevalence may produce slightly different results. The distribution of the prevalence of anemia in analyzed surveys in Table 3 shows that over 50% of all surveys in children and close to 25% of all surveys in WRA had prevalence of anemia above 40%. At the same time, in Fig. 2 we demonstrate that there is a substantial negative correlation between Hb mean and SD, especially pronounced in child surveys. Therefore, surveys with low anemia (and therefore higher Hb mean) tend to have lower SD than those with high anemia and low Hb mean. Hence, including more surveys with lower anemia prevalence that would potentially have lower SD could lower overall mean, median and percentile estimates of Hb SD across surveys. Lastly, the current analysis did not include children aged 0–6 months, as this age group is not routinely included in small-scale anthropometric surveys collected in refugee and other humanitarian settings.

Conclusions

Experts have called for the establishment of postanalytical quality controls for Hb determination—with a specific emphasis on cleaning, adjusting, and analyzing Hb results [1, 3, 5]. From our analyses of population Hb distributions in non-pregnant WRA and children in refugee settings, we observed several intrinsic statistical characteristics that were not necessarily indicative of data quality problems and bear remarkable similarities with the characteristics found in large national DHS surveys. We found that Hb distributions tend to be negatively skewed and positively kurtotic and that distributions in WRA tend to have more left skew and positive kurtosis than child Hb distributions. Moreover, WRA Hb SDs tend to be larger than child Hb SDs, and a considerable proportion of surveys in both groups had SD that exceeded the commonly cited upper acceptable SD value of 1.5 g/dL [12].

Based on our empirical data and on that described in the DHS report [9], we suggest that survey Hb distributions (as determined by the HemoCue 301 system with a single drop of capillary blood) displaying any of the following characteristics be flagged for additional quality investigation: skewness above + 0.2 and/or kurtosis below -0.5, Hb. SD in children aged 6–59 months below 1.1 or above 1.55 g/dL, and Hb SD in non-pregnant women aged 15–49 years below 1.1 and above 1.65 g/dL. The commonly cited minimum and maximum plausible Hb values, 4.0 to 18.0 g/dL, appear to be appropriate [12]. The presence of more than 1% implausible Hb values outside of this range in a given survey may indicate a quality problem. Further investigation is required to determine the reproducibility of our results when Hb is measured using other methods or in blood other than capillary.

Availability of data and materials

The underlying data that support the findings of this study are available from the United Nations High Commissioner for Refugees, but restrictions apply to the availability of these data, which were used under license for the current study and are not publicly available. Data are, however, available from the authors upon reasonable request and with permission of the United Nations High Commissioner for Refugees.

Abbreviations

CDC:

Centers for Disease Control and Prevention

CI:

Confidence interval

DHS:

Demographic and Health Surveys

Hb:

Hemoglobin

SD:

Standard deviation

SENS:

Standardized Expanded Nutrition Survey

UNHCR:

United Nations High Commissioner for Refugees

WHO:

World Health Organization

WRA:

Women of reproductive age

References

  1. Stevens GA, Paciorek CJ, Flores-Urrutia MC, Borghi E, Namaste S, Wirth JP, et al. National, regional, and global estimates of anaemia by severity in women and children for 2000–19: a pooled analysis of population-representative data. Lancet Glob Health. 2022;10:627–39.

    Article  Google Scholar 

  2. Kay A, Leidman E, Lopez V, Wilkinson C, Tondeur M, Bilukha O. The burden of anaemia among displaced women and children in refugee settings worldwide, 2013–2016. BMJ Glob Health. 2019;4:e001837.

    Article  PubMed  PubMed Central  Google Scholar 

  3. World Health Organization. Accelerating anaemia reduction: a comprehensive framework for action. Geneva: World Health Organization; 2023.

    Google Scholar 

  4. Whitehead RD, Mei Z, Mapango C, Jefferds MED. Methods and analyzers for hemoglobin measurement in clinical laboratories and field settings. Ann N Y Acad Sci. 2019;1450:147–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. UNHCR. Standardised Expanded Nutrition Survey (SENS) Guidelines for Refugee Populations, Module 3: Anaemia, a Practical Step-by-step Guide, Version 3. https://www.unhcr.org/sens/wp-content/uploads/sites/155/2021/01/UNHCR_SENS_Module_3_ANAEMIA_v3_EN.pdf (2018). Accessed 30 Aug 2023.

  6. Neufeld LM, Larson LM, Kurpad A, Mburu S, Martorell R, Brown KH. Hemoglobin concentration and anemia diagnosis in venous and capillary blood: biological basis and policy implications. Vol. 1450, Annals of the New York Academy of Sciences. Blackwell Publishing Inc., New York; 2019. p. 172–189.

  7. Larson LM, Braat S, Hasan MI, Mwangi MN, Estepa F, Hossain SJ, et al. Preanalytic and analytic factors affecting the measurement of haemoglobin concentration: impact on global estimates of anaemia prevalence. BMJ Glob Health. 2021;6:e005756. https://doi.org/10.1136/bmjgh-2021-0057568.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Garcia-Casal MN, Dary O, Jefferds ME, Pasricha S. Diagnosing anemia: Challenges selecting methods, addressing underlying causes, and implementing actions at the public health level. Ann N Y Acad Sci. 2023;1524:37–50.

    Article  PubMed  Google Scholar 

  9. Pullum T, Kortso Collison D, Sorrel Namaste S, Garrett D. Hemoglobin Data in DHS Surveys: Intrinsic Variation and Measurement Error. DHS Methodological Reports No. 18. Rockville, Maryland, USA: ICF; 2017. https://dhsprogram.com/publications/publication-mr18-methodological-reports.cfm

  10. Hruschka DJ, Williams AM, Mei Z, Leidman E, Suchdev PS, Young MF, et al. Comparing hemoglobin distributions between population-based surveys matched by country and time. BMC Public Health. 2020;20:422. https://doi.org/10.1186/s12889-020-08537-4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. World Health Organization. Haemoglobin concentrations for the diagnosis of anaemia and assessment of severity. Geneva: World Health Organization; 2011.

    Google Scholar 

  12. Sullivan KM, Mei Z, Grummer-Strawn L, Parvanta I. Haemoglobin adjustments to define anaemia. Tropical Med Int Health. 2008;13:1267–71.

    Article  CAS  Google Scholar 

  13. Kanu FA, Jefferds MED, Williams AM, Addo OY, Suchdev PS, Sharma AJ. Association between hemoglobin and elevation among school-aged children: a verification of proposed adjustments. Am J Clin Nutr. 2023;118:114–20.

    Article  PubMed  Google Scholar 

  14. UNHCR. Standardised Expanded Nutrition Survey. https://www.unhcr.org/sens/ (2018). Accessed 30 Aug 2023.

  15. UNHCR. Standardised Expanded Nutrition Survey (SENS) Guidelines for Refugee Populations, Pre-Module: Survey Steps and Sampling, a Practical Step-by-step Guide, Version 3. https://www.unhcr.org/sens/introduction/pre-module-survey-steps-and-sampling/ (2018). Accessed 30 Aug 2023.

  16. HemoCue America. HemoCue® Hb 301 System. https://www.hemocue.us/hb-301/ (2022). Accessed 30 Aug 2023.

  17. SAS Institute. SAS version 9.4. Cary, NC: SAS Institute Inc.

  18. R Core Team. R Foundation for Statistical Computing, Vienna, Austria. R: The R Project for Statistical Computing. https://www.r-project.org/ (2020). Accessed 30 Aug 2023.

  19. Dimitriadou E, Hornik K, Leisch F, Meyer D, Weingessel A, Leisch MF. e1071: Misc Functions of the Department of Statistics, Probability Theory Group, TU Wien. R Package version 1.7–9. https://cran.r-project.org/web/packages/e1071/index.html (2021). Accessed 30 Aug 2023.

  20. Joanes DN, Gill CA. Comparing measures of sample skewness and kurtosis. J R Stat Soc Ser D (Stat). 1998;47:183–9.

    Google Scholar 

  21. StataCorp. Stata Statistical Software: Release 15. College Station, TX: StataCorp LLC.; 2017.

  22. UNICEF. Regional Classifications. https://data.unicef.org/regionalclassifications/ (2023). Accessed 30 Aug 2023.

  23. Nkrumah B, Blay Nguah S, Sarpong N, Dekker D, Idriss A, May J, Adu-Sarkodie Y. Hemoglobin estimation by the HemoCue®® portable hemoglobin photometer in a resource poor setting. BMC Clin Pathol. 2011;11:5.

    Article  CAS  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors thank the United Nations High Commissioner for Refugees for providing the data and Eva Leidman for her help and technical involvement in the SENS initiative, including the collection and processing of many of the survey data files.

Disclaimer

The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the US Centers for Disease Control and Prevention.

Funding

No additional funding was provided for this study.

Author information

Authors and Affiliations

Authors

Contributions

OB designed the study. BK and KLIS analyzed the data and produced tables/figures. OB, BK, and KLIS drafted and critically revised the manuscript for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Oleg Bilukha.

Ethics declarations

Ethics approval and consent to participate

Not applicable. The analysis was a secondary analysis of anonymized data. Approval was received from the Institutional Review Board of the Centers for Disease Control and Prevention, Atlanta, USA. Permission to use the data was obtained from the United Nations High Commissioner for Refugees, who collected the data. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control.

Consent for publication

Not applicable.

Competing interests

All authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bilukha, O., Kianian, B. & Samson, K.L.I. Characteristics of hemoglobin distributions in preschool children and non-pregnant women of reproductive age and their implications for establishing quality control criteria for hemoglobin data in field surveys: evidence from 483 surveys conducted in refugee settings worldwide. Popul Health Metrics 21, 20 (2023). https://doi.org/10.1186/s12963-023-00315-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12963-023-00315-9

Keywords