Skip to main content

Table 3 Summary statistics of categorical variables in the original confidential Dutch Health Monitor sample and in the synthetic population for individuals aged 18 + 

From: Constructing synthetic populations in the age of big data

Variables

DPHM (age > 18)

Synthetic population (age > 18)

Frequencies

Frequencies

Smoking

Never smoker

41.6

26.9

Past smoker

41.4

50.7

Light smoker

13.3

16.3

Heavy smoker

3.7

6.1

Physical activity (complies with norms)

No

34.1

37.1

Yes

65.9

62.9

Education (completed) [SOI level]

Primary or less [1,2]

12.8

8.4

Lower secondary [3–6]

18.8

13.5

Higher secondary [7–10]

41.8

53.3

Lower Tertiary [11–13]

18.2

16.3

Higher tertiary [14+]

8.4

8.3

Diabetes present

11.5

5.8

COPD present

6.6

7.9

CHD present

10.5

2.7

Stroke present

5.4

4.4