Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Gender Gaps in Life Expectancy

Introduction

In most countries, women live longer than men. This difference is often assumed to be natural and inevitable -- and sometimes desirable. For example, in the World Economic Forum’s Global Gender Gap Report, a smaller gap is interpreted as evidence of discrimination against women.

But the gap varies substantially between countries and has changed over time, which suggests that it might not be entirely natural, or if it is, it can be mitigated. For example, in the Netherlands the gap in healthy life expectancy is now close to zero.

The goal of this investigation is to explore differences in life expectancy and health-adjusted life expectancy (HALE) between countries, to identify the factors that contribute to the observed gender gaps, and to estimated the changes needed to close those gaps by improving health outcomes for both men and women.

We use Elastic Net regression to model the gender gap in life expectancy and HALE as a function of cause-specific mortality indicators. This approach handles the high correlation among predictors and identifies patterns of mortality most strongly associated with the life expectancy gap.

Data

The analysis uses data from two sources:

  1. WHO Global Health Observatory (GHO) API: Provides HALE and life expectancy. Data is accessed programmatically via the GHO OData API.

  2. IHME Global Burden of Disease: Provides most cause-specific mortality indicators with better temporal coverage than WHO, including cardiovascular disease, diabetes, chronic respiratory disease, neoplasms (cancer), alcohol use disorders, self-harm (suicide), interpersonal violence (homicide), road injuries, unintentional injuries, liver disease, and drug use disorders.

For each indicator, we use the most recent available data from 2000-2019. We exclude 2020 and later years to avoid distortions from the COVID-19 pandemic, which had significant impacts on mortality patterns that may not reflect underlying health factors. Using 2019 or earlier data provides a more stable baseline for understanding the gender gap.

The analysis focuses on OECD countries (38 countries) to ensure consistent data quality and comparability. For each country, we compute gender gaps by taking the difference between female and male values for each indicator.

Early-life mortality indicators (infant and under-five mortality) were considered but excluded from the final model because they had very low importance and limited temporal coverage. The IHME alternative (all-cause deaths under 5, per 100,000 population) was also considered but excluded because it is confounded with age structure and fertility rates, making it methodologically inappropriate for cross-country comparison.

Target Variables

The analysis focuses on two target variables:

The following table shows the median, minimum, and maximum values for HALE and Life Expectancy across OECD countries.

IndicatorMedianMinMax
HALE70.165.573.5
Life Expectancy81.475.584.4

The following table shows the median, minimum, and maximum gender gaps (Female - Male) for HALE and Life Expectancy across OECD countries.

IndicatorMedian GapMin GapMax Gap
HALE1.390.01866.06
Life Expectancy4.562.89.36

The following table shows the HALE and Life Expectancy gaps for all OECD countries, sorted by HALE gap:

CountryHALE_GapLifeExpectancy_Gap
Lithuania6.069.36
Latvia5.698.92
Poland4.937.92
Estonia4.797.55
Slovakia3.936.67
Hungary3.726.54
South Korea3.566.09
Mexico3.16.01
Czechia3.095.77
Japan2.935.63
Slovenia2.915.47
Colombia2.895.45
Portugal2.395.38
Costa Rica2.265.22
Finland2.245.12
France1.825.04
United States1.85.04
Spain1.524.78
Chile1.414.68
Austria1.364.45
Greece1.334.44
Canada1.254.33
Türkiye1.174.21
Australia1.164.15
Italy1.073.98
Belgium0.9413.92
Luxembourg0.9153.85
Denmark0.8953.77
Germany0.8623.58
United Kingdom0.6953.57
Israel0.6113.42
New Zealand0.4873.36
Ireland0.4543.33
Switzerland0.3383.27
Norway0.2393.07
Iceland0.163.04
Sweden0.1342.91
Netherlands0.01862.8

In several countries, mostly in Northern Europe, the HALE gap is effectively zero, and the gap in life expectancy is three years or less. These countries are evidence that there is nothing inevitable about these gaps, and no unavoidable reason for gaps as high as 5 years in the United States or 9 years in Latvia and Lithuania.

Predictors

The following tables summarize the predictors used to explain variation in the HALE and Life Expectancy gender gaps. Each predictor includes:

For each predictor, the following table shows overall rates in deaths per 100,000 people (computed as the average of male and female rates).

IndicatorMedian RateMin RateMax RateCorr HALECorr LE
Neoplasms26580.33650.2140.228
Cardiovascular2411137460.6890.721
ChronicRespiratory43.717.185.4-0.537-0.517
UnintentionalInjury26.510.746.40.220.259
Diabetes18.57.4610.3380.377
LiverDisease15.53.0936.50.7340.74
Suicide134.229.50.5380.511
RoadTraffic5.682.4616.20.4450.418
Alcohol3.090.193150.5590.545
DrugDisorder2.140.21220.8-0.194-0.232
Homicide1.130.45730.50.2560.213

The highest death rates are from cancer (neoplasms) and cardiovascular disease, followed by chronic respiratory disease. Death rates due to alcohol use disorders are much smaller than the highest rates, but as we’ll see, cancer-related deaths and other factors are the primary contributors to gender gaps in life expectancy.

Several predictors are strongly correlated with life expectancy gaps, notably Neoplasms and Unintentional Injury. Most of the correlations are positive, indicating that countries with higher death rates also have larger gender gaps.

The following table shows gender gaps (Male - Female) in death rates for each predictor.

IndicatorMedian GapMin GapMax GapCorr HALECorr LE
Neoplasms57.6-0.7881320.4540.519
Suicide13.84.5140.20.7480.733
LiverDisease9.651.2932.60.7240.733
ChronicRespiratory8.03-19.334.10.4750.55
RoadTraffic5.62.04220.420.402
UnintentionalInjury4.95-11.237.10.8330.848
Alcohol3.570.30623.60.6180.602
DrugDisorder1.660.022616-0.0636-0.0932
Homicide0.887-0.062247.50.2330.19
Diabetes-0.173-9.835.5-0.478-0.564
Cardiovascular-19-12425.6-0.596-0.633

Many of the death rates gaps are strongly correlated with life expectancy gaps, which is not surprising -- in a country where more men suffer from alcohol-related disease, for example, we expect a larger gap in both HALE and life expectancy.

Correlations

Many of these predictors are also related to each other. The following table shows the top correlations between the overall rates of different indicators.

Rate 1Rate 2Correlation
UnintentionalInjuryNeoplasms0.721
RoadTrafficHomicide0.687
CardiovascularNeoplasms0.633
CardiovascularLiverDisease0.631
AlcoholCardiovascular0.61
HomicideNeoplasms-0.601
UnintentionalInjurySuicide0.591
AlcoholUnintentionalInjury0.543
UnintentionalInjuryCardiovascular0.536
AlcoholSuicide0.534

The following table shows the top correlations between the gender gaps of different indicators.

Gap 1Gap 2Correlation
ChronicRespiratoryNeoplasms0.734
RoadTrafficHomicide0.723
AlcoholCardiovascular-0.688
AlcoholSuicide0.675
UnintentionalInjurySuicide0.633
CardiovascularSuicide-0.613
UnintentionalInjuryLiverDisease0.577
CardiovascularNeoplasms-0.564
ChronicRespiratoryDiabetes-0.564
AlcoholLiverDisease0.542

The following table shows the correlation between the overall rate and the gender gap for each predictor. This identifies indicators where countries with higher overall rates also tend to have larger gender gaps.

IndicatorCorrelation
Homicide0.999
Alcohol0.982
RoadTraffic0.971
DrugDisorder0.957
LiverDisease0.955
Suicide0.9
Neoplasms0.685
UnintentionalInjury0.0346
ChronicRespiratory-0.0787
Diabetes-0.325
Cardiovascular-0.804

So there are clusters of indicators that move together, both in their overall rates and in their gender gaps.

Methodology

If we put all of these predictors into a single ordinary least squares (OLS) regression, the model is forced to divide the explanatory “credit” among highly correlated variables. In that setting:

As a result, an OLS model with all indicators included does not give a stable or interpretable answer to the question “which factors matter most?”.

Elastic Net regression can help. It combines two kinds of regularization:

The model is tuned by cross-validation, so the amount of regularization is chosen to maximize out-of-sample predictive accuracy, not to fit the particular noise pattern in the dataset.

In practice, this means:

That does not mean that every non-zero coefficient in the regression can be interpreted as a causal effect. But it does mean:

So when Elastic Net assigns substantial weight to a predictor like Alcohol or Neoplasms, we can take that as evidence that these factors are causative, which suggests that efforts to close gaps in these death rates would also close gaps in life expectancy.

Results

Life expectancy gap

We fit three regularized regression models (Ridge, Lasso, and Elastic Net) to predict the Life Expectancy gender gap as a function of cause-specific death rates and gender gaps in those rates. All models use 5-fold cross-validation for model selection and evaluation.

The following table compares the three models using cross-validation R² and mean absolute error:

ModelCV_R2_ScoreCV_MAE_MeanCV_MAE_Std
Elastic Net0.8790.3260.0748
Lasso0.8780.3370.0818
Ridge0.8690.3080.117

Elastic Net performs best with a cross-validation R² of 0.879, meaning it explains about 88% of the variance in the life expectancy gap across OECD countries. The mean absolute error is 0.326 years, indicating that on average, the model’s predictions are off by about 4 months.

Elastic Net selects 13 out of 23 predictors as having non-zero coefficients, effectively performing feature selection while maintaining good predictive performance. The following table shows how many predictors each model uses:

ModelTotal_PredictorsNon_Zero_CoefficientsZero_Coefficients
Ridge22220
Lasso221012
Elastic Net22139

To understand which factors contribute most to the life expectancy gap, we calculate feature importance as the absolute value of the coefficient multiplied by the standard deviation of the predictor. This measures each predictor’s contribution to gap variation on the original scale.

The following figure shows all predictors with non-zero coefficients by importance:

Predictors by importance for Life Expectancy gap

Predictors by importance.

The Neoplasms gap is the most important predictor, with an importance of 11.4. This means that differences in cancer death rates between men and women are the strongest driver of the life expectancy gender gap across OECD countries.

When we aggregate importance by indicator (combining Mid and Gap predictors), we can see which health indicators matter most overall:

IndicatorMid_ImportanceGap_ImportanceTotal_Importance
Neoplasms011.411.4
UnintentionalInjury04.934.93
ChronicRespiratory1.420.8022.22
LiverDiseaseDeathRate0.4181.672.08
Homicide0.8690.9661.84
Suicide01.821.82
Alcohol0.2811.251.53
Diabetes00.8480.848
RoadTraffic0.44200.442
Cardiovascular000
DrugDisorder000

The following figure visualizes indicator-level importance:

Indicator importance for Life Expectancy gap

Indicator-level importance, showing which health indicators contribute most to explaining the life expectancy gender gap.

The top indicators are:

  1. Neoplasms (total importance: 11.4) — Cancer death rates are the dominant factor, with importance coming entirely from the gap component, not the overall rate. Some part of this gap is due to past smoking patterns, as men historically smoked more than women. As smoking rates continue to decline, this gap will likely shrink without further intervention, though efforts to reduce smoking should continue.

  2. Unintentional Injury (total importance: 4.93) — Unintentional injuries are the second most important factor, with importance coming entirely from the gap component. These injuries often show gender differences due to occupational exposures, risk-taking behaviors, and activity patterns.

  3. Chronic Respiratory disease (total importance: 2.22) — Chronic respiratory disease contributes moderately, with contributions from both the overall rate and the gender gap. Like neoplasms, some part of the Chronic Respiratory disease gap is due to past smoking patterns, so it will likely shrink as smoking rates continue to decline.

  4. Liver Disease (total importance: 2.08) — Liver disease death rates contribute moderately, with importance coming mostly from the gap component. Liver disease is often related to alcohol consumption, but also includes non-alcoholic causes such as viral hepatitis and non-alcoholic fatty liver disease.

  5. Homicide (total importance: 1.84) — Homicide contributes moderately, with contributions from both the overall rate and the gender gap. Homicide rates are typically much higher in men than women across most countries. Homicide gained substantial importance after removing the childhood mortality indicator, suggesting it may have been capturing some shared variance.

  6. Suicide (total importance: 1.82) — Suicide rates contribute moderately, with importance coming entirely from the gap component. Suicide rates are typically much higher in men than women across most countries.

  7. Alcohol (total importance: 1.53) — Alcohol use disorder death rates contribute moderately, with importance coming mostly from the gap component. The importance is lower than in previous analyses because the current model uses IHME data, which defines alcohol-related mortality more narrowly than WHO’s broader “alcohol-attributable” definition.

Cardiovascular disease has zero importance in this model, meaning it was not selected by Elastic Net as a predictive factor for the life expectancy gap.

Residuals

The following table shows country-level predictions and residuals:

CountryActual_HALE_GapPredicted_HALE_GapResidualAbs_Residual
Sweden3.073.66-0.5840.584
France5.384.830.5510.551
Greece4.214.74-0.5260.526
Belgium4.154.66-0.5110.511
Austria4.334.8-0.470.47
South Korea6.015.580.4250.425
Türkiye5.044.640.3960.396
Costa Rica4.785.17-0.3860.386
Latvia8.928.550.3680.368
Colombia5.224.90.320.32
Estonia7.927.610.3130.313
Finland5.045.33-0.2860.286
Spain5.124.850.2730.273
United States4.454.220.2340.234
Hungary6.546.310.2340.234
Germany4.444.64-0.2080.208
Chile4.684.88-0.2030.203
Poland7.557.370.1880.188
New Zealand3.273.45-0.1780.178
Canada3.773.60.1720.172
Iceland2.82.96-0.1580.158
Slovakia6.676.530.1390.139
Czechia5.635.76-0.1270.127
Netherlands2.913.03-0.1160.116
Switzerland3.363.47-0.1110.111
Luxembourg3.923.810.110.11
Portugal5.775.87-0.09780.0978
Denmark3.573.480.09520.0952
United Kingdom3.423.51-0.08760.0876
Mexico6.096.010.07930.0793
Australia3.853.780.06530.0653
Lithuania9.369.310.04920.0492
Ireland3.333.290.04050.0405
Israel3.583.540.03420.0342
Italy3.984.01-0.03340.0334
Japan5.455.420.03020.0302
Slovenia5.475.49-0.01950.0195
Norway3.043.05-0.01380.0138

The model shows good fit with residuals distributed around zero across countries.

The model shows good fit with no obvious systematic patterns in the residuals. The following figure shows residuals plotted against predicted values:

Residuals vs predicted values

Residuals plotted against predicted life expectancy gap values.

The points are scattered around zero with no obvious patterns, suggesting the model captures the relationship well.

Comparison with Ordinary Least Squares

For comparison, we also fit an ordinary least squares (OLS) model using only the 17 predictors selected by Elastic Net. The following table compares performance:

ModelAdjusted R²MAENumber of PredictorsNon-Zero Coefficients
Elastic Net0.9710.2172213
OLS (Selected Predictors)0.9750.9620.2031313

Both models perform similarly, with OLS achieving a slightly higher R² and lower MAE on the training data. However, the cross-validation R² of 0.872 for Elastic Net is a better estimate of out-of-sample performance. The difference between in-sample and cross-validation R² indicates some overfitting, which is expected with a small sample size (38 countries) and many predictors.

Counterfactual Analysis

To understand what it would take to close the life expectancy gender gap, we perform a counterfactual analysis that asks: “What would happen to a country’s predicted life expectancy gap if we adjusted each gap predictor to the best attainable value?”

For each gap predictor, we identify the best attainable value by comparing the current gap to observed gaps in other countries:

If the target gap has the opposite sign of the current gap, we infer that it is possible for the gap to be zero, so we set the target to zero.

To achieve the target gap, we adjust the underlying male or female values:

After adjusting the male and female values, we recompute the Mid and Gap values, then use the model to generate a counterfactual prediction. The difference between the original and counterfactual predictions shows how much the life expectancy gap would change if that predictor gap were reduced to the best attainable level (assuming that the relationship is causative).

This approach is conservative in two ways:

  1. We only adjust gap variables, not overall rates, because we’re more confident that gap variables have a causal relationship with the life expectancy gap.

  2. We use other countries as evidence of what’s attainable. If no country has closed or reversed the gap, we assume that the lowest observed gap is the lowest attainable.

The following table shows counterfactual results for the United States.

IndicatorCurrent gapTarget gapTarget CountryChange in LE gap
Neoplasms25.70-0.214
UnintentionalInjury5.260-0.301
ChronicRespiratory-5.8200.0572
LiverDiseaseDeathRate9.051.29Iceland-0.203
Homicide7.290-0.135
Suicide17.34.51Türkiye-0.537
Alcohol5.540.306Colombia-0.241
Diabetes5.500.432
RoadTraffic11.22.04Iceland-0.156
Cardiovascular19.400
DrugDisorder160.0226Japan0

The table is sorted by importance, which indicates in general how effective it is to reduce a particular gap. The counterfactual results (the “Change in LE gap” column) indicate how much reducing that gap would specifically affect the life expectancy gap in the United States, which depends on how far the United States is from the target gap for that indicator.

The results show that reducing the Suicide gap from 17.3 to 4.51 (the level observed in Türkiye) would reduce the predicted life expectancy gap by 0.537 years, the largest single impact. Reducing the Unintentional Injury gap to zero would reduce the LE gap by an additional 0.301 years. Reducing the Alcohol gap from 5.54 to 0.306 (the level observed in Colombia) would reduce the gap by 0.241 years. Reducing the Neoplasms gap to zero would reduce the gap by 0.214 years.

Most indicators show negative changes (gap-closing effects), but a few show positive changes (gap-widening effects). The positive changes occur when reducing a gap predictor would increase the life expectancy gap, which can happen when the relationship between predictors and the outcome is complex due to correlations among indicators.

When we sum the effects across all indicators, we can see the total potential impact:

The net effect of closing all gaps to their target levels would be a reduction in the predicted life expectancy gap. This represents a substantial part of the current gap.

Healthy Life Expectancy (HALE) Gap

We fit the same three regularized regression models (Ridge, Lasso, and Elastic Net) to predict the HALE gender gap. The following table compares the three models using cross-validation R² and mean absolute error:

ModelCV_R2_ScoreCV_MAE_MeanCV_MAE_Std
Ridge0.790.3960.105
Elastic Net0.7780.4220.119
Lasso0.6840.50.229

Elastic Net performs best with a cross-validation R² of 0.778, meaning it explains about 78% of the variance in the HALE gap across OECD countries. The mean absolute error is 0.422 years, indicating that on average, the model’s predictions are off by about 5 months.

The following table shows how many predictors each model uses:

ModelTotal_PredictorsNon_Zero_CoefficientsZero_Coefficients
Ridge22220
Lasso221210
Elastic Net22202

Elastic Net selects 20 out of 22 predictors as having non-zero coefficients. The following figure shows all predictors with non-zero coefficients by importance:

Predictors by importance for HALE gap

Predictors by importance.

The Neoplasms overall rate is the most important predictor, with an importance of 15.3. The Neoplasms gap is next, with an importance of 12.8.

When we aggregate importance by indicator (combining Mid and Gap predictors), we can see which health indicators matter most overall:

IndicatorMid_ImportanceGap_ImportanceTotal_Importance
Neoplasms15.312.828.2
UnintentionalInjury3.372.315.69
Cardiovascular5.240.3065.55
ChronicRespiratory4.420.8015.22
Homicide1.442.483.92
Suicide1.051.92.96
LiverDiseaseDeathRate1.261.262.52
Alcohol0.4911.231.73
Diabetes0.9340.7551.69
RoadTraffic0.46800.468
DrugDisorder0.36300.363

The following figure visualizes indicator-level importance:

Indicator importance for HALE gap

Indicator-level importance, showing which health indicators contribute most to explaining the HALE gender gap.

The top indicators are:

  1. Neoplasms (total importance: 28.2) — Cancer is the dominant factor for HALE, with contributions from both the overall rate and the gender gap. Some part of the neoplasms gap is due to past smoking patterns, as men historically smoked more than women. Neoplasms gained importance after removing the childhood mortality indicator, suggesting it better captures its relationship with the HALE gap without that indicator.

  2. Unintentional Injury (total importance: 5.69) — Unintentional injuries are the second most important factor, with contributions from both the overall rate and the gender gap. These injuries often show gender differences due to occupational exposures, risk-taking behaviors, and activity patterns.

  3. Cardiovascular disease (total importance: 5.55) — Cardiovascular disease is the third most important factor, with contributions from both the overall rate and the gender gap. In many countries, cardiovascular disease rates are higher for women because cardiovascular risk increases with age, and women are more likely to live long enough to develop cardiovascular disease. Cardiovascular importance decreased after removing childhood mortality, suggesting some interaction between these indicators.

  4. Chronic Respiratory disease (total importance: 5.22) — Chronic respiratory disease contributes substantially, with contributions from both the overall rate and the gender gap. Like neoplasms, some part of the Chronic Respiratory disease gap is due to past smoking patterns.

  5. Homicide (total importance: 3.92) — Homicide contributes moderately, with contributions from both the overall rate and the gender gap. Homicide rates are typically much higher in men than women across most countries. Homicide gained substantial importance after removing the childhood mortality indicator, suggesting it may have been capturing some shared variance.

  6. Suicide (total importance: 2.96) — Suicide contributes moderately, with contributions from both the overall rate and the gender gap. Suicide rates are typically much higher in men than women across most countries.

  7. Liver Disease (total importance: 2.52) — Liver disease death rates contribute moderately, with contributions from both the overall rate and the gender gap. Liver disease is often related to alcohol consumption, but also includes non-alcoholic causes.

  8. Alcohol (total importance: 1.73) — Alcohol use disorder death rates contribute moderately, with importance coming mostly from the gap component. The importance is lower than in previous analyses because the current model uses IHME data, which defines alcohol-related mortality more narrowly than WHO’s broader “alcohol-attributable” definition.

  9. Diabetes (total importance: 1.69) — Diabetes contributes moderately, with contributions from both the overall rate and the gender gap. Diabetes gained importance after removing the childhood mortality indicator.

Residual

The following table shows country-level predictions and residuals:

CountryActual_HALE_GapPredicted_HALE_GapResidualAbs_Residual
Germany0.8621.67-0.8030.803
Sweden0.1340.706-0.5710.571
Czechia3.092.550.5370.537
Austria1.361.84-0.4720.472
Chile1.411.81-0.3990.399
Slovakia3.933.570.3590.359
Belgium0.9411.29-0.3530.353
Colombia2.892.560.3290.329
Canada1.250.9380.3170.317
South Korea3.563.270.2950.295
Switzerland0.3380.63-0.2920.292
Hungary3.723.430.2920.292
Japan2.932.640.290.29
Portugal2.392.68-0.290.29
Norway0.239-0.03970.2780.278
Latvia5.695.430.2670.267
Greece1.331.58-0.2570.257
Denmark0.8950.6510.2440.244
France1.821.60.2220.222
Lithuania6.066.27-0.2060.206
Australia1.160.9940.1660.166
Mexico3.13.26-0.1620.162
Poland4.934.790.1360.136
New Zealand0.4870.607-0.1210.121
United Kingdom0.6950.5820.1120.112
Italy1.070.9640.1080.108
Luxembourg0.9150.8120.1020.102
Netherlands0.0186-0.08150.10.1
Iceland0.160.26-0.09970.0997
Türkiye1.171.26-0.08870.0887
Spain1.521.61-0.0840.084
Israel0.6110.5480.06220.0622
Costa Rica2.262.32-0.05880.0588
Slovenia2.912.860.05190.0519
Estonia4.794.83-0.04330.0433
Ireland0.4540.4190.0350.035
Finland2.242.27-0.02750.0275
United States1.81.780.02560.0256

The following figure shows residuals plotted against predicted values:

Residuals vs predicted values for HALE

Residuals plotted against predicted HALE gap values.

The model shows good fit with no obvious systematic patterns in the residuals.

Comparison with Ordinary Least Squares

For comparison, we also fit an ordinary least squares (OLS) model using only the 20 predictors selected by Elastic Net. The following table compares performance:

ModelAdjusted R²MAENumber of PredictorsNon-Zero Coefficients
Elastic Net0.9680.2282220
OLS (Selected Predictors)0.9740.9430.1962020

Both models perform similarly, with OLS achieving a slightly higher R² and lower MAE on the training data. However, the cross-validation R² of 0.78 for Elastic Net is a better estimate of out-of-sample performance and predictive validity.

Counterfactual Analysis

The methodology here is the same as for Life Expectancy.

The following table shows counterfactual results for the United States.

IndicatorCurrent gapTarget gapTarget CountryChange in HALE gap
Neoplasms25.70-0.28
UnintentionalInjury5.260-0.0421
Cardiovascular19.400.00179
ChronicRespiratory-5.8200.0869
Homicide7.290-0.281
Suicide17.34.51Türkiye-0.801
LiverDiseaseDeathRate9.051.29Iceland-0.205
Alcohol5.540.306Colombia-0.273
Diabetes5.500.409
RoadTraffic11.22.04Iceland-0.165
DrugDisorder160.0226Japan-0.206

The table is sorted by importance, which indicates in general how effective it is to reduce a particular gap. The counterfactual results (the “Change in HALE gap” column) indicate how much reducing that gap would specifically affect the HALE gap in the United States, which depends on how far the United States is from the target gap for that indicator.

The results show that reducing the Suicide gap from 17.3 to 4.51 (the level observed in Türkiye) would reduce the predicted HALE gap by 0.801 years, the largest single impact. Reducing the Neoplasms gap to zero would reduce the HALE gap by an additional 0.28 years. Reducing the Alcohol gap from 5.54 to 0.306 (the level observed in Colombia) would reduce the gap by 0.273 years. Reducing the Homicide gap to zero would reduce the gap by 0.281 years.

Most indicators show negative changes (gap-closing effects), but a few show positive changes (gap-widening effects). The positive changes occur when reducing a gap predictor would increase the HALE gap, which can happen when the relationship between predictors and the outcome is complex due to correlations among indicators.

When we sum the effects across all indicators, we can see the total potential impact:

The net effect of closing all gaps to their target levels would be a reduction in the predicted HALE gap. This represents a substantial part of the current gap.

Comparison of Life Expectancy and HALE Results

The Life Expectancy model performs better than the HALE model:

The lower performance for HALE may reflect the additional complexity of modeling healthy years, which depends on both mortality and morbidity patterns, whereas Life Expectancy depends entirely on mortality.

Both models select a similar number of predictors:

The HALE model uses more predictors, suggesting that more factors are relevant for explaining healthy life expectancy than overall life expectancy.

The relative importance of indicators differs between the two models:

Life Expectancy:

  1. Neoplasms (11.4) — entirely from the gap component

  2. Unintentional Injury (4.93) — entirely from the gap component

  3. Chronic Respiratory disease (2.22) — from both overall rate and gap

  4. Liver Disease (2.08) — mostly from the gap component

  5. Homicide (1.84) — from both overall rate and gap

  6. Suicide (1.82) — entirely from the gap component

  7. Alcohol (1.53) — mostly from the gap component

HALE:

  1. Neoplasms (28.2) — from both overall rate and gap

  2. Cardiovascular disease (5.55) — from both overall rate and gap

  3. Unintentional Injury (5.69) — from both overall rate and gap

  4. Chronic Respiratory disease (5.22) — from both overall rate and gap

  5. Homicide (3.92) — from both overall rate and gap

  6. Suicide (2.96) — from both overall rate and gap

  7. Liver Disease (2.52) — from both overall rate and gap

  8. Alcohol (1.73) — mostly from the gap component

  9. Diabetes (1.69) — from both overall rate and gap

Neoplasms (cancer) is the most important factor in both models, but its relative importance is much higher for HALE. Cardiovascular disease is important for HALE but was not selected for Life Expectancy. Alcohol has lower importance in both models than in previous analyses, reflecting the use of IHME data which defines alcohol-related mortality more narrowly.

The counterfactual analysis shows that Suicide has the largest single impact in both models, followed by Neoplasms and Alcohol. The patterns are similar but the magnitudes differ between the two outcomes.

Implications

These differences suggest that:

  1. Cancer prevention and treatment is the most important factor for both outcomes, but particularly critical for healthy life expectancy, as neoplasms have much higher importance for HALE (28.2) than for Life Expectancy (11.4).

  2. Cardiovascular disease affects healthy years (importance 5.55 for HALE) but was not selected as a predictive factor for total life expectancy, suggesting it may have different relationships with mortality versus healthy years.

  3. Suicide prevention has substantial importance in both models and shows the largest counterfactual impact, suggesting it is a critical intervention target.

  4. Unintentional injuries are important for both outcomes, ranking second for Life Expectancy and third for HALE.

  5. Alcohol-related interventions remain important but have lower importance than in previous analyses, reflecting the use of IHME data which defines alcohol-related mortality more narrowly than WHO’s broader “alcohol-attributable” definition.