Forecaster.Health

Forecast
Surveillance

Forecast issue time:

Group category:

Region:

Forecast
Surveillance

Forecast issue time:

Variable:

Group category:

Region:

Forecast
Surveillance

Forecast issue time:

Region level:

Variable:

Region:

Forecast
Surveillance

Forecast issue time:

Region level:

Variable:

Region:

Summary

We combined (i) ambient temperatures and air pollution concentrations with (ii) mortality registers in each region to estimate the location-specific empirical relation between the environment and human health. These so-called “epidemiological associations” thus quantify, for each location, the actual risk of death at any given temperature or air pollution concentration based on real data. Every day, we download and process the new set of updated temperature forecasts for the next 15 days, and the new set of updated air pollution forecasts for the next 4 days, and use the above epidemiological associations to transform them into predictions of temperature related mortality and air pollution related mortality, respectively.

These predictions are grouped into 5 warning categories: a baseline warning state (“none”), when the risk of death is minimum, and 4 categories of heat, cold or air pollution warnings (“low”, “moderate”, “high” and “extreme”), corresponding to increasing levels of risk of death. On the one hand, the risk of death due to ambient temperatures is generally minimum twice every year: at the beginning and end of the summer season. In general, the risk of death increases with increasing temperatures in summer (“heat warnings”), and with decreasing temperatures in autumn, winter and spring (“cold warnings”). On the other hand, the risk of death due to air pollution is generally minimum once per year: in winter for ozone (O₃), and in summer for nitrogen dioxide (NO₂) and particulate matter with an average aerodynamic diameter of up to 2.5 (PM_2.5) or 10 (PM₁₀) micrometres. In general, the risk of death increases with air pollution concentrations, with the highest concentrations of O₃ in summer, and the highest values of NO₂, PM_2.5 and PM₁₀ in winter (“air pollution warnings”).

A key aspect of the system is that, in each location, we estimated separate epidemiological associations for each sex and age group. These sex-specific and age-specific epidemiological associations were exclusively estimated with mortality records of the respective sex and age group, and therefore, they quantify, for each location, the actual risk of death of the population subgroup at any given temperature or air pollution concentration based on real data. This means that we issue independent health warnings for each population subgroup based on the corresponding sex-specific and age-specific epidemiological association. In general, the risk of death from heat is higher in women than in men, and consequently, heat warnings in summer are expected to be more frequent and of higher category in women than in men. Similarly, the risk of death from both heat, cold and air pollution increases with age, and therefore, heat, cold and air pollution warnings are expected to be more frequent and of higher category in the elderly all year round.

We analysed here and here how far in advance we can reliably forecast temperatures and their associated health effects. Ambient temperatures and temperature related mortality risks and health emergencies can generally be forecast with some degree of confidence up to two weeks in advance. We must however strongly emphasise that the reliability of these forecasts and warnings decreases as we predict more distant dates, with very high reliability a few days ahead only. We generally recommend being cautious with temperature forecasts and associated health warnings issued more than 7 days in advance.

Author contributions

Joan Ballester Claramunt: original idea, conceptualisation, overall methodology design, project funding, team creation/coordination, temperature/mortality epidemiological modelling, website descriptions, supervision of all steps.
Mireia Beas-Moix: project management, mortality data acquisition, website licensing.
Nadia Beltrán-Barrón: mortality data processing, website creation/design.
Zhao-yue Chen: air pollution/mortality epidemiological modelling.
Raúl Fernando Méndez Turrubiates: temperature data processing, temperature population weighting.
Fabien Peyrusse: temperature data processing, temperature population weighting.
Marcos Quijal-Zamorano: temperature/mortality epidemiological modelling, temperature/mortality predictability assessment, temperature bias-correction.

Recommended citation

Ballester J, Beas-Moix M, Beltrán-Barrón N, Chen ZY, Méndez Turrubiates RF, Peyrusse F, Quijal-Zamorano M. Forecaster.Health. Available at https://forecaster.health/ (2024).

Mortality records

We used the spatiotemporally-homogeneous daily regional mortality database of the project EARLY-ADAPT. As of September 2024, the database contains over 164 million counts of deaths from 654 contiguous NUTS regions in 32 European countries, representing their entire urban and rural population of over 541 million people.

Temperature observations and forecasts

Every day, we obtain the most recent available hourly gridded (0.1° x 0.1°) 2-meter temperatures from the ERA5-Land reanalysis, here considered as a proxy for observations. We also obtain gridded (0.25° x 0.25°) 2-meter temperature forecasts issued at 00 UTC from ECMWF. Forecasts include 51 ensemble members with data every 3 hours at hourly lead times 0 to 144 (i.e. days 1 to 6), and every 6 hours from hourly lead times 144 to 354 (i.e. days 7 to 15). We compute the daily regional temperature observations and forecasts by weighting gridded temperatures with gridded population data for year 2018 from GISCO.

Then, we post-process the ensemble of temperature forecasts to bias-correct them against the temperature observations used in the epidemiological models. We apply a bias-correction method considering the most recent N = 30 pairs of observations and forecasts with respect to each forecast start date (BC-30). Thus, for any given region $r$, observation date or forecast start date $s$, and forecast lead time $l$ (expressed in days), we calculate the correction $c$ of the forecast ensemble members as

$$ c(r,s,l)=\frac{1}{N} \sum_{n=1}^{N}o(r,s-n)-f(r,s-n-l\mathit{+1},l) , $$

where $o(r,s-n)$ and $f(r,s-n-l\mathit{+1},l)$ are the pairs of temperature observations and ensemble mean forecasts for all cases in the training dataset, respectively. We then add this correction individually to each of the forecast ensemble members to obtain the ensemble of bias-corrected temperature forecasts.

We used a time-series quasi-Poisson regression model in each region to derive estimates of region-specific temperature-lag-mortality risks with data from the period 2000-2019, following the methodology described here and here. The equation is as follows

$$ log(E(mort))=intercept+S(\textit{time, 8 df per year} )+dow+cb, $$

where $mort$ denotes the daily time series of mortality counts; $E$ corresponds to its expected value; $S$ is a natural cubic spline of time with 8 degrees of freedom per year to adjust for the seasonal and longer-term trends; $dow$ corresponds to a categorical variable to control for the day of the week; and $cb$ is the cross-basis function produced by a distributed lag non-linear model combining the exposure-response and lag-response associations. The exposure-response association was modelled with a natural cubic spline, with three internal knots placed at the 10th, 75th and 90th percentiles of the observed distribution of daily regional temperatures. The lag-response association was modelled with a natural cubic spline, with an intercept and three internal knots placed at equally-spaced intervals in the logarithmic scale, with lags ranging between 0 and 21 days. We then performed a multivariate multilevel meta-analysis, modelling dependencies of regions within countries through structured random effects, and including the location-specific temperature average and interquartile range as meta-predictors. The fitted meta-analytical model was used to derive the best linear unbiased predictions of the cumulative temperature-mortality association in each region, from which we estimated the regional minimum mortality temperature.

Every day, we transform the temperature observations and bias-corrected temperature forecasts into temperature related mortality (TRM) estimates and predictions, respectively. TRM represents the fraction of deaths attributable to non-optimal temperatures, calculated as

$$ TRM(d)=1-\frac{1}{RR(T(d))}, $$

where $RR(T(d))$ is the relative risk at temperature $T(d)$ of a given observed or forecast date $d$. The relative risk is computed from the respective regional cumulative temperature-mortality association, centred at its minimum mortality temperature. We created five warning categories, for temperature related mortality values smaller than 5% (“none”), between 5% and 10% (“low”), between 10% and 15% (“moderate”), between 15% and 20% (“high”) and higher than 20% (“extreme”). “Cold” and “heat” warnings correspond to days with temperatures colder or warmer than the respective regional minimum mortality temperature, respectively.

Air pollution observations and forecasts

As described here and here, we used quantile machine learning models to estimate daily concentrations of particulate matter with an average aerodynamic diameter of up to 2.5 (PM_2.5) or 10 (PM₁₀) micrometres, nitrogen dioxide (NO₂) and the maximum daily 8-hour average of ozone (O₃) at a 0.1° x 0.1° spatial resolution across Europe. The models were trained on ground-monitoring data and multiple spatiotemporal predictors, including satellite retrievals, land-use and meteorological and atmospheric reanalysis variables. These estimates were here considered as a proxy for observations, and used to fit the epidemiological models of air pollution, together with data of 2-meter temperature and relative humidity from the ERA5-Land reanalysis (see next section).

Every day, we obtain gridded (0.1° x 0.1°) surface air pollution forecasts of PM_2.5, PM₁₀, NO₂ and O₃ issued at 00 UTC from CAMS, with data every hour at hourly lead times 0 to 96 (i.e. days 1 to 4). We use the median value of the available 11-member ensemble, here considered as the most reliable and stable forecasts.

We compute the daily regional air pollution observations and forecasts by weighting gridded temperatures with gridded population data for year 2018 from GISCO.

We used a time-series quasi-Poisson regression model in each region to derive estimates of region-specific air pollution-lag-mortality risks with data from the period 2003-2019. The equation is as follows

$$ log(E(mort)) = intercept + S(\textit{time, 8 df per year} ) + S(\textit{temperature, lags 0-3 days, 6 df}) + $$ $$ S(\textit{relative humidity, lag 0 days, 3 df}) + dow + cb(\textit{air pollutant}), $$

where $mort$ denotes the daily time series of mortality counts; $E$ corresponds to its expected value; $S$ is a natural cubic spline; $dow$ corresponds to a categorical variable to control for the day of the week; and $cb$ is the cross-basis function produced by a distributed lag non-linear model combining the exposure-response and lag-response associations. The exposure-response association was modelled with a linear term, and the lag-response association with integer values, ranging from lag 0 to a maximum lag of 2 days for PM_2.5, PM₁₀ and NO₂, and 3 days for O₃. The model was controlled for a natural cubic spline of time with 8 degrees of freedom per year to adjust for the seasonal and longer-term trends, a natural cubic spline of temperature averaged over lags 0-3 days with 6 degrees of freedom, and a natural cubic spline of relative humidity at lag 0 with 3 degrees of freedom. We then performed a multivariate multilevel meta-analysis, modelling dependencies of regions within countries through structured random effects, and including the following location-specific meta-predictors: (i) average and (ii) interquartile range of the corresponding air pollutant, (iii) temperature average, (iv) relative humidity average, (v) rate of elderly residents (65 years or older) and (vi) natural logarithm of gross domestic product per capita. The fitted meta-analytical model was used to derive the best linear unbiased predictions of the cumulative air pollution-mortality association in each region.

Every day, we transform the air pollution observations and forecasts into air pollution related mortality (APRM) estimates and predictions, respectively. APRM represents the fraction of deaths attributable to air pollution, calculated as

$$ APRM(d) = 1-\frac{1}{RR(AP(d))}, $$

where $RR(AP(d))$ is the relative risk at air pollution concentration $AP(d)$ of a given observed or forecast date $d$. The relative risk is computed from the respective regional cumulative air pollution-mortality association, centred by subtracting a reference level of 0 µg/m³ for PM_2.5, PM₁₀ and NO₂, and 70 µg/m³ for O₃. We created five warning categories, for air pollution related mortality values smaller than 1% (“none”), between 1% and 2% (“low”), between 2% and 3% (“moderate”), between 3% and 4% (“high”) and higher than 4% (“extreme”).

A. Terms of Use

Use of the site forecaster.health (the “Website”) is governed by the Terms of Use set forth, without prejudice to the application of any other legal provision. The access and use of the Website is subject to the following Terms of Use, implying that the user has read and accepted without reservation these conditions.

1. Ownership

The “Owner” of all intellectual, property and any other rights over the Website is the Fundación Privada Instituto de Salud Global Barcelona (ISGlobal), with VAT number G65341695, domicile in C/ Rosselló 132, 7th floor, 08036 Barcelona, Spain, and registered in the Foundations Registry of the Directorate General of Law and Legal Entities of the Government of Catalonia under number 2,634.

2. Object

The aim of the Website is to create an operational, fit-for-purpose early warning system representing the health risks and impacts that environmental variables have on the exposed population, with particular focus on vulnerable groups. The environmental forecasts and health predictions that appear in the Website are automatically downloaded, processed and uploaded every day for the purpose of issuing updated health early warnings.

3. Website Content

The Website provides content solely for information purposes, and may not reflect the most up-to-date data on the matters addressed. This material may be amended, extended or updated without notice. The access to this content does not create or imply any professional or trust relationship between you (the “User”) and the Owner and/or “Authors” (see the tab “Authorship”) of the Website.

The information and any materials available in the Website cannot in any circumstances be considered or used as a substitute for medical or other advice. For that reason, the User of the Website must not act on the basis of the information contained in the Website without prior obtaining the appropriate professional advice.

The links included in the Website may take the user to other websites or applications managed by third parties over which the Owner and Authors have no control. The Owner and Authors are not responsible for the content or the state of these external websites and applications. Being able to access them via our Website does not imply that the Owner and Authors recommend or approve their content.

4. Restrictions

Any unauthorized use of the information on the Website may violate copyright, trademark and any other applicable laws. Any rights not expressly granted herein are reserved.

The User may view, download and copy information and materials available on the Website solely for research and non-commercial purposes and as long as attribution is given to the Owner and Authors. As a condition of use, the User agrees not to modify or revise any of the materials in any manner, and to retain any copyright or proprietary notice as contained in the original or copied materials. No other use of the information or the materials is authorized.

For the above purposes, the Owner and Authors grant a royalty-free, non-exclusive, non-assignable and non-transferable license without the right to sublicense, to use the Website and the information and materials contained, which will be subject to this Terms of Use and any other applicable laws.

The Terms of Use herein contained do not imply any assignment or transfer to the User of any intellectual or industrial property right over the Website nor its parts thereof, including but not limited to copyrights, trademarks, designs and/or any other rights.

5. Limitation of liability

The use of the Website is at your own risk and expense. The Owner and Authors of the Website are not responsible for any errors or omissions in its content, or for the content of other websites or applications the user may access via the Website. The Owner and Authors of the Website are not liable for any damage arising from its use, or any action performed based on information provided on it.

The Owner and Authors of the Website do not guarantee the absence of viruses or other harmful elements that could damage or alter your computing system, electronic documents or files. As a result, we do not accept any liability for any loss or damage that such elements could cause to the User or to a third party.

6. Governing law and dispute resolution

Accessing and using the Website implies that the User has read and accepted, without reservation, its Terms of Use. If the User does not agree with them, please do not access nor use the Website. Any discrepancies that may arise in relation to the Terms of Use herein contained shall be governed by the Law of Spain and expressly submitted to the courts and tribunals of Barcelona, Spain, with an express waiver of any other jurisdiction to which they may be entitled.

The Website does not use cookies to collect information. A cookie is a file downloaded onto your computer or mobile device for storing data that may be updated and retrieved by the entity that installed it.

C. Privacy Policy

No personal data is obtained when accessing or using the Website.

D. Disclaimer

The designations and borders employed on the maps do not imply the expression of any opinion by the Owner and Authors concerning the legal status or border delimitation of any country or territory.

E. Trademark Information

All trademarks, brands and names are the property of the Owner. Any usage of these marks or trade names is forbidden.

F. Acknowledgments

Mortality data:

Statistics Austria
Agency for Statistics of Bosnia and Herzegovina (BHAS)
Directorate-General Statistics - Statistics Belgium (Statbel)
National Statistics Institute of Bulgaria (NSI)
Federal Statistical Office of Switzerland (FSO): “Federal Statistical Office, data from causes of death statistics 1969-1994 / 1995-2020 / 2020 and so on”
Health Monitoring Unit of the Ministry of Health of Cyprus: “The data used in this study was collected by the Health Monitoring Unit of the Ministry of Health of Cyprus. The ideas and opinions expressed herein are those of the author. Endorsement of these ideas and opinions by the Ministry of Health of Cyprus is not intended nor should it be inferred”
Institute of Health Information and Statistics of the Czech Republic (UZIS)
German Federal Statistical Office (Destatis): “German Federal Statistical Office (Destatis). Data licence Germany – attribution – Version 2.0. Statistischer Bericht - Sterbefälle nach Tagen, Wochen und Monaten - endgültige Daten - 2000 bis 2019 and Statistischer Bericht - Sterbefälle nach Tagen, Wochen und Monaten - 2020 bis 2024”
Statistics Denmark (DST)
National Institute for Health Development of Estonia (TAI): “National Institute for Health Development of Estonia (TAI) - Estonian Causes of Death Registry”
Hellenic Statistical Authority (ELSTAT)
National Statistics Institute of Spain (INE)
Statistics Finland
French National Institute for Statistics and Economic Studies (INSEE)
Croatian Bureau of Statistics (DZS)
Hungarian Central Statistical Office (HCSO): “Aggregated data on daily mortality for Hungary. Datafile prepared upon individual request by the Hungarian Central Statistical Office (www.ksh.hu)”
Central Statistics Office Ireland (CSO)
Italian National Institute of Statistics (ISTAT)
Institute of Hygiene of Lithuania
Ministry of Health of Luxembourg
Centre for Disease Prevention and Control of Latvia (SPKC): “Register of Causes of Death, Centre for Disease Prevention and Control of Latvia (SPKC)”
Statistical Office of Montenegro (MONSTAT)
Statistics Netherlands (CBS)
Norwegian Institute of Public Health (FHI): “Norwegian Cause of DeathRegistry / Norwegian Institute of Public Health (FHI)”
Statistics Poland
National Statistics Institute of Portugal (INE)
Instituto Nacional de Estatística - Portugal (Statistics Portugal): “Instituto Nacional de Estatística - Portugal (Statistics Portugal), Number of deaths by place of residence, sex and age group; 1980-2018”
National Institute of Statistics of Romania (NIS Romania): “National Institute of Statistics of Romania (NIS Romania), Statistical Survey on Mortality”
Statistical Office of the Republic of Serbia
National Board of Health and Welfare of Sweden (Socialstyrelsen): “Swedish Cause of Death Register, National Board of Health and Welfare”
National Institute of Public Health of Slovenia (NIJZ)
Statistical Office of the Slovak Republic
Office for National Statistics of the United Kingdom (ONS)
Northern Ireland Statistics and Research Agency (NISRA)
National Records of Scotland (NRS)

Other data:

European Centre for Medium-Range Weather Forecasts (ECMWF):
- Copyright statement: Copyright “This service is based on data and products of the European Centre for Medium-Range Weather Forecasts (ECMWF)”.
- Source: www.ecmwf.int
- Licence Statement: This ECMWF data is published under a Creative Commons Attribution 4.0 International (CC BY 4.0). https://creativecommons.org/licenses/by/4.0/
- Disclaimer: ECMWF does not accept any liability whatsoever for any error or omission in the data, their availability, or for any loss or damage arising from their use.
- Where applicable, an indication if the material has been modified and an indication of previous modifications
Copernicus Climate Change Service (C3S):
- “The website contains modified Copernicus Climate Change Service information 2024. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains. ERA5-Land hourly data from 1950 to present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS), DOI: 10.24381/cds.e2161bac”
GISCO, Eurostat and European Commission

Summary

Author contributions

Recommended citation

Mortality records

Temperature observations and forecasts

Temperature related mortality estimates and predictions

Air pollution observations and forecasts

Air pollution related mortality estimates and predictions

A. Terms of Use

1. Ownership

2. Object

3. Website Content

4. Restrictions

5. Limitation of liability

6. Governing law and dispute resolution

B. Cookie Policy

C. Privacy Policy

D. Disclaimer

E. Trademark Information

F. Acknowledgments