Are Climate Models Overstating Warming?

by Ross McKitrick
A number of authors, including the IPCC, have argued that climate models have systematically overstated the rate of global warming in recent decades. A recent paper by Millar et al. (2017) presented the same finding in a diagram of temperature change versus cumulative carbon emissions since 1870.

The horizontal axis is correlated with time but by using cumulative CO2 instead the authors infer a policy conclusion. The line with circles along it represents the CMIP5 ensemble mean path outlined by climate models. The vertical dashed line represents a carbon level where two thirds of the climate models say that much extra CO2 in the air translates into at least 1.5 oC warming. The black cross shows the estimated historical cumulative total CO2 emissions and the estimated observed warming. Notably it lies below the model line. The models show more warming than observed at lower emissions than have occurred. The vertical distance from the cross to the model line indicates that once the models have caught up with observed emissions they will have projected 0.3 oC more warming than has been seen, and will be very close (only seven years away) to the 1.5 oC level, which they associate with 615 GtC. With historical CO2 emissions adding up to 545 GtC that means we can only emit another 70 GtC, the so-called “carbon budget.”
Extrapolating forward based on the observed warming rate suggests that the 1.5 oC level would not be reached until cumulative emissions are more than 200 GtC above the current level, and possibly much higher. The gist of the article, therefore, is that because observations do not show the rapid warming shown in the models, this means there is more time to meet policy goals.
As an aside, I dislike the “carbon budget” language because it implies the existence of an arbitrary hard cap on allowable emissions, which rarely emerges as an optimal solution in models of environmental policy, and never in mainstream analyses of the climate issue except under some extreme assumptions about the nature of damages. But that’s a subject for another occasion.
Were Millar et al. authors right to assert that climate models have overstated recent warming? They are certainly not the first to make this claim. Fyfe et al. (2013) compared Hadley Centre temperature series (HadCRUT4) temperatures to the CMIP5 ensemble and showed that most models had higher trends over the 1998-2012 interval than were observed:

Original caption: a, 1993–2012. b, 1998–2012. Histograms of observed trends (red hatching) are from 100 reconstructions of the HadCRUT4 dataset1. Histograms of model trends (grey bars) are based on 117 simulations of the models, and black curves are smoothed versions of the model trends. The ranges of observed trends reflect observational uncertainty, whereas the ranges of model trends reflect forcing uncertainty, as well as differences in individual model responses to external forcings and uncertainty arising from internal climate variability.
The IPCC’s Fifth Assessment Report also acknowledged model over-estimation of recent warming in their Figure 9.8 and accompanying discussion in Box 9.2. I have updated the IPCC chart as follows. I set the CMIP5 range to gray, and the thin white lines show the (year-by-year) central 66% and 95% of model projections. The chart uses the most recent version of the HadCRUT4 data, which goes to the end of 2016. All data are centered on 1961-1990.
Even with the 2016 EL-Nino event, the HadCRUT4 series does not reach the mean of the CMIP5 ensemble. Prior to 2000 the longest interval without a crossing between the red and black lines was 12 years, but the current one now runs to 18 years.
This would appear to confirm the claim in Millar et al. that climate models display an exaggerated recent warming rate not observed in the data.
Not So Fast
Zeke Hausfather has disputed this in a posting for Carbon Brief. He presents a different-looking graph that seems to show HadCRUT4 and the other major data series lining up reasonably well with the CMIP5 (RCP4.5) runs.

How does he get this result?
Hausfather isn’t using the CMIP5 runs as shown by the IPCC; instead he is using data from a different archive that modifies the outputs in a way that tilts the post-2000 model trends down. Cowtan et al. (2015) argued that, for comparisons such as this, climate model outputs should be sampled in the same way that the HadCRUT4 (and other) surface data are sampled, namely using Surface Air Temperatures (SAT) over land, Sea Surface Temperatures (SST) over water, and with maskings that simulate the treatment of areas with missing data and with ice cover rather than open ocean. Global temperature products like HadCRUT use SST data as a proxy for Marine Air Temperature (MAT) over the oceans since MAT data are much less common than SST. Cowtan et al. note that in the models, SST warms more slowly than MAT but the CMIP5 output files used by the IPCC and others present averages constructed by blending MAT and SAT, rather than SST and SAT. Using the latter blend, and taking into account the fact that when Arctic ice coverage declines, some areas that had been sampled with SAT are replaced with SST, Cowtan et al. found that the discrepancy between models and observations declines somewhat.
Figure 4 in Cowtan et al. shows that the use of SAT/SST (“blended”) model output data doesn’t actually close the gap by much: the majority of the reconciliation happens by using “updated forcings”, i.e. peeking at the answer post-2000

.Top: effect of applying Cowtan et al. blending method (change from red to green line)
Bottom: effect of applying updated forcings that use post-2000 observations
Hausfather also uses a slightly later 1970-2000 baseline. With the 2016 El Nino at the end of the record a crossing between the observations and the modified CMIP5 mean occurs.
In my version (using the unmodified CMIP5 data) the change to a 1970-2000 baseline would yield a graph like this:
The 2016 HadCRUT4 value still doesn’t match the CMIP5 mean, but they’re close. The Cowtan et al. method compresses the model data above and below so in Zeke’s graph the CMIP5 mean crosses through the HadCRUT4 (and other observed series’) El Nino peak. That creates the visual impression of greater agreement between models and observations, but bear in mind the models are brought down to the data, not the other way around. On a 1970-2000 centering the max value of the CMIP5 ensemble exceeds 1C in 2012, but in Hausfather’s graph that doesn’t happen until 2018.
Apples with Apples
The basic logic of the Cowtan et al. paper is sound: like should be compared with like. The question is whether their approach, as shown in the Hausfather graph, actually reconciles models and observations.
It is interesting to note that their argument relies on the premise that SST trends are lower than nearby MAT trends. This might be true in some places but not in the tropics, at least prior to 2001. The linked paper by Christy et al. shows the opposite pattern to the one invoked by Cowtan et al. Marine buoys in the tropics show that MAT trends were negative even as the SST trended up, and a global data set using MAT would show less warming than one relying on SST, not more. In other words, if instead of apples-to-apples we did an oranges-to-oranges comparison using the customary CMIP5 model output comprised of SAT and MAT, compared against a modified HadCRUT4 series that used MAT rather than SST, it would have an even larger discrepancy since the modified HadCRUT4 series would have an even lower trend.
More generally, if the blending issues proposed by Cowtan et al. explain the model-obs discrepancy, then if we do comparisons using measures where the issues don’t apply, there should be no discrepancy. But, as I will show, the discrepancies show up in other comparisons as well.
Extremes
Swanson (2013) compared the way CMIP3 and CMIP5 models generated extreme cold and warm events in each gridcell over time. In a warming world, towards the end of the sample, each location would be expected to have a less-than-null probability of a record cold event and a greater-than-null probability of a record warm event each month. Since the comparison is done only using frequencies within individual grid cells it doesn’t require any assumptions about blending the data. The expected pattern was found to hold in the observations and in the models, but the models showed a warm bias. The pattern in the models had enough dispersion in CMIP3 to encompass the observed probabilities, but in CMIP5 the model pattern had a smaller spread and no overlap with observations. In other words, the models had become more like each other but less like the observed data.

(Swanson Fig 2 Panels A and B)
The importance here is that this comparison is not affected by the issues raised by Cowtan et al, so the discrepancy shouldn’t be there. But it is.
Lower Troposphere
Comparisons between model outputs for the Lower Troposphere (LT) and observations from weather satellites (using the UAH and RSS products) are not affected by the blending issues raised in Cowtan et al. Yet the LT discrepancy looks exactly like the one in the HadCRUT4/CMIP5 comparison.

The blue line is RSS, the black line is UAH, the red line is the CMIP5 mean and the grey bands show the RCP4.5 range. The thin white lines denote the central 66% and 95% ranges. The data are centered on 1979-2000. Even with the 2016 El Nino the discrepancy is visible and the observations do not cross the CMIP5 mean after 1999.
A good way to assess the discrepancy is to test for common deterministic trends using the HAC-robust Vogelsang-Franses test (see explanation here). Here are the trends and robust 95% confidence intervals for the lines shown in the above graph, including the percentile boundaries.

UAHv6.0              0.0156 C/yr       ( 0.0104 , 0.0208 )
RSSv4.0                 0.0186 C/yr       ( 0.0142 , 0.0230 )
GCM_min              0.0252 C/yr       ( 0.0191 , 0.0313 )
GCM_025             0.0265 C/yr       ( 0.0213 , 0.0317 )
GCM_165             0.0264 C/yr       ( 0.0200 , 0.0328 )
GCM_mean           0.0276 C/yr       ( 0.0205 , 0.0347 )
GCM_835             0.0287 C/yr       ( 0.0210 , 0.0364 )
GCM_975             0.0322 C/yr       ( 0.0246 , 0.0398 )
GCM_max             0.0319 C/yr       ( 0.0241 , 0.0397 )
All trends are significantly positive, but the observed trends are lower than the model range. Next I test whether the CMIP5 mean trend is the same as, respectively, that in the mean of UAH and RSS, UAH alone and RSS alone. The test scores are below. All three reject at <1%. Note the critical values for the VF scores are: 90%:20.14, 95%: 41.53, 99%: 83.96.
H0: Trend in CMIP5 mean =
Trend in mean obs          192.302
Trend in UAH                      405.876
Trend in RSS                         86.352

The Tropics
In addition to the above comparison, if the treatment of Arctic sea ice is the major problem, there should be no issues when confining attention to the tropics. Also, since models project the strongest response to GHG warming in the tropical LT, this is where models and observations ought best to agree.

Again the blue line is RSS, the black line is UAH, the red line is the CMIP5 mean, the grey bands show the RCP4.5 range and the thin white lines denote the central 66% and 95% ranges. The data are centered on 1979-2000.
Trends:
UAHv6.0              0.0102 C/yr       ( 0.0037 , 0.0167 )
RSSv4.0                 0.0139 C/yr       ( 0.0085 , 0.0193 )
GCM_min             0.0282 C/yr       ( 0.0199 , 0.0365 )
GCM_025             0.0277 C/yr       ( 0.021 , 0.0344 )
GCM_165             0.0281 C/yr       ( 0.0207 , 0.0355 )
GCM_mean           0.0289 C/yr       ( 0.0209 , 0.0369 )
GCM_835             0.0296 C/yr       ( 0.021 , 0.0382 )
GCM_975             0.032 C/yr          ( 0.0239 , 0.0401 )
GCM_max             0.0319 C/yr       ( 0.023 , 0.0408 )
H0: Trend in CMIP5 mean =
Trend in mean obs          229.683
Trend in UAH                      224.190
Trend in RSS                         230.100

All trends are significantly positive and the models strongly reject against the observations. Interestingly the UAH and RSS series both reject even against the (year-by-year) lower bound of the CMIP5 outputs (p<1%).
Finally, Tim Vogelsang and I showed a couple of years ago that the tropical LT (and MT) discrepancies are also present between models and the weather balloon series back to 1958.
Summary
Millar et al. attracted controversy for stating that climate models have shown too much warming in recent decades, even though others (including the IPCC) have said the same thing. Zeke Hausfather disputed this using an adjustment to model outputs developed by Cowtan et al. The combination of the adjustment and the recent El Nino creates a visual impression of coherence. But other measures not affected by the issues raised in Cowtan et al. support the existence of a warm bias in models. Gridcell extreme frequencies in CMIP5 models do not overlap with observations. And satellite-measured temperature trends in the lower troposphere run below the CMIP5 rates in the same way that the HadCRUT4 surface data do, including in the tropics. The model-observational discrepancy is real, and needs to be taken into account especially when using models for policy guidance.
Moderation note: As with all guest posts, please keep your comments civil and relevant.

Link

https://judithcurry.com/2017/09/26/are-climate-models-overstating-warming/

Title	Items
UNZ	837
In This Together	76
Julius Reuchel	39
Truth Comes to Light	1878
The Unweb Developer	15
Grand theft world	2889
Ivor Cummings	171
World Freedom Alliance	1183
Swebb TV	18
SGT Report	19578
Friends Against Government	114
Scott Horton	630
Tim Woods	636
Ron Paul Institute	187
Covid Infos	63
Technocracy News	1962
Ochelli Effect	521
Computing Forever	137
Summit news	4424
Unlimited Hangout	405
American Institute for Economic Research	3089
The last American Vagabond	856
The Gray Zone	255
Covert Action Magazine	690
The high wire	318
Tareq Haddad	32
Please Stop the Ride	102
The Infectious Myth	27
Lockdown Skeptics	3538
Sam Husseini	50
Dr. Andrew Kaufman	4
Swiss Propaganda Research	367
Off Guardian	1950
Cory Morningstar	19
James Bovard	663
WWI Hidden History	51
Grayzone Project	749
Pass Blue	466
Dilyana Gaytandzhieva	32
John Pilger	437
The Real News	402
Scrutinised Minds	39
Need To Know News	5518
FEE	7340
Marine Le Pen	472
Francois Asselineau	25
Opassande	55
HAX on 5July	220
Henrik Alexandersson	1894
Mohamed Omar	409
Professors Blog	10
Arg Blatte Talar	40
Angry Foreigner	19
Fritte Fritzson	12
Teologiska rummet	36
Filosofiska rummet	297
Vetenskapsradion Historia	364
Snedtänkt (Kalle Lind)	437
Les Crises	5899
Richard Falk	390
Ian Sinclair	236
SpinWatch	71
Counter Currents	20574
Kafila	1103
Gail Malone	59
Transnational Foundation	221
Rick Falkvinge	96
The Duran	19500
Vanessa Beeley	555
Nina Kouprianova	29
MintPress	7402
Paul Craig Roberts	6988
News Junkie Post	91
Nomi Prins	27
Kurt Nimmo	191
Strategic Culture	7683
Sir Ken Robinson	98
Stephan Kinsella	1144
Liberty Blitzkrieg	890
Sami Bedouin	65
Consortium News	2685
21 Century Wire	6186
Burning Blogger	324
Stephen Gowans	178
David D. Friedman	322
Anarchist Standard	16
The BRICS Post	1558
Tom Dispatch	736
Levant Report	18
The Saker	8224
The Barnes Review	623
John Friend	770
Psyche Truth	160
Jonathan Cook	184
New Eastern Outlook	7880
School Sucks Project	1932
Giza Death Star	2993
Andrew Gavin Marshall	28
Red Ice Radio	1098
GMWatch	3090
Robert Faurisson	150
Espionage History Archive	38
Jay's Analysis	1823
Le 4ème singe	92
Jacob Cohen	238
Agora Vox	30494
Cercle Des Volontaires	539
Panamza	3561
Fairewinds	127
Project Censored	1944
Spy Culture	983
Conspiracy Archive	135
Crystal Clark	76
Timothy Kelly	1003
PINAC	1482
The Conscious Resistance	1721
Independent Science News	118
The Anti Media	6913
Positive News	830
Brandon Martinez	30
Steven Chovanec	63
Lionel	323
The Mind renewed	562
Natural Society	2627
Yanis Varoufakis	1424
Tragedy & Hope	138
Dr. Tim Ball	114
Web of Debt	207
Porkins Policy Review	495
Conspiracy Watch	174
Eva Bartlett	769
Libyan War Truth	395
DeadLine Live	2006
Kevin Ryan	74
BSNEWS	2315
Aaron Franz	426
Traces of Reality	166
Revelations Radio News	307
Dr. Bruce Levine	244
Peter B Collins	1983
Faux Capitalism	205
Dissident Voice	16972
Climate Audit	246
Donna Laframboise	682
Judith Curry	1397
Geneva Business Insider	40
Media Monarchy	4120
Syria Report	87
Human Rights Investigation	98
Intifada (Voice of Palestine)	1685
Down With Tyranny	14579
Laura Wells Solutions	91
Video Rebel's Blog	691
Revisionist Review	485
Aletho News	31557
ضد العولمة	27
Penny for your thoughts	3947
Northerntruthseeker	4206
كساريات	37
Color Revolutions and Geopolitics	27
Stop Nato	5698
AntiWar.com Blog	5173
AntiWar.com Original Content	10472
Corbett Report	3491
Stop Imperialism	491
Land Destroyer	1685
Webster Tarpley Website	1463

Tags