Does Sugar Consumption Drive Diabetes?

March 6, 2013

A recent article in the journal PLoS ONE by the anti-sugar crusader Robert Lustig and three other co-authors has created quite a stir by purporting to show that increased sugar consumption causes diabetes. In the paper, the authors hold up just shy of saying "cause" but that is the inference drawn by many in the media (see for example this story in Bloomberg among other places) who say things like:

Excessive sugar consumption may be the main driver of a global rise in diabetes,

Moreover, on Mark Bittman's NYT Blog, the author, Lustig, is cited as saying:

This study is proof enough that sugar is toxic. Now it’s time to do something about it.

There is no way a study like this (comparing differences across countries) can firmly establish causation. So, at a minimum the study indicates an interesting (and perhaps suggestive) correlation that might warrant a randomized control trial. Nonetheless, I was intrigued and wanted to check out the evidence for myself.

The evidence by Lustig and colleagues comes by linking data on diabetes prevalence rates across countries (which I was able to easily find online here) and data from the UN FAO on the availability of calories from different food stuffs in different countries (after a bit of digging, I was also able to find it online here - go the the "food balance sheets"). After a bit of effort, I downloaded both data sets for the most recent years available, merged them, and checked out the claims made in the paper.

At first blush, I find very similar results to the ones reported in the paper. Holding constant total calories available, a simple linear regression shows that for every 100 kcal increase in sugar availability, the prevalence of diabetes goes up by 1.3 percentage points (say from 8.5% (the sample mean) to 9.8%). The estimated equation is:

(% with diabetes)=1.067+0.013*(per-capita available sugar kcal)+0.001*(per capita total available kcal)

My estimate is a little higher than the one reported in the paper probably because I'm not controlling for other factors (like GDP, kcal intake from meat, etc.) as the authors did. Moreover, I'm using data on diabetes from 2012 whereas the authors used 2011 and older data (note: I use data from 174 countries in my estimates). The only coefficient significant at the p=0.05 level in the above equation is the 0.013 estimate associated with sugar.

So far so good - the correlation is confirmed.

But let's get to the nitty gritty of the interpretation. The data is at the country level. So, what this implies is that a country that increases per-capita sugar availability by 100kcal will tend to have a 1.3 percentage point increase in the percent of the population with diabetes.

But, we don't really care about countries per se. We care about people. There are a lot more people in some countries than others. In the data set, the range is from a low of 0.00066 million adults to 980 million adults. Shouldn't this factor into the analysis? If we care about how many people in the world have diabetes, we'd better pay a lot more attention to China than to Luxembourg.

We know from the mini-scandal associated with the claim that small schools outperform larger ones (see one account here) that outcomes from small schools (or small countries) tends to be a lot more variable (with more outliers) than data from large schools (or large countries). That's just basic statistics.

Intuitively, we should want a larger country to count more than a smaller one. After all, there are many more people in larger countries - so if we want to think about the prevalence of diabetes in the world (rather than the average prevalence rate across countries), we'd want to calculate a weighted average, where larger countries get more weight (because they have more people). The more people, the higher the weight.

Likewise, when we want to run analyses like the one above, we want to give more weight to countries with more people. We can do this by running a weighted regression, where each country gets a weight proportional to it's population size. This converts the equation to one about how countries differ to one about how individuals differ. Stated differently, the weighted regression places the estimates at the level of the individual (picked at random from any country) rather than the level of the country (picked at random from a group of countries).

Here is the equation I get when I weight by a country's adult population:

(% with diabetes)=0.692+0.002*(total available sugar kcal)+0.002*(total available kcal)

Now, the effect of sugar falls dramatically (and most importantly, it is no longer statistically significant at standard levels; the p-value is 0.074). A 100 kcal increase in per-capita sugar availability only increases the % with diabetes by 0.2 (rather than 1.3 as previously estimated). Moreover, total energy from all sources is now significant and roughly the same magnitude as sugar. Thus, what matters in this framework is total kcal from any food source. Moreover this regression suggests that a sugar calorie is roughly the same as any other calorie insofar as affecting diabetes.

The paper at PLoS ONE says "regressions are population weighted." But, I'm wondering that is indeed the case. It could be true. I don't have access to all their data and I'm not including all their controls.

I'm happy to share the data and SAS code with anybody who cares to see it.

********

Addendum

The nice thing about the web is that you get feedback. Here's an update. The source that reports diabetes prevalence actually reported three measures. In the regressions above, I used national prevalence (total number with diabetes divided by total population). However, as indicated at the data source here, they also report some sort of age adjusted measure that is likely more useful in comparing across countries that might have different mean ages.

When I use this "IGT comparative prevalence" measure, as they call it, then I get exactly the opposite of the results mentioned above. When the data are NOT weighted, the sugar coefficient is only 0.0019 (p-value 0.27). But, when the data ARE weighted by adult population, the sugar coefficient is 0.01277 (p-value < 0.001).

So, there is an interesting mix of things going on here between the population, weighting, and age adjustment. Just out of curiosity, and for some robustness checks, I did two things. First, I re-ran the "preferred" model with population weighting using "IGT comparative prevalence" diabetes but included population as an explanatory variable. When I do this, sugar is no longer statistically significant (the estimate is 0.00242 with a p-value of 0.107), but population is (the estimate suggests larger populations have lower diabetes prevalence). I can't quite figure out what is going on here but there has to be something weird going on in the sense that the model is weighting by population and the dependent variable (and independent variables) are per-capita (i.e., are divided by population), that might be producing some unexpected results.

Second, I ran a quantile regression to see how the results hold up at the median (rather than the mean, which is more sensitive to outliers), I find that (using IGT comparative prevalence and adult population as a weight with only sugar and total calories as explanatory vars) the sugar effect, at the median, is 0.0148 but the 95% confidence interval is (-0.0191, 0.0217) when using the SAS default rank method of calculating standard errors. The 95% confidence interval changes to (0.0041, 0.0254) when using an alternative resampling method. So, whether the median effect is statistically significant depends on which method of calculating standard errors is used.

Here is the plot of the "sugar effect" at each quantile. The first shows the 95% confidence intervals determined by the resampling method and the second uses the SAS default (I have to admit that I'm not sure which method is preferred in this case).

A Quasi-Paternalist Takes on Paternalism

March 5, 2013

Cass Sunstein has a really interesting review of Sarah Conly's new book in the New York Times Review of Books. Conly advocates strongly for paternalism in her book: Against Autonomy: Justifying Coercive Paternalism. The interesting thing about this review is that Sunstein had a very popular book promoting his own version of paternalism. Sunstein's version (libertarian paternalism) is admittedly among the least objectionable (though still found several reasons to object in my forthcoming book - the Food Police).

Here are some of Sunstein's key critiques of Conly's work:

in my view, she underestimates the possibility that once all benefits and all costs are considered, we will generally be drawn to approaches that preserve freedom of choice. One reason involves the bluntness of coercive paternalism and the sheer diversity of people’s tastes and situations. Some of us care a great deal about the future, while others focus intensely on today and tomorrow. This difference may make perfect sense in light not of some bias toward the present, but of people’s different economic situations, ages, and valuations. Some people eat a lot more than others, and the reason may not be an absence of willpower or a neglect of long-term goals, but sheer enjoyment of food. Our ends are hardly limited to longevity and health; our short-term goals are a large part of what makes life worth living.

and

Conly favors a paternalism of means, but the line between means and ends can be fuzzy, and there is a risk that well-motivated efforts to promote people’s ends will end up mischaracterizing them.

and

Freedom of choice is an important safeguard against the potential mistakes of even the most well-motivated officials. Conly heavily depends on cost-benefit analysis . . . Officials may well be subject to the same kinds of errors that concern Conly in the first place. If we embrace cost-benefit analysis, we might be inclined to favor freedom of choice as a way of promoting private learning and reflection, avoiding unjustified costs, and (perhaps more important) providing a safety valve in the event of official errors.

Assorted Links

March 4, 2013

Kids are eating fewer calories

Parke Wilde reports on some (not so) funny business with the pork checkoff (I am astounded that this hadn't been reported in the mainstream media)

82 potential causes of obesity (file this one under: "we have no idea what caused the rise in obesity")

Joe Queenan at the WSJ is fed up with studies telling us what is and isn't healthy to eat (my favorite lines were: "The settlers at Jamestown did not come here dreaming of a steady diet of walnuts and olive oil. Those guys loved fats and carbs and sugar. America was built by men and women who never ate arugula.")

Why is eating out more responsive to income than eating at home? Ori Heffetz has an interesting answer to the question in a paper in the Review of Economics and Statistics (an earlier ungated version is here). The answer: eating out is visible to others; eating at home isn't.

Libertarian paternalism at work in school cafeterias (I have to admit that when I read stories like this, I think "libertarian paternalism" is just a fancy way of saying "advertising and promotion")

No Need to Fear the Horse Meat Burger

March 1, 2013

Today, the Oklahoman (the largest newspaper in the state), ran an editorial I wrote on the European horse meat scandal. I also touched on the consequences of the end of horse slaughter in the US. Here are a few snippets:

An expanding European horse meat scandal has left many Americans wondering whether the same could happen here. Americans are unlikely to find a horse burger. Before celebrating, it might do some good to learn why.

Because horse slaughter ended in the US in 2007. The consequences?

Unable to find a home for aged or crippled horses, ranchers faced high prices for euthanasia and disposal. Many horses were abandoned and left to starve. Investigations into horse abuse, for example, increased 60 percent in Colorado following slaughter cessation. Our research suggests that slaughter cessation caused a 36 percent drop in horse prices at a major Oklahoma auction and resulted in losses of $4 million per year in the yearling quarter horse market.

and

Americans are unlikely to find horse meat on their plate because we no longer produce any. It's possible that mislabeled products could be imported, but about 90 percent of the beef eaten by Americans is homegrown. If mislabeled products were found here, the answer wouldn't be, as we've seen, to ban horse slaughter. However much we are culturally predisposed to abhor eating horse, the reality is that it's safe and perfectly tasty. Just ask the French

and:

. . . if a food retailer lies, there are legal remedies. The mere knowledge of liability, not to mention lost reputation, incentivizes truth telling. More vigilance might have stopped the faux beef sellers in Europe. But no government can prevent us from all harm. Nor should we want it to. Vigilance is costly and our governments are already doing too much.

in conclusion

The lesson from these equine scandals isn't necessarily that the government should have been doing more. Rather, politicians should learn what every good horse intuitively knows: Look before you leap.

The Food Police

February 27, 2013

My new book The Food Police: A Well Fed Manifesto about the Politics of Your Plate officially goes one sale April 15, 2013. You can pre-order a copy now in hardcover or kindle or nook.

To whet your appetite, the front and back covers of the book jacket are below

Here is an early review from Kirkus, the book review magazine:

Subscribe to this blog (receive new posts in your email)