<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Messy Matters &#187; Uncategorized</title>
	<atom:link href="http://messymatters.com/category/uncategorized/feed/" rel="self" type="application/rss+xml" />
	<link>http://messymatters.com</link>
	<description>Bring Your Own Data</description>
	<lastBuildDate>Fri, 03 Sep 2010 03:35:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Birds of a Feather Shop Together</title>
		<link>http://messymatters.com/2010/09/01/birdshop/</link>
		<comments>http://messymatters.com/2010/09/01/birdshop/#comments</comments>
		<pubDate>Wed, 01 Sep 2010 04:01:48 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[homophily]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=959</guid>
		<description><![CDATA[
(Cross posted at Decision Science News.)
Do you know what the #$*! your social media strategy is? Perhaps it&#8217;s &#8220;to facilitate audience conversations and drive engagement with social currency&#8221;? Or maybe &#8220;to amplify word of mouth by motivating influencers&#8221;? Well, given all the lies and damned lies being told about social, fellow yahoo Dan Goldstein and I decided [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/08/birds.jpg"><img class="aligncenter size-full wp-image-960" title="birds" src="http://messymatters.com/wp-content/uploads/2010/08/birds.jpg" alt="" width="369" height="271" /></a></p>
<p>(Cross posted at <a href="http://www.decisionsciencenews.com" target="_blank">Decision Science News</a>.)</p>
<p>Do you know <a href="http://whatthefuckismysocialmediastrategy.com/" target="_blank">what the #$*! your social media strategy is</a>? Perhaps it&#8217;s &#8220;to facilitate audience conversations and drive engagement with social currency&#8221;? Or maybe &#8220;to amplify word of mouth by motivating influencers&#8221;? Well, given all the lies and damned lies being told about social, fellow yahoo <a href="http://www.dangoldstein.com/" target="_blank">Dan Goldstein</a> and I decided to enter the fray with <a href="http://en.wikipedia.org/wiki/Lies,_damned_lies,_and_statistics" target="_blank">statistics</a>. <strong>We measured the extent to which your friends&#8217; behavior predicts your own, and found that in several consumer domains the effect is substantial, complementing traditional demographic and behavioral predictors.</strong></p>
<div class="sidebar">Where there is homophily, one can in principle predict an individual&#8217;s behavior based on the attributes and actions of his or her associates.</div>
<p>That friends are similar along a variety of dimensions is a long-observed empirical regularity&#8212;a pattern sociologists call <a href="http://en.wikipedia.org/wiki/Homophily" target="_blank">homophily</a>. As McPherson et al. write in their canonical <a href="http://arjournals.annualreviews.org/doi/abs/10.1146/annurev.soc.27.1.415" target="_blank">review</a> on the subject, &#8220;homophily limits people’s social worlds in a way that has powerful implications for the information they receive, the attitudes they form, and the interactions they experience.&#8221; Turning this statement around, <strong>where there is homophily, one can in principle predict an individual&#8217;s behavior based on the attributes and actions of his or her associates</strong>.</p>
<p>To assess the quality of such network-based predictions, we merged a large social network (based on email and IM exchanges) with offline sales data at an upscale, national department store chain. Thus, for each of over one million users, we had their past purchase amounts in dollars, and had the same information for each of their network contacts. Think about this for a minute: we not only know how much these individuals themselves spent at an <em>offline</em> retailer, but also how much their social contacts spent, a testament to how profoundly the Internet is changing the way we study human behavior. (Despite bolstering social science research, these newfound tools raise serious privacy issues. We left the matching to a third party that specializes in doing this securely, so neither we nor the department store had access to the other&#8217;s complete customer database.)</p>
<p>The plot below summarizes our findings. First, as indicated by the top line, consumers whose friends spent a lot, also spent a lot themselves, consistent with the hypothesis that homophily extends to consumer behavior. When friends (alters) on average spent $400 during the six-month observation period, the consumer herself (ego) spent nearly $600, more than twice the typical consumer (indicated by the dotted line). As our aim is prediction, however, the relevant question is not just whether friends are similar in their purchasing behavior, but rather how much information is conveyed by social ties relative to other attributes. One might conjecture that ties simply indicate demographic (i.e., age and sex) similarity, that those who spend a lot are more likely to be middle-aged women&#8212;the primary market segment for this department store&#8212;and that friends of middle-aged women tend also to be middle-aged women. To test this hypothesis, we first paired each individual with a randomly chosen consumer of identical age and sex. The bottom line shows that this demographically matched group is, perhaps surprisingly, pretty ordinary. In other words, looking only at age and sex, you can&#8217;t identify consumers whose friends spend a lot (and who we know spend a lot themselves).</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/08/matched_effects.png"><img class="aligncenter size-full wp-image-974" title="The predictive power of social ties persists even after adjusting for demographics and past purchases." src="http://messymatters.com/wp-content/uploads/2010/08/matched_effects.png" alt="" width="347" height="336" /></a></p>
<p>Though it&#8217;s standard marketing practice to target consumers based on their demographics, it&#8217;s an admittedly noisy profiling technique. So, to put social through the wringer, we next took the &#8220;socially select&#8221; group&#8212;consumers whose friends spent a lot&#8212;and matched them to random consumers with identical age, sex, and past purchases. Each social candidate, that is, was matched to a consumer not only of the same age and sex, but one who spent approximately the same amount as the social candidate during the previous six months. Even relative to this formidable baseline, social cues still provide considerable information. As the middle line indicates, knowing a consumer&#8217;s age, sex, and past purchases, but not that their friends are shopaholics, one would still underestimate their future sales.<a id="causation" href="#causation"><sup>[1]</sup></a></p>
<p>We repeated this analysis for two other domains&#8212;examining signups for <a href="http://football.fantasysports.yahoo.com/" target="_blank">Yahoo! Fantasy Football</a>, and clicks on ten online banner ads for movies, apparel, government programs, and beyond&#8212;again finding that <strong>the predictive power of social persists even after adjusting for age, sex, and past behavior</strong>. Lest you run off to rejigger your social strategy, I should mention a couple of caveats. First, we have shown that consumers with big-spending friends tend to spend a lot themselves&#8212;more, in fact, than their demographics or past purchases alone would suggest. But since most people, even premium customers, don&#8217;t have shopaholic friends, social cues do not substantially boost <em>average</em> predictive performance. Second, though social signals help predict <em>how much</em> consumers spend, they don&#8217;t always help identify <em>which</em> consumers will spend the most. Those who recently spent fifty grand on sartorial elegance are likely to be habitual top spenders, regardless of what you know about their friends.</p>
<p>Assessing the value of social, as with most things, is a messy affair. On the one hand, network ties convey information not captured by the usual egocentric metrics, a conclusion that at the very least I find scientifically interesting. On the other hand, it&#8217;s not immediately obvious how to use that knowledge to take over the world. Well, rest assured that an army of social strategy gurus are waiting in the wings with a game-changing, technology-disrupting way to, you know, <a href="http://whatthefuckismysocialmediastrategy.com/" target="_blank">leverage the social graph to deliver personalized experiences</a> or something.</p>
<p><em>N.B. </em>Thanks to Randall Lewis and <a href="http://www.davidreiley.com">David Reiley</a> for acquiring the sales data, <a href="http://jakehofman.com" target="_blank">Jake Hofman</a> for assembling the email data, and <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a> and <a href="http://ai.eecs.umich.edu/people/dreeves/" target="_blank">Dan Reeves</a> for comments. The plot above was generated with <a href="http://had.co.nz/ggplot2/">ggplot2</a>. For related work in the telecom domain, check out the paper, &#8220;<a href="http://projecteuclid.org/euclid.ss/1154979826" target="_blank">Network-Based Marketing: Identifying Likely Adopters via Consumer Networks</a>,&#8221; by <a href="http://www.wharton.upenn.edu/faculty/hill.cfm" target="_blank">Shawndra Hill</a>, <a href="http://pages.stern.nyu.edu/~fprovost/" target="_blank">Foster Provost</a>, and <a href="http://www2.research.att.com/~volinsky/" target="_blank">Chris Volinsky</a>.</p>
<p><em>Illustration by </em><em><a href="http://krsavage.com" target="_blank">Kelly Savage</a></em></p>
<h3>Footnotes</h3>
<p><a id="TIES" href="#causation">[1]</a> It&#8217;s perhaps tempting to conclude from these results that shopping is contagious (i.e., to assert causation where only correlation has been shown). Though there is probably some truth to that claim, establishing such is neither our objective nor justified from our analysis.</p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/09/01/birdshop/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Things That Never Happen</title>
		<link>http://messymatters.com/2010/08/09/strangers/</link>
		<comments>http://messymatters.com/2010/08/09/strangers/#comments</comments>
		<pubDate>Mon, 09 Aug 2010 20:28:45 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[crime]]></category>
		<category><![CDATA[fallacies]]></category>
		<category><![CDATA[kids]]></category>
		<category><![CDATA[myths]]></category>
		<category><![CDATA[security]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=934</guid>
		<description><![CDATA[
I love this quote from security expert Bruce Schneier:

Remember, if it&#8217;s in the news don&#8217;t worry about it. The very definition of news is &#8220;something that almost never happens.&#8221; When something is so common that it&#8217;s no longer news &#8212; car crashes, domestic violence &#8212; that&#8217;s when you should worry about it.

The truth of that[1] [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/08/flyingpigs1.png"><img src="http://messymatters.com/wp-content/uploads/2010/08/flyingpigs1.png" alt="The Counterfactual News Network?" title="The Counterfactual News Network?" width="389" height="275" class="aligncenter size-full wp-image-942" /></a></p>
<p>I love this <a href="http://www.schneier.com/essay-304.html">quote from security expert Bruce Schneier</a>:</p>
<blockquote><p>
Remember, if it&#8217;s in the news don&#8217;t worry about it. The very definition of news is &#8220;something that almost never happens.&#8221; When something is so common that it&#8217;s no longer news &#8212; car crashes, domestic violence &#8212; that&#8217;s when you should worry about it.
</p></blockquote>
<p>The truth of that<a id="refSCHNEIER" href="#SCHNEIER"><sup>[1]</sup></a> hit home recently when I saw a news feature on the abduction of a four-year-old girl from her front yard in Missouri. Candlelight vigils, nation-wide amber alert, police blockades where every single car was stopped and questioned, FBI agents swarming the house. I think the expected reaction from parents is &#8220;oh my god, I need to be so vigilant, even in my own front yard!&#8221;  My reaction was the opposite: Wow, this sort of thing really does essentially <em>never</em> happen.  <a href="http://freerangekids.com">Let the kids run free!</a></p>
<p>I&#8217;m saying this without a drop of irony.  Stranger abductions just don&#8217;t happen. You should worry more about your baby boy dying from complications of circumcision (true).  What you should actually worry about are the real killers of kids, like driving and drowning. Speaking of which, <a href="http://mariovittone.com/2010/05/154/">here&#8217;s a valuable public service announcement about what drowning looks like</a>. (It doesn&#8217;t look like drowning.)</p>
<p>By the way, if you didn&#8217;t hear the news story about the four-year-old girl, it had a happy ending.  She turned up unharmed.</p>
<h2>The Opposite Story: Things That Do Happen</h2>
<p>But speaking of child abductions and drowning, I have to bring up a <a href="http://news.bbc.co.uk/2/hi/uk_news/england/coventry_warwickshire/4837614.stm">brutally ironic news story</a> from several years ago.</p>
<p>In summary, a toddler got separated from her caregivers and wandered off, where a passerby saw her.  Seeing no one else around, his first instinct was to scoop her up and take her with him.  But he decided that that was a big risk to himself &#8212; what would people think? &#8212; and continued on, reporting it later.</p>
<p>The girl then fell in a pond and drowned.</p>
<p>Just to spell out the heart-wrenching irony:  Even though the man made the wrong decision (and should&#8217;ve known it at the time) he was correct that helping that girl was dangerous to him.  He could&#8217;ve been accused of abducting the girl and the accusation, sadly, and insanely, would have been taken very seriously.  That needs to change.  Remember, child abductions by strangers don&#8217;t happen!  Picking up a lost child should not be regarded with any suspicion!  Arrgh!</p>
<div class="sidebar">&#8220;That adult simply is not going to happen to be a child predator! Put that absurd thought out of your mind.&#8221;</div>
<p>This is also why I think these common guidelines parents give their young kids about which kinds of adults to seek help from if they&#8217;re lost (store employees, other moms) are misguided and dangerous.  The best person to get assistance from is the first adult they find.  That adult simply is not going to happen to be a child predator! Put that absurd thought out of your mind.</p>
<p>By the way, the above story happened in the UK which, strangely, seems to be the only place worse than the US with this out-of-control paranoia.   </p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/08/free-candy-van-small.png"><img src="http://messymatters.com/wp-content/uploads/2010/08/free-candy-van-small.png" alt="This image defies comment." title="This image defies comment." width="300" height="112" class="aligncenter size-full wp-image-943" /></a></p>
<h2>But Think of the Children!</h2>
<p><em>Q: Isn&#8217;t some amount of paranoia justified when it comes to children? Shouldn&#8217;t they at least be taught the difference between good strangers and bad strangers?  Shouldn&#8217;t we be vigilant about any possible danger?</em></p>
<p>A: I disagree that children need to learn about &#8220;bad strangers&#8221;.  I teach my kids that strangers are good, period.  This is hard to talk about but it&#8217;s true:  a scary number of kids really do get abused and it&#8217;s rarely by strangers.  Your kids need to know about inappropriate touching but please don&#8217;t muddy that with talk of strangers.  (Can you imagine, &#8220;it&#8217;s ok, I&#8217;m not a stranger&#8230;&#8221; Shudder!)</p>
<p>As for vigilance, it&#8217;s a limited resource: focus your energy on the real dangers.</p>
<p>At risk of getting repetitive I&#8217;ll re-assert my strong belief:  No stranger will ever harm your child.  It&#8217;s a risk (and, fine, if you want to get technical, it&#8217;s a risk, with <em>some</em> positive probability) that you can completely ignore.  In fact, I&#8217;ll go further and say that maximum vigilance requires that you actively dismiss the idea that stangers can be dangerous.  Doing so could prevent, for example, something like the tragically ironic drowning of the girl in that news story.</p>
<p><em>Q: I heard that there were 5 attempted abductions in Central Park last week. What do you say to that?</em> (This was a real response when this came up on a parent list I&#8217;m on, though I doubt the veracity of the claim. The rest of these questions were real too, though most weren&#8217;t phrased as questions.)</p>
<p>A: It&#8217;s hard to even imagine this number but Central Park gets 25 million visitors every year.  With those numbers, impossibly unlikely things will have happened to <em>someone</em> but it doesn&#8217;t necessarily mean you have to worry about them happening to you.  Like when you hear about a meteor crashing through the roof of someone&#8217;s house (that&#8217;s for real, by the way).</p>
<div class="sidebar">&#8220;You have a 1 in 25 million chance of getting killed by a tree if you visit Central Park.&#8221;</div>
<p>Sticking with Central Park, here&#8217;s another real story from this summer: A woman and her baby were posing for a photo under a tree and a branch somehow fell off and killed the child. So that&#8217;s a 1 in 25 million chance of getting killed by a tree if you visit Central Park.  (The appliances in your house are more dangerous than that.)</p>
<p>More to the point though, what&#8217;s an attempted abduction?  Is it <a href="http://www.abc.net.au/news/stories/2010/04/08/2867625.htm">like this</a>?  A stranger talking to a child and the parents freaking out?  Even more to the point, if you hear official numbers on child abductions they sound horrifying, until you learn that they&#8217;re mostly divorced parents in custody battles.  Which should give you pause if you&#8217;re a divorced parent, but again we&#8217;re back to:  know where to focus your vigilance. Think about who has repeated contact with your child, not the stranger in the park.</p>
<p><em>Q: What if child abductions are so rare precisely because parents are so vigilant?</em></p>
<p>A: I&#8217;m not advocating any kind of extreme disregard for children&#8217;s safety.  In fact, I&#8217;m basically advocating raising kids more like how kids were raised a couple generations ago. </p>
<p><em>Q: But that was a different time!</em></p>
<p>A: Yes, it&#8217;s actually safer now.  You wouldn&#8217;t think so from watching the news but <a href="http://www.wolframalpha.com/input/?i=crime">crime</a> has been going down for a couple decades and is currently at 1960s levels in the US.</p>
<p><em>Q: Fine, but what&#8217;s the harm in being cautious?</em></p>
<p>A: Here&#8217;s how Lenore Skenazy of <a href="http://freerangekids.com">Free-Range Kids</a> (highly recommended) puts it:</p>
<blockquote><p>
Children, like chickens, deserve a life outside the cage. The overprotected life is stunting and stifling, not to mention boring for all concerned.
</p></blockquote>
<p><br/></p>
<h3>Straw Exit Poll</h3>
<p><center><br />
<script type="text/javascript" charset="utf-8" src="http://static.polldaddy.com/p/3581918.js"></script><br />
<noscript>[If you can read this, you're supposed to be seeing a straw poll. You probably need to <a href="http://messymatters.com/strangers" onclick="">click through to the post on messymatters</a>.]</noscript><br />
</center></p>
<p>Thanks to <a href="http://messymatters.com/sharad">Sharad Goel</a>, <a href="http://freerangekids.com">Lenore Skenazy</a>, Andrew Reeves, Laurie Reeves, <a href="http://blog.oddhead.com">David Pennock</a>, and many parents at <a href="http://www.harlem4kids.com/">Harlem4Kids</a> for reading drafts of this.  If you liked this article you may like my twitter stream for parents: <a href="http://twitter.com/parentips">twitter.com/parentips</a></p>
<p><em>Image by Kelly Savage</em></p>
<p><br/></p>
<h2>Footnotes</h2>
<p><a id="SCHNEIER" href="#refSCHNEIER">[1]</a> <a href="http://messymatters.com/sharad">Sharad</a> points out that Schneier&#8217;s quote is a little too glib.  The more correct way to say it would be something like &#8220;prominence in the news does not necessarily correlate with the magnitude of risk to you&#8221;.  As <a href="http://xkcd.com/369/">xkcd points out</a>, frequency of occurrence in the news overall probably does correlate with actual risk.  Sharad adds that there are actually prominent news features you should worry (though this is maybe a different sense of &#8220;worry&#8221;) about, for example, war and political/corporate corruption.</p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/08/09/strangers/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Inbox Zeroer</title>
		<link>http://messymatters.com/2010/07/31/email/</link>
		<comments>http://messymatters.com/2010/07/31/email/#comments</comments>
		<pubDate>Sun, 01 Aug 2010 02:23:46 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[email]]></category>
		<category><![CDATA[productivity]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=905</guid>
		<description><![CDATA[
My email is dysfunctional.  I keep things in my inbox because I can&#8217;t afford for them to go out of sight, out of mind &#8212; but then that&#8217;s exactly what happens. They get buried deeper and deeper in my inbox by all the other messages I delusionally think I&#8217;m going to deal with. It&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/07/inboxzeromouse.png"><img src="http://messymatters.com/wp-content/uploads/2010/07/inboxzeromouse-246x300.png" alt="A happy critter, impervious to a deluge of email. Note the Inbox Zero nerd merit badge." title="A happy critter, impervious to a deluge of email. Note the Inbox Zero nerd merit badge." width="246" height="300" class="aligncenter size-medium wp-image-925" /></a></p>
<p>My email is dysfunctional.  I keep things in my inbox because I can&#8217;t afford for them to go out of sight, out of mind &#8212; but then that&#8217;s exactly what happens. They get buried deeper and deeper in my inbox by all the other messages I delusionally think I&#8217;m going to deal with. It&#8217;s a given that some things will fall through the cracks.  I just need to make sure I have some control over which things those are.</p>
<p>Let&#8217;s start with a straw poll:<br />
<center><br />
<script type="text/javascript" charset="utf-8" src="http://static.polldaddy.com/p/3538994.js"></script><br />
<noscript>[If you can read this, you're supposed to be seeing a straw poll about the size of your inbox. You probably need to <a href="http://messymatters.com/email">click through to the post on messymatters</a>.]</noscript><br />
</center></p>
<p><a id="EXCUSES"></a>Consider the reasons a message sits in your inbox:</p>
<ul>
<li><em>The To-do Excuse.</em> You&#8217;re going to need to deal with it but you&#8217;re not ready to yet. (Some of this is procrastination, some is legit.)</li>
<li><em>The Waiting Excuse.</em> It&#8217;s something you&#8217;ve replied to but you don&#8217;t want to forget about it in case the recipient drops the ball.  I.e., you <em>might</em> need to deal with it later.</li>
<li><em>The Reference Excuse.</em> It has information that you&#8217;ll want to refer to. (Different than maybe needing to deal with it later in that there&#8217;s nothing for you to remember to do.)</li>
<li><em>The Laziness Excuse.</em> It&#8217;s one of many messages that has outlived its usefulness in your inbox but you haven&#8217;t yet slogged through and archived them all.</li>
</ul>
<p>In other words, it&#8217;s either serving as a to-do item, a might-need-to-do item, an item for future reference, or there&#8217;s no reason for it to be there at all. Or, of course, you haven&#8217;t read it yet.  Which brings us to&#8230;</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/07/inboxzero_medium.png"><img src="http://messymatters.com/wp-content/uploads/2010/07/inboxzero_medium.png" alt="The nerd merit badge for Inbox Zero, from nerdmeritbadges.com." title="The nerd merit badge for Inbox Zero, from nerdmeritbadges.com." width="150" height="150" class="alignright size-full wp-image-928" /></a></p>
<h2>Inbox Zero</h2>
<p><a id="INBOXZERO"></a><a href="http://jeremy.zawodny.com/blog/archives/009311.html">Inbox Zero</a> is a simple system devised by <a href="http://inboxzero.com">Merlin Mann</a> for keeping your email inbox empty. The idea is that your inbox should really only be for messages you haven&#8217;t actually read. So every time you check your email, do something with every single message.  Specifically, do one of the following:</p>
<ul>
<li>Archive: just get rid of it (originally Delete<a id="refDELETE" href="#DELETE"><sup>[1]</sup></a>).</li>
<li>Delegate: get someone else to deal with it.</li>
<li>Respond: if it takes just a few minutes, fire off a reply right then.<a id="refDELAY" href="#DELAY"><sup>[2]</sup></a></li>
<li>Defer: put it on a to-do list, archive it, and deal with it later.</li>
<li>Do: handle whatever the email actually needs you to do, then archive it.</li>
</ul>
<div class="sidebar">&#8220;If it&#8217;s ok for a message to go out of sight, out of mind &#8212; like because you&#8217;ll go looking for it when you need it &#8212; then archive it immediately.&#8221;</div>
<p>The first one, Archive/Delete, is huge and completely destroys <a href="#EXCUSES">the Reference Excuse</a> for a message to sit in your inbox. If it&#8217;s ok for a message to go out of sight, out of mind &#8212; like because you&#8217;ll go looking for it when you need it &#8212; then archive it immediately. Modern email clients make it easier to search among a million messages than to scroll and scan through a thousand. But even if you don&#8217;t have decent search, you should still get it out of your inbox. Just put it in a folder called Reference and hunt through that when you need to find it.<a id="refFOLDERS" href="#FOLDERS"><sup>[3]</sup></a></p>
<p>The last one, Do, is dandy. It helps with <a href="#EXCUSES">the To-do Excuse</a>. The procrastination part anyway.</p>
<div class="sidebar">&#8220;Delegate and Respond don&#8217;t get things out of your inbox if you need to make sure they get dealt with and follow up if not.&#8221;</div>
<p>It&#8217;s the middle three &#8212; Delegate, Respond, Defer &#8212; that aren&#8217;t actually so helpful for keeping your inbox small. None of them address very well the Waiting Excuse, the Laziness Excuse, or the other half of the To-do Excuse (legitimate need to defer dealing with an email). First, Delegate and Respond don&#8217;t get things out of your inbox if you need to make sure they get dealt with and follow up if not. And Defer also can&#8217;t get a message out of your inbox without some system for transferring it to a to-do list, which is often too much overhead. However easy it is to put something on your to-do list, you&#8217;ve still got to retrieve the email again when it&#8217;s time to deal with it. Chances are you&#8217;ll be inclined to just leave it in your inbox because you&#8217;ll be dealing with it Really Soon.</p>
<p>So of the four excuses for a message to sit in your inbox, only <a href="#EXCUSES">the Reference Excuse</a> has a satisfactory solution. What can be done about the others?</p>
<h2>Proposed Email Features</h2>
<p>What if you had a snooze button for email &#8212; a way to easily make a message disappear from your inbox for 24 hours (or any number of days)? <a href="#EXCUSES">The To-do Excuse</a>:  eliminated.</p>
<p>And what if, when you sent an email or replied to one, you could have your own message re-ping you, i.e., appear in your inbox after a certain amount of time if the recipient failed to follow up? This, by the way, is one of the <a href="http://en.wikipedia.org/wiki/Getting_Things_Done">Getting Things Done (GTD)</a> principles: have systems you can fully trust to remind you of things so you can put them out of your mind in the meantime. With luck the other person will deal with your message or respond in some way that helps move things forward.  If not, the conversation will reappear in your inbox and you&#8217;re back where you started. <a href="#EXCUSES">The Waiting Excuse</a>:  eliminated.</p>
<p>Finally, what if you had a simple way (just a couple keystrokes) to, while reading a message, give it an expiration date, i.e., specify a number of days after which it should automatically leave your inbox and archive itself? <a href="#EXCUSES">The Laziness Excuse</a>:  if not eliminated, at least alleviated.</p>
<h2>Nitty Gritty</h2>
<p>Even if I&#8217;ve convinced you in the abstract that these are good ideas, they hinge on the interface being very quick and easy. So how would they actually work?</p>
<h3>Snooze (eliminates <a href="#EXCUSES">the To-do Excuse</a>; makes it easier to <a href="#INBOXZERO">Defer</a>)</h3>
<p>For any message, type &#8220;sN<enter>&#8221; where N is a number and the message will disappear from your inbox and reappear in N days. If N is omitted the message is snoozed for 24 hours. (Fractional days are allowed, eg, &#8220;s.5&rdquo; snoozes for 12 hours; &#8220;s.1&rdquo; for 2.4 hours; etc.) This is also nice for reminders:  send yourself an email and then snooze it.</p>
<p><em>Anti-procrastination tweak</em>:  It&#8217;s possible you&#8217;ll just keep snoozing a message indefinitely.  Maybe that&#8217;s ok &#8212; at least it keeps being brought back to your attention, solving the out of sight, out of mind problem.  But if it&#8217;s a problem, the snooze button could force you to double the snooze length each time you snooze a message.  That way you will at some point not be able to snooze it again without admitting that you&#8217;re essentially never going to deal with it.</p>
<h3>Re-Ping (eliminates <a href="#EXCUSES">the Waiting Excuse</a>; makes it possible to <a href="#INBOXZERO">Archive/Delete</a> immediately after a <a href="#INBOXZERO">Delegate</a> or <a href="#INBOXZERO">Respond</a>)</h3>
<div class="sidebar">&#8220;By specifying a re-ping you can confidently archive the email after replying&#8230;&#8221;</div>
<p>Have an additional email header along with To, Cc, Bcc, and Subject for outgoing mail:</p>
<pre>
  Re-Ping: N
</pre>
<p>In N days, if there&#8217;s no reply yet, your own email shows up back in your inbox, and you can reply to it to nudge the person with a &#8220;just making sure you got this&#8221;. This could also be treated as just another form of snooze:  &#8220;Put this conversation back in my inbox in N days if no new replies show up before then.&#8221; Again, when you get an email on an important topic, relating to a work project perhaps, you tend not to archive it even after replying because you need to make sure the thread doesn&#8217;t get dropped. By specifying a re-ping you can confidently archive the email after replying since the thread is now guaranteed not to end up out of sight, out of mind.</p>
<p>I propose this as a header because I think it should be part of composing an email and should work identically whether it&#8217;s a reply in an existing thread or a brand new message.</p>
<h3>Auto-Expire (alleviates <a href="#EXCUSES">the Laziness Excuse</a>; less inbox slogging)</h3>
<p>Type &#8220;eN<enter>&#8221; and the message will be automatically archived in N days. This is roughly the inverse of Snooze. Alarm clocks sometimes call this a Sleep feature where you can have the radio play for a certain amount of time and then automatically turn off. Here, if you get an email about some event in a couple weeks you can say &#8220;auto-expire 14&rdquo; and it will stay in your inbox for 2 weeks, and then silently, automatically go away. It&#8217;s not that it takes much more time to clean up no-longer-relevant emails from your inbox once in a while. But when you get the email is when it has your attention. If you decide to leave it in your inbox you might as well schedule the archiving you&#8217;ll have to do in a couple weeks anyway. </p>
<p>It might even be valuable to cultivate the habit of setting an auto-expire for every message as you read it.<a id="refEXP" href="#EXP"><sup>[4]</sup></a>  Auto-expire 0 is the special case of archiving right now and auto-expire 999 (i.e., almost 3 years) would be the equivalent of choosing to just leave it in your inbox.  Most messages probably have a much shorter horizon of possible relevance. If done right, you might never have to do any painful inbox housekeeping.</p>
<h2>Conclusion</h2>
<p>These features &#8212; Snooze, Re-Ping, and Auto-Expire &#8212; offer a quick way to confidently get stuff out of your inbox. Archive+search is a huge step in that direction, but many of us clearly need more than that. You might complain that Snooze and Re-Ping only temporarily remove things from your inbox, but it&#8217;s better than that. First, if you snooze everything till closer to when you&#8217;ll be acting on it, that alone shrinks the average size of your inbox a lot. Second, it will often be the case that when the message reappears it will be much easier to decide to just archive it. Maybe you delegated something and it actually got done.  Or there was some event coming up but now that it&#8217;s closer you know that you have other plans.</p>
<div class="sidebar">&#8220;&#8230;it saves having to open up and think about a message twice.&#8221;</div>
<p>The Snooze and Re-Ping features allow you to almost instantly get a message out of your inbox and out of your mind, while still having full faith that it will re-appear in time for you to take necessary action. Auto-Expire spreads out the work of inbox housekeeping.  It helps keep down the size of your inbox to the extent that you put off the task of archiving messages that are no longer relevant. Whether this actually increases efficiency is an empirical question but the rationale is that it saves having to open up and think about a message twice. While you&#8217;re reading an email about a conference coming up in a couple months you can set it to auto-expire in 90 days. You&#8217;ve thus ensured that soon after the conference is over, that email will no longer be cluttering your inbox &#8212; with zero further action from you.</p>
<p>The only one of these I&#8217;ve actually tried so far is Snooze, which I hacked up in Pine (that&#8217;s right, I mainly use Pine for my personal email). I do sometimes find myself resnoozing indefinitely, so I probably need the anti-procrastination tweak mentioned above. But I believe (hope) that by adding Re-Ping and Auto-Expire I will <a href="http://beeminder.com/d/inbox">finally achieve</a> Inbox Zero Nirvana.</p>
<p><br/></p>
<p>Thanks to <a href="http://melzafit.com">Melanie Reeves</a>, <a href="http://twitter.com/dyng">David Yang</a>, <a href="http://www.google.com/profiles/michael.tiffany">Michael Tiffany</a>, <a href="http://www.patrickrjordan.com/">Patrick Jordan</a>, <a href="http://blog.oddhead.com">Dave Pennock</a>, <a href="http://kevinlochner.com/">Kevin Lochner</a>, <a href="http://www.mahdian.org">Mohammad Mahdian</a>, <a href="http://homepages.nyu.edu/~ef820/">Ed Fu</a>, <a href="http://messymatters.com/sharad">Sharad Goel</a>, and Samson Yeung for reading drafts of this.  I&#8217;ve also gotten some great <a href="http://news.ycombinator.com/item?id=1563923">comments on Hacker News</a>. Thank you!</p>
<p><em>Image by <a href="http://krsavage.com">Kelly Savage</a>.</em></p>
<p><br/></p>
<p><b>PS:</b> Since writing this I&#8217;ve learned of two relevant startups:  <a href="http://etacts.com">Etacts.com</a> is trying to solve the Re-Ping problem, among other things.  And <a href="http://hitmelater.com">HitMeLater.com</a> implements email snooze. I would prefer these to be simple client-side features though.</p>
<p><br/><br />
<br/></p>
<h2>Appendix: Notes on How to Hack Snooze, Re-Ping, and Auto-Expire into Email Clients</h2>
<h3>Snooze</h3>
<p>When you hit sN, where N is a real number, a client-side script applies the label &#8220;snooze-N&#8221; (or puts it in a folder called such) to the message and removes it from the inbox. A daemon checks all such labeled emails every so often and removes the label and puts it back in the inbox (applies the &#8220;Inbox&#8221; label) when the email&#8217;s date is >= N days in the past. Note that this means a proliferation of labels/folders of the form &#8220;snooze-N&#8221;. An open question is whether, when putting something back in the inbox, to also mark it unread. If you&#8217;re succeeding in keeping your inbox very small then this would be unnecessary.</p>
<h3>Re-Ping</h3>
<p>There are two parts to the hackery here: creating the additional field, &#8220;Re-Ping:&#8221;, in the compose headers, and keeping track of whether there has been a reply. It should be fine if the Re-Ping header is included in the actual outgoing message. For the actual re-pinging, a partial solution in Gmail is easy due to its thread-centricness.  Don&#8217;t actually check if there was a reply, just apply the Inbox label after the specified amount of time.  If a reply is already in the inbox, fine.  This works because a whole thread (conversation) is either in the inbox or not.  In fact, all labels are applied to threads, not individual messages.</p>
<p>The only thing missing with the above hack is that you may have archived the conversation after a reply came in but before the re-ping time.  In that case you don&#8217;t actually want the conversation to reappear in your inbox.  So to do this right you have to actually keep track of each outgoing message with a re-ping set and do the re-ping conditional on no reply.</p>
<h3>Auto-Expire</h3>
<p>Implementation-wise this is basically the inverse of Snooze.  Label the email &#8220;auto-archive-N&#8221; and remove it from the inbox at the appropriate time.</p>
<p><br/><br />
<br/></p>
<h2>Footnotes</h2>
<p><a id="DELETE" href="#refDELETE">[1]</a> I&#8217;ll always archive and never delete a message, simply because the cognitive cost of deciding that I&#8217;m certain that I&#8217;ll never need to look at it again is too high. Consider the economics:  How much time will you save, cumulatively, by just always hitting &#8220;archive&#8221; without thinking about whether you might ever need to look at it again? And how much will it cost you to have your archive folder full of junk? The additional storage probably costs nothing, and if your email client&#8217;s search feature is decent it won&#8217;t slow down your searches either. In other words, there&#8217;s only upside.  And that doesn&#8217;t even factor in the chance of accidentally deleting something important!</p>
<p><a id="DELAY" href="#refDELAY">[2]</a> If you sometimes delay responses so as not to seem overeager, or because you want your reply to arrive during business hours, or any number of other reasons, check out <a href="http://www.uncommonprojects.com/uplog/2007/09/24/scheduled-email-schemail/">this clever idea</a>. I think my proposed Snooze feature also partly solves this problem.</p>
<p><a id="FOLDERS" href="#refFOLDERS">[3]</a> I&#8217;m anti-folders for the same reason I never delete anything. I don&#8217;t want to waste time deciding what to do with a message that I might never look at again. Everything goes in one folder and that&#8217;s where I go to find anything.</p>
<p><a id="EXP" href="#refEXP">[4]</a> It would be even better if people set expiration dates on their own outgoing mail, but I can&#8217;t think of a way to get that convention to catch on.</p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/07/31/email/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>What Your Friends Don&#8217;t Know About You</title>
		<link>http://messymatters.com/2010/06/30/friendsense/</link>
		<comments>http://messymatters.com/2010/06/30/friendsense/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 03:56:37 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[homophily]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=859</guid>
		<description><![CDATA[
How well do you know your friends&#8217; political views? According to recent work by Winter Mason, Duncan Watts, and myself, you probably don&#8217;t know them as well as you think. In particular, we found that when friends disagree on a political issue, they are unaware of that disagreement about 60% of the time. Even close friends who [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/06/friends.gif"><img class="aligncenter size-full wp-image-866" title="Elephants and donkeys. This is a little subtle!" src="http://messymatters.com/wp-content/uploads/2010/06/friends.gif" alt="Elephants and donkeys. This is a little subtle!" width="362" height="269" /></a></p>
<p>How well do you know your friends&#8217; political views? According to <a href="http://messymatters.com/sharad/papers/friendsense.pdf">recent work</a> by <a href="http://smallsocialsystems.com/" target="_blank">Winter Mason</a>, <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a>, and myself, you probably don&#8217;t know them as well as you think. In particular, we found that <strong>when friends disagree on a political issue, they are unaware of that disagreement about 60% of the time. Even close friends who discuss politics are typically unaware of their differences in opinions.</strong></p>
<p><a href="http://en.wikipedia.org/wiki/Homophily" target="_blank">Homophily</a>, the tendency for individuals to associate with similar others, is one of the more consistently observed phenomena of the social world. While reports of homophily have focused on socio-demographic attributes&#8212;like age, gender, and race&#8212;it&#8217;s reasonable that homophily would extend to attitudes as well, in part because individuals likely seek out those who agree with them, and in part because social pressure often breeds conformity. However, <em>measuring</em> the extent to which friends agree with one another is a tough empirical problem, and quantifying the differences between actual and perceived homophily was, until recently, prohibitively difficult.</p>
<p>Enter Facebook, the social science experimentation platform with a subject pool of <a href="http://www.facebook.com/press/info.php?statistics" target="_blank">400 million strong</a> (and <a href="http://blog.comscore.com/2010/01/strong_year_for_facebook.html" target="_blank">growing</a>). To examine the differences between real and perceived attitude agreement, we built a Facebook game where players both answered a series of questions about their own attitudes, and attempted to guess how particular friends would respond. We collected 12,000+ responses in which we had a player&#8217;s answer to a given question, their friend&#8217;s answer to that same question, and the player&#8217;s guess as to what their friend would say.</p>
<p>The figure below shows that <strong>people consistently overestimate the likelihood that their friends agree with them on political issues</strong>. Notably, even though close friends (so-called strong ties<a id="refTIES" href="#TIES"><sup>[1]</sup></a>) are in reality more likely to agree with one another than distant friends, people do not appropriately adjust their perceptions. In other words, <strong>though we think close and distant friends are about equally likely to agree with us on political issues, in reality we are much more likely to agree with close friends</strong>.</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/06/agreement.gif"><img class="aligncenter size-full wp-image-862" title="agreement" src="http://messymatters.com/wp-content/uploads/2010/06/agreement.gif" alt="" width="300" height="300" /></a></p>
<p>Upon reflection, it&#8217;s perhaps not so surprising that we often don&#8217;t know where our friends stand on hot-button political issues. How many times have you actually asked your friends how they feel about abortion, capital punishment, or affirmative action? Inferring friends&#8217; attitudes is further complicated by the fact that political positions tend to be only weakly correlated across issues. For example, according to the 2008 <a href="http://www.norc.org/GSS+Website/" target="_blank">General Social Survey</a>, 65% of people who support abortion rights also support capital punishment, compared to 68% among those who oppose abortions. That is, someone&#8217;s position on abortion tells you nearly nothing about their view on capital punishment.</p>
<p>I think society would benefit from friends and neighbors engaging in more active political deliberation, discussions that at the very least leave participants aware of each other&#8217;s views. On the other hand, if your friends really knew what you thought, you might have a lot fewer friends.</p>
<p><em>N.B.</em> For more details, check out our <a href="http://messymatters.com/sharad/papers/friendsense.pdf">paper</a>.</p>
<p><em>Illustration by </em><em><a href="http://krsavage.com" target="_blank">Kelly Savage</a></em></p>
<h3>Footnotes</h3>
<p><a id="TIES" href="#refTIES">[1]</a> We define the tie strength between a pair of individuals by considering the number of mutual friends that they have, and whether or not the pair discusses politics.</p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/06/30/friendsense/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Extremely Weak Cryptography: Rot13 for Numbers</title>
		<link>http://messymatters.com/2010/05/31/sealedbids/</link>
		<comments>http://messymatters.com/2010/05/31/sealedbids/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 03:00:50 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[auctions]]></category>
		<category><![CDATA[crypto]]></category>
		<category><![CDATA[mechanism design]]></category>
		<category><![CDATA[sealed bids]]></category>
		<category><![CDATA[yootles]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=640</guid>
		<description><![CDATA[
The idea of rot13 is to obscure text, for example, to prevent spoilers. It&#8217;s not meant to be cryptographically secure but simply to make sure that only people who are sure they want to read something will read it.
&#8220;It&#8217;s handy when coordinating by text message when one or both parties are about to get on [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://messymatters.com/wp-content/uploads/2010/02/crypt-300x211.png" alt="Two people giving (encrypted) arguments about various encryption schemes. ... 'EBG13 vf terng!' ... 'Double ROT13 is better!'" title="Two people giving (encrypted) arguments about various encryption schemes. ... 'EBG13 vf terng!' ... 'Double ROT13 is better!'" width="300" height="211" class="aligncenter size-medium wp-image-641" /></p>
<p>The idea of <a href="http://rot13.com">rot13</a> is to obscure text, for example, to prevent spoilers. It&#8217;s not meant to be cryptographically secure but simply to make sure that only people who are sure they want to read something will read it.</p>
<div class="sidebar">&#8220;It&#8217;s handy when coordinating by text message when one or both parties are about to get on the subway.&#8221;</div>
<p>We, being pretty extreme nerds, sometimes find ourselves wanting to do something similar for numbers.  This is typically in the context of a sealed-bid auction where you want to submit a bid that the other party can unseal whenever they&#8217;re ready, with no further coordination needed.  Most recently, Sharad and I had to decide who would be presenting our <a href="http://messymatters.com/predmarkets">Prediction Without Markets</a> paper at the upcoming <a href="http://www.sigecom.org/ec10/">ACM Ecommerce conference</a>.  To determine who wanted to present more (i.e., who minded least), I sent him my sealed bid (how much I would pay him, at most, to present) and told him to unseal it when he had committed to his own bid.  (Eliminating the need for further coordination was especially useful here because Sharad was about to get on a plane.  It&#8217;s similarly handy when coordinating by text message when one or both parties are about to get on the subway.)</p>
<p>The outcome in that case was that Sharad bid lower and so I paid him that lower bid to be the one to present our paper.</p>
<div class="sidebar">&#8220;Note the assumption that the recipient is being trusted not to cheat.&#8221;</div>
<p>In general what we want is a way for me to send you a number and trust you to pick your own number, uninfluenced by mine. And then you should be able to reveal mine when you&#8217;re ready, without requiring further input from me or any third party. Note the assumption that the recipient is being trusted not to cheat.</p>
<p>So how do we do that?  It&#8217;s not as simple as rot13 because certain numbers, like 1 and 2, will recur often enough that you might remember that, say, &#8220;gjb&#8221; is really 2.</p>
<p>To be more specific, we seek a function, seal(), that maps a real number to a real number (or a string). It should <i>not</i> be deterministic &#8212; seal(7) should not map to the same thing every time. But the corresponding function, reveal(), should be deterministic &#8212; reveal(seal(x)) should equal x for all x.</p>
<p>For example: </p>
<pre>
   &gt; seal(7)              # Some random-seeming
   429964                 # number or string.

   &gt; seal(7)              # A completely different
   749932                 # random-seeming number.

   &gt; reveal(seal(7))      # Revealing always uncovers
   7                      # the original number.
</pre>
<p>Can you think of a way to implement seal() and reveal()?</p>
<h2>How to Seal and Reveal a Secret Number</h2>
<p>There are <a href="http://stackoverflow.com/questions/809137/rot13-for-numbers">many ways</a>, it turns out, to accomplish this but here, in lay terms, is an elegant scheme that can be carried out easily with a pocket calculator:</p>
<p>Pick a random number (from your head is fine) between 9 and 99 and multiply it by 9999, then add your secret number. This will yield a 5 or 6-digit number that encodes your secret number. To decode it, divide by 9999, subtract the part to the left of the decimal point, then multiply by 9999. (This is known to children and mathematicians as &#8220;finding the remainder when dividing by 9999&#8221; and &#8220;mod&#8217;ing by 9999&#8221;, respectively.)</p>
<p>This works for nonnegative numbers less than 9999 (if that&#8217;s not enough, use 99999 or as many nines as you want). If you want to allow negative numbers, then the magic 9999 number needs to be twice the biggest possible number. And when decoding, if the result is greater than half of 9999, i.e., 5000 or more, then subtract 9999 to get the actual (negative) number.</p>
<p>Again, this is not a true <a href="http://en.wikipedia.org/wiki/Commitment_scheme">commitment scheme</a>. But for cases where the honor system suffices and the only goal is to not be influenced by each other&#8217;s number, we find this quite handy.</p>
<h2>Pseudocode</h2>
<pre>
   M = 9999  # Numbers between -M/2 and M/2 can be sealed.

   seal(x): M * randInt(9, 99) + x

   reveal(x): m = mod(x, M);
              if m &gt; M/2 return m - M else return m
</pre>
</p>
<p><i>Image by <a href="http://www.moserware.com">Jeff Moser</a>.</i></p>
<p><i>Thanks to <a href="http://stackoverflow.com/users/31615/svante">Svante v. Erichsen</a> for the idea behind this scheme, and to a dozen other <a href="http://stackoverflow.com/questions/809137/rot13-for-numbers">Stack Overflow</a> users for alternative schemes.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/05/31/sealedbids/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Misleading Means: On Drugs and Discrimination</title>
		<link>http://messymatters.com/2010/04/30/misleading-means/</link>
		<comments>http://messymatters.com/2010/04/30/misleading-means/#comments</comments>
		<pubDate>Fri, 30 Apr 2010 11:15:31 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[discrimination]]></category>
		<category><![CDATA[RDS]]></category>
		<category><![CDATA[variance]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=745</guid>
		<description><![CDATA[
How do you estimate the prevalence of HIV among drug addicts? In a perfect (though somewhat Orwellian) world you&#8217;d have a giant list of all drug users, and you&#8217;d randomly select a subset of them to test for HIV. Alas, the world is messy, and we&#8217;re left to less rigorous methods, for example time-location sampling, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/04/diffusion.jpeg"><img class="aligncenter size-full wp-image-749" title="diffusion" src="http://messymatters.com/wp-content/uploads/2010/04/diffusion.jpeg" alt="" width="335" height="242" /></a></p>
<p>How do you estimate the prevalence of HIV among drug addicts? In a perfect (though somewhat Orwellian) world you&#8217;d have a giant list of all drug users, and you&#8217;d randomly select a subset of them to test for HIV. Alas, the world is messy, and we&#8217;re left to <a href="http://www.respondentdrivensampling.org/reports/AIDS_2005.pdf">less rigorous methods</a>, for example time-location sampling, where surveyors recruit drug users from the local &#8220;shooting gallery&#8221; (heroin users&#8217; version of a crack house) at randomly selected times.</p>
<p>A few years ago, sociologists <a href="http://www.soc.cornell.edu/faculty/heckathorn.html" target="_blank">Doug Heckathorn</a> and <a href="http://www.princeton.edu/~mjs3/" target="_blank">Matthew Salganik</a> proposed a new approach to surveying these so-called hard-to-reach or hidden populations: <a href="http://www.respondentdrivensampling.org/" target="_blank">respondent-driven sampling</a>. RDS is a variant of snowball sampling&#8212;where existing study participants recruit the next wave of participants, usually in exchange for payment&#8212;with a few clever tweaks. Matt and Doug realized that popular drug users (i.e., those with a lot of friends) are disproportionately likely to be in the final snowball sample, for the simple reason that popular users know more people who could potentially recruit them. In RDS, recruits are thus down-weighted by the number of <a href="http://pubs.amstat.org/doi/abs/10.1198/jasa.2009.ap08518" target="_blank">people they know</a>. In just a few short years, RDS has already been applied in more than 20 countries, and it is currently used by the <a href="http://www.cdc.gov/" target="_blank">Centers for Disease Control and Prevention</a> (CDC) to help track the HIV epidemic in the United States. Now for the bad news: Matt and I just wrote a <a href="http://messymatters.com/sharad/papers/assessing-rds.pdf">paper</a> showing that <strong>RDS may not be suitable for key aspects of public health surveillance where it is now extensively applied</strong>.</p>
<p>Our critique of RDS boils down to a simple empirical reality: <strong>outcomes are typically not typical</strong>. To give an example, while the mean <a href="http://en.wikipedia.org/wiki/Household_income_in_the_United_States" target="_blank">household income</a> in the United States is around $60K, half of households earn either less than $30K or more than $100K. In other words, households typically have incomes that are far from typical. So what does this have to do with RDS? The trick of down-weighting popular drug users guarantees&#8212;albeit under <a href="http://arxiv.org/abs/0904.1855" target="_blank">strong assumptions</a>&#8212;that RDS will on average yield the true infection level. (In statistical parlance, RDS is an asymptotically <a href="http://en.wikipedia.org/wiki/Bias_of_an_estimator" target="_blank">unbiased estimator</a>.) In the RDS community, this theoretical property was widely interpreted as meaning that RDS is a generally reliable method of estimation. What we show is that even in situations where RDS estimates are correct on average, these estimates are typically so far from the truth that meaningful statistical inference is difficult. To clarify the point, suppose we estimate the mean age in New York by   sampling one person uniformly at random. Even though our estimate would be perfect on  average (i.e., unbiased), it could hardly be called accurate. Since RDS participants tend to know, and hence recruit, people who are similar to themselves, this caricatured example is actually not so far from the <a href="http://messymatters.com/sharad/papers/assessing-rds.pdf">truth</a> about why RDS estimates are so volatile.</p>
<p>I suspect that this tendency to neglect the variance also plays a role in discriminatory attitudes. Consider the following statistic, culled from the 2008 <a href="http://www.norc.org/GSS+Website/" target="_blank">General Social Survey</a>: white Americans on average watch 2.8 hours of television daily, compared to 4.4 hours watched on average by black Americans. The difference in group means is large, both on a relative scale&#8212;blacks on average watch nearly 60% more television than whites&#8212;and on an absolute scale&#8212;1.6 hours a day. (Citing a similar statistic, conservative radio host, and Rush Limbaugh protege, <a href="http://www.michaelmedved.com/" target="_blank">Michael Medved</a> has <a href="http://www.wnd.com/index.php?pageId=13623" target="_blank">pontificated</a> on the subject.) Now, if you&#8217;re walking down the street and you see a random black person and a random white person, how likely is it that the black person watches more TV? Given the large group differences, you might be tempted to conclude it&#8217;s pretty likely. As it turns out, however, individual variation is so large that it&#8217;s basically a coin flip (about 60%) as to who watches more TV&#8212;in other words, race is a <a href="http://en.wikipedia.org/wiki/Cue_validity" target="_blank">poor cue</a>. The plot below makes this phenomenon visually apparent: despite the substantial difference in group means, whites and blacks have distributions of TV watching that are tough to distinguish. Just as in RDS, the means are  misleading.</p>
<p style="text-align: center;"><a href="http://messymatters.com/wp-content/uploads/2010/04/tv.png"><img class="aligncenter size-full wp-image-810" title="TV viewing habits for whites and blacks" src="http://messymatters.com/wp-content/uploads/2010/04/tv.png" alt="" width="570" height="285" /></a><em> </em></p>
<div id="magicdomid13">In  fictional <a href="http://en.wikipedia.org/wiki/Lake_Wobegon" target="_blank">Lake Wobegon</a>, all the children are above average. Though that&#8217;s not <a href="http://agoraphilia.blogspot.com/2007/01/more-than-50-percent-can-be-above.html" target="_blank">quite true</a> in the real world, it actually is the case that most people are far from average.</div>
<p>NB: For more details on RDS, check out our <a href="http://messymatters.com/sharad/papers/assessing-rds.pdf">paper</a>.</p>
<p><em>Illustration by <a href="http://krsavage.com" target="_blank">Kelly Savage</a>: A stylized depiction of an RDS recruitment tree, where nodes correspond to sample members and links indicate who recruited whom. Based on an RDS study of drug users in New York City, presented in <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527186/" target="_blank">Abdul-Quader, et al, 2006.</a><br />
</em></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/04/30/misleading-means/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yes, You Are (Maybe) Overconfident</title>
		<link>http://messymatters.com/2010/03/31/calibration-results/</link>
		<comments>http://messymatters.com/2010/03/31/calibration-results/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 03:57:57 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[calibration]]></category>
		<category><![CDATA[decision theory]]></category>
		<category><![CDATA[overcoming bias]]></category>
		<category><![CDATA[overconfidence]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[rationality]]></category>
		<category><![CDATA[wisdom of crowds]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=715</guid>
		<description><![CDATA[
218 of you took our calibration quiz, not counting the 10% of submissions that had to be thrown out for not being complete or giving ranges with the min greater than the max or other sanity check failures. (Here&#8217;s the raw data.)
The bad news is that you&#8217;re terrible at making 90% confidence intervals. For example, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/03/stumped.jpg"><img src="http://messymatters.com/wp-content/uploads/2010/03/stumped-300x199.jpg" alt="" title="Confidence intervals are hard. Let&#039;s go shopping." width="300" height="199" class="aligncenter size-medium wp-image-716" /></a></p>
<p>218 of you took our <a href="http://messymatters.com/calibration">calibration quiz</a>, not counting the 10% of submissions that had to be thrown out for not being complete or giving ranges with the min greater than the max or other sanity check failures. (<a href="http://messymatters.com/calib-quiz/quiz-results.txt">Here&#8217;s the raw data.</a>)</p>
<p>The bad news is that you&#8217;re terrible at making 90% confidence intervals. For example, not a single person had all 10 of their intervals contain the true answer, which, if everyone were perfectly calibrated, should&#8217;ve happened by chance to 35% of you. Getting less than 6 good intervals should, statistically, not have happened to anyone. How many actually had 5 or fewer good intervals?  76% of you.</p>
<p>Here&#8217;s a histogram of the number of good intervals you got, out of 10:</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/03/histwi.png"><img src="http://messymatters.com/wp-content/uploads/2010/03/histwi.png" alt="" title="Histogram of number of good intervals out of 10." width="360" height="229" class="aligncenter size-full wp-image-717" /></a></p>
<div class="sidebar">When you give a purported 90% confidence interval there&#8217;s only a 41% chance that your interval contains the true answer.</div>
<p>The overlaid phantom histogram is what it would look like if it were really the case that every interval people gave had a 90% chance of containing the true answer. In other words, you should&#8217;ve made your intervals much wider. When we ask for a 90% confidence interval there&#8217;s in fact only a 41% chance that your interval contains the true answer.</p>
<p>We ran this quiz on <a href="http://mturk.com">Mechanical Turk</a> as well and you marginally outperformed the turkers. The histogram of turkers&#8217; good intervals is indicated by the red dots in the above graph. They failed our sanity checks at almost twice the rate (19%) of Messy Matters readers and of the remaining responses, the mean number of good intervals was 3.5 out of 10.</p>
<p>The more we&#8217;ve thought about (and read the literature on &#8212; or rather, consulted endlessly with <a href="http://www.dangoldstein.com/">Dan Goldstein</a>, who knows the literature on) these kinds of overconfidence results, however, the less clear it is that the moral of this quiz is simply &#8220;people are overconfident&#8221;. For one thing, overconfidence depends on the question. The fraction of good intervals in your responses ranged from 23% (the length of the Nile and the gestation period of an Asian elephant) to 75% (number of OPEC countries). Of course, even 75% is not the 90% that was asked for.</p>
<p>More interestingly, in an ongoing follow-up study on Mechanical Turk we&#8217;re finding that after you get people&#8217;s intervals, more than half of them realize in retrospect that too few of their intervals are good. This suggests that people can learn to perform much better at this task.</p>
<h3>Obligatory Wisdom of Crowds Demonstration</h3>
<p>It&#8217;s not a fair demonstration since people weren&#8217;t asked for their best guesses, but here&#8217;s a table of median lower bounds, upper bounds, and midpoints of everyone&#8217;s ranges. Interestingly, people&#8217;s upper bounds are overall most accurate.</p>
<style>
table {
  border:1px solid black;  
  border-collapse: collapse;
  font: 12px arial,helvetica,sans-serif;
}  
th {
  font-size: 1.1em;
}  
td,th {
  margin:0px;
  padding: 5px;
  border:1px solid black;
}
</style>
<table>
<tr>
<th></th>
<th>MLK</th>
<th>Nile</th>
<th>OPEC</th>
<th>Bible</th>
<th>Moon</th>
<th>747</th>
<th>Mozart</th>
<th>Elephant</th>
<th>Tokyo</th>
<th>Ocean</th>
</tr>
<tr>
<td><b>True</b></td>
<td>39</td>
<td>4132</td>
<td>12</td>
<td>39</td>
<td>2160</td>
<td>390000</td>
<td>1756</td>
<td>645</td>
<td>5959</td>
<td>35994</td>
</tr>
<tr>
<td><b>Min</b></td>
<td>35</td>
<td>900</td>
<td>6</td>
<td>8</td>
<td>1000</td>
<td>20000</td>
<td>1700</td>
<td>180</td>
<td>5000</td>
<td>13500</td>
</tr>
<tr>
<td><b>Mid</b></td>
<td>45</td>
<td>1750</td>
<td>13</td>
<td>15</td>
<td>3500</td>
<td>63250</td>
<td>1725</td>
<td>320</td>
<td>8000</td>
<td>30000</td>
</tr>
<tr>
<td><b>Max</b></td>
<td>55</td>
<td>3000</td>
<td>20</td>
<td>20</td>
<td>5000</td>
<td>100000</td>
<td>1790</td>
<td>400</td>
<td>10000</td>
<td>40000</td>
</tr>
</table>
<p><br/><br/></p>
<p><em>Thanks to <a href="http://www.cam.cornell.edu/~sharad/">Sharad Goel</a>, <a href="http://www.dangoldstein.com/">Dan Goldstein</a>, <a href="http://bethany.pirateship.org/">Bethany Soule</a>, <a href="http://twitter.com/dakami">Dan Kaminsky</a>, and <a href="http://www.google.com/profiles/michael.tiffany">Michael J.J. Tiffany</a>.</em></p>
<p><em>Image: Kelly Savage</em></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/03/31/calibration-results/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Are You Overconfident?</title>
		<link>http://messymatters.com/2010/02/28/calibration/</link>
		<comments>http://messymatters.com/2010/02/28/calibration/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 04:52:20 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[calibration]]></category>
		<category><![CDATA[decision theory]]></category>
		<category><![CDATA[overconfidence]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[rationality]]></category>
		<category><![CDATA[wisdom of crowds]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=672</guid>
		<description><![CDATA[
We shall now find out if Messy Matters readers are smarter than Mechanical Turkers.  For each of the questions below, provide a numerical range that you are 90% sure contains the correct answer. In particular, if you have “no idea” then give a very wide range; and if you happen to be quite certain [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/02/moon.jpg"><img class="aligncenter size-medium wp-image-673" title="How to estimate the distance from the Earth to the Moon, if you're the Little Prince." src="http://messymatters.com/wp-content/uploads/2010/02/moon-300x231.jpg" alt="How to estimate the distance from the Earth to the Moon, if you're the Little Prince." width="300" height="231" /></a></p>
<p>We shall now find out if Messy Matters readers are smarter than <a href="http://mturk.com">Mechanical Turkers</a>.  For each of the questions below, provide a numerical range that you are 90% sure contains the correct answer. In particular, if you have “no idea” then give a very wide range; and if you happen to be quite certain then give a narrow range.</p>
<p>If you are perfectly calibrated then 90% of your intervals (no more, no less) should contain the right answers.</p>
<p><strong>Please try your best, but don&#8217;t look up the answers!</strong></p>
<p><iframe src="http://messymatters.com/calib-quiz" width=470 height=700></iframe><br />
(If the quiz doesn&#8217;t show up in your news reader, <a href="http://messymatters.com/calibration">click through to the post on messymatters.com</a>.)</p>
<p>We&#8217;ll discuss the results in the next post.</p>
<p>PS: <a href="http://messymatters.com/2010/03/31/calibration-results/">Here are the results.</a></p>
<p><i>Quiz from <a href="http://www.amazon.com/Decision-Traps-J-Edward-Russo/dp/0385248350">Decision Traps</a> by Russo and Shoemaker.</i><br />
<i>Image by Kelly Savage.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/02/28/calibration/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Prediction Without Markets</title>
		<link>http://messymatters.com/2010/01/14/prediction-without-markets/</link>
		<comments>http://messymatters.com/2010/01/14/prediction-without-markets/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 06:28:44 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[fallacies]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[sports]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=581</guid>
		<description><![CDATA[
In the 2008 Summer Olympics Usain Bolt ran 100 meters in 9.69 seconds, earning the gold medal and garnering the international attention that comes with being the &#8220;fastest man in the world.&#8221; While Bolt became a household name, his competitors didn&#8217;t fare nearly as well: far fewer people know that Richard Thompson and Walter Dix [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/01/photo-finish.jpg"><img class="aligncenter size-full wp-image-625" title="photo-finish" src="http://messymatters.com/wp-content/uploads/2010/01/photo-finish.jpg" alt="photo-finish" width="388" height="273" /></a></p>
<p><span>In the 2008 Summer Olympics <a href="http://en.wikipedia.org/wiki/Usain_Bolt" target="_blank">Usain Bolt</a> ran 100 meters in <a href="http://sports.yahoo.com/olympics/beijing/track_field/results/ATM001/full" target="_blank">9.69 seconds</a>, earning the gold medal and garnering the international attention that comes with being the &#8220;<a href="http://en.wikipedia.org/wiki/100_metres" target="_blank">fastest man in the world</a>.&#8221; While Bolt became a household name, his competitors didn&#8217;t fare nearly as well: far fewer people know that <a href="http://en.wikipedia.org/wiki/Richard_Thompson_(athlete)" target="_blank">Richard Thompson</a> and <a href="http://en.wikipedia.org/wiki/Walter_Dix" target="_blank">Walter Dix</a> received silver and bronze, and I suspect that 8th place <a href="http://en.wikipedia.org/wiki/Darvis_Patton" target="_blank">Darvis Patton</a> is practically unknown outside the sprinting world. The 340 milliseconds that separated Bolt from Patton&#8212;the <a href="http://en.wikipedia.org/wiki/Blink">duration of a blink of the eye</a>&#8212;was the difference between celebrity and obscurity. <strong>While a fascination with rank is perhaps justifiable for sports, such focus&#8212;let&#8217;s call it the Gold Medal Syndrome&#8212;is often problematic in statistical analysis.</strong></span></p>
<p><span> </span>Consider the case of prediction markets. In these markets, participants buy and sell securities that realize a value based on the occurrence of some future outcome, such as the result of an election, the box office revenue of an upcoming film, or the market share of a new product. For example, the day before the 2008 U.S. presidential election you could have paid $0.92 for a contract in the <a href="http://www.biz.uiowa.edu/iem">Iowa Electronic Markets</a> that yielded $1 when Barack Obama won, implying a 92% market-estimated probability that Obama would win. There are compelling theoretical reasons to expect prediction markets to outperform all other available forecasting methods. As formalized by the <a href="http://en.wikipedia.org/wiki/Efficient-market_hypothesis" target="_blank">efficient-market hypothesis</a>, if there were some way to beat the market then at least one savvy trader would presumably exploit that advantage to make money; hence, market prices should update to eliminate any performance disparity.</p>
<p>Inspired by such theoretical arguments, and also by a growing body of empirical findings that show markets beat alternatives, several researchers have called for widespread application of prediction markets to real-world business strategy and policy development problems. In a 2007 <a href="http://online.wsj.com/article/SB117885086047199534.html" target="_blank">Wall Street Journal op-ed</a>, economists Robert Hahn and Paul Tetlock write:</p>
<blockquote><p><em>&#8220;Imagine the president had a crystal ball to predict more accurately the impact of broader prescription coverage on the Medicare budget, the effect of more frequent audits on tax compliance&#8212;or even the consequences of a political settlement in Iraq on oil prices. Now, stop imagining: Such crystal balls [prediction markets] are within our grasp.&#8221;</em></p></blockquote>
<p>The work on which these appeals are based, however, primarily addresses the relative ranking of prediction methods. By contrast, the magnitude of the differences in question has received much less attention, and as such, it remains unclear whether the performance improvement associated with prediction markets is meaningful from a practical perspective.</p>
<p><strong>In a new <a href="http://messymatters.com/sharad/papers/pred-wo-markets.pdf">study</a>, <a href="http://ai.eecs.umich.edu/people/dreeves/" target="_blank">Daniel Reeves</a>, <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a>, <a href="http://dpennock.com/">Dave Pennock</a> and I compare the performance of prediction markets to conventional means of forecasting, namely polls and statistical models. Examining thousands of sporting and movie events, we find that the relative advantage of prediction markets is remarkably small.</strong> For example, the Las Vegas market for professional football is only 3% more accurate in predicting final game scores than a simple, three parameter statistical model, and the market is only 1% better than a poll of football enthusiasts. The plot below shows how the three methods perform on the complementary task of estimating the probability the home team wins.</p>
<p><span><a href="http://messymatters.com/wp-content/uploads/2010/01/mm_calibration.png"><img class="aligncenter size-full wp-image-598" title="mm_calibration" src="http://messymatters.com/wp-content/uploads/2010/01/mm_calibration.png" alt="mm_calibration" width="504" height="186" /></a></span></p>
<p>Given that sports and entertainment markets are among the most mature and successful, our results challenge the view that prediction markets are substantively superior to alternative forecasting mechanisms. Nevertheless, it is certainly possible that there are forecasting applications where either the relative advantage of markets is larger, or that such differences in performance are consequential. <strong>Thus, while prediction markets may yet prove to be useful, it would seem the enthusiasm for their predictive prowess has outpaced the evidence.</strong></p>
<p>NB: Check out our <a href="http://messymatters.com/sharad/papers/pred-wo-markets.pdf" target="_blank">paper</a> for more details.</p>
<p><em>Illustration by Kelly Savage</em></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/01/14/prediction-without-markets/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Scroogenomics vs Ulterior Motives (and Other Justifications for Gift Giving)</title>
		<link>http://messymatters.com/2009/12/31/scrooge/</link>
		<comments>http://messymatters.com/2009/12/31/scrooge/#comments</comments>
		<pubDate>Fri, 01 Jan 2010 04:57:09 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[christmas]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[social efficiency]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=552</guid>
		<description><![CDATA[
As a poser economist (as Jeff Ely calls me), I love to complain about the social inefficiency of gift giving. It&#8217;s a terrible idea, guys! We waste 13 billion dollars a year on it! [1] But I do appreciate that I have a tendency to pooh-pooh social conventions far too blithely.  Nonetheless, social conventions [...]]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-564" src="http://messymatters.com/wp-content/uploads/2009/12/scrooge_v_santa-300x219.jpg" alt="scrooge vs santa" title="Scrooge and Santa fighting" width="300" height="219" /></p>
<p>As a <a href="http://cheeptalk.wordpress.com/2009/12/23/the-real-economics-of-holiday-gift-giving/">poser economist</a> (as Jeff Ely calls me), I love to complain about the social inefficiency of gift giving. It&#8217;s a terrible idea, guys! <a href="http://scroogenomics.com">We waste 13 billion dollars a year on it!</a> <a id="refONE" href="#ONE"><sup>[1]</sup></a> But I do appreciate that I have a tendency to <a href="http://xkcd.com/592/">pooh-pooh social conventions far too blithely</a>.  Nonetheless, social conventions change.  Drastically so, given enough time.  As <a href="http://marketdesigner.blogspot.com/2009/12/give-cash-next-year.html">Al Roth points out</a>, wedding registries, gift receipts, and, more <a href="http://www.nytimes.com/2009/12/12/your-money/credit-and-debit-cards/12money.html?_r=1">regrettably, gift cards</a> represent a shift towards cash as gifts, and thus greater efficiency.</p>
<p>But as much as I applaud this trend and agree that holiday gift giving makes us all poorer and generally worse off, I&#8217;m sure Joel Waldfogel&#8217;s estimate (that the value we get from gifts is 25 cents on the dollar) is overblown. So, in the spirit of the holiday season, here are some reasons why gift giving isn&#8217;t quite so bad as it seems:</p>
<ol>
<li> <strong>Ulterior Motives</strong>. When put that way it doesn&#8217;t exactly sound holiday spirited but I think ulterior motives is the most overlooked and perhaps most legitimate justification for gift giving.  If I give you a ticket to see a show with me because I really want to see it with you, then it really doesn&#8217;t matter how much you would&#8217;ve paid for that ticket yourself.  I wanted your company and this is a socially acceptable way for me to pay for it. <a id="refTWO" href="#TWO"><sup>[2]</sup></a> Homer Simpson giving Marge a bowling ball (or a power screwdriver or somesuch) is another example.  Nothing&#8217;s wasted if Homer gets enough use out of those gifts himself! A more common example is giving one&#8217;s girlfriend lingerie.  But the ulterior motives need not be selfish, exactly.  It might well be socially efficient for me to buy you a book you wouldn&#8217;t have bought, if I have a lot of value for discussing it with you.</li>
<p/>
<li> <strong>Signaling Value</strong>.  I don&#8217;t pretend to understand the intricacies of this, but presumably it&#8217;s socially useful to prove our thoughtfulness.  We may also find it socially valuable to be mutually indebted to each other (and in ways hard to quantify).</li>
<p/>
<li> <strong>The Ritual</strong>.  The process itself may have value to us, including the process of figuring out each other&#8217;s desires well enough to buy gifts.</li>
<p/>
<li> <strong>Extrospection</strong>.  In rare cases you might actually know the recipient&#8217;s utility function better than they do.  Usually this is ill-advised, from an efficiency standpoint.  Why not just recommend your new favorite music to your friend and let them decide whether to buy it?  The one case where extrospection is a slam dunk, of course, is giving gifts to small children (all the other excuses for gift giving &#8212; except perhaps number 2 &#8212; also apply in this case).</li>
<p/>
<li> <strong>Transactional Efficiency</strong>. Related to number 4, it may be easier for you to get your friend something than it is for them to get it.  Maybe you&#8217;re buying a book for yourself and it&#8217;s just as easy to get a second copy.  Or you may have access to something the recipient doesn&#8217;t &#8212; for example, if you&#8217;re visiting a foreign country.</li>
</ol>
<p>The last two don&#8217;t really count, of course, since Waldfogel&#8217;s analysis accounts for those. But the first three may put a legitimate dent in that 13 billion dollar deadweight loss.</p>
<p><strong>Added:</strong> <a href="http://www.cs.duke.edu/~conitzer">Vincent Conitzer</a> adds a great one in the comments:</p>
<ol start="6">
<li><strong>Plausible Deniability</strong>.  There are some things that you just have embarrassingly high utility for.  And &#8220;hey, it was a gift!&#8221; can be a nice airtight excuse.</li>
</ol>
<p/>
&nbsp;</p>
<p/>
<h3>Addendum: Other Inefficiencies of the Season</h3>
<p>Sharad pointed out that &#8220;after-Christmas sale&#8221; is a euphemism for &#8220;pre-Christmas price-gouging.&#8221;  This is not an inefficiency if you include the utility of the retailers but from consumers&#8217; perspective, it&#8217;s inefficient to concentrate demand around a single day of the year.  Bah humbug!</p>
<p>I recently encountered an inefficiency related to holiday travel.  Many outbound NYC flights were canceled the weekend before Christmas due to a snowstorm.  People were generally rebooked a few days later.  But on Monday everything was back to normal.  I know because I flew out of New York on Monday.  My aunt was supposed to fly out on Saturday but got stranded until Tuesday.  She clearly wanted to depart on Monday more than we did.  It&#8217;s a shame that there&#8217;s essentially no secondary market for airline tickets! (Though there is in a very limited sense: You can volunteer to be bumped and the airline will compensate you with travel vouchers, worth deceptively little. But naturally the airlines are not bumping willing travelers in order to rebook people sooner when their original flights were canceled.)</p>
<p>
&nbsp;
</p>
<h3>Footnotes</h3>
<p><a id="ONE" href="#refONE">[1]</a> That&#8217;s not a claim that we <em>spend</em> $13B/year on gifts.  We spend 17 billion, 13 billion of which is pure waste. That&#8217;s Joel Waldfogel&#8217;s estimate of the difference between what we spend on holiday gifts and what the recipients would have been willing to spend to buy those items themselves.</p>
<p><a id="TWO" href="#refTWO">[2]</a> Alright, it sounds much ickier when you put it that way!</p>
<p/>
<p><em>Thanks to <a href="http://paul.rutgers.edu/~strehl/">Alex Strehl</a>, Eleanor Strehl, <a href="http://www.davidreiley.com">David Reiley</a>, Jill Sazama, <a href="http://bethany.pirateship.org">Bethany Soule</a>, and <a href="http://www.cam.cornell.edu/~sharad/">Sharad Goel</a> for helpful discussion.</em></p>
<p><em>Illustration by Kelly Savage</em></p>
]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2009/12/31/scrooge/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
