<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>What's new</title>
	<atom:link href="http://terrytao.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://terrytao.wordpress.com</link>
	<description>Updates on my research and expository papers, discussion of open problems, and other maths-related topics.  By Terence Tao</description>
	<lastBuildDate>Thu, 09 Jul 2009 16:28:17 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<image>
		<url>http://www.gravatar.com/blavatar/9ecc6262507441727044ba5428d2d2b7?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>What's new</title>
		<link>http://terrytao.wordpress.com</link>
	</image>
			<item>
		<title>DHJ: Writing the second paper III.</title>
		<link>http://terrytao.wordpress.com/2009/07/09/dhj-writing-the-second-paper-iii/</link>
		<comments>http://terrytao.wordpress.com/2009/07/09/dhj-writing-the-second-paper-iii/#comments</comments>
		<pubDate>Thu, 09 Jul 2009 16:28:16 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.CO]]></category>
		<category><![CDATA[polymath1]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2398</guid>
		<description><![CDATA[This is a continuation of the preceding two threads here on the polymath1 project, which are now full.  We are closing on on the computation of the sixth Moser number &#8211; the size of the largest subset of the six-dimensional cube  that does not contain any lines: it is either 353 or 354, and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2398&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This is a continuation of the <a href="http://terrytao.wordpress.com/2009/05/22/dhj-writing-the-second-paper/">preceding</a> <a href="http://terrytao.wordpress.com/2009/06/14/dhj-still-writing-the-second-paper/">two</a> threads here on the <a href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 project</a>, which are now full.  We are closing on on the computation of the sixth Moser number &#8211; the size of the largest subset of the six-dimensional cube <img src='http://s3.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C3%5C%7D%5E6&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\{1,2,3\}^6' title='\{1,2,3\}^6' class='latex' /> that does not contain any lines: it is either 353 or 354, and 354 looks close to being eliminated soon.</p>
<p>Besides this, the nominal purpose of this thread is to coordinate the writing of the <a href="http://michaelnielsen.org/polymath1/index.php?title=Outline_of_second_paper">second paper</a> of the project.  In addition to incorporating whatever results we get from the six-dimensional Moser problem, a lot of work still has to be done on other sections of the paper, notably the higher k component and the component on Fujimura&#8217;s problem, as well as the appendices.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2398/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2398/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2398/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2398/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2398/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2398/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2398&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/07/09/dhj-writing-the-second-paper-iii/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Benford&#8217;s law, Zipf&#8217;s law, and the Pareto distribution</title>
		<link>http://terrytao.wordpress.com/2009/07/03/benfords-law-zipfs-law-and-the-pareto-distribution/</link>
		<comments>http://terrytao.wordpress.com/2009/07/03/benfords-law-zipfs-law-and-the-pareto-distribution/#comments</comments>
		<pubDate>Sat, 04 Jul 2009 03:27:18 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[expository]]></category>
		<category><![CDATA[math.PR]]></category>
		<category><![CDATA[math.ST]]></category>
		<category><![CDATA[non-technical]]></category>
		<category><![CDATA[Benford's law]]></category>
		<category><![CDATA[Pareto distribution]]></category>
		<category><![CDATA[randomness]]></category>
		<category><![CDATA[Zipf's law]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2305</guid>
		<description><![CDATA[
 A remarkable phenomenon in probability theory is that of universality &#8211; that many seemingly unrelated probability distributions, which ostensibly involve large numbers of unknown parameters, can end up converging to a universal law that may only depend on a small handful of parameters. One of the most famous examples of the universality phenomenon is [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2305&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>
 A remarkable phenomenon in probability theory is that of <em>universality</em> &#8211; that many seemingly unrelated probability distributions, which ostensibly involve large numbers of unknown parameters, can end up converging to a universal law that may only depend on a small handful of parameters. One of the most famous examples of the universality phenomenon is the <a href="http://en.wikipedia.org/wiki/Central_limit_theorem">central limit theorem</a>; another rich source of examples comes from <a href="http://en.wikipedia.org/wiki/Random_matrix_theory">random matrix theory</a>, which is one of the areas of my own research.
</p>
<p>
Analogous universality phenomena also show up in <em>empirical</em> distributions &#8211; the distributions of a statistic <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> from a large population of &#8220;real-world&#8221; objects. Examples include <a href="http://en.wikipedia.org/wiki/Benford&#37;27s_law">Benford&#8217;s law</a>, <a href="http://en.wikipedia.org/wiki/Zipf&#37;27s_law">Zipf&#8217;s law</a>, and the <a href="http://en.wikipedia.org/wiki/Pareto_distribution">Pareto distribution</a> (of which the <a href="http://en.wikipedia.org/wiki/Pareto_principle">Pareto principle</a> or <em>80-20 law</em> is a special case). These laws govern the asymptotic distribution of many statistics <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> which </p>
<ul>
<li> (i) take values as positive numbers; </li>
<li> (ii) range over many different orders of magnitude; </li>
<li> (iiii) arise from a complicated combination of largely independent factors (with different samples of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> arising from different independent factors); and </li>
<li> (iv) have not been artificially rounded, truncated, or otherwise constrained in size.
</li>
</ul>
<p>
Examples here include the population of countries or cities, the frequency of occurrence of words in a language, the mass of astronomical objects, or the net worth of individuals or corporations. The laws are then as follows:
</p>
<p><ul>
<li> <b>Benford&#8217;s law:</b> For <img src='http://s2.wordpress.com/latex.php?latex=%7Bk%3D1%2C%5Cldots%2C9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k=1,\ldots,9}' title='{k=1,\ldots,9}' class='latex' />, the proportion of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> whose first digit is <img src='http://s1.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> is approximately <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+%5Cfrac%7Bk%2B1%7D%7Bk%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} \frac{k+1}{k}}' title='{\log_{10} \frac{k+1}{k}}' class='latex' />. Thus, for instance, <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> should have a first digit of <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> about <img src='http://s2.wordpress.com/latex.php?latex=%7B30%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{30\%}' title='{30\%}' class='latex' /> of the time, but a first digit of <img src='http://s3.wordpress.com/latex.php?latex=%7B9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{9}' title='{9}' class='latex' /> only about <img src='http://s1.wordpress.com/latex.php?latex=%7B5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{5\%}' title='{5\%}' class='latex' /> of the time. </li>
<li> <b>Zipf&#8217;s law:</b> The <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{th}}' title='{n^{th}}' class='latex' /> largest value of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> should obey an approximate power law, i.e. it should be approximately <img src='http://s1.wordpress.com/latex.php?latex=%7BC+n%5E%7B-%5Calpha%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{C n^{-\alpha}}' title='{C n^{-\alpha}}' class='latex' /> for the first few <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%3D1%2C2%2C3%2C%5Cldots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n=1,2,3,\ldots}' title='{n=1,2,3,\ldots}' class='latex' /> and some parameters <img src='http://s3.wordpress.com/latex.php?latex=%7BC%2C+%5Calpha+%3E+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{C, \alpha &gt; 0}' title='{C, \alpha &gt; 0}' class='latex' />. In many cases, <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> is close to <img src='http://s2.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. </li>
<li> <b>Pareto distribution:</b> The proportion of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> with at least <img src='http://s1.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> digits (before the decimal point), where <img src='http://s2.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> is above the median number of digits, should obey an approximate exponential law, i.e. be approximately of the form <img src='http://s3.wordpress.com/latex.php?latex=%7Bc+10%5E%7B-m%2F%5Calpha%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{c 10^{-m/\alpha}}' title='{c 10^{-m/\alpha}}' class='latex' /> for some <img src='http://s1.wordpress.com/latex.php?latex=%7Bc%2C+%5Calpha+%3E+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{c, \alpha &gt; 0}' title='{c, \alpha &gt; 0}' class='latex' />. Again, in many cases <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> is close to <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />.
</li>
</ul>
<p>
Benford&#8217;s law and Pareto distribution are stated here for base <img src='http://s1.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' />, which is what we are most familiar with, but the laws hold for any base (after replacing all the occurrences of <img src='http://s2.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' /> in the above laws with the new base, of course). The laws tend to break down if the hypotheses (i)-(iv) are dropped. For instance, if the statistic <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> concentrates around its mean (as opposed to being spread over many orders of magnitude), then the <a href="http://en.wikipedia.org/wiki/Normal_distribution">normal distribution</a> tends to be a much better model (as indicated by such results as the central limit theorem). If instead the various samples of the statistics are highly correlated with each other, then other laws can arise (for instance, the eigenvalues of a random matrix, as well as many empirically observed matrices, are correlated to each other, with the behaviour of the largest eigenvalues being governed by laws such as the <em>Tracy-Widom law</em> rather than Zipf&#8217;s law, and the bulk distribution being governed by laws such as the <a href="http://en.wikipedia.org/wiki/Wigner_semicircle_distribution">semicircular law</a> rather than the normal or Pareto distributions).
</p>
<p>
To illustrate these laws, let us take as a data set the populations of 235 countries and regions of the world in 2007 (using the <a href="http://www.umsl.edu/services/govdocs/wofact2007/index.html">CIA world factbook</a>); I have put the raw data <a href="http://spreadsheets.google.com/pub?key=rj_3TkLJrrVuvOXkijCHelQ&amp;output=html">here</a>. This is a relatively small sample (cf. <a href="http://terrytao.wordpress.com/2008/10/10/small-samples-and-the-margin-of-error/">my previous post</a>), but is already enough to discern these laws in action. For instance, here is how the data set tracks with Benford&#8217;s law (rounded to three significant figures):
</p>
<p><table align="center">
<tr>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> </td>
<td align="left"> Countries </td>
<td align="left"> Number </td>
<td align="left"> Benford prediction </td>
</tr>
<tr>
<td align="left"> 1 </td>
<td align="left"> Angola, Anguilla, Aruba, Bangladesh, Belgium, Botswana, Brazil, Burkina Faso, Cambodia, Cameroon, Chad, Chile, China, Christmas Island, Cook Islands, Cuba, Czech Republic, Ecuador, Estonia, Gabon, (The) Gambia, Greece, Guam, Guatemala, Guinea-Bissau, India, Japan, Kazakhstan, Kiribati, Malawi, Mali, Mauritius, Mexico, (Federated States of) Micronesia, Nauru, Netherlands, Niger, Nigeria, Niue, Pakistan, Portugal, Russia, Rwanda, Saint Lucia, Saint Vincent and the Grenadines, Senegal, Serbia, Swaziland, Syria, Timor-Leste (East-Timor), Tokelau, Tonga, Trinidad and Tobago, Tunisia, Tuvalu, (U.S.) Virgin Islands, Wallis and Futuna, Zambia, Zimbabwe </td>
<td align="left"> 59 (<img src='http://s2.wordpress.com/latex.php?latex=%7B25.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{25.1\%}' title='{25.1\%}' class='latex' />) </td>
<td align="left"> 71 (<img src='http://s3.wordpress.com/latex.php?latex=%7B30.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{30.1\%}' title='{30.1\%}' class='latex' />) </td>
</tr>
<tr>
<td align="left"> 2 </td>
<td align="left"> Armenia, Australia, Barbados, British Virgin Islands, Cote d&#8217;Ivoire, French Polynesia, Ghana, Gibraltar, Indonesia, Iraq, Jamaica, (North) Korea, Kosovo, Kuwait, Latvia, Lesotho, Macedonia, Madagascar, Malaysia, Mayotte, Mongolia, Mozambique, Namibia, Nepal, Netherlands Antilles, New Caledonia Norfolk Island, Palau, Peru, Romania, Saint Martin, Samoa, San Marino, Sao Tome and Principe, Saudi Arabia, Slovenia, Sri Lanka, Svalbard, Taiwan, Turks and Caicos Islands, Uzbekistan, Vanuatu, Venezuela, Yemen </td>
<td align="left"> 44 (<img src='http://s1.wordpress.com/latex.php?latex=%7B18.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{18.7\%}' title='{18.7\%}' class='latex' />) </td>
<td align="left"> 41 (<img src='http://s2.wordpress.com/latex.php?latex=%7B17.6%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{17.6\%}' title='{17.6\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 3 </td>
<td align="left"> Afghanistan, Albania, Algeria, (The) Bahamas, Belize, Brunei, Canada, (Rep. of the) Congo, Falkland Islands (Islas Malvinas), Iceland, Kenya, Lebanon, Liberia, Liechtenstein, Lithuania, Maldives, Mauritania, Monaco, Morocco, Oman, (Occupied) Palestinian Territory, Panama, Poland, Puerto Rico, Saint Kitts and Nevis, Uganda, United States of America, Uruguay, Western Sahara </td>
<td align="left"> 29 (<img src='http://s3.wordpress.com/latex.php?latex=%7B12.3%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{12.3\%}' title='{12.3\%}' class='latex' />) </td>
<td align="left"> 29 (<img src='http://s1.wordpress.com/latex.php?latex=%7B12.5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{12.5\%}' title='{12.5\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 4 </td>
<td align="left"> Argentina, Bosnia and Herzegovina, Burma (Myanmar), Cape Verde, Cayman Islands, Central African Republic, Colombia, Costa Rica, Croatia, Faroe Islands, Georgia, Ireland, (South) Korea, Luxembourg, Malta, Moldova, New Zealand, Norway, Pitcairn Islands, Singapore, South Africa, Spain, Sudan, Suriname, Tanzania, Ukraine, United Arab Emirates </td>
<td align="left"> 27 (<img src='http://s2.wordpress.com/latex.php?latex=%7B11.4%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{11.4\%}' title='{11.4\%}' class='latex' />) </td>
<td align="left"> 22 (<img src='http://s3.wordpress.com/latex.php?latex=%7B9.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{9.7\%}' title='{9.7\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 5 </td>
<td align="left"> (Macao SAR) China, Cocos Islands, Denmark, Djibouti, Eritrea, Finland, Greenland, Italy, Kyrgyzstan, Montserrat, Nicaragua, Papua New Guinea, Slovakia, Solomon Islands, Togo, Turkmenistan </td>
<td align="left"> 16 (<img src='http://s1.wordpress.com/latex.php?latex=%7B6.8%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{6.8\%}' title='{6.8\%}' class='latex' />) </td>
<td align="left"> 19 (<img src='http://s2.wordpress.com/latex.php?latex=%7B7.9%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{7.9\%}' title='{7.9\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 6 </td>
<td align="left"> American Samoa, Bermuda, Bhutan, (Dem. Rep. of the) Congo, Equatorial Guinea, France, Guernsey, Iran, Jordan, Laos, Libya, Marshall Islands, Montenegro, Paraguay, Sierra Leone, Thailand, United Kingdom </td>
<td align="left"> 17 (<img src='http://s3.wordpress.com/latex.php?latex=%7B7.2%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{7.2\%}' title='{7.2\%}' class='latex' />) </td>
<td align="left"> 16 (<img src='http://s1.wordpress.com/latex.php?latex=%7B6.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{6.7\%}' title='{6.7\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 7 </td>
<td align="left"> Bahrain, Bulgaria, (Hong Kong SAR) China, Comoros, Cyprus, Dominica, El Salvador, Guyana, Honduras, Israel, (Isle of) Man, Saint Barthelemy, Saint Helena, Saint Pierre and Miquelon, Switzerland, Tajikistan, Turkey </td>
<td align="left"> 17 (<img src='http://s2.wordpress.com/latex.php?latex=%7B7.2%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{7.2\%}' title='{7.2\%}' class='latex' />) </td>
<td align="left"> 14 (<img src='http://s3.wordpress.com/latex.php?latex=%7B5.8%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{5.8\%}' title='{5.8\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 8 </td>
<td align="left"> Andorra, Antigua and Barbuda, Austria, Azerbaijan, Benin, Burundi, Egypt, Ethiopia, Germany, Haiti, Holy See (Vatican City), Northern Mariana Islands, Qatar, Seychelles, Vietnam </td>
<td align="left"> 15 (<img src='http://s1.wordpress.com/latex.php?latex=%7B6.4%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{6.4\%}' title='{6.4\%}' class='latex' />) </td>
<td align="left"> 12 (<img src='http://s2.wordpress.com/latex.php?latex=%7B5.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{5.1\%}' title='{5.1\%}' class='latex' />)</td>
</tr>
<tr>
<td align="left"> 9 </td>
<td align="left"> Belarus, Bolivia, Dominican Republic, Fiji, Grenada, Guinea, Hungary, Jersey, Philippines, Somalia, Sweden </td>
<td align="left"> 11 (<img src='http://s3.wordpress.com/latex.php?latex=%7B4.5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{4.5\%}' title='{4.5\%}' class='latex' />) </td>
<td align="left"> 11 (<img src='http://s1.wordpress.com/latex.php?latex=%7B4.6%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{4.6\%}' title='{4.6\%}' class='latex' />) </td>
</tr>
</table>
<p>
Here is how the same data tracks Zipf&#8217;s law for the first twenty values of <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />, with the parameters <img src='http://s3.wordpress.com/latex.php?latex=%7BC+%5Capprox+1.28+%5Ctimes+10%5E9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{C \approx 1.28 \times 10^9}' title='{C \approx 1.28 \times 10^9}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Calpha+%5Capprox+1.03%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha \approx 1.03}' title='{\alpha \approx 1.03}' class='latex' /> (selected by log-linear regression), again rounding to three significant figures:
</p>
<p><table align="center">
<tr>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> </td>
<td align="left"> Country </td>
<td align="left"> Population </td>
<td align="left"> Zipf prediction </td>
<td align="left"> Deviation from prediction </td>
</tr>
<tr>
<td align="left"> 1 </td>
<td align="left"> China </td>
<td align="left"> 1,330,000,000	</td>
<td align="left"> 1,280,000,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B%2B4.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+4.1\%}' title='{+4.1\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 2	</td>
<td align="left"> India </td>
<td align="left">	1,150,000,000	</td>
<td align="left"> 626,000,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B%2B83.5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+83.5\%}' title='{+83.5\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 3	</td>
<td align="left"> USA </td>
<td align="left"> 304,000,000 </td>
<td align="left"> 412,000,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B-26.3%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-26.3\%}' title='{-26.3\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 4	</td>
<td align="left"> Indonesia </td>
<td align="left">	238,000,000	</td>
<td align="left"> 307,000,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B-22.5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-22.5\%}' title='{-22.5\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 5	</td>
<td align="left"> Brazil </td>
<td align="left"> 196,000,000 </td>
<td align="left"> 244,000,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B-19.4%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-19.4\%}' title='{-19.4\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 6	</td>
<td align="left"> Pakistan </td>
<td align="left"> 173,000,000 </td>
<td align="left"> 202,000,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B-14.4%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-14.4\%}' title='{-14.4\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 7 </td>
<td align="left">	Bangladesh </td>
<td align="left"> 154,000,000 </td>
<td align="left"> 172,000,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B-10.9%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-10.9\%}' title='{-10.9\%}' class='latex' /></td>
</tr>
<tr>
<td align="left"> 8	</td>
<td align="left"> Nigeria </td>
<td align="left"> 146,000,000 </td>
<td align="left"> 150,000,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B-2.6%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-2.6\%}' title='{-2.6\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 9	</td>
<td align="left"> Russia </td>
<td align="left"> 141,000,000 </td>
<td align="left"> 133,000,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B%2B5.8%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+5.8\%}' title='{+5.8\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 10 </td>
<td align="left"> Japan </td>
<td align="left"> 128,000,000 </td>
<td align="left"> 120,000,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B%2B6.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+6.7\%}' title='{+6.7\%}' class='latex' /></td>
</tr>
<tr>
<td align="left"> 11 </td>
<td align="left"> Mexico </td>
<td align="left"> 110,000,000 </td>
<td align="left"> 108,000,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B%2B1.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+1.7\%}' title='{+1.7\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 12 </td>
<td align="left"> Philippines </td>
<td align="left"> 96,100,000 </td>
<td align="left">	98,900,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B-2.9%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-2.9\%}' title='{-2.9\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 13 </td>
<td align="left"> Vietnam </td>
<td align="left"> 86,100,000 </td>
<td align="left"> 91,100,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B-5.4%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-5.4\%}' title='{-5.4\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 14 </td>
<td align="left"> Ethiopia </td>
<td align="left">	82,600,000 </td>
<td align="left"> 84,400,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B-2.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-2.1\%}' title='{-2.1\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 15 </td>
<td align="left"> Germany </td>
<td align="left"> 82,400,000 </td>
<td align="left">	78,600,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B%2B4.8%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+4.8\%}' title='{+4.8\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 16 </td>
<td align="left"> Egypt </td>
<td align="left"> 81,700,000 </td>
<td align="left">	73,500,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B%2B11.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+11.1\%}' title='{+11.1\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 17 </td>
<td align="left"> Turkey </td>
<td align="left">	71,900,000 </td>
<td align="left"> 69,100,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B%2B4.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+4.1\%}' title='{+4.1\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 18 </td>
<td align="left"> Congo </td>
<td align="left"> 66,500,000 </td>
<td align="left">	65,100,000 </td>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7B%2B2.2%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+2.2\%}' title='{+2.2\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 19 </td>
<td align="left"> Iran </td>
<td align="left">	65,900,000 </td>
<td align="left"> 61,600,000 </td>
<td align="left"> <img src='http://s3.wordpress.com/latex.php?latex=%7B%2B6.9%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+6.9\%}' title='{+6.9\%}' class='latex' /> </td>
</tr>
<tr>
<td align="left"> 20 </td>
<td align="left"> Thailand </td>
<td align="left"> 65,500,000 </td>
<td align="left"> 58,400,000 </td>
<td align="left"> <img src='http://s1.wordpress.com/latex.php?latex=%7B%2B12.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+12.1\%}' title='{+12.1\%}' class='latex' /> </td>
</tr>
</table>
<p>
As one sees, Zipf&#8217;s law is not particularly precise at the extreme edge of the statistics (when <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> is very small), but becomes reasonably accurate (given the small sample size, and given that we are fitting twenty data points using only two parameters) for moderate sizes of <img src='http://s3.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />.
</p>
<p>
This data set has too few scales in base <img src='http://s1.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' /> to illustrate the Pareto distribution effectively &#8211; over half of the country populations are either seven or eight digits in that base. But if we instead work in base <img src='http://s2.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />, then country populations range in a decent number of scales (the majority of countries have population between <img src='http://s3.wordpress.com/latex.php?latex=%7B2%5E%7B23%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2^{23}}' title='{2^{23}}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B2%5E%7B32%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2^{32}}' title='{2^{32}}' class='latex' />), and we begin to see the law emerge, where <img src='http://s2.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> is now the number of digits in binary, the best-fit parameters are <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Calpha+%5Capprox+1.18%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha \approx 1.18}' title='{\alpha \approx 1.18}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7Bc+%5Capprox+1.7+%5Ctimes+2%5E%7B26%7D+%2F+235%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{c \approx 1.7 \times 2^{26} / 235}' title='{c \approx 1.7 \times 2^{26} / 235}' class='latex' />:
</p>
<p><table align="center">
<tr>
<td align="left"> <img src='http://s2.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> </td>
<td align="left"> Countries with <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Cgeq+m%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\geq m}' title='{\geq m}' class='latex' /> binary digit populations </td>
<td align="left"> Number </td>
<td align="left"> Pareto prediction </td>
</tr>
<tr>
<td align="left"> 31 </td>
<td align="left"> China, India </td>
<td align="left"> 2 </td>
<td align="left"> 1 </td>
</tr>
<tr>
<td align="left"> 30 </td>
<td align="left"> &#8221; </td>
<td align="left"> 2 </td>
<td align="left"> 2 </td>
</tr>
<tr>
<td align="left"> 29 </td>
<td align="left"> &#8220;, United States of America </td>
<td align="left"> 3 </td>
<td align="left"> 5 </td>
</tr>
<tr>
<td align="left"> 28 </td>
<td align="left"> &#8220;, Indonesia, Brazil, Pakistan, Bangladesh, Nigeria, Russia </td>
<td align="left"> 9 </td>
<td align="left"> 8 </td>
</tr>
<tr>
<td align="left"> 27 </td>
<td align="left"> &#8220;, Japan, Mexico, Philippines, Vietnam, Ethiopia, Germany, Egypt, Turkey </td>
<td align="left"> 17 </td>
<td align="left"> 15</td>
</tr>
<tr>
<td align="left"> 26 </td>
<td align="left"> &#8220;, (Dem. Rep. of the) Congo, Iran, Thailand, France, United Kingdom, Italy, South Africa, (South) Korea, Burma (Myanmar), Ukraine, Colombia, Spain, Argentina, Sudan, Tanzania, Poland, Kenya, Morocco, Algeria </td>
<td align="left"> 36 </td>
<td align="left"> 27 </td>
</tr>
<tr>
<td align="left"> 25 </td>
<td align="left"> &#8220;, Canada, Afghanistan, Uganda, Nepal, Peru, Iraq, Saudi Arabia, Uzbekistan, Venezuela, Malaysia, (North) Korea, Ghana, Yemen, Taiwan, Romania, Mozambique, Sri Lanka, Australia, Cote d&#8217;Ivoire, Madagascar, Syria, Cameroon </td>
<td align="left"> 58 </td>
<td align="left"> 49</td>
</tr>
<tr>
<td align="left"> 24 </td>
<td align="left"> &#8220;, Netherlands, Chile, Kazakhstan, Burkina Faso, Cambodia, Malawi, Ecuador, Niger, Guatemala, Senegal, Angola, Mali, Zambia, Cuba, Zimbabwe, Greece, Portugal, Belgium, Tunisia, Czech Republic, Rwanda, Serbia, Chad, Hungary, Guinea, Belarus, Somalia, Dominican Republic, Bolivia, Sweden, Haiti, Burundi, Benin </td>
<td align="left"> 91 </td>
<td align="left"> 88 </td>
</tr>
<tr>
<td align="left"> 23 </td>
<td align="left"> &#8220;, Austria, Azerbaijan, Honduras, Switzerland, Bulgaria, Tajikistan, Israel, El Salvador, (Hong Kong SAR) China, Paraguay, Laos, Sierra Leone, Jordan, Libya, Papua New Guinea, Togo, Nicaragua, Eritrea, Denmark, Slovakia, Kyrgyzstan, Finland, Turkmenistan, Norway, Georgia, United Arab Emirates, Singapore, Bosnia and Herzegovina, Croatia, Central African Republic, Moldova, Costa Rica </td>
<td align="left"> 123 </td>
<td align="left"> 159 </td>
</tr>
</table>
<p> Thus, with each new scale, the number of countries introduced increases by a factor of a little less than <img src='http://s1.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />, on the average. This approximate doubling of countries with each new scale begins to falter at about the population <img src='http://s2.wordpress.com/latex.php?latex=%7B2%5E%7B23%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2^{23}}' title='{2^{23}}' class='latex' /> (i.e. at around <img src='http://s3.wordpress.com/latex.php?latex=%7B4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{4}' title='{4}' class='latex' /> million), for the simple reason that one has begun to run out of countries. (Note that the median-population country in this set, Singapore, has a population with <img src='http://s1.wordpress.com/latex.php?latex=%7B23%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{23}' title='{23}' class='latex' /> binary digits.)
</p>
<p>
These laws are not merely interesting statistical curiosities; for instance, Benford&#8217;s law is often used to help detect fraudulent statistics (such as those arising from accounting fraud), as many such statistics are invented by choosing digits at random, and will therefore deviate significantly from Benford&#8217;s law. (This is nicely discussed in Robert Matthews&#8217; New Scientist article &#8220;<a href="http://www.newscientist.com/article/mg16321944.600">The power of one</a>&#8220;; this article can also be found on the web at a number of other places.) In a somewhat analogous spirit, Zipf&#8217;s law and the Pareto distribution can be used to mathematically test various models of real-world systems (e.g. formation of astronomical objects, accumulation of wealth, population growth of countries, etc.), without necessarily having to fit all the parameters of that model with the actual data.
</p>
<p>
Being empirically observed phenomena rather than abstract mathematical facts, Benford&#8217;s law, Zipf&#8217;s law, and the Pareto distribution cannot be &#8220;proved&#8221; the same way a mathematical theorem can be proved. However, one can still <em>support</em> these laws mathematically in a number of ways, for instance showing how these laws are compatible with each other, and with other plausible hypotheses on the source of the data. In this post I would like to describe a number of ways (both technical and non-technical) in which one can do this; these arguments do not fully explain these laws (in particular, the empirical fact that the exponent <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> in Zipf&#8217;s law or the Pareto distribution is often close to <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> is still quite a mysterious phenomenon), and do not always have the same universal range of applicability as these laws seem to have, but I hope that they do demonstrate that these laws are not completely arbitrary, and ought to have a satisfactory basis of mathematical support. <span id="more-2305"></span>
</p>
</p>
<p align="center"><b> &mdash;  1. Scale invariance  &mdash; </b></p>
<p>
One consistency check that is enjoyed by all of these laws is that of <em>scale invariance</em> &#8211; they are invariant under rescalings of the data (for instance, by changing the units).
</p>
<p>
For example, suppose for sake of argument that the country populations <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> of the world in 2007 obey Benford&#8217;s law, thus for instance about <img src='http://s2.wordpress.com/latex.php?latex=%7B30.7%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{30.7\%}' title='{30.7\%}' class='latex' /> of the countries have population with first digit <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />, <img src='http://s1.wordpress.com/latex.php?latex=%7B17.6%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{17.6\%}' title='{17.6\%}' class='latex' /> have population with first digit <img src='http://s2.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />, and so forth. Now, imagine that several decades in the future, say in 2067, all of the countries in the world double their population, from <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> to a new population <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X+%3A%3D+2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X := 2X}' title='{\tilde X := 2X}' class='latex' />. (This makes the somewhat implausible assumption that growth rates are uniform across all countries; I will talk about what happens when one omits this hypothesis later.) To further simplify the experiment, suppose that no countries are created or dissolved during this time period. What happens to Benford&#8217;s law when passing from <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> to <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />?
</p>
<p>
The key observation here, of course, is that the first digit of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is linked to the first digit of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X+%3D+2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X = 2X}' title='{\tilde X = 2X}' class='latex' />. If, for instance, the first digit of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />, then the first digit of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> is either <img src='http://s3.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> or <img src='http://s1.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />; conversely, if the first digit of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> is <img src='http://s3.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> or <img src='http://s1.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />, then the first digit of <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. As a consequence, the proportion of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />&#8217;s with first digit <img src='http://s2.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> is equal to the proportion of <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />&#8217;s with first digit <img src='http://s1.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />, plus the proportion of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />&#8217;s with first digit <img src='http://s3.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />. This is consistent with Benford&#8217;s law holding for both <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> and <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />, since </p>
<p align="center"><img src='http://s3.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Clog_%7B10%7D+%5Cfrac%7B2%7D%7B1%7D+%3D+%5Clog_%7B10%7D+%5Cfrac%7B3%7D%7B2%7D+%2B+%5Clog_%7B10%7D+%5Cfrac%7B4%7D%7B3%7D+%28+%3D+%5Clog_%7B10%7D+%5Cfrac%7B4%7D%7B2%7D+%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \log_{10} \frac{2}{1} = \log_{10} \frac{3}{2} + \log_{10} \frac{4}{3} ( = \log_{10} \frac{4}{2} )' title='\displaystyle  \log_{10} \frac{2}{1} = \log_{10} \frac{3}{2} + \log_{10} \frac{4}{3} ( = \log_{10} \frac{4}{2} )' class='latex' /></p>
<p> (or numerically, <img src='http://s1.wordpress.com/latex.php?latex=%7B30.7%5C%25+%3D+17.6%5C%25+%2B+12.5%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{30.7\% = 17.6\% + 12.5\%}' title='{30.7\% = 17.6\% + 12.5\%}' class='latex' /> after rounding). Indeed one can check the other digit ranges also and that conclude that Benford&#8217;s law for <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is compatible with Benford&#8217;s law for <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />; to pick a contrasting example, a uniformly distributed model in which each digit from <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> to <img src='http://s2.wordpress.com/latex.php?latex=%7B9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{9}' title='{9}' class='latex' /> is the first digit of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> occurs with probability <img src='http://s1.wordpress.com/latex.php?latex=%7B1%2F9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/9}' title='{1/9}' class='latex' /> totally fails to be preserved under doubling.</p>
<p>
One can be even more precise. Observe (through telescoping series) that Benford&#8217;s law implies that <a name="benfo">
<p align="center"><img src='http://s2.wordpress.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%5Calpha+10%5En+%5Cleq+X+%3C+%5Cbeta+10%5En+%5Chbox%7B+for+some+integer+%7D+n+%29+%3D+%5Clog_%7B10%7D+%5Cfrac%7B%5Cbeta%7D%7B%5Calpha%7D+%5C+%5C+%5C+%5C+%5C+%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  {\Bbb P}( \alpha 10^n \leq X &lt; \beta 10^n \hbox{ for some integer } n ) = \log_{10} \frac{\beta}{\alpha} \ \ \ \ \ (1)' title='\displaystyle  {\Bbb P}( \alpha 10^n \leq X &lt; \beta 10^n \hbox{ for some integer } n ) = \log_{10} \frac{\beta}{\alpha} \ \ \ \ \ (1)' class='latex' /></p>
<p></a> for all integers <img src='http://s3.wordpress.com/latex.php?latex=%7B1+%5Cleq+%5Calpha+%5Cleq+%5Cbeta+%3C+10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1 \leq \alpha \leq \beta &lt; 10}' title='{1 \leq \alpha \leq \beta &lt; 10}' class='latex' />, where the left-hand side denotes the proportion of data for which <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> lies between <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Calpha+10%5En%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha 10^n}' title='{\alpha 10^n}' class='latex' /> and <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Cbeta+10%5En%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\beta 10^n}' title='{\beta 10^n}' class='latex' /> for some integer <img src='http://s1.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />. Suppose now that we generalise Benford&#8217;s law to the <em>continuous Benford&#8217;s law</em>, which asserts that <a href="#benfo">(1)</a> is true for all <em>real</em> numbers <img src='http://s2.wordpress.com/latex.php?latex=%7B1+%5Cleq+%5Calpha+%5Cleq+%5Cbeta+%3C+10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1 \leq \alpha \leq \beta &lt; 10}' title='{1 \leq \alpha \leq \beta &lt; 10}' class='latex' />. Then it is not hard to show that a statistic <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> obeys the continuous Benford&#8217;s law if and only if its dilate <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X+%3D2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X =2X}' title='{\tilde X =2X}' class='latex' /> does, and similarly with <img src='http://s2.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> replaced by any other constant growth factor. (This is easiest seen by observing that <a href="#benfo">(1)</a> is equivalent to asserting that the fractional part of <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} X}' title='{\log_{10} X}' class='latex' /> is uniformly distributed.) In fact, the continuous Benford law is the <em>only</em> distribution for the quantities on the left-hand side of <a href="#benfo">(1)</a> with this scale-invariance property; this fact is a special case of the general fact that Haar measures are unique (see e.g. <a href="http://terrytao.wordpress.com/2009/04/06/the-fourier-transform/">these lecture notes</a>).
</p>
<p>
It is also easy to see that Zipf&#8217;s law and the Pareto distribution also enjoy this sort of scale-invariance property, as long as one generalises the Pareto distribution <a name="pareto">
<p align="center"><img src='http://s1.wordpress.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+X+%5Cgeq+10%5Em+%29+%3D+c+10%5E%7B-m%2F%5Calpha%7D+%5C+%5C+%5C+%5C+%5C+%282%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  {\Bbb P}( X \geq 10^m ) = c 10^{-m/\alpha} \ \ \ \ \ (2)' title='\displaystyle  {\Bbb P}( X \geq 10^m ) = c 10^{-m/\alpha} \ \ \ \ \ (2)' class='latex' /></p>
<p></a> from integer <img src='http://s2.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> to real <img src='http://s3.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' />, just as with Benford&#8217;s law. Once one does that, one can phrase the Pareto distribution law independently of any base as <a name="pareto">
<p align="center"><img src='http://s1.wordpress.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+X+%5Cgeq+x+%29+%3D+c+x%5E%7B-1%2F%5Calpha%7D+%5C+%5C+%5C+%5C+%5C+%283%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  {\Bbb P}( X \geq x ) = c x^{-1/\alpha} \ \ \ \ \ (3)' title='\displaystyle  {\Bbb P}( X \geq x ) = c x^{-1/\alpha} \ \ \ \ \ (3)' class='latex' /></p>
<p></a> for any <img src='http://s2.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> much larger than the median value of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />, at which point the scale-invariance is easily seen.
</p>
<p>
One may object that the above thought-experiment was too idealised, because it assumed uniform growth rates for all the statistics at once. What happens if there are non-uniform growth rates? To keep the computations simple, let us consider the following toy model, where we take the same 2007 population statistics <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> as before, and assume that half of the countries (the &#8220;high-growth&#8221; countries) will experience a population doubling by 2067, while the other half (the &#8220;zero-growth&#8221; countries) will keep their population constant, thus the 2067 population statistic <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> is equal to <img src='http://s3.wordpress.com/latex.php?latex=%7B2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2X}' title='{2X}' class='latex' /> half the time and <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> half the time. (We will assume that our sample sizes are large enough that the <a href="http://en.wikipedia.org/wiki/Law_of_large_numbers">law of large numbers</a> kicks in, and we will therefore ignore issues such as what happens to this &#8220;half the time&#8221; if the number of samples is odd.) Furthermore, we make the plausible but crucial assumption that the event that a country is a high-growth or a zero-growth country is <em>independent</em> of the first digit of the 2007 population; thus, for instance, a country whose population begins with <img src='http://s2.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> is assumed to be just as likely to be high-growth as one whose population begins with <img src='http://s3.wordpress.com/latex.php?latex=%7B7%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{7}' title='{7}' class='latex' />.
</p>
<p>
Now let&#8217;s have a look again at the proportion of countries whose 2067 population <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> begins with either <img src='http://s2.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> or <img src='http://s3.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />. There are exactly two ways in which a country can fall into this category: either it is a zero-growth country whose 2007 population <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> also began with either <img src='http://s2.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> or <img src='http://s3.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />, or it was a high-growth country whose population in 2007 began with <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. Since all countries have a probability <img src='http://s2.wordpress.com/latex.php?latex=%7B1%2F2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/2}' title='{1/2}' class='latex' /> of being high-growth regardless of the first digit of their population, we conclude the identity <a name="px">
<p align="center"><img src='http://s3.wordpress.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%5Ctilde+X+%5Chbox%7B+has+first+digit+%7D+2%2C+3+%29+%3D+%5Cfrac%7B1%7D%7B2%7D+%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+2%2C+3+%29+%5C+%5C+%5C+%5C+%5C+%284%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  {\Bbb P}( \tilde X \hbox{ has first digit } 2, 3 ) = \frac{1}{2} {\Bbb P}( X \hbox{ has first digit } 2, 3 ) \ \ \ \ \ (4)' title='\displaystyle  {\Bbb P}( \tilde X \hbox{ has first digit } 2, 3 ) = \frac{1}{2} {\Bbb P}( X \hbox{ has first digit } 2, 3 ) \ \ \ \ \ (4)' class='latex' /></p>
<p></a> </p>
<p align="center"><img src='http://s1.wordpress.com/latex.php?latex=%5Cdisplaystyle++%2B+%5Cfrac%7B1%7D%7B2%7D+%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+1+%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  + \frac{1}{2} {\Bbb P}( X \hbox{ has first digit } 1 )' title='\displaystyle  + \frac{1}{2} {\Bbb P}( X \hbox{ has first digit } 1 )' class='latex' /></p>
<p> which is once again compatible with Benford&#8217;s law for <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> since
<p align="center"><img src='http://s3.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Clog_%7B10%7D+%5Cfrac%7B4%7D%7B2%7D+%3D+%5Cfrac%7B1%7D%7B2%7D+%5Clog_%7B10%7D+%5Cfrac%7B4%7D%7B2%7D+%2B+%5Cfrac%7B1%7D%7B2%7D+%5Clog+%5Cfrac%7B2%7D%7B1%7D.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \log_{10} \frac{4}{2} = \frac{1}{2} \log_{10} \frac{4}{2} + \frac{1}{2} \log \frac{2}{1}.' title='\displaystyle  \log_{10} \frac{4}{2} = \frac{1}{2} \log_{10} \frac{4}{2} + \frac{1}{2} \log \frac{2}{1}.' class='latex' /></p>
<p> More generally, it is not hard to show that if <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> obeys the continuous Benford&#8217;s law <a href="#benfo">(1)</a>, and one multiplies <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> by some positive multiplier <img src='http://s3.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> which is independent of the first digit of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> (and, <em>a fortiori</em>, is independent of the fractional part of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} X}' title='{\log_{10} X}' class='latex' />), one obtains another quantity <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctilde+X%3DXY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X=XY}' title='{\tilde X=XY}' class='latex' /> which also obeys the continuous Benford&#8217;s law. (Indeed, we have already seen this to be the case when <img src='http://s1.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> is a deterministic constant, and the case when <img src='http://s2.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> is random then follows simply by conditioning <img src='http://s3.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> to be fixed.)</p>
<p>
In particular, we see an absorptive property of Benford&#8217;s law: if <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> obeys Benford&#8217;s law, and <img src='http://s2.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> is any positive statistic independent of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />, then the product <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%3DXY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X=XY}' title='{\tilde X=XY}' class='latex' /> also obeys Benford&#8217;s law &#8211; <em>even if <img src='http://s2.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> did not obey this law</em>. Thus, if a statistic is the product of many independent factors, then it only requires a single factor to obey Benford&#8217;s law in order for the whole product to obey the law also. For instance, the population of a country is the product of its area and its population density. Assuming that the population density of a country is independent of the size of that country (which is not a completely reasonable assumption, but let us take it for the sake of argument), then we see that Benford&#8217;s law for the population would follow if just one of the area or population density obeyed this law. It is also clear that Benford&#8217;s law is the only distribution with this absorptive property (if there was another law with this property, what would happen if one multiplied a statistic with that law with an independent statistic with Benford&#8217;s law?). Thus we begin to get a glimpse as to why Benford&#8217;s law is universal for quantities which are the product of many separate factors, in a manner that no other law could be.
</p>
<p>
As an example: for any given number <img src='http://s3.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' />, the uniform distribution from <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> to <img src='http://s2.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> does not obey Benford&#8217;s law; for instance, if one picks a random number from <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> to <img src='http://s1.wordpress.com/latex.php?latex=%7B999%2C999%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{999,999}' title='{999,999}' class='latex' /> then each digit from <img src='http://s2.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> to <img src='http://s3.wordpress.com/latex.php?latex=%7B9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{9}' title='{9}' class='latex' /> appears as the first digit with an equal probability of <img src='http://s1.wordpress.com/latex.php?latex=%7B1%2F9%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/9}' title='{1/9}' class='latex' /> each. However, if <img src='http://s2.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> is not fixed, but instead obeys Benford&#8217;s law, then a random number selected from <img src='http://s3.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> to <img src='http://s1.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> also obeys Benford&#8217;s law (ignoring for now the distinction between continuous and discrete distributions), as it can be viewed as the product of <img src='http://s2.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> with an independent random number selected from between <img src='http://s3.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />.
</p>
<p>
Actually, one can say something even stronger than the absorption property. Suppose that the continuous Benford&#8217;s law <a href="#benfo">(1)</a> for a statistic <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> did not hold exactly, but instead held with some accuracy <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Cvarepsilon+%3E+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\varepsilon &gt; 0}' title='{\varepsilon &gt; 0}' class='latex' />, thus <a name="bof">
<p align="center"><img src='http://s1.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Clog_%7B10%7D+%5Cfrac%7B%5Cbeta%7D%7B%5Calpha%7D+-+%5Cvarepsilon+%5Cleq+%7B%5CBbb+P%7D%28+%5Calpha+10%5En+%5Cleq+X+%3C+%5Cbeta+10%5En+%5Chbox%7B+for+some+integer+%7D+n+%29+%5Cleq+%5Clog_%7B10%7D+%5Cfrac%7B%5Cbeta%7D%7B%5Calpha%7D+%2B+%5Cvarepsilon+%5C+%5C+%5C+%5C+%5C+%285%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \log_{10} \frac{\beta}{\alpha} - \varepsilon \leq {\Bbb P}( \alpha 10^n \leq X &lt; \beta 10^n \hbox{ for some integer } n ) \leq \log_{10} \frac{\beta}{\alpha} + \varepsilon \ \ \ \ \ (5)' title='\displaystyle  \log_{10} \frac{\beta}{\alpha} - \varepsilon \leq {\Bbb P}( \alpha 10^n \leq X &lt; \beta 10^n \hbox{ for some integer } n ) \leq \log_{10} \frac{\beta}{\alpha} + \varepsilon \ \ \ \ \ (5)' class='latex' /></p>
<p></a> for all <img src='http://s2.wordpress.com/latex.php?latex=%7B1+%5Cleq+%5Calpha+%5Cleq+%5Cbeta+%3C+10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1 \leq \alpha \leq \beta &lt; 10}' title='{1 \leq \alpha \leq \beta &lt; 10}' class='latex' />. Then it is not hard to see that any dilated statistic, such as <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctilde+X+%3D+2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X = 2X}' title='{\tilde X = 2X}' class='latex' />, or more generally <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%3DXY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X=XY}' title='{\tilde X=XY}' class='latex' /> for any fixed deterministic <img src='http://s2.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' />, also obeys <a href="#bof">(5)</a> with exactly the same accuracy <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Cvarepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\varepsilon}' title='{\varepsilon}' class='latex' />. But now suppose one uses a variable multiplier; for instance, suppose one uses the model discussed earlier in which <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> is equal to <img src='http://s2.wordpress.com/latex.php?latex=%7B2X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2X}' title='{2X}' class='latex' /> half the time and <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> half the time. Then the relationship between the distribution of the first digit of <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' /> and the first digit of <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is given by formulae such as <a href="#px">(4)</a>. Now, in the right-hand side of <a href="#px">(4)</a>, each of the two terms <img src='http://s3.wordpress.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+2%2C+3+%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{\Bbb P}( X \hbox{ has first digit } 2, 3 )}' title='{{\Bbb P}( X \hbox{ has first digit } 2, 3 )}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+1+%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{\Bbb P}( X \hbox{ has first digit } 1 )}' title='{{\Bbb P}( X \hbox{ has first digit } 1 )}' class='latex' /> differs from the Benford&#8217;s law predictions of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+%5Cfrac%7B4%7D%7B2%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} \frac{4}{2}}' title='{\log_{10} \frac{4}{2}}' class='latex' /> and <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+%5Cfrac%7B2%7D%7B1%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} \frac{2}{1}}' title='{\log_{10} \frac{2}{1}}' class='latex' /> respectively by at most <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Cvarepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\varepsilon}' title='{\varepsilon}' class='latex' />. Since the left-hand side of <a href="#px">(4)</a> is the average of these two terms, it also differs from the Benford law prediction by at most <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Cvarepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\varepsilon}' title='{\varepsilon}' class='latex' />. But the averaging opens up an opportunity for cancelling; for instance, an overestimate of <img src='http://s3.wordpress.com/latex.php?latex=%7B%2B%5Cvarepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{+\varepsilon}' title='{+\varepsilon}' class='latex' /> for <img src='http://s1.wordpress.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+2%2C+3+%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{\Bbb P}( X \hbox{ has first digit } 2, 3 )}' title='{{\Bbb P}( X \hbox{ has first digit } 2, 3 )}' class='latex' /> could cancel an underestimate of <img src='http://s2.wordpress.com/latex.php?latex=%7B-%5Cvarepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-\varepsilon}' title='{-\varepsilon}' class='latex' /> for <img src='http://s3.wordpress.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Chbox%7B+has+first+digit+%7D+1+%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{\Bbb P}( X \hbox{ has first digit } 1 )}' title='{{\Bbb P}( X \hbox{ has first digit } 1 )}' class='latex' /> to produce a spot-on prediction for <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X}' title='{\tilde X}' class='latex' />. Thus we see that variable multipliers (or variable growth rates) not only preserve Benford&#8217;s law, but in fact <em>stabilise</em> it by averaging out the errors. In fact, if one started with a distribution which did not initially obey Benford&#8217;s law, and then started applying some variable (and independent) growth rates to the various samples in the distribution, then under reasonable assumptions one can show that the resulting distribution will converge to Benford&#8217;s law over time. This helps explain the universality of Benford&#8217;s law for statistics such as populations, for which the independent variable growth law is not so unreasonable (at least, until the population hits some maximum capacity threshold).
</p>
<p>
Note that the independence property is crucial; if for instance population growth always slowed down for some inexplicable reason to a crawl whenever the first digit of the population was <img src='http://s2.wordpress.com/latex.php?latex=%7B6%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{6}' title='{6}' class='latex' />, then there would be a noticeable deviation from Benford&#8217;s law, particularly in digits <img src='http://s3.wordpress.com/latex.php?latex=%7B6%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{6}' title='{6}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B7%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{7}' title='{7}' class='latex' />, due to this growth bottleneck. But this is not a particularly plausible scenario (being somewhat analogous to <a href="http://en.wikipedia.org/wiki/Maxwell&#37;27s_demon">Maxwell&#8217;s demon</a> in thermodynamics).
</p>
<p>
The above analysis can also be carried over to some extent to the Pareto distribution and Zipf&#8217;s law; if a statistic <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> obeys these laws approximately, then after multiplying by an independent variable <img src='http://s3.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' />, the product <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctilde+X%3DXY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\tilde X=XY}' title='{\tilde X=XY}' class='latex' /> will obey the same laws with equal or higher accuracy, so long as <img src='http://s2.wordpress.com/latex.php?latex=%7BY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{Y}' title='{Y}' class='latex' /> is small compared to the number of scales that <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> typically ranges over. (One needs a restriction such as this because the Pareto distribution and Zipf&#8217;s law must break down below the median.) These laws are also stable under other multiplicative processes, for instance if some fraction of the samples in <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> spontaneously split into two smaller pieces, or conversely if two samples in <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> spontaneously merge into one; as before, the key is that the occurrence of these events should be independent of the actual size of the objects being split. If one considers a generalisation of the Pareto or Zipf law in which the exponent <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> is not fixed, but varies with <img src='http://s1.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> or <img src='http://s2.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' />, then the effect of these sorts of multiplicative changes is to blur and average together the various values of <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' />, thus &#8220;flattening&#8221; the <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> curve over time and making the distribution approach Zipf&#8217;s law and/or the Pareto distribution. This helps explain why <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> eventually becomes constant; however, I do not have a good explanation as to why <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> is often close to <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />.
</p>
</p>
<p align="center"><b> &mdash;  2. Compatibility between laws  &mdash; </b></p>
<p>
Another mathematical line of support for Benford&#8217;s law, Zipf&#8217;s law, and the Pareto distribution are that the laws are highly compatible with each other. For instance, Zipf&#8217;s law and the Pareto distribution are formally equivalent: if there are <img src='http://s2.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> samples of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />, then applying <a href="#pareto">(3)</a> with <img src='http://s1.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> equal to the <img src='http://s2.wordpress.com/latex.php?latex=%7Bn%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{th}}' title='{n^{th}}' class='latex' /> largest value <img src='http://s3.wordpress.com/latex.php?latex=%7BX_n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X_n}' title='{X_n}' class='latex' /> of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> gives </p>
<p align="center"><img src='http://s2.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cfrac%7Bn%7D%7BN%7D+%3D+%7B%5CBbb+P%7D%28+X+%5Cgeq+X_n+%29+%3D+c+X_n%5E%7B-1%2F%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \frac{n}{N} = {\Bbb P}( X \geq X_n ) = c X_n^{-1/\alpha}' title='\displaystyle  \frac{n}{N} = {\Bbb P}( X \geq X_n ) = c X_n^{-1/\alpha}' class='latex' /></p>
<p> which implies Zipf&#8217;s law <img src='http://s3.wordpress.com/latex.php?latex=%7BX_n+%3D+C+n%5E%7B-%5Calpha%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X_n = C n^{-\alpha}}' title='{X_n = C n^{-\alpha}}' class='latex' /> with <img src='http://s1.wordpress.com/latex.php?latex=%7BC+%3A%3D+%28Nc%29%5E%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{C := (Nc)^\alpha}' title='{C := (Nc)^\alpha}' class='latex' />. Conversely one can deduce the Pareto distribution from Zipf&#8217;s law. These deductions are only formal in nature, because the Pareto distribution can only hold exactly for continuous distributions, whereas Zipf&#8217;s law only makes sense for discrete distributions, but one can generate more rigorous variants of these deductions without much difficulty. </p>
<p>
In some literature, Zipf&#8217;s law is applied primarily near the extreme edge of the distribution (e.g. the top <img src='http://s2.wordpress.com/latex.php?latex=%7B0.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0.1\%}' title='{0.1\%}' class='latex' /> of the sample space), whereas the Pareto distribution in regions closer to the bulk (e.g. between the top <img src='http://s3.wordpress.com/latex.php?latex=%7B0.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0.1\%}' title='{0.1\%}' class='latex' /> and and top <img src='http://s1.wordpress.com/latex.php?latex=%7B50%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{50\%}' title='{50\%}' class='latex' />). But this is mostly a difference of degree rather than of kind, though in some cases (such as with the example of the 2007 country populations data set) the exponent <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> for the Pareto distribtion in the bulk can differ slightly from the exponent for Zipf&#8217;s law at the extreme edge.
</p>
<p>
The relationship between Zipf&#8217;s law or the Pareto distribution and Benford&#8217;s law is more subtle. For instance Benford&#8217;s law predicts that the proportion of <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> with initial digit <img src='http://s1.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> should equal the proportion of <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> with initial digit <img src='http://s3.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> or <img src='http://s1.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />. But if one formally uses the Pareto distribution <a href="#pareto">(3)</a> to compare those <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> between <img src='http://s3.wordpress.com/latex.php?latex=%7B10%5Em%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{10^m}' title='{10^m}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B2+%5Ctimes+10%5Em%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2 \times 10^m}' title='{2 \times 10^m}' class='latex' />, and those <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> between <img src='http://s3.wordpress.com/latex.php?latex=%7B2+%5Ctimes+10%5Em%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2 \times 10^m}' title='{2 \times 10^m}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=%7B4+%5Ctimes+10%5Em%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{4 \times 10^m}' title='{4 \times 10^m}' class='latex' />, it seems that the former is larger by a factor of <img src='http://s2.wordpress.com/latex.php?latex=%7B2%5E%7B1%2F%5Calpha%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2^{1/\alpha}}' title='{2^{1/\alpha}}' class='latex' />, which upon summing by <img src='http://s3.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> appears inconsistent with Benford&#8217;s law (unless <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> is extremely large). A similar inconsistency is revealed if one uses Zipf&#8217;s law instead.
</p>
<p>
However, the fallacy here is that the Pareto distribution (or Zipf&#8217;s law) does not apply on the entire range of <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />, but only on the upper tail region when <img src='http://s3.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> is significantly higher than the median; it is a law for the <em>outliers</em> of <img src='http://s1.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' /> only. In contrast, Benford&#8217;s law concerns the behaviour of <em>typical</em> values of <img src='http://s2.wordpress.com/latex.php?latex=%7BX%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{X}' title='{X}' class='latex' />; the behaviour of the top <img src='http://s3.wordpress.com/latex.php?latex=%7B0.1%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0.1\%}' title='{0.1\%}' class='latex' /> is of negligible significance to Benford&#8217;s law, though it is of prime importance for Zipf&#8217;s law and the Pareto distribution. Thus the two laws describe different components of the distribution and thus complement each other. Roughly speaking, Benford&#8217;s law asserts that the bulk distribution of <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} X}' title='{\log_{10} X}' class='latex' /> is locally uniform at unit scales, while the Pareto distribution (or Zipf&#8217;s law) asserts that the tail distribution of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Clog_%7B10%7D+X%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log_{10} X}' title='{\log_{10} X}' class='latex' /> decays exponentially. Note that Benford&#8217;s law only describes the fine-scale behaviour of the bulk distribution; the coarse-scale distribution can be a variety of distributions (e.g. log-gaussian).
</p></p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2305/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2305/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2305/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2305/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2305/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2305/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2305/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2305/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2305/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2305/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2305&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/07/03/benfords-law-zipfs-law-and-the-pareto-distribution/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Operator splitting for the KdV equation</title>
		<link>http://terrytao.wordpress.com/2009/06/27/operator-splitting-for-the-kdv-equation/</link>
		<comments>http://terrytao.wordpress.com/2009/06/27/operator-splitting-for-the-kdv-equation/#comments</comments>
		<pubDate>Sun, 28 Jun 2009 05:25:28 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.AP]]></category>
		<category><![CDATA[math.NA]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[Duhamel's formula]]></category>
		<category><![CDATA[Helge Holden]]></category>
		<category><![CDATA[Kenneth Karlsen]]></category>
		<category><![CDATA[Nils Risebro]]></category>
		<category><![CDATA[operator splitting]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2357</guid>
		<description><![CDATA[The summer continues to allow some clearing of the backlog of projects accumulated during the academic year: Helge Holden, Kenneth Karlsen, Nils Risebro, and myself have uploaded to the arXiv the paper &#8220;Operator splitting for the KdV equation&#8220;, submitted to Math. Comp..   This paper is concerned with accurate numerical schemes for solving initial value problems [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2357&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The summer continues to allow some clearing of the backlog of projects accumulated during the academic year: <a href="http://www.math.ntnu.no/~holden/">Helge Holden</a>, <a href="http://www.math.uio.no/~kennethk/">Kenneth Karlsen</a>, <a href="http://folk.uio.no/nilshr/">Nils Risebro</a>, and myself have uploaded to the <a href="http://www.arxiv.org">arXiv</a> the paper &#8220;<a href="http://arxiv.org/abs/0906.4902">Operator splitting for the KdV equation</a>&#8220;, submitted to <a href="http://www.ams.org/mcom/">Math. Comp.</a>.   This paper is concerned with accurate numerical schemes for solving initial value problems for the <a href="http://en.wikipedia.org/wiki/Korteweg%E2%80%93de_Vries_equation">Korteweg-de Vries equation</a></p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=u_t+%2B+u_%7Bxxx%7D+%3D+u+u_x%3B+%5Cquad+u%280%2Cx%29+%3D+u_0%28x%29%2C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t + u_{xxx} = u u_x; \quad u(0,x) = u_0(x),' title='u_t + u_{xxx} = u u_x; \quad u(0,x) = u_0(x),' class='latex' /> (1)</p>
<p>though the analysis here would be valid for a wide range of other semilinear dispersive models as well.  In principle, these equations, which are <a href="http://en.wikipedia.org/wiki/Integrable_system">completely integrable</a>, can be solved exactly by the <a href="http://en.wikipedia.org/wiki/Inverse_scattering_transform">inverse scattering method</a>, but fast and accurate implementations of this method are still not as satisfactory as one would like.  On the other hand, the<a href="http://tosio.math.toronto.edu/wiki/index.php/Airy_equation"> linear Korteweg-de Vries equation</a></p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=u_t+%2B+u_%7Bxxx%7D+%3D+0%3B+%5Cquad+u%280%2Cx%29+%3D+u_0%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t + u_{xxx} = 0; \quad u(0,x) = u_0(x)' title='u_t + u_{xxx} = 0; \quad u(0,x) = u_0(x)' class='latex' /> (2)</p>
<p>can be solved exactly (with accurate and fast numerics) via the (fast) Fourier transform, while the (inviscid) <a href="http://en.wikipedia.org/wiki/Burgers%27_equation">Burgers equation</a></p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=u_t+%3D+u+u_x%3B+%5Cquad+u%280%2Cx%29+%3D+u_0%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t = u u_x; \quad u(0,x) = u_0(x)' title='u_t = u u_x; \quad u(0,x) = u_0(x)' class='latex' /> (3)</p>
<p>can also be solved exactly (and quickly) by the <a href="http://en.wikipedia.org/wiki/Method_of_characteristics">method of characteristics</a>.  Since the KdV equation is in some sense a combination of the equations (2) and (3), it is then reasonable to hope that some combination of the solution schemes for (2) and (3) can be used to solve (1), at least in some approximate sense.</p>
<p>One way to do this is by the method of <em>operator splitting</em>.  Observe from the formal approximation <img src='http://s3.wordpress.com/latex.php?latex=e%5E%7BA+%5CDelta+t%7D+%5Capprox+1+%2B+A+%5CDelta+t+%2B+O%28+%5CDelta+t%5E2%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{A \Delta t} \approx 1 + A \Delta t + O( \Delta t^2)' title='e^{A \Delta t} \approx 1 + A \Delta t + O( \Delta t^2)' class='latex' /> (where <img src='http://s1.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' /> should be thought of as small, and <img src='http://s2.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A' title='A' class='latex' /> is some matrix or linear operator), that one has</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=e%5E%7B%28A%2BB%29+%5CDelta+t%7D+u_0+%3D+e%5E%7BA+%5CDelta+t%7D+e%5E%7BB+%5CDelta+t%7D+u_0+%2B+O%28+%5CDelta+t%5E2+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{(A+B) \Delta t} u_0 = e^{A \Delta t} e^{B \Delta t} u_0 + O( \Delta t^2 )' title='e^{(A+B) \Delta t} u_0 = e^{A \Delta t} e^{B \Delta t} u_0 + O( \Delta t^2 )' class='latex' />, (4)</p>
<p>[we do not assume A and B to commute here] and thus we formally have</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=e%5E%7Bn+%28A%2BB%29+%5CDelta+t%7D+u_0+%3D+%28e%5E%7BA+%5CDelta+t%7D+e%5E%7BB+%5CDelta+t%7D%29%5En+u_0+%2B+O%28+%5CDelta+t+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{n (A+B) \Delta t} u_0 = (e^{A \Delta t} e^{B \Delta t})^n u_0 + O( \Delta t )' title='e^{n (A+B) \Delta t} u_0 = (e^{A \Delta t} e^{B \Delta t})^n u_0 + O( \Delta t )' class='latex' /> (5)</p>
<p>if <img src='http://s2.wordpress.com/latex.php?latex=n+%3D+T%2F%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n = T/\Delta t' title='n = T/\Delta t' class='latex' /> for some fixed time T (thus <img src='http://s3.wordpress.com/latex.php?latex=n+%3D+O%281%2F%5CDelta+t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n = O(1/\Delta t)' title='n = O(1/\Delta t)' class='latex' />).  As a consequence, if one wants to solve the linear ODE</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=u_t+%3D+%28A%2BB%29+u%3B+%5Cquad+u%280%29+%3D+u_0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t = (A+B) u; \quad u(0) = u_0' title='u_t = (A+B) u; \quad u(0) = u_0' class='latex' /> (1&#8242;)</p>
<p>for time <img src='http://s2.wordpress.com/latex.php?latex=T+%3D+n+%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='T = n \Delta t' title='T = n \Delta t' class='latex' />, one can achieve an approximate solution (accurate to order <img src='http://s3.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' />) by alternating <img src='http://s1.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n' title='n' class='latex' /> times between evolving the ODE</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=u_t+%3D+A+u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t = A u' title='u_t = A u' class='latex' /> (2&#8242;)</p>
<p>for time <img src='http://s3.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' />, and evolving the ODE</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=u_t+%3D+B+u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_t = B u' title='u_t = B u' class='latex' /> (3&#8242;)</p>
<p>for time <img src='http://s2.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' />, starting with the initial data <img src='http://s3.wordpress.com/latex.php?latex=u_0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_0' title='u_0' class='latex' />.</p>
<p>It turns out that this scheme can be formalised, and furthermore generalised to nonlinear settings such as those for the KdV equation (1).  More precisely, we show that if <img src='http://s1.wordpress.com/latex.php?latex=u_0+%5Cin+H%5Es%28%7B%5CBbb+R%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_0 \in H^s({\Bbb R})' title='u_0 \in H^s({\Bbb R})' class='latex' /> for some <img src='http://s2.wordpress.com/latex.php?latex=s+%5Cgeq+5&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='s \geq 5' title='s \geq 5' class='latex' />, then one can solve (1) to accuracy <img src='http://s3.wordpress.com/latex.php?latex=O%28%5CDelta+t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O(\Delta t)' title='O(\Delta t)' class='latex' /> in <img src='http://s1.wordpress.com/latex.php?latex=H%5Es&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H^s' title='H^s' class='latex' /> norm for any fixed time <img src='http://s2.wordpress.com/latex.php?latex=T+%3D+n+%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='T = n \Delta t' title='T = n \Delta t' class='latex' /> by alternating between evolving (2) and (3) for times <img src='http://s3.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' /> (this scheme is known as <em>Godunov splitting</em>).</p>
<p>Actually, one can obtain faster convergence by modifying the scheme, at the cost of requiring higher regularity on the data; the situation is similar to that of <a href="http://en.wikipedia.org/wiki/Numerical_integration">numerical integration (or quadrature)</a>, in which the <a href="http://en.wikipedia.org/wiki/Midpoint_rule">midpoint rule</a> or <a href="http://en.wikipedia.org/wiki/Simpson%27s_rule">Simpson&#8217;s rule</a> provide more accuracy than the Riemann integral if the integrand is smooth.  For instance, one has the variant</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=e%5E%7Bn+%28A%2BB%29+%5CDelta+t%7D+%3D+%28e%5E%7BA+%5CDelta+t%2F2%7D+e%5E%7BB+%5CDelta+t%2F2%7D+e%5E%7BB+%5CDelta+t%2F2%7D+e%5E%7BA+%5CDelta+t%2F2%7D%29%5En+%2B+O%28+%5CDelta+t%5E2+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{n (A+B) \Delta t} = (e^{A \Delta t/2} e^{B \Delta t/2} e^{B \Delta t/2} e^{A \Delta t/2})^n + O( \Delta t^2 )' title='e^{n (A+B) \Delta t} = (e^{A \Delta t/2} e^{B \Delta t/2} e^{B \Delta t/2} e^{A \Delta t/2})^n + O( \Delta t^2 )' class='latex' /> (6)</p>
<p>of (5), which can be seen by expansion to second order in <img src='http://s2.wordpress.com/latex.php?latex=%5CDelta+t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t' title='\Delta t' class='latex' /> (or by playing around with the <a href="http://en.wikipedia.org/wiki/Baker-Campbell-Hausdorff_formula">Baker-Campbell-Hausdorff formula</a>).  For KdV, we can rigorously show that the analogous scheme (known as <em>Strang splitting</em>) involving the indicated combination of evolutions of (2) and (3) will also converge to accuracy <img src='http://s3.wordpress.com/latex.php?latex=%5CDelta+t%5E2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Delta t^2' title='\Delta t^2' class='latex' /> in <img src='http://s1.wordpress.com/latex.php?latex=H%5Es&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H^s' title='H^s' class='latex' /> norm, provided that <img src='http://s2.wordpress.com/latex.php?latex=u_0+%5Cin+H%5Es%28%7B%5CBbb+R%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_0 \in H^s({\Bbb R})' title='u_0 \in H^s({\Bbb R})' class='latex' /> and <img src='http://s3.wordpress.com/latex.php?latex=s+%5Cgeq+17&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='s \geq 17' title='s \geq 17' class='latex' />.</p>
<p><span id="more-2357"></span>Our main tools are the energy method (i.e. computations of how <img src='http://s1.wordpress.com/latex.php?latex=H%5Es%28%7B%5CBbb+R%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H^s({\Bbb R})' title='H^s({\Bbb R})' class='latex' />-norm type energies of the solution evolve in time via differentiation and integration by parts), and the use of two time variables rather than one.  To describe the second method, let us give a PDE-style proof of (4) (assuming that A, B are finite-dimensional matrices and u is vector-valued to avoid the technical complications associated with infinite dimensional vector spaces).  We introduce two time variables <img src='http://s2.wordpress.com/latex.php?latex=t%2C+%5Ctau&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t, \tau' title='t, \tau' class='latex' /> and consider the PDE problem</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%5Cpartial_%5Ctau+u%28t%2C%5Ctau%29+%3D+A+u%28t%2C%5Ctau%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_\tau u(t,\tau) = A u(t,\tau)' title='\partial_\tau u(t,\tau) = A u(t,\tau)' class='latex' /></p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_t+u%28t%2C0%29+%3D+B+u%28t%2C0%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t u(t,0) = B u(t,0)' title='\partial_t u(t,0) = B u(t,0)' class='latex' /></p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=u%280%2C0%29+%3D+u_0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(0,0) = u_0' title='u(0,0) = u_0' class='latex' />.</p>
<p>Thus, one first solves the equation (3&#8242;) along the t axis with initial data <img src='http://s3.wordpress.com/latex.php?latex=u_0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_0' title='u_0' class='latex' />, and then solves the equation (2&#8242;) in the <img src='http://s1.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\tau' title='\tau' class='latex' /> direction, thus leading to the explicit solution <img src='http://s2.wordpress.com/latex.php?latex=u%28t%2C%5Ctau%29+%3D+e%5E%7B%5Ctau+A%7D+e%5E%7Bt+B%7D+u_0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t,\tau) = e^{\tau A} e^{t B} u_0' title='u(t,\tau) = e^{\tau A} e^{t B} u_0' class='latex' />.  In particular, the left-hand side of (4) is <img src='http://s3.wordpress.com/latex.php?latex=u%28%5CDelta+t%2C+%5CDelta+t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(\Delta t, \Delta t)' title='u(\Delta t, \Delta t)' class='latex' />.</p>
<p>Now we look at the evolution of u along the diagonal axis.  Observe from the chain rule that</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cfrac%7Bd%7D%7Bdt%7D+u%28t%2Ct%29+%3D+%28A%2BB%29+u%28t%2Ct%29+%2B+F%28t%2Ct%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\frac{d}{dt} u(t,t) = (A+B) u(t,t) + F(t,t)' title='\frac{d}{dt} u(t,t) = (A+B) u(t,t) + F(t,t)' class='latex' /> (7)</p>
<p>where <img src='http://s2.wordpress.com/latex.php?latex=F%28t%2C%5Ctau%29+%3A%3D%5Cpartial_t+u%28t%2C%5Ctau%29+-+B+u%28t%2C%5Ctau%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='F(t,\tau) :=\partial_t u(t,\tau) - B u(t,\tau)' title='F(t,\tau) :=\partial_t u(t,\tau) - B u(t,\tau)' class='latex' />.  On the other hand, F itself solves an ODE in the <img src='http://s3.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\tau' title='\tau' class='latex' /> direction:</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_%5Ctau+F+%3D+A+F+%2B+%5BA%2CB%5D+u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_\tau F = A F + [A,B] u' title='\partial_\tau F = A F + [A,B] u' class='latex' /></p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=F%28t%2C0%29+%3D+0.&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='F(t,0) = 0.' title='F(t,0) = 0.' class='latex' /></p>
<p>We thus have <img src='http://s3.wordpress.com/latex.php?latex=F%28t%2Ct%29+%3D+O%28+t+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='F(t,t) = O( t )' title='F(t,t) = O( t )' class='latex' /> for small t, and then by <a href="http://en.wikipedia.org/wiki/Duhamel%27s_integral">Duhamel&#8217;s formula</a> and (7) we have <img src='http://s1.wordpress.com/latex.php?latex=u%28t%2Ct%29+%3D+e%5E%7B%28A%2BB%29t%7D+u_0+%2B+O%28+t%5E2+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t,t) = e^{(A+B)t} u_0 + O( t^2 )' title='u(t,t) = e^{(A+B)t} u_0 + O( t^2 )' class='latex' />, which implies (4) as claimed.  Note that this analysis also recovers the classical formula <img src='http://s2.wordpress.com/latex.php?latex=e%5E%7B%28A%2BB%29t%7D+%3D+e%5E%7BAt%7D+e%5E%7BBt%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{(A+B)t} = e^{At} e^{Bt}' title='e^{(A+B)t} = e^{At} e^{Bt}' class='latex' /> in the special case that A and B commute.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2357/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2357/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2357/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2357/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2357/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2357/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2357/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2357/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2357/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2357/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2357&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/27/operator-splitting-for-the-kdv-equation/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Bulk universality for Wigner hermitian matrices with subexponential decay</title>
		<link>http://terrytao.wordpress.com/2009/06/26/bulk-universality-for-wigner-hermitian-matrices-with-subexponential-decay/</link>
		<comments>http://terrytao.wordpress.com/2009/06/26/bulk-universality-for-wigner-hermitian-matrices-with-subexponential-decay/#comments</comments>
		<pubDate>Sat, 27 Jun 2009 04:03:54 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.PR]]></category>
		<category><![CDATA[math.SP]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[Benjamin Schlein]]></category>
		<category><![CDATA[Horng-Tzer Yau]]></category>
		<category><![CDATA[Jose Ramirez]]></category>
		<category><![CDATA[Laszlo Erdos]]></category>
		<category><![CDATA[random matrices]]></category>
		<category><![CDATA[randomness]]></category>
		<category><![CDATA[universality]]></category>
		<category><![CDATA[Van Vu]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2353</guid>
		<description><![CDATA[One further paper in this stream: László Erdős, José Ramírez, Benjamin Schlein, Van Vu, Horng-Tzer Yau, and myself have just uploaded to the arXiv the paper &#8220;Bulk universality for Wigner hermitian matrices with subexponential decay&#8220;, submitted to Mathematical Research Letters.  (Incidentally, this is my first six-author paper I have been involved in, not counting the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2353&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>One further paper in this stream: <a href="http://www.mathematik.uni-muenchen.de/~lerdos/">László Erdős</a>, José Ramírez, <a href="http://www.dpmms.cam.ac.uk/~bds32/">Benjamin Schlein</a>, <a href="http://www.dpmms.cam.ac.uk/~bds32/">Van Vu</a>, <a href="http://as.nyu.edu/object/horngtzeryau.html">Horng-Tzer Yau</a>, and myself have just uploaded to the <a href="http://www.arxiv.org">arXiv</a> the paper &#8220;<a href="http://arxiv.org/abs/0906.4400">Bulk universality for Wigner hermitian matrices with subexponential decay</a>&#8220;, submitted to <a href="http://math.ucsd.edu/~mrl/">Mathematical Research Letters</a>.  (Incidentally, this is my first six-author paper I have been involved in, not counting the <a href="http://en.wordpress.com/tag/polymath1/">polymath</a> projects of course, though I have had a number of five-author papers.)</p>
<p>This short paper (9 pages) combines the machinery from two recent papers on the universality conjecture for the eigenvalue spacings in the bulk for Wigner random matrices (see <a href="http://terrytao.wordpress.com/2009/06/03/random-matrices-universality-of-local-eigenvalue-statistics/">my earlier blog post</a> for more discussion).  On the one hand, the <a href="http://arxiv.org/abs/0905.4176">paper of Erdős-Ramírez-Schlein-Yau</a> established this conjecture under the additional hypothesis that the distribution of the individual entries obeyed some smoothness and exponential decay conditions.  Meanwhile, the <a href="http://arxiv.org/abs/0906.0510">paper of Van Vu and myself</a> (which I discussed in <a href="http://terrytao.wordpress.com/2009/06/03/random-matrices-universality-of-local-eigenvalue-statistics/">my earlier blog post</a>) established the conjecture under a somewhat different set of hypotheses, namely that the distribution of the individual entries obeyed some moment conditions (in particular, the third moment had to vanish), a support condition (the entries had to have real part supported in at least three points), and an exponential decay condition.</p>
<p>After comparing our results, the six of us realised that our methods could in fact be combined rather easily to obtain a stronger result, establishing the universality conjecture assuming only a exponential decay (or more precisely, sub-exponential decay) bound <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+P%7D%28%7Cx_%7B%5Cell+k%7D%7C+%3E+t+%29+%5Cll+%5Cexp%28+-+t%5Ec+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb P}(|x_{\ell k}| &gt; t ) \ll \exp( - t^c )' title='{\Bbb P}(|x_{\ell k}| &gt; t ) \ll \exp( - t^c )' class='latex' /> on the coefficients; thus all regularity, moment, and support conditions have been eliminated.  (There is one catch, namely that we can no longer control a single spacing <img src='http://s2.wordpress.com/latex.php?latex=%5Clambda_%7Bi%2B1%7D-%5Clambda_i&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\lambda_{i+1}-\lambda_i' title='\lambda_{i+1}-\lambda_i' class='latex' /> for a single fixed i, but must now average over all <img src='http://s3.wordpress.com/latex.php?latex=1+%5Cleq+i+%5Cleq+n&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='1 \leq i \leq n' title='1 \leq i \leq n' class='latex' /> before recovering the universality.  This is an annoying technical issue but it may be resolvable in the future with further refinements to the method.)</p>
<p>I can describe the main idea behind the unified approach here.  One can arrange the Wigner matrices in a hierarchy, from most structured to least structured:</p>
<ul>
<li>The most structured (or special) ensemble is the <em>Gaussian Unitary Ensemble</em> (GUE), in which the coefficients are gaussian. Here, one has very explicit and tractable formulae for the eigenvalue distributions, gap spacing, etc.</li>
<li>The next most structured ensemble of Wigner matrices are the <em>Gaussian-divisible</em> or <em>Johansson matrices</em>, which are matrices H of the form <img src='http://s1.wordpress.com/latex.php?latex=H+%3D+e%5E%7B-t%2F2%7D+%5Chat+H+%2B+%281-e%5E%7B-t%7D%29%5E%7B1%2F2%7D+V&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H = e^{-t/2} \hat H + (1-e^{-t})^{1/2} V' title='H = e^{-t/2} \hat H + (1-e^{-t})^{1/2} V' class='latex' />, where <img src='http://s2.wordpress.com/latex.php?latex=%5Chat+H&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\hat H' title='\hat H' class='latex' /> is another Wigner matrix, V is a GUE matrix independent of <img src='http://s3.wordpress.com/latex.php?latex=%5Chat+H&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\hat H' title='\hat H' class='latex' />, and <img src='http://s1.wordpress.com/latex.php?latex=0+%3C+t+%3C+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='0 &lt; t &lt; 1' title='0 &lt; t &lt; 1' class='latex' /> is a fixed parameter independent of n.  Here, one still has quite explicit (though not quite as tractable) formulae for the joint eigenvalue distribution and related statistics.  Note that the limiting case t=1 is GUE.</li>
<li>After this, one has the <em><a href="http://en.wikipedia.org/wiki/Ornstein-Uhlenbeck_process">Ornstein-Uhlenbeck</a>-evolved</em><em> matrices</em>, which are also of the form <img src='http://s2.wordpress.com/latex.php?latex=H+%3D+e%5E%7B-t%2F2%7D+%5Chat+H+%2B+%281-e%5E%7B-t%7D%29%5E%7B1%2F2%7D+V&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H = e^{-t/2} \hat H + (1-e^{-t})^{1/2} V' title='H = e^{-t/2} \hat H + (1-e^{-t})^{1/2} V' class='latex' />, but now <img src='http://s3.wordpress.com/latex.php?latex=t+%3D+n%5E%7B-1%2B%5Cdelta%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t = n^{-1+\delta}' title='t = n^{-1+\delta}' class='latex' /> decays at a power rate with n, rather than being comparable to 1.  Explicit formulae still exist for these matrices, but extracting universality out of this is hard work (and occupies the bulk of the <a href="http://arxiv.org/abs/0905.4176">paper of Erdős-Ramírez-Schlein-Yau</a>).</li>
<li>Finally, one has arbitrary Wigner matrices, which can be viewed as the t=0 limit of the above Ornstein-Uhlenbeck process.</li>
</ul>
<p>The arguments in the <a href="http://arxiv.org/abs/0905.4176">paper of Erdős-Ramírez-Schlein-Yau</a> can be summarised as follows (I assume subexponential decay throughout this discussion):</p>
<ol>
<li>(Structured case) The universality conjecture is true for Ornstein-Uhlenbeck-evolved matrices with <img src='http://s1.wordpress.com/latex.php?latex=t+%3D+n%5E%7B-1%2B%5Cdelta%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t = n^{-1+\delta}' title='t = n^{-1+\delta}' class='latex' /> for any <img src='http://s2.wordpress.com/latex.php?latex=0+%3C+%5Cdelta+%5Cleq+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='0 &lt; \delta \leq 1' title='0 &lt; \delta \leq 1' class='latex' />.  (The case <img src='http://s3.wordpress.com/latex.php?latex=1%2F4+%3C+%5Cdelta+%5Cleq+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='1/4 &lt; \delta \leq 1' title='1/4 &lt; \delta \leq 1' class='latex' /> was treated in an earlier paper of <a href="http://arxiv.org/abs/0905.2089">Erdős-Ramírez-Schlein-Yau</a>, while the case where t is comparable to 1 was treated <a href="http://www.ams.org/mathscinet-getitem?mr=1810949">by Johansson</a>.)</li>
<li>(Matching) Every Wigner matrix with suitable smoothness conditions can be &#8220;matched&#8221; with an Ornstein-Uhlenbeck-evolved matrix, in the sense that the eigenvalue statistics for the two matrices are asymptotically identical.  (This is relatively easy due to the fact that <img src='http://s1.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\delta' title='\delta' class='latex' /> can be taken arbitrarily close to zero.)</li>
<li>Combining 1. and 2. one obtains universality for all Wigner matrices obeying suitable smoothness conditions.</li>
</ol>
<p>The arguments in the <a href="http://terrytao.wordpress.com/2009/06/03/random-matrices-universality-of-local-eigenvalue-statistics/">paper of Van and myself</a> can be summarised as follows:</p>
<ol>
<li>(Structured case) The universality conjecture is true for Johansson matrices, by the <a href="http://www.ams.org/mathscinet-getitem?mr=1810949">paper of Johansson</a>.</li>
<li>(Matching) Every Wigner matrix with some moment and support conditions can be &#8220;matched&#8221; with a Johansson matrix, in the sense that the first four moments of the entries agree, and hence (by the Lindeberg strategy in our paper) have asymptotically identical statistics.</li>
<li>Combining 1. and 2. one obtains universality for all Wigner matrices obtaining suitable moment and support conditions.</li>
</ol>
<p>What we realised is by combining the hard part 1. of the <a href="http://arxiv.org/abs/0905.4176">paper of Erdős-Ramírez-Schlein-Yau</a> with the hard part 2. of the <a href="../2009/06/03/random-matrices-universality-of-local-eigenvalue-statistics/">paper of Van and myself</a>, we can remove all regularity, moment, and support conditions.  Roughly speaking, the unified argument proceeds as follows:</p>
<ol>
<li>(Structured case) By the arguments <a href="http://arxiv.org/abs/0905.4176">of Erdős-Ramírez-Schlein-Yau</a>, the universality conjecture is true for Ornstein-Uhlenbeck-evolved matrices with <img src='http://s2.wordpress.com/latex.php?latex=t+%3D+n%5E%7B-1%2B%5Cdelta%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t = n^{-1+\delta}' title='t = n^{-1+\delta}' class='latex' /> for any <img src='http://s3.wordpress.com/latex.php?latex=0+%3C+%5Cdelta+%5Cleq+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='0 &lt; \delta \leq 1' title='0 &lt; \delta \leq 1' class='latex' />.</li>
<li>(Matching) Every Wigner matrix <img src='http://s1.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='H' title='H' class='latex' /> can be &#8220;matched&#8221; with an Ornstein-Uhlenbeck-evolved matrix <img src='http://s2.wordpress.com/latex.php?latex=e%5E%7B-t%2F2%7D+H+%2B+%281-e%5E%7B-t%7D%29%5E%7B1%2F2%7D+V&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='e^{-t/2} H + (1-e^{-t})^{1/2} V' title='e^{-t/2} H + (1-e^{-t})^{1/2} V' class='latex' /> for <img src='http://s3.wordpress.com/latex.php?latex=t%3D+n%5E%7B-1%2B0.01%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t= n^{-1+0.01}' title='t= n^{-1+0.01}' class='latex' /> (say), in the sense that the first four moments of the entries almost agree, which is enough (by the arguments <a href="../2009/06/03/random-matrices-universality-of-local-eigenvalue-statistics/">of Van and myself</a>) to show that these two matrices have asymptotically identical statistics on the average.</li>
<li>Combining 1. and 2. one obtains universality for the averaged statistics for all Wigner matrices.</li>
</ol>
<p>The averaging should be removable, but this would require better convergence results to the semicircular law than are currently known (except with additional hypotheses, such as vanishing third moment).  The subexponential decay should also be relaxed to a condition of finiteness for some fixed moment <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+E%7D+%7Cx%7C%5EC&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb E} |x|^C' title='{\Bbb E} |x|^C' class='latex' />, but we did not pursue this direction in order to keep the paper short.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2353/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2353&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/26/bulk-universality-for-wigner-hermitian-matrices-with-subexponential-decay/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Sumset and inverse sumset theorems for Shannon entropy</title>
		<link>http://terrytao.wordpress.com/2009/06/25/sumset-and-inverse-sumset-theorems-for-shannon-entropy/</link>
		<comments>http://terrytao.wordpress.com/2009/06/25/sumset-and-inverse-sumset-theorems-for-shannon-entropy/#comments</comments>
		<pubDate>Fri, 26 Jun 2009 01:12:31 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.CO]]></category>
		<category><![CDATA[math.PR]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[Freiman's theorem]]></category>
		<category><![CDATA[Shannon entropy]]></category>
		<category><![CDATA[sum set estimates]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2348</guid>
		<description><![CDATA[It turns out to be a favourable week or two for me to finally finish a number of papers that had been at a nearly completed stage for a while.  I have just uploaded to the arXiv my article &#8220;Sumset and inverse sumset theorems for Shannon entropy&#8220;, submitted to Combinatorics, Probability, and Computing.  This paper [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2348&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>It turns out to be a favourable week or two for me to finally finish a number of papers that had been at a nearly completed stage for a while.  I have just uploaded to the <a href="http://www.arxiv.org">arXiv</a> my article &#8220;<a href="http://arxiv.org/abs/0906.4387">Sumset and inverse sumset theorems for Shannon entropy</a>&#8220;, submitted to <a href="http://journals.cambridge.org/action/displayMoreInfo?jid=CPC&amp;type=ifc">Combinatorics, Probability, and Computing</a>.  This paper evolved from a &#8220;deleted scene&#8221; in <a href="http://terrytao.wordpress.com/books/additive-combinatorics/">my book with Van Vu</a> entitled &#8220;<a href="http://www.math.ucla.edu/~tao/preprints/Expository/chapter_entropy.dvi">Entropy sumset estimates</a>&#8220;.  In those notes, we developed analogues of the standard Plünnecke-Ruzsa sumset estimates (which relate quantities such as the cardinalities <img src='http://s2.wordpress.com/latex.php?latex=%7CA%2BB%7C%2C+%7CA-B%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+B|, |A-B|' title='|A+B|, |A-B|' class='latex' /> of the sum and difference sets of two finite sets <img src='http://s3.wordpress.com/latex.php?latex=A%2C+B&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A, B' title='A, B' class='latex' /> in an additive group <img src='http://s1.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='G' title='G' class='latex' /> to each other), to the entropy setting, in which the finite sets <img src='http://s2.wordpress.com/latex.php?latex=A+%5Csubset+G&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \subset G' title='A \subset G' class='latex' /> are replaced instead with discrete random variables <img src='http://s3.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X' title='X' class='latex' /> taking values in that group G, and the (logarithm of the) cardinality |A| is replaced with the <a href="http://en.wikipedia.org/wiki/Entropy_(information_theory)">Shannon entropy</a></p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%29+%3A%3D+%5Csum_%7Bx+%5Cin+G%7D+%7B%5CBbb+P%7D%28x+%5Cin+X%29+%5Clog+%5Cfrac%7B1%7D%7B%7B%5CBbb+P%7D%28x+%5Cin+X%29%7D.&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X) := \sum_{x \in G} {\Bbb P}(x \in X) \log \frac{1}{{\Bbb P}(x \in X)}.' title='{\textbf H}(X) := \sum_{x \in G} {\Bbb P}(x \in X) \log \frac{1}{{\Bbb P}(x \in X)}.' class='latex' /></p>
<p>This quantity measures the information content of X; for instance, if <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%29+%3D+k+%5Clog+2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X) = k \log 2' title='{\textbf H}(X) = k \log 2' class='latex' />, then it will take k bits on the average to store the value of X (thus a string of n independent copies of X will require about nk bits of storage in the asymptotic limit <img src='http://s3.wordpress.com/latex.php?latex=n+%5Cto+%5Cinfty&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n \to \infty' title='n \to \infty' class='latex' />).  The relationship between entropy and cardinality is that if X is the uniform distribution on a finite non-empty set A, then <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%29+%3D+%5Clog+%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X) = \log |A|' title='{\textbf H}(X) = \log |A|' class='latex' />.  If instead X is non-uniformly distributed on A, one has <img src='http://s2.wordpress.com/latex.php?latex=0+%3C+%7B%5Ctextbf+H%7D%28X%29+%3C+%5Clog+%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='0 &lt; {\textbf H}(X) &lt; \log |A|' title='0 &lt; {\textbf H}(X) &lt; \log |A|' class='latex' />, thanks to <a href="http://en.wikipedia.org/wiki/Jensen%27s_inequality">Jensen&#8217;s inequality</a>.</p>
<p>It turns out that many estimates on sumsets have entropy analogues, which resemble the &#8220;logarithm&#8221; of the sumset estimates.  For instance, the trivial bounds</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%7CA%7C%2C+%7CB%7C+%5Cleq+%7CA%2BB%7C+%5Cleq+%7CA%7C+%7CB%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A|, |B| \leq |A+B| \leq |A| |B|' title='|A|, |B| \leq |A+B| \leq |A| |B|' class='latex' /></p>
<p>have the entropy analogue</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%29%2C+%7B%5Ctextbf+H%7D%28Y%29+%5Cleq+%7B%5Ctextbf+H%7D%28X%2BY%29+%5Cleq+%7B%5Ctextbf+H%7D%28X%29+%2B+%7B%5Ctextbf+H%7D%28Y%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X), {\textbf H}(Y) \leq {\textbf H}(X+Y) \leq {\textbf H}(X) + {\textbf H}(Y)' title='{\textbf H}(X), {\textbf H}(Y) \leq {\textbf H}(X+Y) \leq {\textbf H}(X) + {\textbf H}(Y)' class='latex' /></p>
<p>whenever X, Y are independent discrete random variables in an additive group; this is not difficult to deduce from standard entropy inequalities.  Slightly more non-trivially, the sum set estimate</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%7CA%2BB%7C+%5Cleq+%5Cfrac%7B%7CA-B%7C%5E3%7D%7B%7CA%7C+%7CB%7C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+B| \leq \frac{|A-B|^3}{|A| |B|}' title='|A+B| \leq \frac{|A-B|^3}{|A| |B|}' class='latex' /></p>
<p>established by Ruzsa, has an entropy analogue</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%2BY%29+%5Cleq+3+%7B%5Ctextbf+H%7D%28X-Y%29+-+%7B%5Ctextbf+H%7D%28X%29+-+%7B%5Ctextbf+H%7D%28Y%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X+Y) \leq 3 {\textbf H}(X-Y) - {\textbf H}(X) - {\textbf H}(Y)' title='{\textbf H}(X+Y) \leq 3 {\textbf H}(X-Y) - {\textbf H}(X) - {\textbf H}(Y)' class='latex' />,</p>
<p>and similarly for a number of other standard sumset inequalities in the literature (e.g. the Rusza triangle inequality, the Plünnecke-Rusza inequality, and the Balog-Szemeredi-Gowers theorem, though the entropy analogue of the latter requires a little bit of care to state).  These inequalities can actually be deduced fairly easily from elementary arithmetic identities, together with standard entropy inequalities, most notably the <em>submodularity inequality</em></p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28Z%29+%2B+%7B%5Ctextbf+H%7D%28W%29+%5Cleq+%7B%5Ctextbf+H%7D%28X%29+%2B+%7B%5Ctextbf+H%7D%28Y%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(Z) + {\textbf H}(W) \leq {\textbf H}(X) + {\textbf H}(Y)' title='{\textbf H}(Z) + {\textbf H}(W) \leq {\textbf H}(X) + {\textbf H}(Y)' class='latex' /></p>
<p>whenever X,Y,Z,W are discrete random variables such that X and Y each determine W separately (thus <img src='http://s2.wordpress.com/latex.php?latex=W+%3D+f%28X%29+%3D+g%28Y%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='W = f(X) = g(Y)' title='W = f(X) = g(Y)' class='latex' /> for some deterministic functions f, g) and X and Y determine Z jointly (thus <img src='http://s3.wordpress.com/latex.php?latex=Z+%3D+h%28X%2CY%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='Z = h(X,Y)' title='Z = h(X,Y)' class='latex' /> for some deterministic function f).  For instance, if X,Y,Z are independent discrete random variables in an additive group G, then <img src='http://s1.wordpress.com/latex.php?latex=%28X-Y%2CY-Z%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(X-Y,Y-Z)' title='(X-Y,Y-Z)' class='latex' /> and <img src='http://s2.wordpress.com/latex.php?latex=%28X%2CZ%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(X,Z)' title='(X,Z)' class='latex' /> each determine <img src='http://s3.wordpress.com/latex.php?latex=X-Z&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X-Z' title='X-Z' class='latex' /> separately, and determine <img src='http://s1.wordpress.com/latex.php?latex=X%2CY%2CZ&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X,Y,Z' title='X,Y,Z' class='latex' /> jointly, leading to the inequality</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%2CY%2CZ%29+%2B+%7B%5Ctextbf+H%7D%28X-Z%29+%5Cleq+%7B%5Ctextbf+H%7D%28X-Y%2CY-Z%29+%2B+%7B%5Ctextbf+H%7D%28X%2CZ%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X,Y,Z) + {\textbf H}(X-Z) \leq {\textbf H}(X-Y,Y-Z) + {\textbf H}(X,Z)' title='{\textbf H}(X,Y,Z) + {\textbf H}(X-Z) \leq {\textbf H}(X-Y,Y-Z) + {\textbf H}(X,Z)' class='latex' /></p>
<p>which soon leads to the <em>entropy Rusza triangle inequality</em></p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X-Z%29+%5Cleq+%7B%5Ctextbf+H%7D%28X-Y%29+%2B+%7B%5Ctextbf+H%7D%28Y-Z%29+-+%7B%5Ctextbf+H%7D%28Y%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X-Z) \leq {\textbf H}(X-Y) + {\textbf H}(Y-Z) - {\textbf H}(Y)' title='{\textbf H}(X-Z) \leq {\textbf H}(X-Y) + {\textbf H}(Y-Z) - {\textbf H}(Y)' class='latex' /></p>
<p>which is an analogue of the combinatorial Ruzsa triangle inequality</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7CA-C%7C+%5Cleq+%5Cfrac%7B%7CA-B%7C+%7CB-C%7C%7D%7B%7CB%7C%7D.&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A-C| \leq \frac{|A-B| |B-C|}{|B|}.' title='|A-C| \leq \frac{|A-B| |B-C|}{|B|}.' class='latex' /></p>
<p>All of this was already in the unpublished notes with Van, though I include it in this paper in order to place it in the literature.  The main novelty of the paper, though, is to consider the entropy analogue of Freiman&#8217;s theorem, which classifies those sets A for which <img src='http://s2.wordpress.com/latex.php?latex=%7CA%2BA%7C+%3D+O%28%7CA%7C%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+A| = O(|A|)' title='|A+A| = O(|A|)' class='latex' />.  Here, the analogous problem is to classify the random variables <img src='http://s3.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X' title='X' class='latex' /> such that <img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X_1%2BX_2%29+%3D+%7B%5Ctextbf+H%7D%28X%29+%2B+O%281%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X_1+X_2) = {\textbf H}(X) + O(1)' title='{\textbf H}(X_1+X_2) = {\textbf H}(X) + O(1)' class='latex' />, where <img src='http://s2.wordpress.com/latex.php?latex=X_1%2CX_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X_1,X_2' title='X_1,X_2' class='latex' /> are independent copies of X.  Let us say that X has <em>small doubling</em> if this is the case.</p>
<p>For instance, the uniform distribution U on a finite subgroup H of G has small doubling (in fact <img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28U_1%2BU_2%29%3D%7B%5Ctextbf+H%7D%28U%29+%3D+%5Clog+%7CH%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(U_1+U_2)={\textbf H}(U) = \log |H|' title='{\textbf H}(U_1+U_2)={\textbf H}(U) = \log |H|' class='latex' /> in this case). In a similar spirit, the uniform distribution on a (generalised) arithmetic progression P also has small doubling, as does the uniform distribution on a coset progression H+P.  Also, if X has small doubling, and Y has bounded entropy, then X+Y also has small doubling, even if Y and X are not independent.  The main theorem is that these are the only cases:</p>
<blockquote><p><strong>Theorem 1.</strong> (Informal statement) X has small doubling if and only if <img src='http://s1.wordpress.com/latex.php?latex=X+%3D+U+%2B+Y&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X = U + Y' title='X = U + Y' class='latex' /> for some uniform distribution U on a coset progression (of bounded rank), and Y has bounded entropy.</p></blockquote>
<p>For instance, suppose that X was the uniform distribution on a dense subset A of a finite group G.  Then Theorem 1 asserts that X is close in a &#8220;transport metric&#8221; sense to the uniform distribution U on G, in the sense that it is possible to rearrange or transport the probability distribution of X to the probability distribution of U (or vice versa) by shifting each component of the mass of X by an amount Y which has bounded entropy (which basically means that it primarily ranges inside a set of bounded cardinality).  The way one shows this is by randomly translating the mass of X around by a few random shifts to approximately uniformise the distribution, and then deal with the residual fluctuation in the distribution by hand.  Theorem 1 as a whole is established by using the Freiman theorem in the combinatorial setting combined with various elementary convexity and entropy inequality arguments to reduce matters to the above model case when X is supported inside a finite group G and has near-maximal entropy.</p>
<p>I also show a variant of the above statement: if X, Y are independent and <img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X%2BY%29+%3D+%7B%5Ctextbf+H%7D%28X%29%2BO%281%29+%3D+%7B%5Ctextbf+H%7D%28Y%29%2BO%281%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X+Y) = {\textbf H}(X)+O(1) = {\textbf H}(Y)+O(1)' title='{\textbf H}(X+Y) = {\textbf H}(X)+O(1) = {\textbf H}(Y)+O(1)' class='latex' />, then we have <img src='http://s3.wordpress.com/latex.php?latex=X+%5Cequiv+Y%2BZ&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X \equiv Y+Z' title='X \equiv Y+Z' class='latex' /> (i.e. X has the same distribution as Y+Z for some Z of bounded entropy (not necessarily independent of X or Y).  Thus if two random variables are additively related to each other, then they can be additively transported to each other by using a bounded amount of entropy.</p>
<p>In the last part of the paper I relate these discrete entropies to their continuous counterparts</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D_%7B%5CBbb+R%7D%28X%29+%3A%3D+%5Cint_%7B%7B%5CBbb+R%7D%7D+p%28x%29+%5Clog+%5Cfrac%7B1%7D%7Bp%28x%29%7D%5C+dx%2C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}_{\Bbb R}(X) := \int_{{\Bbb R}} p(x) \log \frac{1}{p(x)}\ dx,' title='{\textbf H}_{\Bbb R}(X) := \int_{{\Bbb R}} p(x) \log \frac{1}{p(x)}\ dx,' class='latex' /></p>
<p>where X is now a continuous random variable on the real line with density function <img src='http://s2.wordpress.com/latex.php?latex=p%28x%29%5C+dx&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='p(x)\ dx' title='p(x)\ dx' class='latex' />.  There are a number of sum set inequalities known in this setting, for instance</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D_%7B%5CBbb+R%7D%28X_1+%2B+X_2%29+%5Cgeq+%7B%5Ctextbf+H%7D_%7B%5CBbb+R%7D%28X%29+%2B+%5Cfrac%7B1%7D%7B2%7D+%5Clog+2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}_{\Bbb R}(X_1 + X_2) \geq {\textbf H}_{\Bbb R}(X) + \frac{1}{2} \log 2' title='{\textbf H}_{\Bbb R}(X_1 + X_2) \geq {\textbf H}_{\Bbb R}(X) + \frac{1}{2} \log 2' class='latex' />,</p>
<p>for independent copies <img src='http://s1.wordpress.com/latex.php?latex=X_1%2CX_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X_1,X_2' title='X_1,X_2' class='latex' /> of a finite entropy random variable X, with equality if and only if X is a Gaussian.  Using this inequality and Theorem 1, I show a discrete version, namely that</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%7B%5Ctextbf+H%7D%28X_1+%2B+X_2%29+%5Cgeq+%7B%5Ctextbf+H%7D%28X%29+%2B+%5Cfrac%7B1%7D%7B2%7D+%5Clog+2+-+%5Cvarepsilon&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\textbf H}(X_1 + X_2) \geq {\textbf H}(X) + \frac{1}{2} \log 2 - \varepsilon' title='{\textbf H}(X_1 + X_2) \geq {\textbf H}(X) + \frac{1}{2} \log 2 - \varepsilon' class='latex' />,</p>
<p>whenever <img src='http://s3.wordpress.com/latex.php?latex=%5Cvarepsilon%3E+0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\varepsilon&gt; 0' title='\varepsilon&gt; 0' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=X_1%2CX_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X_1,X_2' title='X_1,X_2' class='latex' /> are independent copies of a random variable in <img src='http://s2.wordpress.com/latex.php?latex=%7B%5CBbb+Z%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb Z}' title='{\Bbb Z}' class='latex' /> (or any other torsion-free abelian group) whose entropy is sufficiently large depending on <img src='http://s3.wordpress.com/latex.php?latex=%5Cvarepsilon&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\varepsilon' title='\varepsilon' class='latex' />.  This is somewhat analogous to the classical sumset inequality</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%7CA%2BA%7C+%5Cgeq+2+%7CA%7C+-+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+A| \geq 2 |A| - 1' title='|A+A| \geq 2 |A| - 1' class='latex' /></p>
<p>though notice that we have a gain of just <img src='http://s2.wordpress.com/latex.php?latex=%5Cfrac%7B1%7D%7B2%7D+%5Clog+2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\frac{1}{2} \log 2' title='\frac{1}{2} \log 2' class='latex' /> rather than <img src='http://s3.wordpress.com/latex.php?latex=%5Clog+2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\log 2' title='\log 2' class='latex' /> here, the point being that there is a Gaussian counterexample in the entropy setting which does not have a combinatorial analogue (except perhaps in the high-dimensional limit).  The main idea is to use Theorem 1 to trap most of X inside a coset progression, at which point one can use Fourier-analytic additive combinatorial tools to show that the distribution <img src='http://s1.wordpress.com/latex.php?latex=X_1%2BX_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='X_1+X_2' title='X_1+X_2' class='latex' /> is &#8220;smooth&#8221; in some non-trivial direction r, which can then be used to approximate the discrete distribution by a continuous one.</p>
<p>I also conjecture more generally that the entropy monotonicity inequalities established <a href="http://www.cims.nyu.edu/~naor/homepage%20files/growth.pdf">by Artstein, Barthe, Ball, and Naor</a> in the continuous case also hold in the above sense in the discrete case, though my method of proof breaks down because I no longer can assume small doubling.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2348/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2348/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2348/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2348/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2348/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2348/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2348/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2348/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2348/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2348/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2348&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/25/sumset-and-inverse-sumset-theorems-for-shannon-entropy/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Freiman&#8217;s theorem for solvable groups</title>
		<link>http://terrytao.wordpress.com/2009/06/21/freimans-theorem-for-solvable-groups/</link>
		<comments>http://terrytao.wordpress.com/2009/06/21/freimans-theorem-for-solvable-groups/#comments</comments>
		<pubDate>Sun, 21 Jun 2009 21:32:16 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.CO]]></category>
		<category><![CDATA[math.GR]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[Freiman's theorem]]></category>
		<category><![CDATA[Milnor-Wolf theorem]]></category>
		<category><![CDATA[nilpotent groups]]></category>
		<category><![CDATA[solvable groups]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2334</guid>
		<description><![CDATA[I&#8217;m continuing the stream of uploaded papers this week with my paper &#8220;Freiman&#8217;s theorem for solvable groups&#8220;, submitted to Contrib. Disc. Math..  This paper concerns the problem (discussed in this earlier blog post) of determining the correct analogue of Freiman&#8217;s theorem in a general non-abelian group .  Specifically, if  is a finite set that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2334&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I&#8217;m continuing the stream of uploaded papers this week with my paper &#8220;<a href="http://arxiv.org/abs/0906.3535">Freiman&#8217;s theorem for solvable groups</a>&#8220;, submitted to <a href="http://cdm.ucalgary.ca/index.php/cdm">Contrib. Disc. Math.</a>.  This paper concerns the problem (discussed in <a href="http://terrytao.wordpress.com/2007/03/02/open-question-noncommutative-freiman-theorem/">this earlier blog post</a>) of determining the correct analogue of Freiman&#8217;s theorem in a general non-abelian group <img src='http://s2.wordpress.com/latex.php?latex=G+%3D+%28G%2C%5Ccdot%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='G = (G,\cdot)' title='G = (G,\cdot)' class='latex' />.  Specifically, if <img src='http://s3.wordpress.com/latex.php?latex=A+%5Csubset+G&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \subset G' title='A \subset G' class='latex' /> is a finite set that obeys the doubling condition <img src='http://s1.wordpress.com/latex.php?latex=%7CA+%5Ccdot+A%7C+%5Cleq+K%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A \cdot A| \leq K|A|' title='|A \cdot A| \leq K|A|' class='latex' /> for some bounded K, what does this tell us about A?  Heuristically, we expect A to behave like a finite subgroup of G (or perhaps a coset of such a subgroup).</p>
<p>When G is the integers (with the additive group operation), Freiman&#8217;s theorem then tells us that A is controlled by a generalised arithmetic progression P, where I say that one set A is controlled by another P if they have comparable size, and the former can be covered by a finite number of translates of the latter.  (One can view generalised arithmetic progressions as an approximate version of a subgroup, in which one only uses the generators of the progression for a finite amount of time before stopping, as opposed to groups which allow words of unbounded length in the generators.) For more general abelian groups, the Freiman theorem <a href="http://www.arxiv.org/abs/math.NT/0505198">of Green and Ruzsa</a> tells us that a set of bounded doubling is controlled by a generalised coset progression <img src='http://s2.wordpress.com/latex.php?latex=P%2BH&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='P+H' title='P+H' class='latex' />, i.e. the sum of a generalised arithmetic progression P and a finite subgroup H of G.  (Of course, if G is torsion-free, the finite subgroup H must be trivial.)</p>
<p>In this paper we address the case when G is a <a href="http://en.wikipedia.org/wiki/Solvable_group">solvable group</a> of bounded derived length.  The main result is that if a subset of G has small doubing, then it is controlled by an object which I call a &#8220;coset nilprogression&#8221;, which is a certain technical generalisation of a coset progression, in which the generators do not quite commute, but have commutator expressible in terms of &#8220;higher order&#8221; generators.  This is essentially a sharp characterisation of such sets, except for the fact that one would like a more explicit description of these coset nilprogressions.   In the torsion-free case, a more explicit description (analogous to the Mal&#8217;cev basis description of nilpotent groups) has appeared in a very recent <a href="http://arxiv.org/abs/0906.3598">paper of Breulliard and Green</a>; in the case of monomial groups (a class of groups that overlaps to a large extent with solvable groups), and assuming a polynomial growth condition rather than a doubling condition, a related result controlling A by balls in a suitable type of metric has appeared in very recent work of Sanders.  In the nilpotent case there is also a nice recent <a href="http://arxiv.org/abs/0901.1409">argument of Fisher, Peng, and Katz</a> which shows that sets of small doubling remain of small doubling with respect to the Lie algebra operations of addition and Lie bracket, and thus are amenable to the abelian Freiman theorems.</p>
<p>The conclusion of my paper is easiest to state (and easiest to prove) in the model case of the <a href="http://en.wikipedia.org/wiki/Lamplighter_group">lamplighter group</a> <img src='http://s3.wordpress.com/latex.php?latex=G+%3D+%7B%5CBbb+Z%7D+%5Crtimes+%7B%5CBbb+F%7D_2%5E%5Comega&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='G = {\Bbb Z} \rtimes {\Bbb F}_2^\omega' title='G = {\Bbb Z} \rtimes {\Bbb F}_2^\omega' class='latex' />, where <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5E%5Comega+%3D+%5Clim_%7Bn+%5Cto+%5Cinfty%7D+%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^\omega = \lim_{n \to \infty} {\Bbb F}_2^n' title='{\Bbb F}_2^\omega = \lim_{n \to \infty} {\Bbb F}_2^n' class='latex' /> is the additive group of doubly infinite sequences in the finite field <img src='http://s2.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2' title='{\Bbb F}_2' class='latex' /> with only finitely many non-zero entries, and <img src='http://s3.wordpress.com/latex.php?latex=%7B%5CBbb+Z%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb Z}' title='{\Bbb Z}' class='latex' /> acts on this space by translations.  This is a solvable group of derived length two.  The main result here is</p>
<blockquote><p><strong>Theorem 1.</strong> (Freiman&#8217;s theorem for the lamplighter group) If <img src='http://s1.wordpress.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+Z%7D+%5Cltimes+%7B%5CBbb+F%7D_2%5E%5Comega&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \subset {\Bbb Z} \ltimes {\Bbb F}_2^\omega' title='A \subset {\Bbb Z} \ltimes {\Bbb F}_2^\omega' class='latex' /> has bounded doubling, then A is controlled either by a finite subspace of the &#8220;vertical&#8221; group <img src='http://s2.wordpress.com/latex.php?latex=%5C%7B0%5C%7D+%5Ctimes+%7B%5CBbb+F%7D_2%5E%5Comega&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\{0\} \times {\Bbb F}_2^\omega' title='\{0\} \times {\Bbb F}_2^\omega' class='latex' />, or else by a set of the form <img src='http://s3.wordpress.com/latex.php?latex=%5C%7B+%28n%2C%5Cphi%28n%29%29%3A+n+%5Cin+P+%5C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\{ (n,\phi(n)): n \in P \}' title='\{ (n,\phi(n)): n \in P \}' class='latex' />, where <img src='http://s1.wordpress.com/latex.php?latex=P+%5Csubset+%7B%5CBbb+Z%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='P \subset {\Bbb Z}' title='P \subset {\Bbb Z}' class='latex' /> is a generalised arithmetic progression, and <img src='http://s2.wordpress.com/latex.php?latex=%5Cphi%3A+P+%5Cto+%7B%5CBbb+F%7D_2%5E%7B%5Comega%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi: P \to {\Bbb F}_2^{\omega}' title='\phi: P \to {\Bbb F}_2^{\omega}' class='latex' /> obeys the Freiman isomorphism property <img src='http://s3.wordpress.com/latex.php?latex=%28n_1%2C%5Cphi%28n_1%29%29+%5Ccdot+%28n_2%2C+%5Cphi%28n_2%29%29+%3D+%28n_3%2C%5Cphi%28n_3%29%29+%5Ccdot+%28n_4%2C%5Cphi%28n_4%29%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(n_1,\phi(n_1)) \cdot (n_2, \phi(n_2)) = (n_3,\phi(n_3)) \cdot (n_4,\phi(n_4))' title='(n_1,\phi(n_1)) \cdot (n_2, \phi(n_2)) = (n_3,\phi(n_3)) \cdot (n_4,\phi(n_4))' class='latex' /> whenever <img src='http://s1.wordpress.com/latex.php?latex=n_1%2Cn_2%2Cn_3%2Cn_4+%5Cin+P&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n_1,n_2,n_3,n_4 \in P' title='n_1,n_2,n_3,n_4 \in P' class='latex' /> and <img src='http://s2.wordpress.com/latex.php?latex=n_1%2Bn_2%3Dn_3%2Bn_4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='n_1+n_2=n_3+n_4' title='n_1+n_2=n_3+n_4' class='latex' />.</p></blockquote>
<p>This result, incidentally, recovers an earlier <a href="http://www.ams.org/mathscinet-getitem?mr=1865397">result of Lindenstrauss</a> that the lamplighter group does not contain a <a href="http://en.wikipedia.org/wiki/F%C3%B8lner_sequence">Følner sequence</a> of sets of uniformly bounded doubling.  It is a good exercise to establish the &#8220;exact&#8221; version of this theorem, in which one classifies subgroups of the lamplighter group rather than sets of small doubling; indeed, the proof of this the above theorem follows fairly closely the natural proof of the exact version.</p>
<p>One application of the solvable Freiman theorem is the following quantitative version of a classical result <a href="http://www.ams.org/mathscinet-getitem?mr=232311">of Milnor</a> and <a href="http://www.ams.org/mathscinet-getitem?mr=248688">of Wolf</a>, which asserts that any solvable group of polynomial growth is virtually nilpotent:</p>
<blockquote>
<p style="text-align:left;"><strong>Theorem 2.</strong> (Quantitative Milnor-Wolf theorem) Let G be a solvable group of derived length O(1), let S be a set of generators for G, and suppose one has the polynomial growth condition <img src='http://s3.wordpress.com/latex.php?latex=%7CB_S%28R%29%7C+%5Cleq+R%5Ed&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|B_S(R)| \leq R^d' title='|B_S(R)| \leq R^d' class='latex' /> for some d = O(1), where <img src='http://s1.wordpress.com/latex.php?latex=B_S%28R%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='B_S(R)' title='B_S(R)' class='latex' /> is the set of all words generated by S of length at most R.  If R is sufficiently large, then this implies that G is virtually nilpotent; more precisely, G contains a nilpotent subgroup of step O(1) and index <img src='http://s2.wordpress.com/latex.php?latex=O%28R%5E%7BO%281%29%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O(R^{O(1)})' title='O(R^{O(1)})' class='latex' />.</p>
</blockquote>
<p>The key points here are that one only needs polynomial growth at a single scale R, rather than on many scales, and that the index of the nilpotent subgroup has polynomial size.</p>
<p>The proofs are based on an induction on the derived length.  After some standard manipulations (basically, splitting A by an approximate version of a short exact sequence), the problem boils down to that of understanding the action <img src='http://s3.wordpress.com/latex.php?latex=%5Crho&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\rho' title='\rho' class='latex' /> of some finite set A on a set E in an additive group.  If one assumes that E has small doubling and that the action of A leaves E approximately invariant, then one can show that E is a coset progression, and the action of A can be described efficiently using the generators of that progression (after refining the set A a bit).</p>
<p>In the course of the proof we need two new additive combinatorial results which may be of independent interest.  The first is a variant of a well-known <a href="http://www.ams.org/mathscinet-getitem?mr=1002119">theorem of Sárközy</a>, which asserts that if A is a large subset of an arithmetic progression P, then an iterated sumset kA of A for some <img src='http://s1.wordpress.com/latex.php?latex=k%3DO%281%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='k=O(1)' title='k=O(1)' class='latex' /> itself contains a long arithmetic progression. Here, we need the related fact that if A is a large subset of a coset progression, then an iterated subset kA for <img src='http://s2.wordpress.com/latex.php?latex=k%3DO%281%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='k=O(1)' title='k=O(1)' class='latex' /> contains a large coset progression Q, and furthermore this inclusion is &#8220;robust&#8221; in the sense that all elements the elements of Q have a large number of representations as sums of elements of A.  We also need a new (non-commutative) variant of the Balog-Szemerédi(-Gowers) lemma, which asserts that if A has small doubling, then A (or more precisely <img src='http://s3.wordpress.com/latex.php?latex=A+%5Ccdot+A%5E%7B-1%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \cdot A^{-1}' title='A \cdot A^{-1}' class='latex' />) contains a large &#8220;core&#8221; subset D such that almost all of a large iterated subset kD of D still lies inside <img src='http://s1.wordpress.com/latex.php?latex=A+%5Ccdot+A%5E%7B-1%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \cdot A^{-1}' title='A \cdot A^{-1}' class='latex' />).  (This may not look like the usual Balog-Szemerédi lemma, but the proof of the lemma is almost identical to the <a href="http://www.ams.org/mathscinet-getitem?mr=1305895">original proof of Balog and Szemerédi</a>, in particular relying on the <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di_regularity_lemma">Szemerédi regularity lemma</a>.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2334/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2334/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2334/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2334/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2334/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2334/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2334/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2334/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2334/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2334/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2334&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/21/freimans-theorem-for-solvable-groups/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>An equivalence between inverse sumset theorems and inverse conjectures for the U^3 norm</title>
		<link>http://terrytao.wordpress.com/2009/06/20/an-equivalence-between-inverse-sumset-theorems-and-inverse-conjectures-for-the-u3-norm/</link>
		<comments>http://terrytao.wordpress.com/2009/06/20/an-equivalence-between-inverse-sumset-theorems-and-inverse-conjectures-for-the-u3-norm/#comments</comments>
		<pubDate>Sat, 20 Jun 2009 17:54:48 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.CO]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[Ben Green]]></category>
		<category><![CDATA[Freiman's theorem]]></category>
		<category><![CDATA[Gowers uniformity norm]]></category>
		<category><![CDATA[inverse conjecture]]></category>
		<category><![CDATA[inverse theorems]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2317</guid>
		<description><![CDATA[For a number of reasons, including the start of the summer break for me and my coauthors, a number of papers that we have been working on are being released this week.  For instance, Ben Green and I have just uploaded to the arXiv our paper &#8220;An equivalence between inverse sumset theorems and inverse conjectures [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2317&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>For a number of reasons, including the start of the summer break for me and my coauthors, a number of papers that we have been working on are being released this week.  For instance, <a href="http://www.dpmms.cam.ac.uk/~bjg23/">Ben Green</a> and I have just uploaded to the arXiv our paper &#8220;<a href="http://arxiv.org/abs/0906.3100">An equivalence between inverse sumset theorems and inverse conjectures for the <img src='http://s3.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm</a>&#8220;, submitted to <a href="http://journals.cambridge.org/action/displayJournal?jid=PSP">Math. Proc. Camb. Phil. Soc.</a>.  The main result of this short paper (which was briefly announced in <a href="http://terrytao.wordpress.com/2008/11/09/a-counterexample-to-a-strong-polynomial-freiman-ruzsa-conjecture/">this earlier post</a>) is a connection between two types of inverse theorems in additive combinatorics, namely the <em>inverse sumset theorems</em> of Freiman type, and <em>inverse theorems for the Gowers uniformity norm</em>, and more specifically, for the <img src='http://s1.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%5C%7Cf%5C%7C_%7BU%5E3%28G%29%7D%5E8+%3A%3D+%7B%5CBbb+E%7D_%7Bx%2Ca%2Cb%2Cc+%5Cin+G%7D+f%28x%29+%5Coverline%7Bf%28x%2Ba%29%7D+%5Coverline%7Bf%28x%2Bb%29%7D+%5Coverline%7Bf%28x%2Bc%29%7D+f%28x%2Ba%2Bb%29+f%28x%2Ba%2Bc%29+f%28x%2Bb%2Bc%29+%5Coverline%7Bf%28x%2Ba%2Bb%2Bc%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\|f\|_{U^3(G)}^8 := {\Bbb E}_{x,a,b,c \in G} f(x) \overline{f(x+a)} \overline{f(x+b)} \overline{f(x+c)} f(x+a+b) f(x+a+c) f(x+b+c) \overline{f(x+a+b+c)}' title='\|f\|_{U^3(G)}^8 := {\Bbb E}_{x,a,b,c \in G} f(x) \overline{f(x+a)} \overline{f(x+b)} \overline{f(x+c)} f(x+a+b) f(x+a+c) f(x+b+c) \overline{f(x+a+b+c)}' class='latex' /></p>
<p>on finite additive group G, where <img src='http://s3.wordpress.com/latex.php?latex=f%3A+G+%5Cto+%7B%5CBbb+C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f: G \to {\Bbb C}' title='f: G \to {\Bbb C}' class='latex' /> is a complex-valued function.</p>
<p>As usual, the connection is easiest to state in a finite field model such as <img src='http://s1.wordpress.com/latex.php?latex=G+%3D+%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='G = {\Bbb F}_2^n' title='G = {\Bbb F}_2^n' class='latex' />.  In this case, we have the following inverse sumset theorem <a href="http://www.ams.org/mathscinet-getitem?mr=1701207">of Ruzsa</a>:</p>
<blockquote><p><strong>Theorem 1.</strong> If <img src='http://s2.wordpress.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \subset {\Bbb F}_2^n' title='A \subset {\Bbb F}_2^n' class='latex' /> is such that <img src='http://s3.wordpress.com/latex.php?latex=%7CA%2BA%7C+%5Cleq+K%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+A| \leq K|A|' title='|A+A| \leq K|A|' class='latex' />, then A can be covered by a translate of a subspace of <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^n' title='{\Bbb F}_2^n' class='latex' /> of cardinality at most <img src='http://s2.wordpress.com/latex.php?latex=K%5E2+2%5E%7BK%5E4%7D+%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='K^2 2^{K^4} |A|' title='K^2 2^{K^4} |A|' class='latex' />.</p></blockquote>
<p>The constant <img src='http://s3.wordpress.com/latex.php?latex=K%5E2+2%5E%7BK%5E4%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='K^2 2^{K^4}' title='K^2 2^{K^4}' class='latex' /> has been improved for large <img src='http://s1.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='K' title='K' class='latex' /> in a sequence of papers, from <img src='http://s2.wordpress.com/latex.php?latex=K+2%5E%7B%5Clfloor+K%5E3+%5Crfloor-1%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='K 2^{\lfloor K^3 \rfloor-1}' title='K 2^{\lfloor K^3 \rfloor-1}' class='latex' /> <a href="http://www.ams.org/mathscinet-getitem?mr=2057683">by Ruzsa</a>, <img src='http://s3.wordpress.com/latex.php?latex=K%5E2+2%5E%7BK%5E2-2%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='K^2 2^{K^2-2}' title='K^2 2^{K^2-2}' class='latex' /> <a href="http://www.ams.org/mathscinet-getitem?mr=2201602">by Green-Ruzsa</a>, <img src='http://s1.wordpress.com/latex.php?latex=2%5E%7BO%28K%5E%7B3%2F2%7D+%5Clog%281%2BK%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='2^{O(K^{3/2} \log(1+K)}' title='2^{O(K^{3/2} \log(1+K)}' class='latex' /> <a href="http://www.ams.org/mathscinet-getitem?mr=2396355">by Sanders</a>, <img src='http://s2.wordpress.com/latex.php?latex=2%5E%7B2K%2BO%28%5Csqrt%7BK%7D+%5Clog+K%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='2^{2K+O(\sqrt{K} \log K})' title='2^{2K+O(\sqrt{K} \log K})' class='latex' /> <a href="http://arxiv.org/abs/math.CO/0703668">by Green and myself</a>, and finally <img src='http://s3.wordpress.com/latex.php?latex=2%5E%7B2K%2BO%28%5Clog+K%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='2^{2K+O(\log K)}' title='2^{2K+O(\log K)}' class='latex' /> by Konyagin (private communication) which is sharp except for the precise value of the O() implied constant (as can be seen by considering the example when A consists of about 2K independent elements).  However, it is conjectured that the polynomial loss can be removed entirely if one modifies the conclusion slightly:</p>
<blockquote>
<p style="text-align:left;"><strong>Conjecture 1. (Polynomial Freiman-Ruzsa conjecture for <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^n' title='{\Bbb F}_2^n' class='latex' />.) </strong>If <img src='http://s2.wordpress.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='A \subset {\Bbb F}_2^n' title='A \subset {\Bbb F}_2^n' class='latex' /> is such that <img src='http://s3.wordpress.com/latex.php?latex=%7CA%2BA%7C+%5Cleq+K%7CA%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|A+A| \leq K|A|' title='|A+A| \leq K|A|' class='latex' />, then A can be covered by <img src='http://s1.wordpress.com/latex.php?latex=O%28K%5E%7BO%281%29%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O(K^{O(1)})' title='O(K^{O(1)})' class='latex' /> translates of subspaces of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^n' title='{\Bbb F}_2^n' class='latex' /> of cardinality at most |A|.</p>
</blockquote>
<p style="text-align:left;">This conjecture was verified for downsets <a href="http://arxiv.org/abs/math.CO/0703668">by Green and myself</a>, but is open in general.   This conjecture has a number of equivalent formulations; see <a href="http://www.ams.org/mathscinet-getitem?mr=2187732">this paper of Green</a> for more discussion.  In <a href="http://terrytao.wordpress.com/2008/11/09/a-counterexample-to-a-strong-polynomial-freiman-ruzsa-conjecture/">this previous post</a> we show that a stronger version of this conjecture fails.</p>
<p style="text-align:left;">Meanwhile, for the Gowers norm, we have the following inverse theorem, <a href="http://www.cs.huji.ac.il/~salex/papers/low_degree.ps">due to Samorodnitsky</a>:</p>
<blockquote>
<p style="text-align:left;"><strong>Theorem 2. </strong> Let <img src='http://s3.wordpress.com/latex.php?latex=f%3A+%7B%5CBbb+F%7D_2%5En+%5Cto+%5B-1%2C1%5D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f: {\Bbb F}_2^n \to [-1,1]' title='f: {\Bbb F}_2^n \to [-1,1]' class='latex' /> be a function whose <img src='http://s1.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm is at least 1/K.  Then there exists a quadratic polynomial <img src='http://s2.wordpress.com/latex.php?latex=Q%3A+%7B%5CBbb+F%7D_2%5En+%5Cto+%7B%5CBbb+F%7D_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='Q: {\Bbb F}_2^n \to {\Bbb F}_2' title='Q: {\Bbb F}_2^n \to {\Bbb F}_2' class='latex' /> such that <img src='http://s3.wordpress.com/latex.php?latex=%7C%7B%5CBbb+E%7D_%7Bx+%5Cin+%7B%5CBbb+F%7D_2%5En%7D+f%28x%29+%28-1%29%5E%7BQ%28x%29%7D%7C+%5Cgeq+%5Cexp%28+-+O%28K%29%5E%7BO%281%29%7D+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|{\Bbb E}_{x \in {\Bbb F}_2^n} f(x) (-1)^{Q(x)}| \geq \exp( - O(K)^{O(1)} )' title='|{\Bbb E}_{x \in {\Bbb F}_2^n} f(x) (-1)^{Q(x)}| \geq \exp( - O(K)^{O(1)} )' class='latex' />.</p>
</blockquote>
<p style="text-align:left;">Note that the quadratic phases <img src='http://s1.wordpress.com/latex.php?latex=%28-1%29%5E%7BQ%28x%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(-1)^{Q(x)}' title='(-1)^{Q(x)}' class='latex' /> are the only functions taking values in [-1,1] whose <img src='http://s2.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm attains its maximal value of 1.</p>
<p style="text-align:left;">It is conjectured that the exponentially weak correlation here can be strengthened to a polynomial one:</p>
<blockquote>
<p style="text-align:left;"><strong>Conjecture 2. (Polynomial inverse conjecture for the <img src='http://s3.wordpress.com/latex.php?latex=U%5E3%28%7B%5CBbb+F%7D_2%5En%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3({\Bbb F}_2^n)' title='U^3({\Bbb F}_2^n)' class='latex' /> norm).</strong> Let <img src='http://s1.wordpress.com/latex.php?latex=f%3A+%7B%5CBbb+F%7D_2%5En+%5Cto+%5B-1%2C1%5D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f: {\Bbb F}_2^n \to [-1,1]' title='f: {\Bbb F}_2^n \to [-1,1]' class='latex' /> be a function whose <img src='http://s2.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm is at least 1/K.  Then there exists a quadratic polynomial <img src='http://s3.wordpress.com/latex.php?latex=Q%3A+%7B%5CBbb+F%7D_2%5En+%5Cto+%7B%5CBbb+F%7D_2&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='Q: {\Bbb F}_2^n \to {\Bbb F}_2' title='Q: {\Bbb F}_2^n \to {\Bbb F}_2' class='latex' /> such that <img src='http://s1.wordpress.com/latex.php?latex=%7C%7B%5CBbb+E%7D_%7Bx+%5Cin+%7B%5CBbb+F%7D_2%5En%7D+f%28x%29+%28-1%29%5E%7BQ%28x%29%7D%7C+%5Cgeq+K%5E%7B-O%281%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|{\Bbb E}_{x \in {\Bbb F}_2^n} f(x) (-1)^{Q(x)}| \geq K^{-O(1)}' title='|{\Bbb E}_{x \in {\Bbb F}_2^n} f(x) (-1)^{Q(x)}| \geq K^{-O(1)}' class='latex' />.</p>
</blockquote>
<p style="text-align:left;">The first main result of this paper is</p>
<blockquote>
<p style="text-align:left;"><strong>Theorem 3.</strong> Conjecture 1 and Conjecture 2 are equivalent.</p>
</blockquote>
<p style="text-align:left;">This result was also independently observed by Shachar Lovett (private communication).  We also establish an analogous result for the cyclic group <img src='http://s2.wordpress.com/latex.php?latex=%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb Z}/N{\Bbb Z}' title='{\Bbb Z}/N{\Bbb Z}' class='latex' />, in which the notion of polynomial is replaced by that of a subexponential <img src='http://s3.wordpress.com/latex.php?latex=%5Cexp%28K%5E%7Bo%281%29%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\exp(K^{o(1)})' title='\exp(K^{o(1)})' class='latex' />, and in which the notion of a quadratic polynomial is replaced by a 2-step nilsequence; the precise statement is a bit technical and will not be given here.  We also observe a partial partial analogue of the correpsondence between inverse sumset theorems and Gowers norms in the higher order case, in particular observing that <img src='http://s1.wordpress.com/latex.php?latex=U%5E4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^4' title='U^4' class='latex' /> inverse theorems imply a certain rigidity result for &#8220;Freiman-quadratic polynomials&#8221; (a quadratic version of Conjecture 3 below).</p>
<p style="text-align:left;">Below the fold, we sketch the proof of Theorem 3.</p>
<p style="text-align:left;"><span id="more-2317"></span></p>
<p style="text-align:left;">The deduction of Conjecture 2 from Conjecture 1 is already implicit in the <a href="http://www.cs.huji.ac.il/~salex/papers/low_degree.ps">work of Samorodnitsky</a>, so we focus here on the converse implication.  It is convenient to put Conjecture 1 in the following equivalent form:</p>
<blockquote>
<p style="text-align:left;"><strong>Conjecture 3. </strong> (Polynomial rigidity of Freiman homomorphisms) Let <img src='http://s2.wordpress.com/latex.php?latex=S+%5Csubset+%7B%5CBbb+F%7D_2%5Em&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='S \subset {\Bbb F}_2^m' title='S \subset {\Bbb F}_2^m' class='latex' /> have density <img src='http://s3.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\sigma' title='\sigma' class='latex' />, and let <img src='http://s1.wordpress.com/latex.php?latex=%5Cphi%3A+S+%5Cto+%7B%5CBbb+F%7D_2%5EN&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi: S \to {\Bbb F}_2^N' title='\phi: S \to {\Bbb F}_2^N' class='latex' /> be a <em>Freiman homomorphism</em>, thus <img src='http://s2.wordpress.com/latex.php?latex=%5Cphi%28x%29%2B%5Cphi%28y%29+%3D+%5Cphi%28z%29%2B%5Cphi%28w%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi(x)+\phi(y) = \phi(z)+\phi(w)' title='\phi(x)+\phi(y) = \phi(z)+\phi(w)' class='latex' /> whenever <img src='http://s3.wordpress.com/latex.php?latex=x%2Cy%2Cz%2Cw+%5Cin+S&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='x,y,z,w \in S' title='x,y,z,w \in S' class='latex' /> are such that <img src='http://s1.wordpress.com/latex.php?latex=x%2By%3Dz%2Bw&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='x+y=z+w' title='x+y=z+w' class='latex' />.  Then there exists an affine-linear map <img src='http://s2.wordpress.com/latex.php?latex=%5Cpsi%3A+%7B%5CBbb+F%7D_2%5Em+%5Cto+%7B%5CBbb+F%7D_2%5EN&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\psi: {\Bbb F}_2^m \to {\Bbb F}_2^N' title='\psi: {\Bbb F}_2^m \to {\Bbb F}_2^N' class='latex' /> which agrees with <img src='http://s3.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi' title='\phi' class='latex' /> on at least <img src='http://s1.wordpress.com/latex.php?latex=%5Cgg+%5Csigma%5E%7B-O%281%29%7D+2%5Em&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\gg \sigma^{-O(1)} 2^m' title='\gg \sigma^{-O(1)} 2^m' class='latex' /> elements of <img src='http://s2.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5Em&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^m' title='{\Bbb F}_2^m' class='latex' />.</p>
</blockquote>
<p style="text-align:left;">The implication of Conjecture 1 from Conjecture 3 was observed by Ruzsa; basically, the idea is to use the random projection trick (cf. <a href="http://terrytao.wordpress.com/2009/03/12/the-kakeya-set-and-maximal-conjectures-for-algebraic-varieties-over-finite-fields/">this post</a>) to view the set A in Conjecture 1 as a graph of a function <img src='http://s3.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi' title='\phi' class='latex' /> over a base <img src='http://s1.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5Em&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^m' title='{\Bbb F}_2^m' class='latex' /> of cardinality roughly comparable to |A|.</p>
<p style="text-align:left;">To deduce Conjecture 3 from Conjecture 2, we encode the Freiman homomorphism <img src='http://s2.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi' title='\phi' class='latex' /> as a function <img src='http://s3.wordpress.com/latex.php?latex=f%3A+%7B%5CBbb+F%7D_2%5Em+%5Ctimes+%7B%5CBbb+F%7D_2%5EN+%5Cto+%5C%7B-1%2C%2B1%5C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f: {\Bbb F}_2^m \times {\Bbb F}_2^N \to \{-1,+1\}' title='f: {\Bbb F}_2^m \times {\Bbb F}_2^N \to \{-1,+1\}' class='latex' />, defined by the formula</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=f%28x%2Cy%29+%3A%3D+1_S%28x%29+%28-1%29%5E%7B%5Cphi%28x%29+%5Ccdot+y%7D%2C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f(x,y) := 1_S(x) (-1)^{\phi(x) \cdot y},' title='f(x,y) := 1_S(x) (-1)^{\phi(x) \cdot y},' class='latex' /></p>
<p style="text-align:left;">thus f is the vertical Fourier transform of the graph of <img src='http://s2.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi' title='\phi' class='latex' />.  A Fourier-analytic computation using the Freiman homomorphism property of <img src='http://s3.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi' title='\phi' class='latex' />and some standard applications of the Cauchy-Schwartz inequality shows that <img src='http://s1.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='f' title='f' class='latex' /> has a large <img src='http://s2.wordpress.com/latex.php?latex=U%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='U^3' title='U^3' class='latex' /> norm (of size at least <img src='http://s3.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\sigma' title='\sigma' class='latex' />, in fact).  Applying Conjecture 2, we conclude that f must polynomially correlate with some quadratic function <img src='http://s1.wordpress.com/latex.php?latex=%5CPsi%3A+%7B%5CBbb+F%7D_2%5Em+%5Ctimes+%7B%5CBbb+F%7D_2%5EN&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Psi: {\Bbb F}_2^m \times {\Bbb F}_2^N' title='\Psi: {\Bbb F}_2^m \times {\Bbb F}_2^N' class='latex' />.  Separating the x and y coordinates, this implies that for a polynomially dense set of values of x, the linear phase <img src='http://s2.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5Cphi%28x%29+%5Ccdot+y%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(-1)^{\phi(x) \cdot y}' title='(-1)^{\phi(x) \cdot y}' class='latex' /> in y has polynomial correlation with a quadratic polynomial <img src='http://s3.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5CPsi%280%2Cy%29+%2B+%5Cpsi%28x%29+%5Ccdot+y%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(-1)^{\Psi(0,y) + \psi(x) \cdot y}' title='(-1)^{\Psi(0,y) + \psi(x) \cdot y}' class='latex' />, where <img src='http://s1.wordpress.com/latex.php?latex=%5Cpsi%3A+%7B%5CBbb+F%7D_2%5Em+%5Cto+%7B%5CBbb+F%7D_2%5EN&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\psi: {\Bbb F}_2^m \to {\Bbb F}_2^N' title='\psi: {\Bbb F}_2^m \to {\Bbb F}_2^N' class='latex' /> is affine-linear and <img src='http://s2.wordpress.com/latex.php?latex=%5CPsi&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\Psi' title='\Psi' class='latex' /> is some quadratic polynomial on <img src='http://s3.wordpress.com/latex.php?latex=%7B%5CBbb+F%7D_2%5EN&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='{\Bbb F}_2^N' title='{\Bbb F}_2^N' class='latex' /> which is independent of x.  To put it another way, <img src='http://s1.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5CPsi%280%2Cy%29%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(-1)^{\Psi(0,y)}' title='(-1)^{\Psi(0,y)}' class='latex' /> has a polynomially large Fourier coefficient at <img src='http://s2.wordpress.com/latex.php?latex=%5Cphi%28x%29-%5Cpsi%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi(x)-\psi(x)' title='\phi(x)-\psi(x)' class='latex' /> for many x.  But Plancherel&#8217;s theorem shows that there can only be polynomially many such large Fourier coefficients, so by the pigeonhole principle, <img src='http://s3.wordpress.com/latex.php?latex=%5Cphi%28x%29-%5Cpsi%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\phi(x)-\psi(x)' title='\phi(x)-\psi(x)' class='latex' /> is equal to a constant for a polynomially dense set of x, and the claim follows.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2317/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2317/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2317/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2317/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2317/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2317/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2317/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2317/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2317/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2317/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2317&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/20/an-equivalence-between-inverse-sumset-theorems-and-inverse-conjectures-for-the-u3-norm/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Global regularity for a logarithmically supercritical hyperdissipative Navier-Stokes equation</title>
		<link>http://terrytao.wordpress.com/2009/06/19/global-regularity-for-a-logarithmically-supercritical-hyperdissipative-navier-stokes-equation/</link>
		<comments>http://terrytao.wordpress.com/2009/06/19/global-regularity-for-a-logarithmically-supercritical-hyperdissipative-navier-stokes-equation/#comments</comments>
		<pubDate>Sat, 20 Jun 2009 05:07:41 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.AP]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[logarithmically supercritical equations]]></category>
		<category><![CDATA[Navier-Stokes equations]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2309</guid>
		<description><![CDATA[I&#8217;ve just uploaded to the arXiv my paper &#8220;Global regularity for a logarithmically supercritical hyperdissipative Navier-Stokes equation&#8220;, submitted to Analysis &#38; PDE.  It is a famous problem to establish the existence of global smooth solutions to the three-dimensional Navier-Stokes system of equations




given smooth, compactly supported, divergence-free initial data .
I do not claim to have any [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2309&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I&#8217;ve just uploaded to the <a href="http://www.arxiv.org">arXiv</a> my paper &#8220;<a href="http://arxiv.org/abs/0906.3070">Global regularity for a logarithmically supercritical hyperdissipative Navier-Stokes equation</a>&#8220;, submitted to <a href="http://pjm.math.berkeley.edu/apde/about/cover/cover.html">Analysis &amp; PDE</a>.  It is a <a href="http://en.wikipedia.org/wiki/Navier%E2%80%93Stokes_existence_and_smoothness">famous problem</a> to establish the existence of global smooth solutions to the three-dimensional <a href="http://en.wikipedia.org/wiki/Navier%E2%80%93Stokes_equations">Navier-Stokes</a> system of equations</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%5Cpartial_t+u+%2B+%28u+%5Ccdot+%5Cnabla%29+u+%3D+%5CDelta+u+-+%5Cnabla+p&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t u + (u \cdot \nabla) u = \Delta u - \nabla p' title='\partial_t u + (u \cdot \nabla) u = \Delta u - \nabla p' class='latex' /><br />
<img src='http://s3.wordpress.com/latex.php?latex=%5Cnabla+%5Ccdot+u+%3D+0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\nabla \cdot u = 0' title='\nabla \cdot u = 0' class='latex' /><br />
<img src='http://s1.wordpress.com/latex.php?latex=u%280%2Cx%29+%3D+u_0%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(0,x) = u_0(x)' title='u(0,x) = u_0(x)' class='latex' />
</p>
<p style="text-align:left;">given smooth, compactly supported, divergence-free initial data <img src='http://s2.wordpress.com/latex.php?latex=u_0%3A+%7B%5CBbb+R%7D%5E3+%5Cto+%7B%5CBbb+R%7D%5E3&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_0: {\Bbb R}^3 \to {\Bbb R}^3' title='u_0: {\Bbb R}^3 \to {\Bbb R}^3' class='latex' />.</p>
<p style="text-align:left;">I do <strong>not</strong> claim to have any substantial progress on this problem here.  Instead, the paper makes a small observation about the hyper-dissipative version of the Navier-Stokes equations, namely</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%5Cpartial_t+u+%2B+%28u+%5Ccdot+%5Cnabla%29+u+%3D+-+%7C%5Cnabla%7C%5E%7B2%5Calpha%7D+u+-+%5Cnabla+p&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t u + (u \cdot \nabla) u = - |\nabla|^{2\alpha} u - \nabla p' title='\partial_t u + (u \cdot \nabla) u = - |\nabla|^{2\alpha} u - \nabla p' class='latex' /><br />
<img src='http://s1.wordpress.com/latex.php?latex=%5Cnabla+%5Ccdot+u+%3D+0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\nabla \cdot u = 0' title='\nabla \cdot u = 0' class='latex' /><br />
<img src='http://s2.wordpress.com/latex.php?latex=u%280%2Cx%29+%3D+u_0%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(0,x) = u_0(x)' title='u(0,x) = u_0(x)' class='latex' />
</p>
<p style="text-align:left;">for some <img src='http://s3.wordpress.com/latex.php?latex=%5Calpha+%3E+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha &gt; 1' title='\alpha &gt; 1' class='latex' />.  It is a folklore result that global regularity for this equation holds for <img src='http://s1.wordpress.com/latex.php?latex=%5Calpha+%5Cgeq+5%2F4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha \geq 5/4' title='\alpha \geq 5/4' class='latex' />; the significance of the exponent <img src='http://s2.wordpress.com/latex.php?latex=5%2F4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='5/4' title='5/4' class='latex' /> is that it is energy-critical, in the sense that the scaling which preserves this particular hyper-dissipative Navier-Stokes equation, also preserves the energy.</p>
<p style="text-align:left;">Values of <img src='http://s3.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> below <img src='http://s1.wordpress.com/latex.php?latex=5%2F4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='5/4' title='5/4' class='latex' /> (including, unfortunately, the case <img src='http://s2.wordpress.com/latex.php?latex=%5Calpha%3D1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha=1' title='\alpha=1' class='latex' />, which is the original Navier-Stokes equation) are supercritical and thus establishing global regularity beyond the reach of most known methods (see <a href="http://terrytao.wordpress.com/2007/03/18/why-global-regularity-for-navier-stokes-is-hard/">my earlier blog post</a> for more discussion).</p>
<p style="text-align:left;">A few years ago, <a href="http://arxiv.org/abs/math.AP/0606145">I observed</a> (in the case of the spherically symmetric wave equation) that this &#8220;criticality barrier&#8221; had a very small amount of flexibility to it, in that one could push a critical argument to a slightly supercritical one by exploiting spacetime integral estimates a little bit more.  I realised recently that the same principle applied to hyperdissipative Navier-Stokes; here, the relevant spacetime integral estimate is the energy dissipation inequality</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%5Cint_0%5ET+%5Cint_%7B%7B%5CBbb+R%7D%5Ed%7D+%7C+%7C%5Cnabla%7C%5E%5Calpha+u%28t%2Cx%29%7C%5E2%5C+dx+dt+%5Cleq+%5Cfrac%7B1%7D%7B2%7D+%5Cint_%7B%7B%5CBbb+R%7D%5Ed%7D+%7Cu_0%28x%29%7C%5E2%5C+dx&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\int_0^T \int_{{\Bbb R}^d} | |\nabla|^\alpha u(t,x)|^2\ dx dt \leq \frac{1}{2} \int_{{\Bbb R}^d} |u_0(x)|^2\ dx' title='\int_0^T \int_{{\Bbb R}^d} | |\nabla|^\alpha u(t,x)|^2\ dx dt \leq \frac{1}{2} \int_{{\Bbb R}^d} |u_0(x)|^2\ dx' class='latex' /></p>
<p style="text-align:left;">which ensures that the energy dissipation <img src='http://s1.wordpress.com/latex.php?latex=a%28t%29+%3A%3D+%5Cint_%7B%7B%5CBbb+R%7D%5Ed%7D+%7C+%7C%5Cnabla%7C%5E%5Calpha+u%28t%2Cx%29%7C%5E2%5C+dx&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='a(t) := \int_{{\Bbb R}^d} | |\nabla|^\alpha u(t,x)|^2\ dx' title='a(t) := \int_{{\Bbb R}^d} | |\nabla|^\alpha u(t,x)|^2\ dx' class='latex' /> is locally integrable (and in fact globally integrable) in time.</p>
<p style="text-align:left;">In this paper I push the global regularity results by a fraction of a logarithm from <img src='http://s2.wordpress.com/latex.php?latex=%5Calpha%3D5%2F4&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha=5/4' title='\alpha=5/4' class='latex' /> towards <img src='http://s3.wordpress.com/latex.php?latex=%5Calpha%3D1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\alpha=1' title='\alpha=1' class='latex' />.  For instance, the argument shows that the logarithmically supercritical equation</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_t+u+%2B+%28u+%5Ccdot+%5Cnabla%29+u+%3D+-+%5Cfrac%7B%7C%5Cnabla%7C%5E%7B5%2F2%7D%7D%7B%5Clog%5E%7B1%2F2%7D%282-%5CDelta%29%7D+u+-+%5Cnabla+p&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t u + (u \cdot \nabla) u = - \frac{|\nabla|^{5/2}}{\log^{1/2}(2-\Delta)} u - \nabla p' title='\partial_t u + (u \cdot \nabla) u = - \frac{|\nabla|^{5/2}}{\log^{1/2}(2-\Delta)} u - \nabla p' class='latex' /> (0)<br />
<img src='http://s2.wordpress.com/latex.php?latex=%5Cnabla+%5Ccdot+u+%3D+0&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\nabla \cdot u = 0' title='\nabla \cdot u = 0' class='latex' /><br />
<img src='http://s3.wordpress.com/latex.php?latex=u%280%2Cx%29+%3D+u_0%28x%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(0,x) = u_0(x)' title='u(0,x) = u_0(x)' class='latex' />
</p>
<p style="text-align:left;">admits global smooth solutions.</p>
<p style="text-align:left;">The argument is in fact quite simple (the paper is seven pages in length), and relies on known technology; one just applies the energy method and a logarithmically modified Sobolev inequality in the spirit of a well-known inequality <a href="http://www.ams.org/mathscinet-getitem?mr=579997">of Brezis and Wainger</a>.  It looks like it will take quite a bit of effort though to improve the logarithmic factor much further.</p>
<p style="text-align:left;">One way to explain the tiny bit of wiggle room beyond the critical case is as follows.  The standard energy method approach to the critical Navier-Stokes equation relies at one stage on <a href="http://en.wikipedia.org/wiki/Gronwall%27s_inequality">Gronwall&#8217;s inequality</a>, which among other things asserts that if a time-dependent non-negative quantity E(t) obeys the differential inequality</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_t+E%28t%29+%5Cleq+a%28t%29+E%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t E(t) \leq a(t) E(t)' title='\partial_t E(t) \leq a(t) E(t)' class='latex' /> (1)</p>
<p style="text-align:left;">and <img src='http://s2.wordpress.com/latex.php?latex=a%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='a(t)' title='a(t)' class='latex' /> was locally integrable, then E does not blow up in time; in fact, one has the inequality</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=E%28t%29+%5Cleq+E%280%29+%5Cexp%28+%5Cint_0%5Et+a%28s%29%5C+ds+%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E(t) \leq E(0) \exp( \int_0^t a(s)\ ds )' title='E(t) \leq E(0) \exp( \int_0^t a(s)\ ds )' class='latex' />.</p>
<p style="text-align:left;">A slight modification of the argument shows that one can replace the linear inequality with a slightly superlinear inequality.  For instance, the differential inequality</p>
<p style="text-align:center;"><img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_t+E%28t%29+%5Cleq+a%28t%29+E%28t%29+%5Clog+E%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t E(t) \leq a(t) E(t) \log E(t)' title='\partial_t E(t) \leq a(t) E(t) \log E(t)' class='latex' /> (2)</p>
<p style="text-align:left;">also does not blow up in time; indeed, a <a href="http://en.wikipedia.org/wiki/Separation_of_variables">separation of variables</a> argument gives the explicit double-exponential bound</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=E%28t%29+%5Cleq+%5Cexp%28%5Cexp%28+%5Cint_0%5Et+a%28s%29%5C+ds+%2B+%5Clog+%5Clog+E%280%29+%29%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E(t) \leq \exp(\exp( \int_0^t a(s)\ ds + \log \log E(0) ))' title='E(t) \leq \exp(\exp( \int_0^t a(s)\ ds + \log \log E(0) ))' class='latex' /></p>
<p style="text-align:left;">(let&#8217;s take <img src='http://s3.wordpress.com/latex.php?latex=E%280%29+%3E+1&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E(0) &gt; 1' title='E(0) &gt; 1' class='latex' /> and all functions smooth, to avoid technicalities).  It is this ability to go beyond Gronwall&#8217;s inequality by a little bit which is really at the heart of the logarithmically supercritical phenomenon.  In the paper, I establish an inequality basically of the shape (2), where <img src='http://s1.wordpress.com/latex.php?latex=E%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E(t)' title='E(t)' class='latex' /> is a suitably high-regularity Sobolev norm of <img src='http://s2.wordpress.com/latex.php?latex=u%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t)' title='u(t)' class='latex' />, and <img src='http://s3.wordpress.com/latex.php?latex=a%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='a(t)' title='a(t)' class='latex' /> is basically the energy dissipation mentioned earlier.  The point is that the logarithmic loss of <img src='http://s1.wordpress.com/latex.php?latex=%5Clog%281+-+%5CDelta%29%5E%7B1%2F4%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\log(1 - \Delta)^{1/4}' title='\log(1 - \Delta)^{1/4}' class='latex' /> in the dissipation can eventually be converted (by a Brezis-Wainger type argument) to a logarithmic loss in the high-regularity energy, as this energy can serve as a proxy for the frequency <img src='http://s2.wordpress.com/latex.php?latex=%7C%5Cxi%7C&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='|\xi|' title='|\xi|' class='latex' />, which in turn serves as a proxy for the Laplacian <img src='http://s3.wordpress.com/latex.php?latex=-%5CDelta&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='-\Delta' title='-\Delta' class='latex' />.</p>
<p style="text-align:left;">To put it another way, with a linear exponential growth model, such as <img src='http://s1.wordpress.com/latex.php?latex=%5Cpartial_t+E%28t%29+%3D+C+E%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t E(t) = C E(t)' title='\partial_t E(t) = C E(t)' class='latex' />, it takes a constant amount of time for E to double, and so E never becomes infinite in finite time.  With an equation such as <img src='http://s2.wordpress.com/latex.php?latex=%5Cpartial_t+E%28t%29+%3D+C+E%28t%29+%5Clog+E%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\partial_t E(t) = C E(t) \log E(t)' title='\partial_t E(t) = C E(t) \log E(t)' class='latex' />, the time taken for E to double from (say) <img src='http://s3.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='2^n' title='2^n' class='latex' /> to <img src='http://s1.wordpress.com/latex.php?latex=2%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='2^{n+1}' title='2^{n+1}' class='latex' /> now shrinks to zero, but only as quickly as the harmonic series <img src='http://s2.wordpress.com/latex.php?latex=1%2Fn&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='1/n' title='1/n' class='latex' />, so it still takes an infinite amount of time for E to blow up.  But because the divergence of <img src='http://s3.wordpress.com/latex.php?latex=%5Csum_n+1%2Fn&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\sum_n 1/n' title='\sum_n 1/n' class='latex' /> is logarithmically slow, the growth of E is now a double exponential rather than a single one.  So there is a little bit of room to exploit between exponential growth and blowup.</p>
<p style="text-align:left;">Interestingly, there is a heuristic argument that suggests that the half-logarithmic loss in (0) can be widened to a full logarithmic loss, which I give below the fold.</p>
<p style="text-align:left;"><span id="more-2309"></span></p>
<p style="text-align:left;">Suppose the solution to (0) (with a full power of the logarithm) blows up at some finite time <img src='http://s1.wordpress.com/latex.php?latex=T_%2A&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='T_*' title='T_*' class='latex' />.  We make the (somewhat improbable) ansatz that all the energy concentrates to a point (e.g. the origin) at this blowup time, thus for each <img src='http://s2.wordpress.com/latex.php?latex=0+%3C+t+%3C+T_%2A&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='0 &lt; t &lt; T_*' title='0 &lt; t &lt; T_*' class='latex' /> we assume that u is concentrated in a ball of radius <img src='http://s3.wordpress.com/latex.php?latex=1%2FN%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='1/N(t)' title='1/N(t)' class='latex' />, where <img src='http://s1.wordpress.com/latex.php?latex=N%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='N(t)' title='N(t)' class='latex' /> is a frequency scale that goes to infinity as <img src='http://s2.wordpress.com/latex.php?latex=t+%5Cto+T_%2A&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t \to T_*' title='t \to T_*' class='latex' />.  (This type of concentration is, heuristically, the &#8220;worst case&#8221; for any argument involving Sobolev embedding, which the arguments in my paper certainly rely on.)  As the total energy is bounded, and the ball has volume <img src='http://s3.wordpress.com/latex.php?latex=O%28+1%2FN%28t%29%5E3%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O( 1/N(t)^3)' title='O( 1/N(t)^3)' class='latex' />, this suggests that u(t,x) should have magnitude <img src='http://s1.wordpress.com/latex.php?latex=O%28N%28t%29%5E%7B3%2F2%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O(N(t)^{3/2})' title='O(N(t)^{3/2})' class='latex' /> at time t.  In particular, from the convection term <img src='http://s2.wordpress.com/latex.php?latex=%28u+%5Ccdot+%5Cnabla%29+u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(u \cdot \nabla) u' title='(u \cdot \nabla) u' class='latex' /> in (0) this suggests that u propagates at speed <img src='http://s3.wordpress.com/latex.php?latex=O%28N%28t%29%5E%7B3%2F2%7D%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='O(N(t)^{3/2})' title='O(N(t)^{3/2})' class='latex' />.  In particular, the radius <img src='http://s1.wordpress.com/latex.php?latex=1%2FN%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='1/N(t)' title='1/N(t)' class='latex' /> should obey the ODE</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%5Cfrac%7Bd%7D%7Bdt%7D+%5Cfrac%7B1%7D%7BN%28t%29%7D+%3D+O%28+N%28t%29%5E%7B3%2F2%7D+%29+&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\frac{d}{dt} \frac{1}{N(t)} = O( N(t)^{3/2} ) ' title='\frac{d}{dt} \frac{1}{N(t)} = O( N(t)^{3/2} ) ' class='latex' /></p>
<p style="text-align:left;">which after solving this ODE suggests that N(t) needs to blow up at the rate <img src='http://s3.wordpress.com/latex.php?latex=%28T_%2A-t%29%5E%7B-2%2F5%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='(T_*-t)^{-2/5}' title='(T_*-t)^{-2/5}' class='latex' /> or faster: <img src='http://s1.wordpress.com/latex.php?latex=N%28t%29+%5Cgg+%28T_%2A-t%29%5E%7B-2%2F5%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='N(t) \gg (T_*-t)^{-2/5}' title='N(t) \gg (T_*-t)^{-2/5}' class='latex' />.  On the other hand, the energy identity for (0) (with a full power of the logarithm) implies that</p>
<p style="text-align:center;"><img src='http://s2.wordpress.com/latex.php?latex=%5Cint_0%5E%7BT_%2A%7D+%5Cint_%7B%7B%5CBbb+R%7D%5E3%7D+%5Cfrac%7B%7C%5Cnabla%7C%5E%7B5%2F2%7D%7D%7B%5Clog%282-%5CDelta%29%7D+u+%5Ccdot+u%5C+dx+dt+%3C+%5Cinfty&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\int_0^{T_*} \int_{{\Bbb R}^3} \frac{|\nabla|^{5/2}}{\log(2-\Delta)} u \cdot u\ dx dt &lt; \infty' title='\int_0^{T_*} \int_{{\Bbb R}^3} \frac{|\nabla|^{5/2}}{\log(2-\Delta)} u \cdot u\ dx dt &lt; \infty' class='latex' />;</p>
<p style="text-align:left;">heuristically substituting in our ansatz, this suggests</p>
<p style="text-align:center;"><img src='http://s3.wordpress.com/latex.php?latex=%5Cint_0%5E%7BT_%2A%7D+%5Cfrac%7BN%28t%29%5E%7B5%2F2%7D%7D%7B%5Clog%282+%2BN%28t%29%29%7D+dt+%3C+%5Cinfty&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\int_0^{T_*} \frac{N(t)^{5/2}}{\log(2 +N(t))} dt &lt; \infty' title='\int_0^{T_*} \frac{N(t)^{5/2}}{\log(2 +N(t))} dt &lt; \infty' class='latex' /></p>
<p style="text-align:left;">but this is incompatible with the blowup rate <img src='http://s1.wordpress.com/latex.php?latex=N%28t%29+%5Cgg+%28T_%2A-t%29%5E%7B-2%2F5%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='N(t) \gg (T_*-t)^{-2/5}' title='N(t) \gg (T_*-t)^{-2/5}' class='latex' /> because <img src='http://s2.wordpress.com/latex.php?latex=%5Cint_0%5E%5Cinfty+%5Cfrac%7Bds%7D%7Bs+%5Clog+s%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\int_0^\infty \frac{ds}{s \log s}' title='\int_0^\infty \frac{ds}{s \log s}' class='latex' /> is (barely) divergent at infinity.  Unfortunately, I do not know how to make this non-rigorous argument precise without taking on some unwanted logarithmic losses, but it may well be feasible to do so; readers are welcome to try, of course :-).</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2309/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2309/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2309/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2309/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2309/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2309/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2309/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2309/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2309/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2309/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2309&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/19/global-regularity-for-a-logarithmically-supercritical-hyperdissipative-navier-stokes-equation/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>Global regularity of wave maps VI.  Abstract theory of minimal-energy blowup solutions</title>
		<link>http://terrytao.wordpress.com/2009/06/17/global-regularity-of-wave-maps-vi-abstract-theory-of-minimal-energy-blowup-solutions/</link>
		<comments>http://terrytao.wordpress.com/2009/06/17/global-regularity-of-wave-maps-vi-abstract-theory-of-minimal-energy-blowup-solutions/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 01:48:55 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.AP]]></category>
		<category><![CDATA[paper]]></category>
		<category><![CDATA[minimal-energy blowup solutions]]></category>
		<category><![CDATA[project heatwave]]></category>
		<category><![CDATA[wave maps]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2283</guid>
		<description><![CDATA[I&#8217;ve just uploaded to the arXiv my paper &#8220;Global regularity of wave maps VI.  Abstract theory of minimal-energy blowup solutions&#8220;, to be submitted with the rest of the &#8220;heatwave&#8221; project to establish global regularity (and scattering) for energy-critical wave maps into hyperbolic space.  Initially, this paper was intended to cap off the project by showing [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2283&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I&#8217;ve just uploaded to the <a href="http://www.arxiv.org">arXiv</a> my paper &#8220;<a href="http://arxiv.org/abs/0906.2833">Global regularity of wave maps VI.  Abstract theory of minimal-energy blowup solutions</a>&#8220;, to be submitted with the rest of the &#8220;heatwave&#8221; project to establish global regularity (and scattering) for energy-critical wave maps into hyperbolic space.  Initially, this paper was intended to cap off the project by showing that if global regularity failed, then a special <em>minimal energy blowup solution</em> must exist, which enjoys a certain <em>almost periodicity</em> property modulo the symmetries of the equation.  However, the argument was more technical than I anticipated, and so I am splitting the paper into a relatively short high-level paper (this one) that reduces the problem to four smaller propositions, and a much longer technical paper which establishes those propositions, by developing a substantial amount of perturbation theory for wave maps.  I am pretty sure though that this process will not iterate any further, and paper VII will be my final paper in this series (and which I hope to finish by the end of this summer).  It is also worth noting that a number of papers establishing similar results (though with slightly different hypotheses and conclusions) will shortly appear by Sterbenz-Tataru and Krieger-Schlag.</p>
<p>Almost periodic minimal energy blowup solutions have been constructed for a variety of critical equations, such as the nonlinear Schrodinger equation (NLS) and the nonlinear wave equation (NLW).  The formal definition of almost periodicity is that the orbit of the solution <img src='http://s1.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u' title='u' class='latex' /> stays in a precompact subset of the energy space once one quotients out by the non-compact symmetries of the equation (namely, translation and dilation).   Another (more informal) way of saying this is that for every time <img src='http://s2.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='t' title='t' class='latex' />, there exists a position <img src='http://s3.wordpress.com/latex.php?latex=x%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='x(t)' title='x(t)' class='latex' /> and a frequency <img src='http://s1.wordpress.com/latex.php?latex=N%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='N(t)' title='N(t)' class='latex' /> such that the solution <img src='http://s2.wordpress.com/latex.php?latex=u%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t)' title='u(t)' class='latex' /> is localised in space in the region <img src='http://s3.wordpress.com/latex.php?latex=%5C%7B+x%3A+x+%3D+x%28t%29+%2B+O%28N%28t%29%5E%7B-1%7D%29+%5C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\{ x: x = x(t) + O(N(t)^{-1}) \}' title='\{ x: x = x(t) + O(N(t)^{-1}) \}' class='latex' /> and in frequency in the region <img src='http://s1.wordpress.com/latex.php?latex=%5C%7B+%5Cxi%3A+%7C%5Cxi%7C+%5Csim+N%28t%29+%5C%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\{ \xi: |\xi| \sim N(t) \}' title='\{ \xi: |\xi| \sim N(t) \}' class='latex' />, with the solution decaying in energy away from these regions of space and frequency.  Model examples of almost periodic solutions include traveling waves (in which N(t) is fixed, and x(t) moves at constant velocity) and self-similar solutions (in which x(t) is fixed, and N(t) blows up in finite time at some power law rate).</p>
<p>Intuitively, the reason almost periodic minimal energy blowup solutions ought to exist in the absence of global regularity is as follows.  It is known (for any of the equations mentioned above) that global regularity (and scattering) holds at sufficiently small energies.  Thus, if global regularity fails at high energies, there must exist a critical energy <img src='http://s2.wordpress.com/latex.php?latex=E_%7Bcrit%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E_{crit}' title='E_{crit}' class='latex' />, below which solutions exist globally (and obey scattering bounds), and above which solutions can blow up.</p>
<p>Now consider a solution <img src='http://s3.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u' title='u' class='latex' /> at the critical energy which blows up (actually, for technical reasons, we instead consider a sequence of solutions approaching this critical energy which come increasingly close to blowing up, but let&#8217;s ignore this for now).  We claim that this solution must be localised in both space and frequency at every time, thus giving the desired almost periodic minimal energy blowup solution.  Indeed, suppose <img src='http://s1.wordpress.com/latex.php?latex=u%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t)' title='u(t)' class='latex' /> is not localised in frequency at some time t; then one can decompose <img src='http://s2.wordpress.com/latex.php?latex=u%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u(t)' title='u(t)' class='latex' /> into a high frequency component <img src='http://s3.wordpress.com/latex.php?latex=u_%7Bhi%7D%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{hi}(t)' title='u_{hi}(t)' class='latex' /> and a low frequency component <img src='http://s1.wordpress.com/latex.php?latex=u_%7Blo%7D%28t%29&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{lo}(t)' title='u_{lo}(t)' class='latex' />, both of which have strictly smaller energy than <img src='http://s2.wordpress.com/latex.php?latex=E_%7Bcrit%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E_{crit}' title='E_{crit}' class='latex' />, and which are widely separated from each other in frequency space.  By hypothesis, each of <img src='http://s3.wordpress.com/latex.php?latex=u_%7Bhi%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{hi}' title='u_{hi}' class='latex' /> and <img src='http://s1.wordpress.com/latex.php?latex=u_%7Blo%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{lo}' title='u_{lo}' class='latex' /> can then be extended to global solutions, which should remain widely separated in frequency (because the linear analogues of these equations are constant-coefficient and thus preserve frequency support).   Assuming that interactions between very high and very low frequencies are negligible, this implies that the superposition <img src='http://s2.wordpress.com/latex.php?latex=u_%7Bhi%7D%2Bu_%7Blo%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{hi}+u_{lo}' title='u_{hi}+u_{lo}' class='latex' /> approximately obeys the nonlinear equation; with a suitable perturbation theory, this implies that <img src='http://s3.wordpress.com/latex.php?latex=u_%7Bhi%7D%2Bu_%7Blo%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u_{hi}+u_{lo}' title='u_{hi}+u_{lo}' class='latex' /> is close to <img src='http://s1.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u' title='u' class='latex' />.  But then <img src='http://s2.wordpress.com/latex.php?latex=u&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='u' title='u' class='latex' /> is not blowing up, a contradiction.   The situation with spatial localisation is similar, but is somewhat more complicated due to the fact that spatial support, in contrast to frequency support, is not preserved by the linear evolution, let alone the nonlinear evolution.</p>
<p>As mentioned before, this type of scheme has been successfully implemented on a number of equations such as NLS and NLW.  However, there are two main obstacles in establishing it for wave maps.  The first is that the wave maps equation is not a scalar equation: the unknown field takes values in a target manifold (specifically, in a hyperbolic space) rather than in a Euclidean space.  As a consequence, it is not obvious how one would perform operations such as &#8220;decompose the solution into low frequency and high frequency components&#8221;, or the inverse operation &#8220;superimpose the low frequency and high frequency components to reconstitute the solution&#8221;.  Another way of viewing the problem is that the various component fields of the solution have to obey a number of important compatibility conditions which can be disrupted by an overly simple-minded approach to decomposition or reconstitution of solutions.</p>
<p>The second problem is that the interaction between very high and very low frequencies for wave maps turns out to not be entirely negligible: the high frequencies do have a negligible impact on the evolution of the low frequencies, but the low frequencies can &#8220;rotate&#8221; the high frequencies by acting as a sort of magnetic field (or more precisely, a connection) for the evolution of those high frequencies.  So the combined evolution of the high and low frequencies is not well approximated by a naive superposition of the separate evolutions of these frequency components.</p>
<p><span id="more-2283"></span>There are a number of ways to resolve the first problem.  One way, which has been pursued in a very recent paper by Sterbenz and Tataru (and also in an earlier paper <a href="http://www.ams.org/mathscinet-getitem?mr=2130618">of Tataru</a>, and <a href="http://arxiv.org/abs/math/0010068">of myself</a> in the case of spherical targets), is to embed the target manifold into Euclidean space and perform various operations (e.g. Littlewood-Paley projections) on the solution in that ambient space, thus creating new fields which lie outside the target.  This does not work directly with the hyperbolic space target because this is not efficiently embeddable into Euclidean space (although it was recently pointed out to me by Jacob Sterbenz that one can proceed &#8211; at least for the narrow question of establishing global regularity rather than scattering &#8211; by passing from hyperbolic space to a compact quotient).  Another approach, introduced by Krieger, is that of <em>dynamic separation</em> &#8211; to isolate a &#8220;dynamic&#8221; scalar field which is unconstrained, controls all the other components of the evolution (as &#8220;static&#8221; functions of the dynamic field), and then manipulate the dynamic field directly.  It is this latter approach which we will pursue in the sequel to this paper; the dynamic field we will use is the tension field <img src='http://s3.wordpress.com/latex.php?latex=%5Cpsi_s&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='\psi_s' title='\psi_s' class='latex' /> of the harmonic map heat flow.</p>
<p>For the second problem, we will use a variant of the &#8220;frequency truncation method&#8221; of Bourgain, constructing the solution iteratively, in a sequence of time intervals in which the low frequency solution is small in a certain spacetime norm sense.   On each such time interval, the impact of the low frequencies on the high ones is small enough that one can basically ignore the low frequencies, and evolve the high frequencies using the hypothesis that solutions with energy less than <img src='http://s1.wordpress.com/latex.php?latex=E_%7Bcrit%7D&#038;bg=ffffff&#038;fg=545454&#038;s=0' alt='E_{crit}' title='E_{crit}' class='latex' /> have global solutions.  But then one has to appeal to the hypothesis again at every time interval, otherwise the cumulative effect of the low frequencies on the high frequencies will become uncontrollable.  This is not a problem unless the energy of the high frequencies increases significantly after every time interval.  But one can use the energy conservation of the low frequencies and of the entire solution to prevent this from occuring.</p>
<p>All of these things will be detailed in the sequel to the current paper.   What the current paper does is to perform the abstract argument that constructs minimal-energy blowup solutions, assuming four black-box results which will be proven in the sequel:</p>
<ol>
<li>A perturbation theory for wave maps which enjoys a certain <em>fungibility</em> property, which is technical to state but roughly asserts that any large wave map on a long time interval can be subdivided into a controlled number of shorter time intervals in which the evolution behaves like the linear equation;</li>
<li>A means of synthesising solutions from frequency-delocalised data from solutions at strictly lower energies;</li>
<li>A means of synthesising solutions from spatially-dispersed data from solutions at strictly lower energies;</li>
<li>A means of synthesising solutions from spatially-delocalised data from solutions at strictly lower energies.</li>
</ol>
<p>(Here, &#8220;spatially dispersed&#8221; means, roughly, that the energy density does not accumulate at any point, while &#8220;spatially delocalised&#8221; means that there is a location where the energy density accumulates, but a significant amount of energy is also present at a large distance from this accumulation point.  These two scenarios complement the scenario we actually want, which is spatial localisation &#8211; where the energy density accumulates at one point and decays away from that point.)</p>
<p>The abstract component of the argument is in fact quite similar to that used for the energy-critical NLS by Colliander, Keel, Staffilani, Takaoka, and myself.  There are also some more concrete components to the argument in this paper, though, namely the use of the harmonic map heat flow as a kind of nonlinear Littlewood-Paley resolution in order to formally define frequency delocalisation and set up its basic properties, and also a Rellich-type compactness lemma which asserts that solutions which are localised in space and frequency are indeed almost precompact, and which is also proven using the harmonic map heat flow.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2283/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2283/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2283/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2283/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2283/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2283/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2283/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2283/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2283/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2283/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2283&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/17/global-regularity-of-wave-maps-vi-abstract-theory-of-minimal-energy-blowup-solutions/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
		<item>
		<title>DHJ: Still writing the second paper</title>
		<link>http://terrytao.wordpress.com/2009/06/14/dhj-still-writing-the-second-paper/</link>
		<comments>http://terrytao.wordpress.com/2009/06/14/dhj-still-writing-the-second-paper/#comments</comments>
		<pubDate>Sun, 14 Jun 2009 09:22:32 +0000</pubDate>
		<dc:creator>Terence Tao</dc:creator>
				<category><![CDATA[math.CO]]></category>
		<category><![CDATA[polymath1]]></category>

		<guid isPermaLink="false">http://terrytao.wordpress.com/?p=2292</guid>
		<description><![CDATA[This is a continuation of the previous thread here in the polymath1 project, which is now full.  Ostensibly, the purpose of this thread is to continue writing up the paper containing many of the things achieved during this side of the project, though we have also been spending time on chasing down more results, in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2292&subd=terrytao&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This is a continuation of <a href="http://terrytao.wordpress.com/2009/05/22/dhj-writing-the-second-paper/">the previous thread</a> here in the <a href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1</a> project, which is now full.  Ostensibly, the purpose of this thread is to <a href="http://michaelnielsen.org/polymath1/index.php?title=Outline_of_second_paper">continue writing up the paper</a> containing many of the things achieved during this side of the project, though we have also been spending time on chasing down more results, in particular using new computer data to narrow down the range of the maximal size of  6D Moser sets (currently we can pin this down to between 353 and 355).   At some point we have to decide what results to put in in full detail in the paper, what results to summarise only (with links to the wiki), and what results to defer to perhaps a subsequent paper, but these decisions can be taken at a leisurely pace.</p>
<p>I guess we&#8217;ve abandoned the numbering system now, but I suppose that if necessary we can use timestamps or URLs to link to previous comments.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terrytao.wordpress.com/2292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terrytao.wordpress.com/2292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terrytao.wordpress.com/2292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terrytao.wordpress.com/2292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terrytao.wordpress.com/2292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terrytao.wordpress.com/2292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terrytao.wordpress.com/2292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terrytao.wordpress.com/2292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terrytao.wordpress.com/2292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terrytao.wordpress.com/2292/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terrytao.wordpress.com&blog=817149&post=2292&subd=terrytao&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://terrytao.wordpress.com/2009/06/14/dhj-still-writing-the-second-paper/feed/</wfw:commentRss>
		<slash:comments>104</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3c795880f3b73784a9b75fbff3772701?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Terry</media:title>
		</media:content>
	</item>
	</channel>
</rss>