<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
 
 <title>Inductio Ex Machina</title>
 <link href="http://mark.reid.name/iem/atom.xml" rel="self"/>
 <link href="http://mark.reid.name/iem/"/>
 <updated>2009-12-05T11:04:02+11:00</updated>
 <id>http://mark.reid.name/iem/</id>
 <author>
   <name>Mark Reid</name>
   <email>mark@reid.name</email>
 </author>
 
 
 <entry>
   <title>ICML and COLT Highlights</title>
   <link href="http://mark.reid.name/iem/icml-colt-highlights.html"/>
   <updated>2009-07-07T00:00:00+10:00</updated>
   <id>id:/iem/icml-colt-highlights</id>
   <content type="html">&lt;p&gt;Now that I have had a much needed holiday I am feeling refreshed enough to write up some of my notes on the recent &lt;a href='http://www.cs.mcgill.ca/~icml2009/'&gt;ICML&lt;/a&gt; and &lt;a href='http://www.cs.mcgill.ca/~colt2009/'&gt;COLT&lt;/a&gt; conferences held in Montréal this year.&lt;/p&gt;

&lt;p&gt;I did not get to see as much of either conference as I would have liked to as I was nursing an awful cold. Fortunately, I was able to fight it off long enough to finish and give my &lt;a href='/iem/generalised-pinsker-inequalities.html'&gt;ICML and COLT presentations&lt;/a&gt;. The ICML one almost didn&amp;#8217;t happen as I had completely lost my voice the day before.&lt;/p&gt;

&lt;h2 id='icml'&gt;ICML&lt;/h2&gt;

&lt;p&gt;One theory paper I really liked at ICML was &lt;em&gt;&lt;a href='http://conflate.net/icml/paper/2009/89'&gt;PAC-Bayesian Learning of Linear Classifiers&lt;/a&gt;&lt;/em&gt; by Pascal Germain, Alexandre Lacasse, &lt;a href='http://www2.ift.ulaval.ca/~laviolette/'&gt;François Laviolette&lt;/a&gt; and &lt;a href='http://www2.ift.ulaval.ca/~mmarchand/'&gt;Mario Marchand&lt;/a&gt;. They give a general statement of the PAC-Bayes bound in terms of a convex function measuring the divergence between the true and empirical risks and, most importantly, give a remarkably simple proof using only Markov and Jensen&amp;#8217;s inequalities.&lt;/p&gt;

&lt;p&gt;Although I didn&amp;#8217;t really follow the active learning aspects of &lt;em&gt;&lt;a href='http://conflate.net/icml/paper/2009/393'&gt;Learning from Measurements in Exponential Families&lt;/a&gt;&lt;/em&gt; by &lt;a href='http://www.eecs.berkeley.edu/~pliang/'&gt;Percy Liang&lt;/a&gt;, &lt;a href='http://www.cs.berkeley.edu/~klein'&gt;Dan Klein&lt;/a&gt; and &lt;a href='http://www.cs.berkeley.edu/~jordan'&gt;Michael Jordan&lt;/a&gt;, I did think their introduction of &lt;em&gt;measurements&lt;/em&gt; as a way of combining labels and constraints was elegant. Rather than assuming instances &lt;em&gt;X&lt;/em&gt; have observed labels &lt;em&gt;Y&lt;/em&gt;, they assume the labels are hidden and their values are only available implicitly through a set of aggregated measurements &lt;em&gt;M(X,Y)&lt;/em&gt; over the whole data set. Various choices of the function &lt;em&gt;M&lt;/em&gt; result in many existing learning problems.&lt;/p&gt;

&lt;h2 id='colt'&gt;COLT&lt;/h2&gt;

&lt;p&gt;Of the presentations I managed to attend at COLT, there were two that I found thought-provoking.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;&lt;a href='http://ttic.uchicago.edu/~shai/papers/ShalevShamirSridharanSrebro2.pdf'&gt;Learnability and Stability in the General Learning Setting&lt;/a&gt;&lt;/em&gt; by &lt;a href='http://ttic.uchicago.edu/~shai/'&gt;Shai Shalev-Shwartz&lt;/a&gt;, &lt;a href='http://www.cs.huji.ac.il/~ohads03/'&gt;Ohad Shamir&lt;/a&gt;, &lt;a href='http://ttic.uchicago.edu/~nati/'&gt;Nathan Srebro&lt;/a&gt; and &lt;a href='http://ttic.uchicago.edu/~karthik/'&gt;Karthik Sridharan&lt;/a&gt; examined the &amp;#8220;general learning setting&amp;#8221; proposed by Vapnik where, instead of trying to minimising the expected cost of a function that predicts labels for instances &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/8c67bec3874aa3142b957c63ae468fd4.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; we just want to find a point &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/399d4f7a28db2290bec05d8e4d310a7b.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt; in a hypothesis space &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/cb4f0b7b7462e127797ae6e18b0f6b58.png' alt='equation' style='vertical-align: -0.111111111111111ex;height: 1.66666666666667ex;' /&gt;&lt;/span&gt; that minimises &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/94aa6fd8fb7700ab4650c5f61065ee55.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;. The relationship between hypotheses and instances in the latter case is much more arbitrary than in the usual supervised setting.&lt;/p&gt;

&lt;p&gt;Interestingly, the usual correspondence between uniform convergence of empirical risks to the expected risk and learnability which holds for supervised learning with empirical risk minimisation (ERM) breaks down in the general setting. In particular, it turns out uniform convergence is sufficient for learning with ERM but not necessary. The authors therefore explore generalised notions of ERM and show that learnability and stability are equivalent for these more general versions.&lt;/p&gt;

&lt;p&gt;Finally, I really liked the connections made between online learning and traditional statistical learning theory in &lt;em&gt;&lt;a href='http://arxiv.org/abs/0903.5328'&gt;A Stochastic View of Optimal Regret through Minimax Duality&lt;/a&gt;&lt;/em&gt; by &lt;a href='http://www.eecs.berkeley.edu/~jake/'&gt;Jacob Abernethy&lt;/a&gt;, &lt;a href='http://www.eecs.berkeley.edu/~alekh/'&gt;Alekh Agarwal&lt;/a&gt;, &lt;a href='http://www.stat.berkeley.edu/~bartlett/'&gt;Peter Bartlett&lt;/a&gt; and &lt;a href='http://www-stat.wharton.upenn.edu/~rakhlin/'&gt;Alexander Rakhlin&lt;/a&gt;. In this paper the authors use the von Neumann&amp;#8217;s minimax theorem to relate the minimax regret that is normally used in online convex optimisation (OCO) games to a supremum over a set of distributions of the expectations of a loss. Through a neat geometrical interpretation of this loss they are able to establish upper and lower bounds on the optimal regret for various online learning problems.&lt;/p&gt;

&lt;h2 id='more_highlights'&gt;More Highlights&lt;/h2&gt;

&lt;p&gt;If you are after more highlights, I recommend having a look at &lt;a href='http://hunch.net/?p=813'&gt;John&lt;/a&gt; and &lt;a href='http://nlpers.blogspot.com/2009/06/icmlcoltuai-2009-retrospective.html'&gt;Hal&lt;/a&gt;&amp;#8217;s overviews of the conferences.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>ICML Discussion</title>
   <link href="http://mark.reid.name/iem/icml-discussion.html"/>
   <updated>2009-06-11T00:00:00+10:00</updated>
   <id>id:/iem/icml-discussion</id>
   <content type="html">&lt;p&gt;Another quick note, this time about the &lt;a href='http://conflate.net/icml/'&gt;ICML Discussion&lt;/a&gt; site I have updated for &lt;a href='http://www.cs.mcgill.ca/~icml2009/'&gt;ICML 2009&lt;/a&gt; in Montréal, Canada.&lt;/p&gt;

&lt;p&gt;Thanks to this year&amp;#8217;s organisers, I was given a list of authors and papers for ICML 2009 and was able to add them to the base I set up &lt;a href='/iem/icml-discussion-site.html'&gt;last year&lt;/a&gt; for ICML 2008. I&amp;#8217;ve made separate indices for &lt;a href='http://conflate.net/icml/paper/2009'&gt;this year&lt;/a&gt;&amp;#8217;s and &lt;a href='http://conflate.net/icml/paper/2008'&gt;last year&lt;/a&gt;&amp;#8217;s papers but have bundled all the &lt;a href='http://conflate.net/icml/author'&gt;authors&lt;/a&gt; across the two years in together.&lt;/p&gt;

&lt;p&gt;The statistics I have kept for the site since I started it show that there has been a boost of interest since I&amp;#8217;ve added the new papers. I&amp;#8217;m hoping that will result in a flurry of comments during the conference as people attend talks and posters. Keep in mind that if you are commenting on a paper &lt;a href='http://conflate.net/icml/help#syntax'&gt;you can use LaTeX&lt;/a&gt; to get your point across.&lt;/p&gt;

&lt;p&gt;Unfortunately, there is no notification system in place for the discussion site at the moment so I encourage authors with papers accepted to this year&amp;#8217;s conference to check the &lt;a href='http://conflate.net/icml/'&gt;front page&lt;/a&gt; of the site regularly to see if anyone has asked a question about their paper.&lt;/p&gt;

&lt;p&gt;If there is any questions or suggestions about the site you can leave me some feedback either here or in the &lt;a href='http://conflate.net/icml/feedback'&gt;feedback section&lt;/a&gt; of the site itself.&lt;/p&gt;

&lt;p&gt;Enjoy the conference and the discussion site. See you in Montréal!&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Generalised Pinsker Inequalities and Surrogate Regret Bounds</title>
   <link href="http://mark.reid.name/iem/generalised-pinsker-inequalities.html"/>
   <updated>2009-05-23T00:00:00+10:00</updated>
   <id>id:/iem/generalised-pinsker-inequalities</id>
   <content type="html">&lt;p&gt;Just a quick note to say that &lt;a href='http://axiom.anu.edu.au/~williams/'&gt;Bob Williamson&lt;/a&gt; and I have improved a few of the results we had previously written up in our &lt;a href='/iem/information-divergence-and-risk.html'&gt;technical report&lt;/a&gt; and even managed to get some of them accepted to &lt;a href='http://www.cs.mcgill.ca/~icml2009/'&gt;ICML&lt;/a&gt; and &lt;a href='http://www.cs.mcgill.ca/~colt2009'&gt;COLT&lt;/a&gt; this year.&lt;/p&gt;

&lt;p&gt;The ICML paper is &lt;a href='http://www.cs.mcgill.ca/~icml2009/abstracts.html#400'&gt;Surrogate Regret Bounds for Proper Losses&lt;/a&gt;. In it we extend earlier results on surrogate bounds for 0-1 misclassification loss to cost-weighted misclassification losses and proper losses. We also investigate conditions for convexity of composite proper losses.&lt;/p&gt;

&lt;p&gt;As the abstract puts it:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;We present tight surrogate regret bounds for the class of proper (i.e., Fisher consistent) losses. The bounds generalise the margin-based bounds due to Bartlett et al. (2006). The proof uses Taylor&amp;#8217;s theorem and leads to new representations for loss and regret and a simple proof of the integral representation of proper losses. We also present a different formulation of a duality result of Bregman divergences which leads to a demonstration of the convexity of composite losses using canonical link functions.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;What I particularly like about this paper is that these results and several earlier ones regarding proper losses are arrived at using little more than a Taylor series expansion.&lt;/p&gt;

&lt;p&gt;The COLT paper, &lt;a href='/files/pubs/colt09.pdf'&gt;Generalised Pinsker Inequalities&lt;/a&gt;, is (unsurprisingly) a bit more technical. As the name suggests, we were able to generalise the classical &lt;a href='http://en.wikipedia.org/wiki/Pinsker&amp;apos;s_inequality'&gt;Pinsker inequality&lt;/a&gt;&lt;sup id='fnref:1'&gt;&lt;a href='#fn:1' rel='footnote'&gt;1&lt;/a&gt;&lt;/sup&gt; that provides a lower bound on the KL divergence &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/bc99c9778c6b8c8aacfee95a1ad14ce8.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; between the distributions &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/5b6ea149f8244c11cf88e1ea59035b8f.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt; and &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/d550ce8a75da6e4ced0323f55cf5608b.png' alt='equation' style='vertical-align: -0.444444444444444ex;height: 2.0ex;' /&gt;&lt;/span&gt; in terms of their variational divergence &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/a31ec140b20af70f10b18370bc4180fb.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;:&lt;/p&gt;
&lt;div class='maruku-equation'&gt;&lt;img class='maruku-png' src='/images/latex/e68e8b068d27c54d17219b77acc3ee58.png' alt='equation' style='height: 2.66666666666667ex;' /&gt;&lt;div class='maruku-eq-tex'&gt;&lt;code style='display: none'&gt;	KL(P,Q) \ge \frac{1}{2} [V(P,Q)]^2 

&lt;/code&gt;&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Not only were we able to tighten this bound but we provide a template for constructing similar (and tight) bounds for any &lt;a href='http://en.wikipedia.org/wiki/F-divergence'&gt;f-divergence&lt;/a&gt; in terms of variational divergence. Furthermore, we also generalise the bounds to make use of one or more non-symmetric variational divergences.&lt;/p&gt;

&lt;p&gt;Here&amp;#8217;s the abstract:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;We generalise the classical Pinsker inequality which relates variational divergence to Kullback-Liebler divergence in two ways: we consider arbitrary f -divergences in place of KL divergence, and we assume knowledge of a sequence of values of generalised variational divergences. We then develop a best possible inequality for this doubly generalised situation. Specialising our result to the classical case provides a new and tight explicit bound relating KL to variational divergence (solving a problem posed by Vajda some 40 years ago). The solution relies on exploiting a connection between divergences and the Bayes risk of a learning problem via an integral representation.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Although the proof requires some careful, technical detail, the approach is very simple and geometric. It makes use of the fact that any f-divergence can be written as a weighted integral of variational-like divergences and the fact that f-divergences are related to Bayes risk curves for proper losses.&lt;/p&gt;

&lt;p&gt;In a very loose sense, these papers are duals of each other: the bounds in both share a similar structure &amp;#8211; what can be said about a &amp;#8220;complex&amp;#8221; divergence given information about some &amp;#8220;simple&amp;#8221; divergences vs. what can be said about a &amp;#8220;simple&amp;#8221; loss given information about a &amp;#8220;complex&amp;#8221; one &amp;#8211; and both are driven by the integral representation of losses and divergences which provides an interpretation of &amp;#8220;simple&amp;#8221; and &amp;#8220;complex&amp;#8221;.&lt;/p&gt;

&lt;p&gt;I&amp;#8217;ll be presenting both these papers in Montréal in June so if you are also attending ICML and COLT and are interested in these results please let me know.&lt;/p&gt;
&lt;div class='footnotes'&gt;&lt;hr /&gt;&lt;ol&gt;&lt;li id='fn:1'&gt;
&lt;p&gt;The statement of Pinsker&amp;#8217;s inequality on the Wikipedia page uses a version of variational divergence that differs from ours by a factor of 2.&lt;/p&gt;
&lt;a href='#fnref:1' rev='footnote'&gt;&amp;#8617;&lt;/a&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;</content>
 </entry>
 
 <entry>
   <title>Irving “Jack” Good (1916–2009)</title>
   <link href="http://mark.reid.name/iem/irving-john-good.html"/>
   <updated>2009-05-04T00:00:00+10:00</updated>
   <id>id:/iem/irving-john-good</id>
   <content type="html">&lt;p&gt;The &lt;a href='http://languagelog.ldc.upenn.edu/nll/?p=1386'&gt;Language Log&lt;/a&gt; recently noted the passing of the eminent statistician, probability theorist, WWII code-breaker, colleague of Alan Turing and populariser of Go, Irving John &amp;#8220;Jack&amp;#8221; Good who died on April the 5&lt;sup&gt;th&lt;/sup&gt; at the ripe old age of 92.&lt;/p&gt;

&lt;p&gt;As I have &lt;a href='/iem/research-changing-books.html'&gt;mentioned earlier&lt;/a&gt; on this blog, Good&amp;#8217;s 1965 monograph, &amp;#8221;&lt;a href='http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;amp;tid=9821'&gt;The Estimation of Probabilities: An Essay on Modern Bayesian Methods&lt;/a&gt;&amp;#8221; was especially influential for my research. In this slim volume was a beautifully written treatise on how to estimate probabilities from &amp;#8220;effectively small samples&amp;#8221; &amp;#8211; exactly the problem I was facing in my &lt;a href='http://arrow.unsw.edu.au/vital/access/manager/Repository/unsworks:1253'&gt;thesis&lt;/a&gt;. When samples are scarce, evaluating models becomes difficult because many models can share exactly the same misclassification rates.&lt;/p&gt;

&lt;p&gt;It was by chance that I read his discussion of Dirichlet priors for multinomial samples and realised that they were the perfect tool for smoothing estimations of misclassification counts in contingency tables. By learning these priors on one task and transferring them to new, similar tasks I was able to break ties between models in a principled way by using the performance of similar models on earlier tasks.&lt;/p&gt;

&lt;p&gt;The other time I encountered Good&amp;#8217;s work was when reading about population estimation. &lt;a href='http://en.wikipedia.org/wiki/Good-Turing'&gt;Good-Turing estimation&lt;/a&gt; is a clever statistical technique developed by Good and Turing when they worked together as code-breakers at Bletchley Park. When randomly sampling items from some unknown number of classes, the Good-Turing estimator takes into account the probability that the next randomly selected item comes from a previously &lt;em&gt;unseen&lt;/em&gt; class. This is very useful in areas such as linguistics where vocabularies are only incompletely known through some relatively small corpus of text.&lt;/p&gt;

&lt;p&gt;Essentially, their technique estimates then smoothes the &lt;em&gt;frequencies of the frequencies&lt;/em&gt; of the observed classes &amp;#8211; the number of times a class with one item is observed, the number of times a class with two items is observed, and so on. By fitting a linear model to a log-log plot of these smoothed frequencies of frequencies estimates of low frequency classes can be revised upwards, possibly from 0, and an estimate of number of unseen class can be made.&lt;/p&gt;

&lt;p&gt;In general, Good&amp;#8217;s work could be described as that of a practical subjective Bayesian and I think he summed up this position very elegantly in a footnote in his monograph:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;In our theories, we rightly search for unification, but real life is both complicated and short, and we make no mockery of honest ad hockery.&lt;/p&gt;
&lt;/blockquote&gt;</content>
 </entry>
 
 <entry>
   <title>Priors and the Argument By Design</title>
   <link href="http://mark.reid.name/iem/argument-by-design.html"/>
   <updated>2009-04-21T00:00:00+10:00</updated>
   <id>id:/iem/argument-by-design</id>
   <content type="html">&lt;p&gt;Although I&amp;#8217;m not completely sold on the book as a whole, Ken Binmore&amp;#8217;s &amp;#8221;&lt;a href='http://www.librarything.com/work/book/41239432'&gt;Rational Decisions&lt;/a&gt;&amp;#8221; is a great collection of very sharp, terse insights into many &lt;a href='/iem/the-horatio-principle.html'&gt;paradoxes&lt;/a&gt; and counter-intuitive results regarding probability and decision theory.&lt;/p&gt;

&lt;p&gt;One such digression is his five sentence analysis of the &amp;#8220;Argument By Design&amp;#8221; that is central to the Intelligent Design movement. With minor modification to his notation, it goes like this:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Let &lt;em&gt;F&lt;/em&gt; be the event that something appears to have been organized. Let &lt;em&gt;G&lt;/em&gt; be the event that there is an organizer. Everybody agrees that &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/182f0a82ade4a01c479c14632078f50e.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;, but the argument by design needs to deduce that &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/3e88597609271bbfc63c03278a72a7f1.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; if God&amp;#8217;s existence is to be more likely than not. Applying Bayes&amp;#8217; rule, we find that&lt;/p&gt;
&lt;/blockquote&gt;
&lt;div class='maruku-equation'&gt;&lt;img class='maruku-png' src='/images/latex/137860f00021ed0d9af214a0e710c6ea.png' alt='equation' style='height: 5.33333333333333ex;' /&gt;&lt;div class='maruku-eq-tex'&gt;&lt;code style='display: none'&gt;	\displaystyle
	\mathbb{P}(G|F) 
	= \frac{\mathbb{P}(F|G)\mathbb{P}(G)}{\mathbb{P}(F|G)\mathbb{P}(G) + \mathbb{P}(F|\neg G)\mathbb{P}(\neg G)}.

&lt;/code&gt;&lt;/div&gt;&lt;/div&gt;
&lt;blockquote&gt;
&lt;p&gt;If &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/7aff8ef6e1f04ccd27b4638bd6e46dce.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;, we can deduce the required conclusion that &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/3e88597609271bbfc63c03278a72a7f1.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;, but we are otherwise left in doubt.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;From this snappy argument he concludes&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Bayesianism therefore has an explanation of why religious folk are more ready to accept the argument by design than skeptics!&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;As an atheist with Bayesian tendencies, I was very ready to accept this argument against the argument by design, but upon reflection I think this conclusion is arrived at too quickly.&lt;/p&gt;

&lt;p&gt;Since we are computing probabilities anyway, why stop at checking if one is just larger than the other when we can compute odds?&lt;/p&gt;

&lt;p&gt;Rather than just ask whether &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/3e88597609271bbfc63c03278a72a7f1.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; we can compute the odds &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/bef9bd167062e2fc8183cbdfd092f606.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; and see how far it is from 1. Specifically,&lt;/p&gt;
&lt;div class='maruku-equation'&gt;&lt;img class='maruku-png' src='/images/latex/b3b050f46951f3c2d861891c7af6d3c4.png' alt='equation' style='height: 5.33333333333333ex;' /&gt;&lt;div class='maruku-eq-tex'&gt;&lt;code style='display: none'&gt;\displaystyle\frac{\mathbb{P}(G|F)}{\mathbb{P}(\neg G|F)}
= \frac{\mathbb{P}(F|G)\mathbb{P}(G)}{\mathbb{P}(F|\neg G)\mathbb{P}(\neg G)}
= \frac{\mathbb{P}(F|G)}{\mathbb{P}(F|\neg G)}
  \cdot\frac{\mathbb{P}(G)}{\mathbb{P}(\neg G)}

&lt;/code&gt;&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;since the denominator in Binmore&amp;#8217;s calculation is just &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/8594756b31a2eee8f74e66a78a9f63c5.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt; &amp;#8211; the probability that &amp;#8220;something appears to have been organized&amp;#8221; and is assumed to be the same for believer and skeptic alike.&lt;/p&gt;

&lt;p&gt;Now we see that the argument by design increases one&amp;#8217;s belief in God given that organisation is observed (since we have assumed &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/182f0a82ade4a01c479c14632078f50e.png' alt='equation' style='vertical-align: -0.555555555555556ex;height: 2.33333333333333ex;' /&gt;&lt;/span&gt;) but will only &amp;#8220;convert&amp;#8221; a non-believer to a believer if the likelihood ratio for God is larger than the non-believer&amp;#8217;s initial skepticism, as measured by his or her subjective odds against God&amp;#8217;s existence.&lt;sup id='fnref:1'&gt;&lt;a href='#fn:1' rel='footnote'&gt;1&lt;/a&gt;&lt;/sup&gt; That is, if&lt;/p&gt;
&lt;div class='maruku-equation'&gt;&lt;img class='maruku-png' src='/images/latex/4f80609791168f6121903917dff9ad65.png' alt='equation' style='height: 5.33333333333333ex;' /&gt;&lt;div class='maruku-eq-tex'&gt;&lt;code style='display: none'&gt;\displaystyle L = \frac{\mathbb{P}(F|G)}{\mathbb{P}(F|\neg G)} &gt; 
\frac{\mathbb{P}(\neg G)}{\mathbb{P}(G)} = R.

&lt;/code&gt;&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The strength of the argument by design is a function of the likelihood ratio &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/7d2c0c2260079724d34361ae6d009baf.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt; of observing organised things with and without assuming a God.&lt;/p&gt;

&lt;p&gt;However, contrary to Binmore&amp;#8217;s conclusion, I believe the real reason ID advocates and skeptics disagree as to the strength of the argument by design is due to differences in this likelihood ratio.&lt;/p&gt;

&lt;p&gt;It is fair to assume, as Binmore does, that the the likelihood ratio &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/6509ebfee3dd861f8b85441d1e13d9f2.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt; for an ID advocate and the likelihood ratio &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/e6c82ef2f2612ef96cfb1774912a2033.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt; of a skeptic are both greater than 1 since there are many man-made artefacts such as watches that clearly do have an &amp;#8220;organizer&amp;#8221;. However, I would additionally argue that &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/c1342272b8aeb379f41d6725024b3aad.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt;. Why? Well, presumably a skeptic who belies in evolution sees the natural world as a whole lot of things that were organised without an organiser whereas the ID advocate does not.&lt;/p&gt;

&lt;p&gt;An ID advocate with a large &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/6509ebfee3dd861f8b85441d1e13d9f2.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt; will believe that even if you were very skeptical (small &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/8b04f8acfc22dc85d0a8e8f623ad0f5c.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt;), the product of &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/6509ebfee3dd861f8b85441d1e13d9f2.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt; and &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/8b04f8acfc22dc85d0a8e8f623ad0f5c.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt; would be much larger than 1 and you would therefore be convinced to believe in God. From the point of view of a skeptic though &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/e6c82ef2f2612ef96cfb1774912a2033.png' alt='equation' style='vertical-align: -0.333333333333333ex;height: 1.88888888888889ex;' /&gt;&lt;/span&gt; would not be larger than &lt;span class='maruku-inline'&gt;&lt;img class='maruku-png' src='/images/latex/8b04f8acfc22dc85d0a8e8f623ad0f5c.png' alt='equation' style='vertical-align: -0.0ex;height: 1.55555555555556ex;' /&gt;&lt;/span&gt; and so the argument by design is not convincing.&lt;/p&gt;
&lt;div class='footnotes'&gt;&lt;hr /&gt;&lt;ol&gt;&lt;li id='fn:1'&gt;
&lt;p&gt;This type of updating is what &lt;a href='http://en.wikipedia.org/wiki/Richard_Jeffrey'&gt;Richard Jeffrey&lt;/a&gt; evocatively called &amp;#8220;probability kinematics&amp;#8221; and is otherwise known as &lt;a href='http://plato.stanford.edu/entries/bayes-theorem/#4'&gt;Jeffrey Conditioning&lt;/a&gt;.&lt;/p&gt;
&lt;a href='#fnref:1' rev='footnote'&gt;&amp;#8617;&lt;/a&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;</content>
 </entry>
 
 
</feed>