<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Super Crunchers</title>
	<atom:link href="http://conflate.net/inductio/2008/09/super-crunchers/feed/" rel="self" type="application/rss+xml" />
	<link>http://conflate.net/inductio/2008/09/super-crunchers/</link>
	<description>Thoughts on Machine Learning and Inference</description>
	<pubDate>Tue, 06 Jan 2009 01:28:27 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6</generator>
		<item>
		<title>By: Bob Carpenter</title>
		<link>http://conflate.net/inductio/2008/09/super-crunchers/#comment-273</link>
		<dc:creator>Bob Carpenter</dc:creator>
		<pubDate>Wed, 17 Dec 2008 19:18:46 +0000</pubDate>
		<guid isPermaLink="false">http://conflate.net/inductio/?p=124#comment-273</guid>
		<description>&lt;p&gt;I also thought about getting this book, so thanks for saving me some time.  I was so turned off by the breathless style of &lt;i&gt;The Numerati&lt;/i&gt; (another pop book about data mining) that I think I'll wait a while before delving into another pop quant book.&lt;/p&gt;

&lt;p&gt;I believe the right question to ask is whether we need domain experts at all, or just need a whole lot of data.&lt;/p&gt;

&lt;p&gt;I think the answer's pretty obvious.  Even the basic structure of a statistical model entails a large degree of design in everything from setting up dependencies to selecting predictors.&lt;/p&gt;

&lt;p&gt;The most accurate natural language systems bring in all kinds of human-generated knowledge sources from labeled data for classifiers or part-of-speech taggers to domain-specific dictionaries to full-blown ontologies.&lt;/p&gt;

&lt;p&gt;I'd cut just about anybody slack for not sorting out all of our redundant terminology. I only just recently realized that so-called max entropy classifiers, logistic regression, and one-layer neural nets with sigmoid/softmax activation were the same thing, and that L1 norms, Laplace priors, double-exponential priors,  and the "lasso" are the same thing.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I also thought about getting this book, so thanks for saving me some time.  I was so turned off by the breathless style of <i>The Numerati</i> (another pop book about data mining) that I think I&#8217;ll wait a while before delving into another pop quant book.</p>

<p>I believe the right question to ask is whether we need domain experts at all, or just need a whole lot of data.</p>

<p>I think the answer&#8217;s pretty obvious.  Even the basic structure of a statistical model entails a large degree of design in everything from setting up dependencies to selecting predictors.</p>

<p>The most accurate natural language systems bring in all kinds of human-generated knowledge sources from labeled data for classifiers or part-of-speech taggers to domain-specific dictionaries to full-blown ontologies.</p>

<p>I&#8217;d cut just about anybody slack for not sorting out all of our redundant terminology. I only just recently realized that so-called max entropy classifiers, logistic regression, and one-layer neural nets with sigmoid/softmax activation were the same thing, and that L1 norms, Laplace priors, double-exponential priors,  and the &#8220;lasso&#8221; are the same thing.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Ricardo Niederberger Cabral</title>
		<link>http://conflate.net/inductio/2008/09/super-crunchers/#comment-238</link>
		<dc:creator>Ricardo Niederberger Cabral</dc:creator>
		<pubDate>Tue, 30 Sep 2008 00:21:41 +0000</pubDate>
		<guid isPermaLink="false">http://conflate.net/inductio/?p=124#comment-238</guid>
		<description>&lt;p&gt;Great review!&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Great review!</p>]]></content:encoded>
	</item>
	<item>
		<title>By: ansate</title>
		<link>http://conflate.net/inductio/2008/09/super-crunchers/#comment-237</link>
		<dc:creator>ansate</dc:creator>
		<pubDate>Sat, 27 Sep 2008 14:31:58 +0000</pubDate>
		<guid isPermaLink="false">http://conflate.net/inductio/?p=124#comment-237</guid>
		<description>&lt;p&gt;thanks for the review! I'd been thinking about reading this but hadn't gotten to it. Sounds like he's enthusiastic about the same things I am, but doesn't add enough to the discussion to be worth it to us geeks who used these arguments in our grad school entrance essays.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>thanks for the review! I&#8217;d been thinking about reading this but hadn&#8217;t gotten to it. Sounds like he&#8217;s enthusiastic about the same things I am, but doesn&#8217;t add enough to the discussion to be worth it to us geeks who used these arguments in our grad school entrance essays.</p>]]></content:encoded>
	</item>
</channel>
</rss>
