Monthly Archives: February 2008

A Meta-index of Data Sets

I had to go hunting around for some data to try some new ideas on recently. As handy as Google is, there’s still a fair bit of chaff from which to sort the wheat.

Fortunately, there is a lot of good stuff out there including well-organised indexes of data sets for various purposes. For my future reference (and [...]

Clarity and Mathematics

John Langford has diagnosed a complexity illness that afflicts research in academia. One of its symptoms is what he calls “Math++”: the use of unnecessary and obfuscatory mathematics to improve the author’s chance of publication.

Having recently ploughed through a large number of math-heavy articles during the preparation of a COLT paper I have started to [...]

Staying Organised with CiteULike and BibDesk

I recently started using CiteULike to keep track of papers I read. For those not familiar with it, it deems itself to be “a free online service to organise your academic papers”. In contrast to my offline bibliography organising tool, BibDesk, a service like this has at least three main advantages:

Reduced data entry: If someone [...]

A Cute Convexity Result

Just when I thought I was starting to get my head around the multitudinous uses of convexity in statistics I was thrown by the following definition:

A function f over the interval (a,b) is convex if, for all choices of {x,y,z} satisfying a < x < y < z < b the [...]