I've just found this crawling the web... This is statistics from google on the structure of the web pages:
http://code.google.com/webstats/index.html