computer chip



home
archive

suggestions
help page
future features



View current page
...more recent posts

Holy cow we are being demolished by the googlebot! It is going crazy and absolutely pegging our bandwidth with bizarre page requests like:

/treehouse/date/2000/systemnews/index.php3/mrwilson/rachael/ mrwilson/arboretum.php3/dave/systemnews/index.php3/

WTF?

Unfortunately, instead of generating a 404 (since that obviously is not a real page) this requests generates the entire year 2000 archive for the /treehouse page (a huge transfer!) The system just sees /treehouse/date/2000/ and can't make anything of the rest of it, so serves the whole year.

Ouch.

I have temporarily disabled entire year archives. We'll see if that helps. Damn. I should have caught this a few days ago, but I wasn't paying close enough attention. Might cost me. Literally.
- jim 8-25-2005 9:14 pm [link] [11 comments]