Jump to content
  • Checkout
  • Login
  • Get in touch

osCommerce

The e-commerce.

Msnbot-media and the robots.txt


Andreas2003

Recommended Posts

Posted

Hi there,

 

got a question regarding the robots.txt and MSN.

 

Recently I got hits from "bl1sch4091909.phx.gbl".

The referer is "msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)".

Also it is correctly identified by the spiders.txt as MSN.

 

Now I heard, that MSN has split their crawlers and gives names to them, one of them is the a.m. one, which is for all media-type-related searches.

 

Like the Googlebot-Image, I dont want to get the images spidered by MSN.

So, I tried to modify my robots.txt.

 

Now I have added:

User-agent: msnbot-media

Disallow: /

 

But since then, I didnt saw MSN back on the site, I guess due to the robots.txt and that I block MSN completely.

On Google, I cant find the correct information, also the help page of MSN, which is named in the referer, cannot be found on MSN.

 

Do you have any experiences with MSN regarding the new media-search and how to exclude them via the robots.txt ?

 

Thanks in advance,

kind regards

Andreas

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...