agiftcodotcom Posted June 3, 2004 Share Posted June 3, 2004 When I view my error reports, I see that someone (or something/bot) is getting a 404 when looking for /robots.txt That's because I don't have a /robots.txt What should I have for /robots.txt? Any help is appreciated. Contributions I used : Updated 06-13-04 23:42 ---------------- Vote on My Graphis Poll Link to comment Share on other sites More sharing options...
stevel Posted June 3, 2004 Share Posted June 3, 2004 You don't need one. But if you don't want the 404s, create one such as this one line: User-agent: * On the other hand, it's a good idea to keep robots out of pages such as the shopping cart, etc. Here's what I have: User-agent: * Disallow: /shopping_cart.php Disallow: /advanced_search.php Disallow: /login.php Disallow: /checkout_shipping.php Disallow: /account.php Disallow: /login.php Disallow: /create_account.php Disallow: /password_forgotten.php Steve Contributions: Country-State Selector Login Page a la Amazon Protection of Configuration Updated spiders.txt Embed Links with SID in Description Link to comment Share on other sites More sharing options...
peterr Posted June 4, 2004 Share Posted June 4, 2004 Hi, I've read on many search engine optimisation forums that even though you can specify what files/paths are 'allowed', most bots/spiders don't follow the 'rules' anyway, it won't stop them spidering a path/file after all. :D Peter Link to comment Share on other sites More sharing options...
agiftcodotcom Posted June 4, 2004 Author Share Posted June 4, 2004 So basically... not needed. Cool, thanks guys. Contributions I used : Updated 06-13-04 23:42 ---------------- Vote on My Graphis Poll Link to comment Share on other sites More sharing options...
peterr Posted June 4, 2004 Share Posted June 4, 2004 Eric, As Steve said, just create a file called robots.txt with this line in it User-agent: * Place it in your web root path (ususally called public_html). Then that gets rid of the 404 messages. Peter Link to comment Share on other sites More sharing options...
stevel Posted June 4, 2004 Share Posted June 4, 2004 I've read on many search engine optimisation forums that even though you can specify what files/paths are 'allowed', most bots/spiders don't follow the 'rules' anyway, it won't stop them spidering a path/file after all. That doesn't seem to be the case from what I can tell. Since I added the robots.txt file as above, none of the search engines have gone down the disallowed paths. It certainly doesn't hurt in any case. Steve Contributions: Country-State Selector Login Page a la Amazon Protection of Configuration Updated spiders.txt Embed Links with SID in Description Link to comment Share on other sites More sharing options...
agiftcodotcom Posted June 7, 2004 Author Share Posted June 7, 2004 Added the file, thanks :) Contributions I used : Updated 06-13-04 23:42 ---------------- Vote on My Graphis Poll Link to comment Share on other sites More sharing options...
burt Posted June 7, 2004 Share Posted June 7, 2004 http://www.robotstxt.org is a good resource. Link to comment Share on other sites More sharing options...
agiftcodotcom Posted June 7, 2004 Author Share Posted June 7, 2004 Thanks :D Contributions I used : Updated 06-13-04 23:42 ---------------- Vote on My Graphis Poll Link to comment Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.