vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   Forum and Server Management (https://vborg.vbsupport.ru/forumdisplay.php?f=232)
-   -   robots.txt for 3.8.2 - Any Ideas? (https://vborg.vbsupport.ru/showthread.php?t=211965)

vbplusme 04-23-2009 02:37 AM

robots.txt for 3.8.2 - Any Ideas?
 
Hello and Greetings,

I have just noticed that Google Webmaster Tools is complaining about a LOT of URLs being restricted by my robots.txt file.

Is anyone else having this problem? If not, can I get an example of a robots.txt written for 3.8.2?

I tweaked mine thinking that I was fixing a duplicate content problem but I apparently crossed the line on it :D

Any ideas, suggestions greatly appreciated.

TIA

Dismounted 04-23-2009 05:23 AM

What is it currently?

vbplusme 04-23-2009 07:20 AM

Sorry, should have thought to post it:

Quote:


User-agent: *

#Crawl-Delay: 10

Disallow: /admincp/
Disallow: /ajax.php
Disallow: /announcement.php
Disallow: /archive/
Disallow: /attachment.php
Disallow: /calendar.php
Disallow: /cgi-bin/
Disallow: /chat/
Disallow: /clientscript/
Disallow: /converse.php
Disallow: /cpstyles/
Disallow: /cron.php
Disallow: /customavatars/
Disallow: /customgroupicons/
Disallow: /customprofilepics/
Disallow: /editpost.php
Disallow: /faq.php
Disallow: /forumdisplay.php?daysprune
Disallow: /forumdisplay.php?do
Disallow: /forumdisplay.php?order
Disallow: /forumdisplay.php?page
Disallow: /forumdisplay.php?pp
Disallow: /forumdisplay.php?sort
Disallow: /gallery/
Disallow: /global.php
Disallow: /group_inlinemod.php
Disallow: /groupsubscription.php
Disallow: /images/
Disallow: /includes/
Disallow: /infraction.php
Disallow: /inlinemod.php
Disallow: /joinrequests.php
Disallow: /login.php
Disallow: /member.php
Disallow: /member_inlinemod.php
Disallow: /memberlist.php
Disallow: /misc.php
Disallow: /modcp/
Disallow: /moderation.php
Disallow: /moderator.php
Disallow: /newattachment.php
Disallow: /newreply.php
Disallow: /newthread.php
Disallow: /online.php
Disallow: /payment_gateway.php
Disallow: /payments.php
Disallow: /personal/
Disallow: /printthread.php
Disallow: /profile.php?do
Disallow: /register.php
Disallow: /report.php
Disallow: /search.php
Disallow: /sendmessage.php
Disallow: /showpost.php
Disallow: /showthread.php?goto
Disallow: /showthread.php?mode
Disallow: /showthread.php?p
Disallow: /showthread.php?page
Disallow: /showthread.php?post
Disallow: /showthread.php?pp
Disallow: /signaturepics/
Disallow: /subscription.php

User-Agent: msnbot
Crawl-Delay: 10

User-Agent: Slurp
Crawl-Delay: 10


veenuisthebest 04-23-2009 08:25 AM

Two points in addition to above robots.txt:-

1. We should not give out our admincp directory in robots.txt as it makes the location displayable to the world. What is the use of renaming admincp feature then?

2. Also its good to give a referance to our sitemap at the end of robots.txt

Sitemap: http://site.com/sitemap_index.xml.gz

vbplusme 04-23-2009 08:54 AM

Thanks for the comments, had not thought about the sitemap reference in there. thanks for that. I double password protect my admincp folder though I could easily take it out of the list altogether as the bots can not access it anyway so thanks for that comment as well.

vbplusme 04-24-2009 06:36 PM

Anyone see any problem with the content of this robots.txt or have any idea how to fix the "google" complaining about the restrictions? TIA

hambil 04-24-2009 10:33 PM

Quote:

Originally Posted by vbplusme (Post 1797267)
Thanks for the comments, had not thought about the sitemap reference in there. thanks for that. I double password protect my admincp folder though I could easily take it out of the list altogether as the bots can not access it anyway so thanks for that comment as well.

It's not an issue of whether they access it or not, it is whether they try. Everything they try and fail at is wasted bandwidth and resources. If you password protect your admincp and modcp directories there is no reason to leave them out of robots.txt.

I pretty much followed the advice in this article, and have had not complaints from google: http://www.theadminzone.com/forums/s...ad.php?t=19872

vbplusme 04-25-2009 01:16 AM

As a matter of fact I did use those guidelines to construct my robots.txt (and the follow on suggestions in that thread). I forgot about that, thanks for reminding me about it.

vbplusme 04-26-2009 08:35 AM

I do have a follow on question on the robots.txt file that I am currently using. I have the vbulletin blog software installed on this site as well as wordpress. I have not disallows in this robots.txt for any blog files. I would not have thought anything about it except that I just looked at my sitemap and see a huge number of URLs for blog stuff that doesn't really exist like archives from 1983?

Anyone have a suggestion about a sensible robots.txt entry for both the vbulletin blog and wordpress?

TIA for any ideas.

hambil 04-26-2009 11:12 AM

Sounds like a sitemap issue not a robot.txt issue. I know that's not an answer per say, but I'd be looking at why your sitemap contains links that don't exist, instead.


All times are GMT. The time now is 11:00 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01083 seconds
  • Memory Usage 1,738KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (1)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete