Go Back   vb.org Archive > vBulletin 4 Discussion > vB4 General Discussions
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 02-10-2010, 11:34 PM
Glenn_E Glenn_E is offline
 
Join Date: Dec 2009
Posts: 12
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default How do I limit the # of google and yahoo crawler bots?

I'm suddenly getting 10-15 google / yahoo bots at a time. How do I limit the number of bot connections?
Reply With Quote
  #2  
Old 02-14-2010, 12:30 AM
Andy Andy is offline
 
Join Date: Sep 2003
Location: San Francisco
Posts: 138
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Normally you only see one or two Google spiders at a time. However with Yahoo you can see dozens or even hundreds of spiders. Here is an example of a robots.txt which only allows Google and Bing to index your forum.

User-agent: Googlebot
Disallow:

User-agent: MSNBot
Disallow:

User-agent: *
Disallow: /
Reply With Quote
  #3  
Old 02-15-2010, 03:29 AM
gulfcoasttundra's Avatar
gulfcoasttundra gulfcoasttundra is offline
 
Join Date: Sep 2009
Location: Houston
Posts: 17
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

in reverse of this "issue/Problem" how do you make your site more visible to these bots/spiders to improve searchability?

Thanks
Ben
Reply With Quote
  #4  
Old 02-15-2010, 04:15 AM
final kaoss final kaoss is offline
 
Join Date: Apr 2006
Posts: 1,314
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

glenn, go to google.com/webmasters and then after you've configured your site for it, click your site and then

*Site configuration
*Settings
*Crawl Rate, you can set the crawl rate to either be faster or slower
Reply With Quote
  #5  
Old 02-21-2010, 02:41 AM
Glenn_E Glenn_E is offline
 
Join Date: Dec 2009
Posts: 12
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thanks guys... I'm still bringing my site up to speed and am not ready for the swarm of crawlers. Nice to know that my site is hitting the top of the search list on the keywords I use
Reply With Quote
  #6  
Old 02-22-2010, 02:22 AM
Videx's Avatar
Videx Videx is offline
 
Join Date: Feb 2007
Posts: 3,085
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I wouldn't completely disallow a spider. After all, you want them to index things. I used this instead:
Quote:
User-agent: Slurp
Crawl-delay: 1500
Slurp was the only one I saw with an inordinate number of connections (in the hundreds).
Reply With Quote
  #7  
Old 02-23-2010, 10:12 AM
toibs toibs is offline
 
Join Date: Nov 2001
Location: Birmingham, UK
Posts: 55
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Is there any way to disallow google from indexing one specific forum using robots.txt, rather than just enable/disabling it??

Thx
Paul
Reply With Quote
  #8  
Old 02-23-2010, 02:21 PM
Videx's Avatar
Videx Videx is offline
 
Join Date: Feb 2007
Posts: 3,085
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by toibs View Post
Is there any way to disallow google from indexing one specific forum
I've always understood that the bots all browse the forum with Guest permissions (I hope!), so you can just disallow Guests from seeing that forum.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 02:19 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.03833 seconds
  • Memory Usage 2,223KB
  • Queries Executed 11 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (2)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (8)post_thanks_box
  • (8)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (8)post_thanks_postbit_info
  • (8)postbit
  • (8)postbit_onlinestatus
  • (8)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete