Go Back   vb.org Archive > vBulletin 3 Discussion > vB3 General Discussions
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 08-15-2008, 01:12 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default robots.txt help

Hello all. I have been doing a bit of research here on vb.org about robots.txt and have added it to my site... but its been 3 days and nothing has changed. My biggest problem is Yahoo! Slurp Spider. There are constantly multiple instances of it on my Who's Online page and they are constantly spidering my newthread.php and other pages that I don't want them wasting their time on. Google is a good boy, it only appears once at any given time in my WOL and it seems to crawl pages that I'd like it to crawl.

I've read that the robots.txt will fix this problem... but nothing has changed in 3 days. Isn't it supposed to take effect within 24 hrs?

I chmodded it to 755 is that correct? If not, what should I chmod it to?

Heres an exact copy of my robots.txt. Tell me if its correct...

Code:
User-agent: Slurp
Crawl-delay: 60
User-agent: Fasterfox
Disallow: /
User-agent: *
Disallow: /forums/admincp/
Disallow: /forums/clientscript/
Disallow: /forums/cpstyles/
Disallow: /forums/customavatars/
Disallow: /forums/customprofilepics/
Disallow: /forums/images/
Disallow: /forums/modcp/
Disallow: /forums/ajax.php
Disallow: /forums/attachment.php
Disallow: /forums/calendar.php
Disallow: /forums/cron.php
Disallow: /forums/editpost.php
Disallow: /forums/global.php
Disallow: /forums/image.php
Disallow: /forums/inlinemod.php
Disallow: /forums/joinrequests.php
Disallow: /forums/login.php
Disallow: /forums/misc.php
Disallow: /forums/moderator.php
Disallow: /forums/newattachment.php
Disallow: /forums/newreply.php
Disallow: /forums/newthread.php
Disallow: /forums/online.php
Disallow: /forums/poll.php
Disallow: /forums/postings.php
Disallow: /forums/printthread.php
Disallow: /forums/private.php
Disallow: /forums/profile.php
Disallow: /forums/register.php
Disallow: /forums/report.php
Disallow: /forums/reputation.php
Disallow: /forums/search.php
Disallow: /forums/sendmessage.php
Disallow: /forums/showpost.php
Disallow: /forums/subscription.php
Disallow: /forums/threadrate.php
Disallow: /forums/usercp.php
Disallow: /forums/usernote.php
Disallow: /forums/credits.php
Disallow: /forums/arcade.php
Disallow: /forums/vbimagehost.php
Reply With Quote
  #2  
Old 08-15-2008, 03:04 PM
Lynne's Avatar
Lynne Lynne is offline
 
Join Date: Sep 2004
Location: California/Idaho
Posts: 41,180
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

The yahoo slurp! spider will take several days, if not a couple of weeks, until it starts obeying your robots.txt file. But it will finally do so, you just need to be patient.
Reply With Quote
  #3  
Old 08-15-2008, 03:14 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by Lynne View Post
The yahoo slurp! spider will take several days, if not a couple of weeks, until it starts obeying your robots.txt file. But it will finally do so, you just need to be patient.
Thanks for your reply. One more question. In the part of the robots.txt file that says--->

User-agent: Slurp
Crawl-delay: 60

...Should I change "Slurp" to "Yahoo! Slurp Spider"?
Reply With Quote
  #4  
Old 08-15-2008, 03:52 PM
Lynne's Avatar
Lynne Lynne is offline
 
Join Date: Sep 2004
Location: California/Idaho
Posts: 41,180
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

According to this link, you just use Slurp. http://help.yahoo.com/l/us/yahoo/sea.../slurp-03.html
Reply With Quote
  #5  
Old 08-15-2008, 04:15 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by Lynne View Post
According to this link, you just use Slurp. http://help.yahoo.com/l/us/yahoo/sea.../slurp-03.html
You've been most helpful. Thank you very much.
Reply With Quote
  #6  
Old 08-29-2008, 07:17 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Its been 2 weeks and Yahoo! Slurp Spider is still visiting pages that I've disallowed in my robots.txt.
Reply With Quote
  #7  
Old 08-29-2008, 07:46 PM
SEOvB's Avatar
SEOvB SEOvB is offline
 
Join Date: May 2007
Location: Indianapolis
Posts: 2,451
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by Chachacha View Post
Its been 2 weeks and Yahoo! Slurp Spider is still visiting pages that I've disallowed in my robots.txt.
make sure the robots.txt is in your web root and not the forum root.

Make sure the paths are correct, ex if your forums are in domain.com/forums/ folder then your posted robots.txt is correct. If your forums are not in the domain.com/forums/ folder, then your robots.txt is incorrect.
Reply With Quote
  #8  
Old 08-29-2008, 07:51 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by FRDS View Post
make sure the robots.txt is in your web root and not the forum root.

Make sure the paths are correct, ex if your forums are in domain.com/forums/ folder then your posted robots.txt is correct. If your forums are not in the domain.com/forums/ folder, then your robots.txt is incorrect.
I have 3 copies of my robots.txt. One is in the MAIN folder when (if using c/panel) you go to "File Manager"... then I have one in the "Public_html" folder... and one in the "forums" folder. Yes, the path is correct.
Reply With Quote
  #9  
Old 08-29-2008, 07:53 PM
SEOvB's Avatar
SEOvB SEOvB is offline
 
Join Date: May 2007
Location: Indianapolis
Posts: 2,451
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

you dont need any other than the public_html folder, they aren't hurting anything being in different locations to my knowledge, but they are just useless.

If the paths are correct and the yahoo spider is still going insane, try to run your robots.txt thru a robots.txt generator or checker (many available free online) to make sure there are no errors.

After that, i dont have a clue of why Yahoo isn't picking it up.
Reply With Quote
  #10  
Old 08-29-2008, 08:16 PM
Chachacha's Avatar
Chachacha Chachacha is offline
 
Join Date: Jul 2006
Posts: 173
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by FRDS View Post
you dont need any other than the public_html folder, they aren't hurting anything being in different locations to my knowledge, but they are just useless.

If the paths are correct and the yahoo spider is still going insane, try to run your robots.txt thru a robots.txt generator or checker (many available free online) to make sure there are no errors.

After that, i dont have a clue of why Yahoo isn't picking it up.
Thank you. I will try that.

EDIT: There were errors. I should have had an empty line between each block of code. I fixed it. Hope that was the problem.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 01:10 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.12949 seconds
  • Memory Usage 2,261KB
  • Queries Executed 13 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)bbcode_code
  • (5)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (10)post_thanks_box
  • (10)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (10)post_thanks_postbit_info
  • (10)postbit
  • (10)postbit_onlinestatus
  • (10)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_postinfo_query
  • fetch_postinfo
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete