Go Back   vb.org Archive > Community Discussions > Modification Requests/Questions (Unpaid)
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 07-07-2004, 01:57 AM
Limey-YMR Limey-YMR is offline
 
Join Date: Jul 2004
Location: Wake Forest, NC
Posts: 40
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default Limit Spidering to ONLY the archive

I help run a car club forum, it's something of a membership by association of car colour, therefore we aren't too bothered about having too many new (and potential non-same-car owning members)

I was wondering if it was possible by .htaccess or by hack of configuring our server so that spiders, mainly MSN, google and Inktomi/Yahoo could only "see" the Archive.

We are at 1and1 hosting - 100MB mySQL limit and today our web server was blocked from the database server for exactly two hours (to the minute), I believe this was their own firewall blocking us, possibly due to "over spidering" or our forum was just plain too busy for 1and1's sensitive firewall / IDS rules whatever the block cause, I would like to see if someone knows of an efficient way of herding the spiders
so I can spend time on admin/installing hacks and not have to learn how to code them or .htaccess files

Any help is greatly appreciated.

you know what they say - you can lead a spider to an archive, but you can't make it index.
Reply With Quote
  #2  
Old 07-07-2004, 02:07 AM
AN-net's Avatar
AN-net AN-net is offline
 
Join Date: Dec 2003
Location: AnimationTalk.com
Posts: 2,367
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

um your best bet would be to use a robot.txt file and just list what forum files you dont want them to visit
Reply With Quote
  #3  
Old 07-07-2004, 02:37 AM
Limey-YMR Limey-YMR is offline
 
Join Date: Jul 2004
Location: Wake Forest, NC
Posts: 40
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by AN-net
um your best bet would be to use a robot.txt file and just list what forum files you dont want them to visit
I thought a robots.txt file was for denying the robot altogether, I don't think it's selective, that would be the .htaccess
if anyone knows the .htaccess syntax it would be cool.

EDIT: my mistake, a bit of RTFM unearthed this handy resource for blocking spiders from specific resources
http://www.chami.com/tips/internet/010198I.html
Reply With Quote
  #4  
Old 07-13-2004, 03:55 AM
Limey-YMR Limey-YMR is offline
 
Join Date: Jul 2004
Location: Wake Forest, NC
Posts: 40
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Robots wouldn't really prevent them from seeing *only* the archive, I want to deny them from seeing index.php, and since the archive is linked from there, they wouldn't spider anything - I want to make the board look like it's just an archive to a spider automagically.

Does anyone know if this is even a moot point? I've noticed that the spiders act differently these days and read the vbulletin three no longer uses session IDs (cookie based ony?)

I've switched the archive off since the spiders were pretty much ignoring it and it was probably causing them to hang around longer - I'm not so bothered about search engine exposure as I am about 8 spiders raping the site at once!
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 06:24 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.03821 seconds
  • Memory Usage 2,189KB
  • Queries Executed 13 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (4)post_thanks_box
  • (4)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (4)post_thanks_postbit_info
  • (4)postbit
  • (4)postbit_onlinestatus
  • (4)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete