vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB3 General Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=111)
-   -   So why has my site become popular for spiders, webcralwlers and bots now? (https://vborg.vbsupport.ru/showthread.php?t=151492)

Southernphuk 07-06-2007 05:02 AM

So why has my site become popular for spiders, webcralwlers and bots now?
 
Hola,

I've noticed over the last month that the spiders, bots and webcrawlers have really punched it up a notch, where I used to get around 150-200 a day I'm now getting closer to 2000 of them a day now and I can't for the life of me figure out why!

Any clues or ideas for the clueless here? I've got a robot.txt file but that is to keep them out of certain areas that they really have no business bothering with (profiles, pm's, etc) but have shied away from outright cutting them off from the site as I figure that is a good way to have a presence in the search engines which of course everyone wants them to funnel their way.

Anyhow, extremely curious about this and wouldn't mind some thoughts and insight on this.

SCRIPT3R 07-06-2007 05:06 AM

just count yourself lucky and leave it at that.

Southernphuk 07-06-2007 05:16 AM

Hah, believe me, I really don't want to get rid of them as frankly they aren't eating up 'that' much bandwidth (well, the google one ate through a goodly bit last month but still), I'm just curious what triggers something like this.

Regards,
Nathan

SCRIPT3R 07-06-2007 04:10 PM

nobody really knows for sure... luck of the draw.

acertek 07-06-2007 04:14 PM

Yeah I've noticed this lately too. What I've been noticing is inktomi/yahoo seems to have a spider that appears as 100+ users on your forum and crawls it with multiple legs. I think as your forum grows they put largers spiders on it to crawl it faster.

Brandon Sheley 07-06-2007 04:33 PM

back links ;)
Whats your site url ? I'll be able to tell you in a matter on minutes..

mfyvie 07-15-2007 01:33 PM

Quote:

Originally Posted by Southernphuk (Post 1284124)
Any clues or ideas for the clueless here? I've got a robot.txt file but that is to keep them out of certain areas that they really have no business bothering with (profiles, pm's, etc) but have shied away from outright cutting them off from the site as I figure that is a good way to have a presence in the search engines which of course everyone wants them to funnel their way.

Anyhow, extremely curious about this and wouldn't mind some thoughts and insight on this.

Most of the evil ones won't respect robots.txt anyway. You should care about them though - the nastier ones are just stealing your content and bringing you nothing in return. At worst they can bring your site to a standstill if they are poorly written (I've had this happen to me on a few occassions). I block a big list of the bad ones at my webserver and my forum response time increased immediately. I still allow all search engines through unless they are specifically blocked (I think I block around 200).

I also wrote a small mod to remove spiders from the statistics on your forum. This doesn't block anything, but it is useful to get rid of the 100's of sessions started by spiders like Yahoo's slurp.


All times are GMT. The time now is 12:53 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01704 seconds
  • Memory Usage 1,724KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (7)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete