vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB4 General Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=251)
-   -   How can I stop Unknown robot, spider, crawler (https://vborg.vbsupport.ru/showthread.php?t=312844)

dacho 07-09-2014 08:30 AM

How can I stop Unknown robot, spider, crawler
 
Hello mate

How can we stop the Unknown robot, spider, crawler ?
They probably skipped and not refer to robot.txt
In robot.txt I made some bots are enabled and blocking all others bots, and still without permission they scan me

Dave 07-09-2014 08:36 AM

I believe it usually takes a few days up to a week before those robots "look" at the robots.txt file and update it on their side.
By the way you named it robots.txt right? With the s after robot?

dacho 07-09-2014 02:08 PM

Yes my friends, it is clear that file name robots.txt and sits on Root.
It's not about takes a few days up to a week before those robots "look" at the robots.txt file and update it on their side

It probably really are a lot of bots, spiders and crawlers pass through the robots.txt and go inside without identified

The Solution probably need to combine the robots.txt and .htaccess

See the following guide :
http://michael.langley.id.au/blog/posts/28

Max Taxable 07-09-2014 02:14 PM

Robots.txt isn't a law. Bots aren't required to follow it or even index it. Only the legitimate SE bots will. Bad bots simply ignore it. It's just a sign basically, politely saying "please don't crawl these areas of my site." It's alot like gun laws in the US - criminals simply ignore them.

For the others, you want the "Ban Spiders by User Agent" Mod.

ForceHSS 07-09-2014 02:15 PM

<a href="https://vborg.vbsupport.ru/showthread.php?t=268208" target="_blank">https://vborg.vbsupport.ru/showthread.php?t=268208</a>
This will stop them as soon as you install it

Lee G 07-09-2014 10:17 PM

This mod by Paul M will help you track them
https://vborg.vbsupport.ru/showthread.php?t=232182

There used to be an updated spider list for vbulletin on the net. No idea what or if that's still shared

You can then track the heavy hitters and block them

Max Taxable 07-09-2014 10:28 PM

Quote:

Originally Posted by Lee G (Post 2506156)
There used to be an updated spider list for vbulletin on the net. No idea what or if that's still shared[/B]

It is:

Updated vBulletin spiders list


But that is NOT the block list for the ban spiders mod, there's never been a "official" list for that as the Mod is blank when first installed..

ForceHSS 07-09-2014 10:57 PM

When was the last time that list was updated Max

ozzy47 07-09-2014 11:00 PM

It was updated in January of this year. It had over 200 spiders added.

ForceHSS 07-09-2014 11:03 PM

So the last few in the last few posts have not been added? Can they be added


All times are GMT. The time now is 02:44 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01162 seconds
  • Memory Usage 1,727KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (1)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete