vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 4.x Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=245)
-   -   Miscellaneous Hacks - Ban Spiders by User Agent (https://vborg.vbsupport.ru/showthread.php?t=268208)

Max Taxable 03-21-2012 01:26 AM

That dude has alot of garbage on his computer.

Simon Lloyd 03-21-2012 07:56 AM

Use this tool http://user-agent-string.info/parse it will breakdown the UA in to its component parts :)

BadgerDog 03-21-2012 09:08 AM

Quote:

Originally Posted by Simon Lloyd (Post 2311641)
Use this tool http://user-agent-string.info/parse it will breakdown the UA in to its component parts :)

Thank you Simon... :)

Very useful ... :up:

Regards,
Doug

Max Taxable 03-21-2012 02:59 PM

Quote:

Originally Posted by Simon Lloyd (Post 2311641)
Use this tool http://user-agent-string.info/parse it will breakdown the UA in to its component parts :)

U da Man Simon.

Simon Lloyd 03-21-2012 04:18 PM

Lol, thanks!, i try ;)

baileyjojoms 03-22-2012 01:27 AM

Quote:

Originally Posted by BadgerDog (Post 2311313)
Do you have a link?

I can't seem to find the right page....

Thanks .. :)

Regards,
Doug

Yes, I ensured that the following was in my robots.txt file:

User-agent: Baiduspider
Disallow: /


Then I sent an email to: spiderhelp@baidu.com

Here is the message and reply I received:


Quote:

Dear,

Thank you for your email.
We have updated our DNS record to make our spider behave the way requested in your robots file.
Should you need further assistance, please do not hesitate to contact us.

Best Regards,
Stephy Wu
Baidu Spider Team

________________________________________
re: Continuous Crawling of my site

To whom it may concern;

I have been trying for a month now to halt all crawling of my site by Baidu. I have added the following code to my robots.txt file:
User-agent: Baiduspider
Disallow: /

This was done 3 weeks ago. However I am being crawled daily.

Baidu is daily eating up a ton of Server Resources, and costing me slow load times. I also employed a spider ban modification, and have banned more than 28,000 Baidu spider entries in 3 weeks.

This is ridiculous. I am asking you to immediately halt all crawling of my site by Baidu.
I have not seen hide nor hair of Baidu since this was done, nearly a month ago.

To find the email address I went to their website, translated the page into English, and the searched Baidu Spider. Which took me to a search results page, which lead me to this page:
http://www.baidu.com/search/spider.html

I simply translated to English, and found the info I was looking for.

Baidu was the ONLY spider that was causing major issues, now I am able to use this add-on for other spiders - but Baidu was using massive amounts of resources.

Hope this helps.

BadgerDog 03-22-2012 10:42 AM

Quote:

Originally Posted by baileyjojoms (Post 2311920)

Hope this helps.

Yes, thank you very much ... :)

Regards,
Doug

Alan_SP 03-28-2012 04:40 PM

I have problems with Majestics MJ12bot. I tried to redirect bad spiders to own IP, or to HTML address given in mod. AFAIK all spiders other than Majestics MJ12bot are gone (here and there are new ones, but I remove them).

Do you know why this spider is successful in avoiding this mod?

Simon Lloyd 03-28-2012 05:08 PM

When the spider appears next click the who's online, at the bottom choose show useragent and copy their entire UA string to the list :)

Alan_SP 03-28-2012 06:07 PM

Thanks for the info. Before I didn't noticed this. I'll wait till it shows again (hopefully never again).

I also installed vB Bad Behavior: https://vborg.vbsupport.ru/showthread.php?t=261498

EDIT: I found this info about it:

Mozilla/5.0 (compatible; MJ12bot/v1.4.2; http://www.majestic12.co.uk/bot.php?+)

I now use only this string in spider list settings: MJ12bot.

I hope it will stop it, maybe Majestics MJ12bot was too much


All times are GMT. The time now is 01:59 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01655 seconds
  • Memory Usage 1,745KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (5)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (3)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete