vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 4.x Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=245)
-   -   Miscellaneous Hacks - Ban Spiders by User Agent (https://vborg.vbsupport.ru/showthread.php?t=268208)

fly 02-08-2013 05:13 PM

Quote:

Originally Posted by Simon Lloyd (Post 2402890)
I suspect accounts held on some of their servers are of no use to your forum and are scrapping content or emails..etc

Banning bots, as i've always said is a personal thing :)

Okay, but if I recall correctly this only bans by user agent, not IP block and therefore would be ineffective to ban 'AWS'.

Simon Lloyd 02-08-2013 05:16 PM

Why would it be ineffective banning them? every device that accesses the internet...etc has a UserAgent, you just need to find the useragent and i show you how to do that in the links in the mod description.

Read this: http://www.webmasterworld.com/search...rs/4368965.htm

If you really want to ban ip's then https://vborg.vbsupport.ru/showthread.php?t=268146

fly 02-08-2013 05:22 PM

Quote:

Originally Posted by Simon Lloyd (Post 2402915)
Why would it be ineffective banning them? every device that accesses the internet...etc has a UserAgent, you just need to find the useragent and i show you how to do that in the links in the mod description.

Read this: http://www.webmasterworld.com/search...rs/4368965.htm

If you really want to ban ip's then https://vborg.vbsupport.ru/showthread.php?t=268146

Because since Amazon runs a cloud hosting service, anyone can own an AWS server. Hell, I have one. There is no ONE service and user agent on AWS, so its not possible to ban all AWS servers by user agent.

Simon Lloyd 02-08-2013 05:25 PM

But not ALL AWS users are bad, are you? :), agreed you cannot ban a server but every bot, spider, person or device that comes your way will have a UA that you can ban.

fly 02-08-2013 05:31 PM

Quote:

Originally Posted by Simon Lloyd (Post 2402918)
But not ALL AWS users are bad, are you? :), agreed you cannot ban a server but every bot, spider, person or device that comes your way will have a UA that you can ban.

The request was to ban AWS servers by user agent. That's not possible.

(And technically you don't even have to send a user agent.)

Simon Lloyd 02-08-2013 05:37 PM

The request wasn't specifically to ban the servers by UA :), if you send a malformed or blank UA then you can ban those too ;)

As a side note i noticed that you haven't downloaded the latest version of this mod or marked it installed, have you uninstalled it, if so could i ask why? just helps me develop more robust things in the future.

fly 02-08-2013 08:47 PM

Quote:

Originally Posted by Simon Lloyd (Post 2402922)
The request wasn't specifically to ban the servers by UA :), if you send a malformed or blank UA then you can ban those too ;)

As a side note i noticed that you haven't downloaded the latest version of this mod or marked it installed, have you uninstalled it, if so could i ask why? just helps me develop more robust things in the future.

I started with this mod. However, my server at the time was so resource starved that I needed to block the spiders before it got to PHP/MYSQL. Nothing wrong with it. It worked well. I just couldn't afford the resources.

Max Taxable 02-10-2013 03:09 AM

Quote:

Originally Posted by fly (Post 2402919)
The request was to ban AWS servers by user agent. That's not possible.

(And technically you don't even have to send a user agent.)

Yes it is. Enter it exactly like it appears in the user agent string.

"amazonaws"

fly 02-10-2013 12:28 PM

Quote:

Originally Posted by Max Taxable (Post 2403250)
Yes it is. Enter it exactly like it appears in the user agent string.

"amazonaws"

Yes, but no one is using that UA. Amazon has no reason to crawl *any* site.

Max Taxable 02-11-2013 04:56 PM

Quote:

Originally Posted by fly (Post 2403305)
Yes, but no one is using that UA. Amazon has no reason to crawl *any* site.

Amazon AWS is their hosting they sell. And yes they also crawl the web: http://aws.amazon.com/search-engines/

I have it blocked as well, using this Mod.

Here is how I decide what UAs I block:

1.) Is it beneficial for my site to have it crawling?

2.) Does it behave nicely? Does it obey robots.txt?

3.) If in any way suspicious, it goes in this Mod.

Like the developer says, it's all about personal choice.


All times are GMT. The time now is 05:04 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01697 seconds
  • Memory Usage 1,744KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (7)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (2)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete