vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 4.x Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=245)
-   -   Miscellaneous Hacks - Ban Spiders by User Agent (https://vborg.vbsupport.ru/showthread.php?t=268208)

CAG CheechDogg 11-02-2012 04:16 PM

Quote:

Originally Posted by simonhind (Post 2377834)
a wise choice if that's happening to you, as they will eat up your bandwidth

Yeah Simon, I had to do it, I didn't want to because it did bring in traffic but after carefully thinking about it, it's not going to hurt me or the site if I block it.

Oh the funs of owning a website eh? lol..:eek:

TheSupportForum 11-02-2012 04:27 PM

Quote:

Originally Posted by CAG CheechDogg (Post 2377837)
Yeah Simon, I had to do it, I didn't want to because it did bring in traffic but after carefully thinking about it, it's not going to hurt me or the site if I block it.

Oh the funs of owning a website eh? lol..:eek:

for me i own 2 so i am catching bots to block across 2 domains
i have spam traps on 1 to catch them

CAG CheechDogg 11-02-2012 04:29 PM

Just a bit of FYI, I was getting hit by Russian IPs and they were also trying to register, I tracked it down to "Deepnet" Explorer which I just blocked as well, just thought I would mention that here.

Simon Lloyd 11-02-2012 04:57 PM

As for Facebook, if you've gone that route maybe it would be beneficial to set up an rss feed from your site in facebook :)

CAG CheechDogg 11-02-2012 05:15 PM

I did have the rss feed using "rss graffiti" which will no longer work now that I blocked facebook

Do you know any other way to do this ?

Simon Lloyd 11-02-2012 05:44 PM

Ah!, no it was graffitti that i was using, however, you can get twitter to post to facebook :)

CAG CheechDogg 11-02-2012 05:51 PM

Hmmm...ok I will check to see how twitter will crawl my site lol....I did set up an RSS feed a couple minutes ago using Social RSS: http://www.facebook.com/CAGclan/app_23798139265

I will do some searching for twitter to facebook though, thanks for the suggestion!

CAG CheechDogg 11-02-2012 09:10 PM

Simon, rss graffiti still works with facebook blocked by this mod! muahaha! This is great!

Simon Lloyd 11-02-2012 09:36 PM

:) Glad you're happy!

CAG CheechDogg 11-02-2012 09:49 PM

Quote:

Originally Posted by Simon Lloyd (Post 2377917)
:) Glad you're happy!

Yeah Simon thanks again for a great mod! Now I don't have the facebook critters and my new threads are still getting posted on facebook...muahahha!

CAG CheechDogg 11-04-2012 03:50 PM

New Spider to add you guys

SeznamBot

Seznam Fulltext Blog

In Omnibus 11-04-2012 04:16 PM

Quote:

Originally Posted by CAG CheechDogg (Post 2378329)
New Spider to add you guys


SeznamBot

Seznam Fulltext Blog

I've never seen this bot before so obviously it only hangs out at the coolest sites.

Max Taxable 11-04-2012 05:14 PM

It was on the list posted earlier in the thread. Nasty little bugger.

TheSupportForum 11-04-2012 05:16 PM

there is a new bot i spotted today

TurnitinBot/2.1

Simon Lloyd 11-04-2012 05:34 PM

Hey guys, if you come across new bots...etc can you also post them here https://www.vbulletin.com/forum/showthread.php?t=352664 so Mosh can add them to his spider list for vbulletin too :)

TheSupportForum 11-04-2012 05:39 PM

Quote:

Originally Posted by Simon Lloyd (Post 2378371)
Hey guys, if you coma across new bots...etc can you also post them here https://www.vbulletin.com/forum/showthread.php?t=352664 so Mosh can add them to his spider lits for vbulletin too :)

Thanks, just posted mine

CAG CheechDogg 11-04-2012 06:37 PM

Quote:

Originally Posted by TheSupportForum (Post 2378362)
there is a new bot i spotted today

TurnitinBot/2.1

I had that one show up before when I used Kunena forums, I blocked that sucker about a year ago.

Snowhog 11-04-2012 10:04 PM

Thank you Simon for such a useful MOD. Simple, clean, and effective. Installed and nominated for MOTM.

tambo 11-05-2012 06:50 PM

Excellent mod. Has already helped reduce our guest list.

Many thanks.

CAG CheechDogg 11-06-2012 06:51 AM

The Artabus spider is still getting through even though it's on the list, anything else that can help ?

Simon Lloyd 11-06-2012 07:07 AM

Quote:

Originally Posted by CAG CheechDogg (Post 2378785)
The Artabus spider is still getting through even though it's on the list, anything else that can help ?

I've said this before..........go to WGO click users online (online.php), at the bottom from the dropdown choose to view user agent and check what Artabus has as its UA, it probably doesn't have artabus in the UA.

CAG CheechDogg 11-06-2012 07:16 AM

The following is what shows there Simon:

pool-109-191-73-49.is74.ru
Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0; T312461)

What should I use to block it here?

Simon Lloyd 11-06-2012 07:31 AM

to be safe block the entire string, this way you wont accidentally block legitimate users :)

CAG CheechDogg 11-06-2012 07:34 AM

Simon I feel like a fool asking this...but which one would be the entire string to use?

pool-109-191-73-49.is74.ru

or

Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0; T312461)

I don't want to block legitimate users! lol..

Simon Lloyd 11-06-2012 07:39 AM

the second one, the first is to do with the IP address.

CAG CheechDogg 11-06-2012 07:47 AM

Thanks Simon! lol....I feel like what my Son calls me at times a "Goober"...hahah...:D

Max Taxable 11-06-2012 02:01 PM

VERY few legitimate human users are going to be on MSIE 7 or older.

I block ALL MSIE except 8. Amazing how much that alone cut down on bot registration attempts.

CAG CheechDogg 11-06-2012 11:38 PM

Quote:

Originally Posted by Max Taxable (Post 2378883)
VERY few legitimate human users are going to be on MSIE 7 or older.

I block ALL MSIE except 8. Amazing how much that alone cut down on bot registration attempts.

so what do you use to block all MSIE except for 8 Max, htaccess or some other way?

TheSupportForum 11-06-2012 11:52 PM

Quote:

Originally Posted by CAG CheechDogg (Post 2378985)
so what do you use to block all MSIE except for 8 Max, htaccess or some other way?

heres an example

Chrome/10.*
Firefox/4.*
so i asume MSIE will work as

MSIE/6.*
MSIE/7.*

and so on

Max Taxable 11-07-2012 12:33 AM

Quote:

Originally Posted by CAG CheechDogg (Post 2378985)
so what do you use to block all MSIE except for 8 Max, htaccess or some other way?

I just have it in like this:

MSIE 1
MSIE 2
MSIE 3
MSIE 4
MSIE 5
MSIE 6
MSIE 7

TheSupportForum 11-07-2012 12:44 AM

Quote:

Originally Posted by Max Taxable (Post 2379003)
I just have it in like this:

MSIE 1
MSIE 2
MSIE 3
MSIE 4
MSIE 5
MSIE 6
MSIE 7

wont MSIE 1 block MSIE Beta 10 ?

which means MSIE 8, 9 only visitors

Max Taxable 11-07-2012 01:22 AM

Quote:

Originally Posted by TheSupportForum (Post 2379008)
wont MSIE 1 block MSIE Beta 10 ?

which means MSIE 8, 9 only visitors

Nope!

CAG CheechDogg 11-07-2012 02:28 AM

This is using your mod here right?

Max Taxable 11-07-2012 03:42 AM

This Mod, yeah. It ain't my mod tho.

Simon Lloyd 11-07-2012 07:47 AM

Quote:

Originally Posted by TheSupportForum (Post 2379008)
wont MSIE 1 block MSIE Beta 10 ?

which means MSIE 8, 9 only visitors

Take a quick look at this post https://vborg.vbsupport.ru/showpost....&postcount=381 should help explain how the system works better :)

CAG CheechDogg 11-07-2012 10:33 AM

Quote:

Originally Posted by Simon Lloyd (Post 2379064)
Take a quick look at this post https://vborg.vbsupport.ru/showpost....&postcount=381 should help explain how the system works better :)


Yeep makes way more sense now, how easy it is for "us" to overlook a single post that explains it all. Sorry , I am guilty of doing so. ...

Thanks for tolerating us Lloyd, we appreciate it very much!

Simon Lloyd 11-07-2012 11:50 AM

Its not toleration, i just love helping people :)

vb50kgpoo 11-12-2012 10:17 AM

Hi Simon
Yours is a great product. I made the mistake of uninstalling it in order to use AbyssGuard, which is plagued with problems. I have now reinstalled Ban Spiders By User Agent. One question, are there any ramifications in banning \wbot[\/\-] with your mod? I ask as putting \wbot[\/\-] directly into my htaccess banning mecahism causes issues.
Regards / RSVP

vb50kgpoo 11-12-2012 11:26 AM

Also.......

Does anyone know what these bots are;

Robot ID - Hits - Bandwidth - Last visit - Hits on robots.txt
robot 772 8576221 20121111093454 0
crawl 699 9556953 20121108085243 0
spider 5 114750 20121106065956 0

Bad bots using generic names?

ForceHSS 11-12-2012 12:04 PM

What is there full host name


All times are GMT. The time now is 05:59 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01820 seconds
  • Memory Usage 1,817KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (14)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete