vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 4.x Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=245)
-   -   Miscellaneous Hacks - Ban Spiders by User Agent (https://vborg.vbsupport.ru/showthread.php?t=268208)

ForceHSS 05-02-2012 03:10 AM

My full list if any need it
Code:

beta.statsit.com
statsit
SiteIntel
Yandex
GomezAgent
FunWebProducts
Nesotebot
DCPbot
Opera
AOL Advertising R&D
DataCha0s
aiHitBot
Apache-HttpClient
Zend_Http_Client
ReverseGet
Baidu
BoardReader
almaden
Anarchie
ASPSeek
attach
autoemailspider
BackWeb
Bandit
BatchFTP
BlackWidow
Bot\mailto:craftbot@yahoo.com
Buddy
bumblebee
CherryPicker
ChinaClaw
CICC
Collector
Copier
Copyscape
Crescent
DIIbot
DISCo
DISCo\Pump
dotbot
Download\Demon
Download\Wonder
Downloader
Drip
DSurf15a
eCatch
EasyDL/2.99
EirGrabber
email
EmailCollector
EmailSiphon
EmailWolf
Express\WebPictures
ExtractorPro
EyeNetIE
FileHound
FlashGet
FrontPage
GetRight
GetSmart
GetWeb!
gigabaz
Go\!Zilla
Go!Zilla
Go-Ahead-Got-It
gotit
Grabber
GrabNet
Grafula
grub-client
HMView
HTTrack
httpdown
.*httrack.*
ia_archiver
Image\Stripper
Image\Sucker
Indy*Library
Indy\Library
InterGET
InternetLinkagent
Internet\Ninja
InternetSeer.com
Iria
JBH*agent
JetCar
JOC\Web\Spider
JustView
larbin
LeechFTP
LexiBot
lftp
Link*Sleuth
likse
//Link
LinkWalker
Mag-Net
Magnet
Mass\Downloader
Memo
Microsoft.URL
MIDown\tool
Mirror
Mister\PiX
Mozilla.*Indy
Mozilla.*NEWT
Mozilla*MSIECrawler
Mozilla/4.0
Mozilla/4.79
MS\FrontPage*
MSFrontPage
MSIECrawler
MSProxy
myinfo.any-request-allowed.com
Navroad
NearSite
NetAnts
NetMechanic
NetSpider
Net\Vampire
NetZIP
NICErsPRO
Ninja
Nutch
Octopus
Offline\Explorer
Offline\Navigator
Openfind
PageGrabber
Papa\Foto
pavuk
pcBrowser
Ping
PingALink
Pockey
psbot
Pump
Python-urllib/2.4
QRVA
RealDownload
Reaper
Recorder
ReGet
Scooter
Seeker
Siphon
sitecheck.internetseer.com
SiteSnagger
SlySearch
SmartDownload
Snake
sogou
Soso
SpaceBison
Spinn3r
sproose
Stripper
start.exe
Sucker
SuperBot
SuperHTTP
Surfbot
Szukacz
SeznamBot
tAkeOut
Teleport\Pro
TurnitinBot/2.1
URLSpiderPro
Vacuum
VoidEYE
vBSEO
Web\Image\Collector
Web\Sucker
WebAuto
[Ww]eb[Bb]andit
webcollage
WebCopier
Web\Downloader
WebEMailExtrac.*
WebFetch
WebGo\IS
WebHook
WebLeacher
WebMiner
WebMirror
WebReaper
WebSauger
Website
Website\eXtractor
Website\Quester
Webster
WebStripper
WebWhacker
WebZIP
Wget
Whacker
Widow
WWWOFFLE
x-Tractor
Xaldon\WebSpider
Xenu
xpymep.exe
Yandex
Yeti
YOUDAOBOT
Zeus.*Webster
Zeus


bigtree 05-02-2012 03:41 AM

Add BoardReader Its a bad one.

ForceHSS 05-02-2012 03:47 AM

added

Nirjonadda 05-02-2012 04:33 PM

# Facebook Spider
# Google Wireless Transcoder Spider

How I Can Ban Permanently From My Web Site?

Simon Lloyd 05-02-2012 06:26 PM

Quote:

Originally Posted by Nirjonadda (Post 2325551)
# Facebook Spider
# Google Wireless Transcoder Spider

How I Can Ban Permanently From My Web Site?

Check this thread https://vborg.vbsupport.ru/showpost....&postcount=318

waldvb 05-16-2012 04:51 PM

Installed, enabled Mod. Still can see Baidu in Who's Online - i.e, IP 180.76.5.165, 180.76.5.168, 180.76.5.66

What's wrong?

ForceHSS 05-16-2012 05:34 PM

Quote:

Originally Posted by waldvb (Post 2329936)
Installed, enabled Mod. Still can see Baidu in Who's Online - i.e, IP 180.76.5.165, 180.76.5.168, 180.76.5.66

What's wrong?

post screenshot of settings

waldvb 05-16-2012 06:08 PM

1 Attachment(s)
Quote:

Originally Posted by ForceHSS (Post 2329956)
post screenshot of settings

Here are settings. With mod I have just 4-5 baidu IP's. Before I had 100 +

ForceHSS 05-16-2012 09:39 PM

spiders own ip make that a yes

Winter Sonata 05-17-2012 01:10 PM

Installed, I hope that can help reducing the server resources usage

Simon Lloyd 05-17-2012 07:38 PM

@Waldvb, if you have Paul M's who's online mod then you will see them in that as they actually call for a thread directly, what happens is they make that request but are instantly redirected to the plac eof your choice :)

@Winter Sonata, thanks for installing, please click "Mark As Installed" at the top left of this thread so you can recieve support if you need it :)

spillage 05-18-2012 12:23 AM

Quote:

Originally Posted by Winter Sonata (Post 2330159)
Installed, I hope that can help reducing the server resources usage


For me, it's stopping all but Google, Yahoo, and Bing (as I've not added those to my list).

This is an excellent mod.

Winter Sonata 05-18-2012 11:36 PM

Thanks Simon Lloyd , now done :) sorry forget at the 1st time :)

Best Regards!

waldvb 07-16-2012 10:47 PM

Can't ban utel.net.ua

Max Taxable 07-16-2012 10:58 PM

Quote:

Originally Posted by waldvb (Post 2348801)
Can't ban utel.net.ua

If that is in the USER AGENT string, sure you can. If it's the host name, this Mod doesn't "see" it. This Mod isn't for banning IPs, ISPs or hostnames.

Simon Lloyd 07-17-2012 06:07 AM

Quote:

Originally Posted by waldvb (Post 2348801)
Can't ban utel.net.ua

Read this post https://vborg.vbsupport.ru/showpost....&postcount=318

KidHTML 07-19-2012 12:08 PM

How do I block these bots because I'm not understanding this mod or this whole topic...

Twitterbot,
Butterfly Topsy Crawler (2),
Embedly

Simon Lloyd 07-19-2012 12:16 PM

If you'd like to mark it installed i'll give you all the help you need :)

KidHTML 07-19-2012 12:19 PM

Sorry, marked as installed and thanks.

Max Taxable 07-19-2012 12:24 PM

Quote:

Originally Posted by KidHTML (Post 2349486)
How do I block these bots because I'm not understanding this mod or this whole topic...

Twitterbot,
Butterfly Topsy Crawler (2),
Embedly

Personally I wouldn't block any of these.

KidHTML 07-19-2012 12:26 PM

Quote:

Originally Posted by Max Taxable (Post 2349495)
Personally I wouldn't block any of these.

I wasn't sure if I should or not...

Max Taxable 07-19-2012 02:16 PM

Quote:

Originally Posted by KidHTML (Post 2349499)
I wasn't sure if I should or not...

I posted a list of the ones I block, earlier in this thread. All are there for very good reason. They aren't helpful to the site, they leech too much resources, or they are typically botnet zombies. Bad actors all.

Simon Lloyd 07-19-2012 03:20 PM

Thanks for the help Max :)

Max Taxable 07-19-2012 07:24 PM

Quote:

Originally Posted by Simon Lloyd (Post 2349541)
Thanks for the help Max :)

Heh. I never explained how to use your Mod....:p

Simon Lloyd 07-19-2012 08:52 PM

Quote:

Originally Posted by Max Taxable (Post 2349630)
Heh. I never explained how to use your Mod....:p

I was under the impression that it was self explanatory, maybe i should add to the first post?

Max Taxable 07-19-2012 09:23 PM

Quote:

Originally Posted by Simon Lloyd (Post 2349655)
I was under the impression that it was self explanatory, maybe i should add to the first post?

I got the same impression...

Willy T 07-19-2012 10:35 PM

I must say..... This was the only thing I found to get rid of those damn Baidu spiders! I averaged 50 - 65 baidu spiders at one time. I often had almost as many guests online as I did users.

Now I am perfectly fine with 1-5 google & bing spiders online. 50+ spiders that simply don't help me? yea... No thanks!

Max Taxable 07-19-2012 11:47 PM

Baidu is evil.

Nichtofen 07-29-2012 07:08 PM

This mod is great! I cannot express how wonderful this mod is and how well it works. Tie that into how responsive and resourceful Simon is and it's a winner. Thanks for all the help and clarification within the comments to Simon, Max, Force, and all!

My forum is young and relatively slow as it were, but I wish to stay ahead of the game on this one and keep my security up as the server load increases. I don't see the spiders that were stopping by for a visit anymore. :)

Thanks again and keep up the good work
Marked Installed and Nominated

vB4.2.0

Simon Lloyd 07-29-2012 10:55 PM

Glad you like it and it works well for you, i think your our first vb4.2.x so it's good to know it works upward :)

I know it's a pain but do familiarise yourself with the info that can be found at the links in the first post, these will help you as your forum gets busier, also take the time to research some of the bots, they're not all bad, some will even help your site grow.

Be mindful about who and which nations you are going to cater for before banning/allowing bots to crawl your site and you'll be golden.

zascok 08-15-2012 03:18 PM

Don't really know what the Yandex has done to be called bad_bot :confused: and ia_archiver ?

Nice mod installed

Simon Lloyd 08-15-2012 05:09 PM

As has been said many times blocking spiders/bots is a personal thing, you need to understand who and which countries you are aiming your site and content at, do you have enoughj resources to allow your content to be scraped by archivers...etc, do you have enough bandwidth to allow bots who add no value or do not cater for your target audience to index your content?

Just be very honest with yourself on what your intentions are with regards to your target audience and user experience :)

zascok 08-15-2012 06:40 PM

here is a one to your collection Ezooms <-- really bad one don't even give a thing about robot.txt.

About the Yandex. Just in case: it's the biggest engine in Russian language. Of course if you don't wanna it index you site that is you personal thing. All I asked is: what has it done? Did anyone ever see that bot is braking the rules ?

Simon Lloyd 08-15-2012 07:10 PM

Is your site in english?, do you cater for Russians? (Yandex = Yahoo) it's done nothing bad that i know of and my interest isn't to keep a list of "bad bots"...etc, i purely built this to save on precious bandwidth that was being decimated by bots, my site is englsh, the UTF is set up as english...etc but yandex crawl my site, i don't need them, they dont bring trafic and i don't speak Russian, so i block them, so thats what i mean, choose what you want to block with regards to who you ar targetting.

In the mod description there's links to maintained lists that you can use, the ones in the product were just your starter :)

rootsxrocks 08-15-2012 07:22 PM

OMG I can Banish Baudi Thank you I am sick of that worthless misbehaving IP switching overloading spider.

zascok 08-15-2012 07:28 PM

OK i see now. Tia. Now wondering is there is a plugin that bans counties by the name :)

rootsxrocks 08-15-2012 07:34 PM

We are a localized site too and have no need for Chinese or Russian search results.

Simon Lloyd 08-15-2012 08:55 PM

Quote:

Originally Posted by rootsxrocks (Post 2357223)
OMG I can Banish Baudi Thank you I am sick of that worthless misbehaving IP switching overloading spider.

Glad i could be of service ;)

ForceHSS 10-29-2012 03:03 AM

xpymep.exe
here is a new one to add to the list

Max Taxable 10-29-2012 03:10 AM

Quote:

Originally Posted by ForceHSS (Post 2376738)
xpymep.exe
here is a new one to add to the list

One you already gave us, in your fantastic list posted earlier in the thread:

https://vborg.vbsupport.ru/showpost....&postcount=321


All times are GMT. The time now is 11:30 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.02595 seconds
  • Memory Usage 1,829KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_code_printable
  • (14)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete