vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 4.x Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=245)
-   -   Miscellaneous Hacks - Ban Spiders by User Agent (https://vborg.vbsupport.ru/showthread.php?t=268208)

Simon Lloyd 12-19-2014 09:53 AM

Quote:

Originally Posted by ozzy47 (Post 2528011)
Yes, I know Simon, I thought about doing it that way also, but the payoff, is not worth the trouble. It would bloat the mod, and become a resource hog in no time. :)

Thats what i'm saying, if all they want to do with their forums is watch bots/spiders (and thats not what the mod is about) then they'll have hours and hours of fun but no members as the forum would slow down to a snails pace.

People, just rest assured that the mod is doing it's job and giving you back valuable resources please dont waste them again by trying to build in fancyful functionality thats really a whim and not a must have :D

ozzy47 12-19-2014 09:56 AM

Exactly, which is why I only made it a count of total blocked, it's quick to write, and quick to read and display the count.

I used the analogy, KISS, keep it simple stupid. :)

Simon Lloyd 12-19-2014 09:58 AM

Quote:

Originally Posted by ozzy47 (Post 2528013)
Exactly, which is why I only made it a count of total blocked, it's quick to write, and quick to read and display the count.

I used the analogy, KISS, keep it simple stupid. :)

:mad: who you calling stupid????? Lol!

ozzy47 12-19-2014 10:01 AM

I call myself stupid all the time, when I am writing code. :p

Simon Lloyd 12-19-2014 10:25 AM

Quote:

Originally Posted by ozzy47 (Post 2528015)
I call myself stupid all the time, when I am writing code. :p

Haha!, i hear ya!

Black Snow 12-19-2014 11:35 AM

Quote:

Originally Posted by ozzy47 (Post 2528007)
Say what? This is already recording the blocked bots. In four different possible ways, in a log file, in a thread, in a email and the bots blocked count.

Now writing each bot individually to the DB, is not a good idea, as the table would grow too big, and be a giant PITA when you query it to get the info. Think about it, in just five days, only blocking baidu, I have a count of 9062.

Now lets take that and round it to 10,000 for seven days. Lets say I am going to block the 281 bots I have in my list. 281 x 10,000 x 52 = 146,120,000 entries a year. :eek:

Sure each bot is not going to hit me 10,000 times a week, but you can see, this can really grow fast. ;)

And my site is not that busy, now imaginge on a busy site, the entries to the table in the DB could be 250,000,000 plus a year. Now run a query to retrieve the info from that table, and it would hurt the site every time it is ran. :mad:

IMO, the best way to go is how it is now, just increase the count every time a bot is blocked, and get that number and display it. :)

I understand where you are coming from but that is not my intention completely. I will log, for example, Baidu into the database and each time it is blocked, the count will go up by one. I just want to be able to query the DB to see a list of all blocked bots and how many times it was blocked. The log, output.txt, is messy IMHO and I want to be able to query the bots list.

ozzy47 12-19-2014 11:43 AM

I can see that, but as described, that is still going to be something that is totally un necessary, and still slow things down, on a busy site. :)

But it is cool in theory, I started to do just that, but decided against it as I was going along, and found the best thing was to have a running count total. People don't care how many times not X was blocked, just as long as it was blocked. :)

too_cool_3 12-19-2014 08:52 PM

I have one question regarding Ban Spiders by User Agent. Since many of the User Agent's in the lists contain words like 'bot', 'spider' and 'crawl', will enabling the option Ban Spiders in List ban even good spiders from the forum???

e.g.
Google Spider (crawl-66-249-75-154.googlebot.com)
BingBot Spider (msnbot-157-55-39-186.search.msn.com)

ozzy47 12-19-2014 08:53 PM

It will only ban spiders you have listed, not all spiders.

Simon Lloyd 12-20-2014 05:39 AM

Quote:

Originally Posted by too_cool_3 (Post 2528080)
I have one question regarding Ban Spiders by User Agent. Since many of the User Agent's in the lists contain words like 'bot', 'spider' and 'crawl', will enabling the option Ban Spiders in List ban even good spiders from the forum???

e.g.
Google Spider (crawl-66-249-75-154.googlebot.com)
BingBot Spider (msnbot-157-55-39-186.search.msn.com)

I've explained this many many times throughout all the Ban Spiders By UserAgent threads, take a look here for the latest explanation https://vborg.vbsupport.ru/showpost....&postcount=652


All times are GMT. The time now is 06:27 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01916 seconds
  • Memory Usage 1,745KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (5)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (2)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete