vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   Forum and Server Management (https://vborg.vbsupport.ru/forumdisplay.php?f=232)
-   -   Heckling (and Jeckling) magpie-crawler (https://vborg.vbsupport.ru/showthread.php?t=303962)

Digital Jedi 10-29-2013 06:43 AM

Heckling (and Jeckling) magpie-crawler
 
Has anyone noticed or experienced issues with Brandwatch's magpie-crawler? I'll admit, I sometimes don't pay as close attention to my server logs as I should. But I did used to note that magpie-crawler visited my site quite often.

Over a year ago, I had to shut down my site because of persistent database errors. Mostly failure to connect or too many connections. Every time I had sorted it out, it would crash even more often, so I just decided it wasn't worth getting my account suspended all the time and shut the place down until I could sort it out. The database errors stopped all those years, until the last couple of months, where I've been actively working on my websites every day. Suddenly it's crashing again, on websites where I'm the only visitor. So I checked my logs again.

I noticed that magpie-crawler had visited my website today over 235 times. Seems excessive. I did some research online, but I could only find little known, somewhat overly emotional blog entries about the crawler chewing up their bandwidth, but nothing more professionally written. And oddly, no forums posting about it. I was wondering what your experience with them is. While 235 hits is a bit much for a bot, I would have thought a typical shared hosting environment could handle it.

Since I'm not all that concerned with whatever it is Brandwatch does, I went ahead and put in their robot.txt deny line, and went ahead and IP blocked the three IPs I found for them in my logs. I'll be watching my site (and my logs) over the next couple of days to see if that even makes a difference.

final kaoss 10-29-2013 01:53 PM

Can always use try out the "miserable users" mod on the bot's ip addresses :)

https://vborg.vbsupport.ru/showthread.php?t=231106


All times are GMT. The time now is 04:01 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.02061 seconds
  • Memory Usage 1,709KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (2)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete