Go Back   vb.org Archive > vBulletin Modifications > Archive > vB.org Archives > vBulletin 3.5 > vBulletin 3.5 Add-ons
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Spider Watcher Details »»
Spider Watcher
Version: 1.0.0 B10, by mikelbeck mikelbeck is offline
Developer Last Online: Feb 2016 Show Printable Version Email this Page

Version: 3.5.4 Rating:
Released: 01-05-2006 Last Update: 08-08-2006 Installs: 194
DB Changes Uses Plugins Template Edits
Additional Files Is in Beta Stage  
No support by the author.

Spider Watcher
Author: Mikel Beck (mikel.beck@elite-computing.net)


This hack keeps track of the spiders (Search Engine robots) that visit your fourm. Every time a guest visits a page, the guest's IP address, user agent and the page they visited are logged to the database.

When somebody views the spider statistics page, this data is "rolled up", meaning the raw data is collated, the spider's name is determined by comparing the user agent to data contained in the spiders_bulletin.xml file, and the number of pages and visits is summarized and writted back to the database. In addition, and data from non-bots is removed.

The data is then displayed in a easy to read format for your viewing pleasure.

If the user viewing the report has permissions to view IP addresses, these are displayed as well.

A live version of the report from one of my sites can be seen here: http://www.happyhourpub.com/spiders.php

Also see the attached screenshot for an exmaple.


Revision History:
1.0.0 Beta 1 - 01/05/2006
- Initial Release

1.0.0 Beta 2 - 01/06/2006
- Included templates for spiders.php
- Removed text from templates, added them as phrases

1.0.0 Beta 3 - 01/07/2006
- Split up the display of "known" and "unknown" spiders

1.0.0 Beta 4 - 01/25/2006
- Corrected potentional SQL injection issue in plug-in
- Reduced the number of SQL queries required to display statistics
- Corrected date/time display issue

1.0.0 Beta 5 - 02/01/2006
- Reduced the number of SQL queries required to display statistics

1.0.0 Beta 6 - 02/08/2006
- No release

1.0.0 Beta 7 - 02/11/2006
- Corrected issue with "unknown" spiders not being displayed properly.
- Added tracking of the type of spider (searchspider, link checker, etc)

1.0.0 Beta 8 - 02/19/2006
- Change the display of IP addresses to be a pop-up so they're all not displayed on the main page.
- Combined the spiders that have the same name but different user agents.

1.0.0 Beta 9 - 03/10/2006
- Changed the display to group similar spiders together (search spiders, http check spiders, etc)

1.0.0 Beta 10 - 08/08/2006
- Changed how the rollup functions. Instead of rolling up every time somebody views the spider page, it rolls up once per hour.
- Corrected a few bugs here and there, mostly related to removing entries from the database.

Installation Instructions
1. Upload spiders.php to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module.
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

Upgrade Instructions
1. Upload (and overwrite) spiders to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module. Make sure the "Allow Overwrite" option is set to "Yes".
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

***UPGRADE NOTE***
When you upgrade from version 1.0.0 Beta 7 to 1.0.0 Beta 8 your existing spider data will be lost!


To make sure that you can decode the maximum amount of spiders, you should grab the latest spiderlist.xml and replace the spiders_vbulletin.xml file in your forumhome/includes/xml/ directory with the one from this thread: http://www.vbulletin.com/forum/showthread.php?t=76662

Supporters / CoAuthors

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #302  
Old 05-01-2006, 01:14 AM
The Notorious's Avatar
The Notorious The Notorious is offline
 
Join Date: Jan 2006
Posts: 118
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

This hack killed my server today, was working fine today but it's messed my SQL and turned the server down.
I got like hundreds of e-mails like this one:

Database error in vBulletin 3.5.4:

Invalid SQL:
INSERT INTO spider_watcher (nice_bot, bot, ip_address, page, type, info, timestamp) VALUES ('Unknown Spider','Mozilla/4.0 Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)','1198658994','index','','', NOW());

MySQL Error : Duplicate entry '431497' for key 1
Error Number : 1062
Date : Sunday, April 30th 2006 @ 08:17:10 PM
Reply With Quote
  #303  
Old 05-01-2006, 01:47 AM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by The Notorious
This hack killed my server today, was working fine today but it's messed my SQL and turned the server down.
I got like hundreds of e-mails like this one:

Database error in vBulletin 3.5.4:

Invalid SQL:
INSERT INTO spider_watcher (nice_bot, bot, ip_address, page, type, info, timestamp) VALUES ('Unknown Spider','Mozilla/4.0 Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)','1198658994','index','','', NOW());

MySQL Error : Duplicate entry '431497' for key 1
Error Number : 1062
Date : Sunday, April 30th 2006 @ 08:17:10 PM
I don't think this error was caused by this hack... The key that it's complaining about is an auto_increment field, as far as I know there's no way to force it to try to insert a duplicate key like this is reporting. And the sql line shown above does not have anything about the key field that it's complaining about.
Reply With Quote
  #304  
Old 05-01-2006, 01:50 AM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by LiveWire
great hack, but there lays a very crtitcal problem for me at least. When viewing the page.

Page generated in 0.59078 seconds with 50 queries [Server Loads: 0.41 0.22 : 0.10]

Thats quite alot of quires. The more spiders it fetches, the more queires this adds.
True, and I don't see any way to reduce those queries right now. The only thing I can think of to do is to run a job that will occasionaly "roll up" the spider data, and then when you view the spiders page it will only display data that's already been collated. But then the data displayed won't always be up to date. If anybody else has any ideas, I'm open to suggestions.
Reply With Quote
  #305  
Old 05-01-2006, 01:57 AM
The Notorious's Avatar
The Notorious The Notorious is offline
 
Join Date: Jan 2006
Posts: 118
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by mikelbeck
I don't think this error was caused by this hack... The key that it's complaining about is an auto_increment field, as far as I know there's no way to force it to try to insert a duplicate key like this is reporting. And the sql line shown above does not have anything about the key field that it's complaining about.
Well as soon as I removed the hack it started working fine...
Reply With Quote
  #306  
Old 05-01-2006, 03:16 AM
The Notorious's Avatar
The Notorious The Notorious is offline
 
Join Date: Jan 2006
Posts: 118
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I started to get the same e-mails again. Can you tell me exactly what this hack modifies on my board so I can look if anything is left after the uninstall?
Thanks
Reply With Quote
  #307  
Old 05-01-2006, 05:40 PM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by The Notorious
I started to get the same e-mails again. Can you tell me exactly what this hack modifies on my board so I can look if anything is left after the uninstall?
Thanks
If you used the un-install function, it should have removed everything.

It creates a few plug-ins:
Spider Watcher
Spiders Location (Part 1)
Spiders Location (Part 2)
Spider Watcher Template Group

and it creates a few template in the "Spider Watcher Templates" group.
Reply With Quote
  #308  
Old 05-01-2006, 05:46 PM
hambil's Avatar
hambil hambil is offline
 
Join Date: Jun 2004
Location: Seattle
Posts: 1,719
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by mikelbeck
True, and I don't see any way to reduce those queries right now. The only thing I can think of to do is to run a job that will occasionaly "roll up" the spider data, and then when you view the spiders page it will only display data that's already been collated. But then the data displayed won't always be up to date. If anybody else has any ideas, I'm open to suggestions.
It's not a high traffic page. A lot of users jump onto the 'queries bad' bandwaggen a little to quickly. Queries make the vb world run.
Reply With Quote
  #309  
Old 05-02-2006, 11:43 AM
The Notorious's Avatar
The Notorious The Notorious is offline
 
Join Date: Jan 2006
Posts: 118
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by mikelbeck
If you used the un-install function, it should have removed everything.

It creates a few plug-ins:
Spider Watcher
Spiders Location (Part 1)
Spiders Location (Part 2)
Spider Watcher Template Group

and it creates a few template in the "Spider Watcher Templates" group.
Thanks mate, I stopped getting the e-mails and sever is running just fine.
Reply With Quote
  #310  
Old 05-03-2006, 11:06 PM
Dr.Viggy Dr.Viggy is offline
 
Join Date: Apr 2006
Posts: 172
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

nice hack, thanks.

*installed
Reply With Quote
  #311  
Old 05-04-2006, 01:18 AM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by The Notorious
Thanks mate, I stopped getting the e-mails and sever is running just fine.
With or without this hack installed?
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 12:38 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.04753 seconds
  • Memory Usage 2,329KB
  • Queries Executed 26 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (7)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (6)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (2)pagenav_pagelinkrel
  • (11)post_thanks_box
  • (11)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (11)post_thanks_postbit_info
  • (10)postbit
  • (11)postbit_onlinestatus
  • (11)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete