Go Back   vb.org Archive > vBulletin Modifications > Archive > vB.org Archives > vBulletin 3.5 > vBulletin 3.5 Add-ons
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Spider Watcher Details »»
Spider Watcher
Version: 1.0.0 B10, by mikelbeck mikelbeck is offline
Developer Last Online: Feb 2016 Show Printable Version Email this Page

Version: 3.5.4 Rating:
Released: 01-05-2006 Last Update: 08-08-2006 Installs: 194
DB Changes Uses Plugins Template Edits
Additional Files Is in Beta Stage  
No support by the author.

Spider Watcher
Author: Mikel Beck (mikel.beck@elite-computing.net)


This hack keeps track of the spiders (Search Engine robots) that visit your fourm. Every time a guest visits a page, the guest's IP address, user agent and the page they visited are logged to the database.

When somebody views the spider statistics page, this data is "rolled up", meaning the raw data is collated, the spider's name is determined by comparing the user agent to data contained in the spiders_bulletin.xml file, and the number of pages and visits is summarized and writted back to the database. In addition, and data from non-bots is removed.

The data is then displayed in a easy to read format for your viewing pleasure.

If the user viewing the report has permissions to view IP addresses, these are displayed as well.

A live version of the report from one of my sites can be seen here: http://www.happyhourpub.com/spiders.php

Also see the attached screenshot for an exmaple.


Revision History:
1.0.0 Beta 1 - 01/05/2006
- Initial Release

1.0.0 Beta 2 - 01/06/2006
- Included templates for spiders.php
- Removed text from templates, added them as phrases

1.0.0 Beta 3 - 01/07/2006
- Split up the display of "known" and "unknown" spiders

1.0.0 Beta 4 - 01/25/2006
- Corrected potentional SQL injection issue in plug-in
- Reduced the number of SQL queries required to display statistics
- Corrected date/time display issue

1.0.0 Beta 5 - 02/01/2006
- Reduced the number of SQL queries required to display statistics

1.0.0 Beta 6 - 02/08/2006
- No release

1.0.0 Beta 7 - 02/11/2006
- Corrected issue with "unknown" spiders not being displayed properly.
- Added tracking of the type of spider (searchspider, link checker, etc)

1.0.0 Beta 8 - 02/19/2006
- Change the display of IP addresses to be a pop-up so they're all not displayed on the main page.
- Combined the spiders that have the same name but different user agents.

1.0.0 Beta 9 - 03/10/2006
- Changed the display to group similar spiders together (search spiders, http check spiders, etc)

1.0.0 Beta 10 - 08/08/2006
- Changed how the rollup functions. Instead of rolling up every time somebody views the spider page, it rolls up once per hour.
- Corrected a few bugs here and there, mostly related to removing entries from the database.

Installation Instructions
1. Upload spiders.php to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module.
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

Upgrade Instructions
1. Upload (and overwrite) spiders to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module. Make sure the "Allow Overwrite" option is set to "Yes".
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

***UPGRADE NOTE***
When you upgrade from version 1.0.0 Beta 7 to 1.0.0 Beta 8 your existing spider data will be lost!


To make sure that you can decode the maximum amount of spiders, you should grab the latest spiderlist.xml and replace the spiders_vbulletin.xml file in your forumhome/includes/xml/ directory with the one from this thread: http://www.vbulletin.com/forum/showthread.php?t=76662

Supporters / CoAuthors

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #32  
Old 01-07-2006, 09:11 PM
Lizard King Lizard King is offline
 
Join Date: Jan 2005
Location: Mersin
Posts: 907
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by nitro
That would just display the block and images with no info. Guess I will have to try an find the php code in vb files and take pot luck on adding it to the spiders.php
https://vborg.vbsupport.ru/showthread.php?t=104591
Reply With Quote
  #33  
Old 01-07-2006, 10:02 PM
nitro nitro is offline
 
Join Date: Nov 2001
Posts: 302
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thanks that works great best click install there aswell


Iv noticed this spider product also detects JumpD's RSS article bot as an unknown spider.

I dont mind that as it shows that bots hit counter aswell but how would I add to the spiders xml file this bot which has an ip of 0.0.0.0 and i dont think it has an agent but I maybe wrong so it shows as say Articlebot instead of unknown ?
Reply With Quote
  #34  
Old 01-07-2006, 11:36 PM
Zenith Zenith is offline
 
Join Date: Dec 2001
Posts: 7
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Tidy hack, thanks.

I notice that the last visited date is out though. I only installed a few minutes ago but the few spiders already listed are showing "Yesterday". Time is correct, but it should be "Today".

Clicked install...
Reply With Quote
  #35  
Old 01-08-2006, 12:53 AM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Version 1.0.0 Beta 3 is available.

1.0.0 Beta 3 - 01/07/2006
- Split up the display of "known" and "unknown" spiders

Upgrade Instructions
1. Upload (and overwrite) spiders.php to the root of your forum.
2. Import the file product-spiderwatcher.xml using the Manage Products module. Make sure the "Allow Overwrite" option is set to "Yes".
3. Add a link to spiders.php on your navbar or footer.
Reply With Quote
  #36  
Old 01-08-2006, 04:14 AM
Zia's Avatar
Zia Zia is offline
 
Join Date: Dec 2005
Location: golpo.net
Posts: 931
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thankx Man.. for real kool support..


-Unknown Spider
Googlebot-Image/1.0 01-07-2006
01:09 AM 66.249.65.202 errordocs 2

-Unknown Spider
Mozilla/3.01 (compatible 01-07-2006
03:48 AM 69.88.30.146, 202.52.197.55, 202.52.197.56, 210.50.143.21 cron, errordocs 213

-Unknown Spider
Mozilla/4.0 (compatible; Google Desktop) 01-07-2006
11:55 AM 70.236.181.179 errordocs, external 7



umm one spider hit 213 times..its detected as unknown spider..
can u plz mark them?? and most of time it face errordoc..
how could we detect that..right where it face error doc?
we need to know which bot see error doc in where?


and which bot is doing harm to forum...any way to detect from the tracker page?

plz thanks...help us
Reply With Quote
  #37  
Old 01-08-2006, 06:55 PM
bashy bashy is offline
 
Join Date: Nov 2005
Posts: 2,544
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hi..

What i find intersting is the fact that all the spiders are showing as being online 20 hours ago...I know for a fact that most have been in all day long, they very rarely leave so how can it be saying 20hrs ago or should i say why is it saying 20 when it clearly isnt lol

Please advise?

Bashy

Code:
Spider Name (Known Spiders) Last Visit IP Address(es) Page Hits 
Google AdSense 20 Hours Ago
08:37 PM 66.249.66.102 THIS_SCRIPT, forumdisplay, index, login, member, misc, newreply, newthread, online, pager, poll, private, profile, register, search, showthread 347 
Google 20 Hours Ago
08:48 PM 66.249.66.102 index, showthread 25 
MSNBot 20 Hours Ago
04:46 PM 207.46.98.48 VBGooglemap, calendar, forumdisplay, index, jukebox, member, memberlist, search, showthread, vbradio 44 
Yahoo! Slurp 20 Hours Ago
08:10 PM 68.142.249.14, 68.142.249.160, 68.142.250.17, 68.142.251.166 forumdisplay, member, showthread 4 

Spider Name (Unknown Spiders) Last Visit IP Address(es) Page Hits 
Unknown Spider
 20 Hours Ago
08:00 PM 82.110.105.18 external 76 
Unknown Spider
FeedBurner/1.0 (http://www.FeedBurner.com) 20 Hours Ago
08:16 PM 66.150.96.109 external 30 
Unknown Spider
Jakarta Commons-HttpClient/3.0-rc2 20 Hours Ago
08:16 PM 206.188.0.11, 209.18.119.137, 209.18.119.138 showthread 18 
Unknown Spider
Mozilla/4.0 (compatible; Google Desktop) 20 Hours Ago
03:38 PM 86.142.103.72 external 5 
Unknown Spider
Yahoo-Blogs/v3.9 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/ysearc...awling-02.html ) 20 Hours Ago
08:24 PM 209.191.83.13 showthread 47
Reply With Quote
  #38  
Old 01-08-2006, 07:59 PM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

That is strange...

The only thing I can think is that the server time zone is set to something different from what yours is.

What the plugin does is write a record into the database, and the timestamp it uses is "NOW()", which is the current time. I think it uses the server's current time, but I'm not positive. So if the server's time is different from your time (if it's set to GMT, for example, and your time zone is GMT-4), then the times will be different.

I'll have a look to see if it can be set to the board's current time instead of using the "NOW()" function.
Reply With Quote
  #39  
Old 01-08-2006, 08:01 PM
bashy bashy is offline
 
Join Date: Nov 2005
Posts: 2,544
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hi mikelbeck

Cheers for the quick response...
The server is in the UK as i am therefore we are on the same timezone

Bashy

Quote:
Originally Posted by mikelbeck
That is strange...

The only thing I can think is that the server time zone is set to something different from what yours is.

What the plugin does is write a record into the database, and the timestamp it uses is "NOW()", which is the current time. I think it uses the server's current time, but I'm not positive. So if the server's time is different from your time (if it's set to GMT, for example, and your time zone is GMT-4), then the times will be different.

I'll have a look to see if it can be set to the board's current time instead of using the "NOW()" function.
Reply With Quote
  #40  
Old 01-08-2006, 08:09 PM
mikelbeck's Avatar
mikelbeck mikelbeck is offline
 
Join Date: Jul 2005
Location: 4C6F6E672049736C616E642C2
Posts: 238
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by bashy
Hi mikelbeck

Cheers for the quick response...
The server is in the UK as i am therefore we are on the same timezone

Bashy
Can you go into your admin control panel, click on "Maintenenace", then "Diagnostics", then select "MySQL Variables" in the "System Information" box and click "Submit". What value does it show for "timezone", and what is your timezone? And also, what is your forum's timezone set to?
Reply With Quote
  #41  
Old 01-08-2006, 08:13 PM
bashy bashy is offline
 
Join Date: Nov 2005
Posts: 2,544
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hi m8

MySQL Variables
system_time_zone GMT

My time zone is GMT
Forum Timezone is GMT also

I dont suppose this helps you any now does it lol

Bashy
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 11:27 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.06273 seconds
  • Memory Usage 2,324KB
  • Queries Executed 26 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)bbcode_code
  • (3)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (6)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (11)post_thanks_box
  • (11)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (11)post_thanks_postbit_info
  • (10)postbit
  • (11)postbit_onlinestatus
  • (11)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete