vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 3.5 Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=113)
-   -   Spider Watcher (https://vborg.vbsupport.ru/showthread.php?t=104582)

mikelbeck 01-05-2006 10:00 PM

Spider Watcher
 
Spider Watcher
Author: Mikel Beck (mikel.beck@elite-computing.net)


This hack keeps track of the spiders (Search Engine robots) that visit your fourm. Every time a guest visits a page, the guest's IP address, user agent and the page they visited are logged to the database.

When somebody views the spider statistics page, this data is "rolled up", meaning the raw data is collated, the spider's name is determined by comparing the user agent to data contained in the spiders_bulletin.xml file, and the number of pages and visits is summarized and writted back to the database. In addition, and data from non-bots is removed.

The data is then displayed in a easy to read format for your viewing pleasure.

If the user viewing the report has permissions to view IP addresses, these are displayed as well.

A live version of the report from one of my sites can be seen here: http://www.happyhourpub.com/spiders.php

Also see the attached screenshot for an exmaple.


Revision History:
1.0.0 Beta 1 - 01/05/2006
- Initial Release

1.0.0 Beta 2 - 01/06/2006
- Included templates for spiders.php
- Removed text from templates, added them as phrases

1.0.0 Beta 3 - 01/07/2006
- Split up the display of "known" and "unknown" spiders

1.0.0 Beta 4 - 01/25/2006
- Corrected potentional SQL injection issue in plug-in
- Reduced the number of SQL queries required to display statistics
- Corrected date/time display issue

1.0.0 Beta 5 - 02/01/2006
- Reduced the number of SQL queries required to display statistics

1.0.0 Beta 6 - 02/08/2006
- No release

1.0.0 Beta 7 - 02/11/2006
- Corrected issue with "unknown" spiders not being displayed properly.
- Added tracking of the type of spider (searchspider, link checker, etc)

1.0.0 Beta 8 - 02/19/2006
- Change the display of IP addresses to be a pop-up so they're all not displayed on the main page.
- Combined the spiders that have the same name but different user agents.

1.0.0 Beta 9 - 03/10/2006
- Changed the display to group similar spiders together (search spiders, http check spiders, etc)

1.0.0 Beta 10 - 08/08/2006
- Changed how the rollup functions. Instead of rolling up every time somebody views the spider page, it rolls up once per hour.
- Corrected a few bugs here and there, mostly related to removing entries from the database.

Installation Instructions
1. Upload spiders.php to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module.
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

Upgrade Instructions
1. Upload (and overwrite) spiders to the root of your forum.
2. Upload spiders_rollup.php to the includes/cron directory.
3. Import the file product-spiderwatcher.xml using the Manage Products module. Make sure the "Allow Overwrite" option is set to "Yes".
4. Add a link to spiders.php on your navbar or footer.
5. Add a cron job with the following information:
Title: Spider Watcher Rollup
Day of the Week: *
Day of the Month: *
Hour: *
Minute: 0 - - -
Log entries: Yes
Filename: ./includes/cron/spiders_rollup.php

***UPGRADE NOTE***
When you upgrade from version 1.0.0 Beta 7 to 1.0.0 Beta 8 your existing spider data will be lost!


To make sure that you can decode the maximum amount of spiders, you should grab the latest spiderlist.xml and replace the spiders_vbulletin.xml file in your forumhome/includes/xml/ directory with the one from this thread: http://www.vbulletin.com/forum/showthread.php?t=76662

The Chief 01-06-2006 02:17 AM

nice, I'll check it out later on :)

eViL_sTiGmA 01-06-2006 02:54 AM

How do i see spiders who visited my site????

just go to www.mysite.com/forum/spiders.php

??

swa 01-06-2006 03:14 AM

doesn't work on 3.51

mikelbeck 01-06-2006 03:18 AM

I've just realized that the required templates aren't included in the zip file.

Sorry about that, I'll correct this in the morning and will post an updated archive.

Zia 01-06-2006 03:36 AM

Its Nice...really nice...been looking for it...for a long time....but what about the spider list? did that detect allmost all spider or only common few like yahoo,google,msn...
els this there are many spiders are out there....hope it is able to detect all of spider...

Thanx

mikelbeck 01-06-2006 03:41 AM

Quote:

Originally Posted by Zia
Its Nice...really nice...been looking for it...for a long time....but what about the spider list? did that detect allmost all spider or only common few like yahoo,google,msn...
els this there are many spiders are out there....hope it is able to detect all of spider...

Thanx

It will detect any of the spiders listed in this file:

Quote:

To make sure that you can decode the maximum amount of spiders, you should grab the latest spiderlist.xml and replace the spiders_vbulletin.xml file in your forumhome/includes/xml/ directory with the one from this thread: http://www.vbulletin.com/forum/showthread.php?t=76662
The way it sits now it's not working, I'm having an issue with the templates for the spiders.php page. I'll have a functional version in the morning. Sorry for this.

mikelbeck 01-06-2006 09:54 AM

The zip file for this hack has been updated to include the templates for spiders.php. If you have already installed this, you can re-install it, just be sure to check the "Allow Overwite" is checked when you upload the xml.

mikelbeck 01-06-2006 12:17 PM

Beta 2 of this modification is available.

- Included templates for spiders.php
- Removed text from templates, added them as phrases

Upgrade Instructions
1. Uninstall the previous version using the Manage Products module.
2. Upload (and overwrite) spiders to the root of your forum.
3. Import the file product-spiderwatcher.xml using the Manage Products module.
4. Add a link to spiders.php on your navbar or footer.

To the people who have installed this - can you post a link to your spiders page?

nexialys 01-06-2006 12:21 PM

2 thumbs on the spiders... :)

eViL_sTiGmA 01-06-2006 02:04 PM

Now works, thank you, i love it =D

Snake 01-06-2006 04:05 PM

Whoa just installed this hack and works great! Thanks bro!

mikelbeck 01-06-2006 04:10 PM

Quote:

Originally Posted by Aftermath
Whoa just installed this hack and works great! Thanks bro!

Nice to hear that!

Do you have a link to where you have it installed?

puertoblack2003 01-06-2006 05:10 PM

Quote:

Originally Posted by swa
doesn't work on 3.51


it do work got it installed. :ermm:

silurius 01-07-2006 06:51 AM

I don't know why, but I love this hack!

*Clicks install

Could you briefly outline the justification for watching spiders? Is there a real business or security benefit, or is it just satisfying?

Also, I assume there is no security risk associated with allowing any old viewer to view the Spider Watcher page?

Allan 01-07-2006 07:11 AM

Thanks ;)

Click Install

Hornstar 01-07-2006 08:21 AM

I'm going to install this as i requested this a while back. This is what i've been looking for ^^

nice work. Hope to see more from you :D

bashy 01-07-2006 08:40 AM

Hi

Thanks for this, I have installed,
Working very well thanks :)
Heres the link for the working hack ;)

http://www.bashys-place.com/forums/spiders.php

Regards

Bashy

Lizard King 01-07-2006 11:05 AM

Working pretty good thanks

i have the latest vbulletin_spiders.xml but i still get unknown spiders.
Check the list http://www.ayyas.com/spiders.php

mikelbeck 01-07-2006 12:04 PM

Quote:

Originally Posted by silurius
I don't know why, but I love this hack!

*Clicks install

Could you briefly outline the justification for watching spiders? Is there a real business or security benefit, or is it just satisfying?

Also, I assume there is no security risk associated with allowing any old viewer to view the Spider Watcher page?

I think it's just satisfiying, to know that your hard work on the SEO for your forum attracts spiders... And with this you can see when their last visit was, and how many pages they've viewed.

As for being a security risk, I don't see how it would be.

Zia 01-07-2006 03:26 PM

Dear mikelbeck

Thankx..for this too kool hack...but when u r gonna relase final version...

and one more point....if guest access is disable in forum..then can spider crawl the forum?
If Not..Any way that can makes spider crwal the forum ????
It changes db...hope it wont create any probs in sql db..

:) ALso Looking For More Modified hack...

Thankx...waiting for final version ....



If Not...can u make that the spider can crawl forum..if guest access disable.

mikelbeck 01-07-2006 04:02 PM

Quote:

Originally Posted by Zia
Dear mikelbeck

Thankx..for this too kool hack...but when u r gonna relase final version...

I've got one or two things I'd like to add/change on this. Once that's done, I'll leave it in beta for a little while to make sure there's no issues, and once that's done then I'll release the 1.0.0 version.

Quote:

and one more point....if guest access is disable in forum..then can spider crawl the forum?
No, I'm pretty sure you need to allow guest access to your forums for the spiders to get in. The spiders don't log in with a vBulletin username, so they are guests.

Quote:

If Not..Any way that can makes spider crwal the forum ????
Not that I'm aware of. Unless you enable the archive and allow guests to view that, I guess.

Quote:

It changes db...hope it wont create any probs in sql db..
It doesn't.

Quote:

:) ALso Looking For More Modified hack...
What else would you want to see in it?

Quote:

If Not...can u make that the spider can crawl forum..if guest access disable.
That's got nothing to do with this hack, that has to do with how you configure your forum.

bashy 01-07-2006 04:04 PM

Hi

Is there anyway to make a Unkown spider know?
I have the upto date spider xml, so not sure why it would be like this?

Bashy

mikelbeck 01-07-2006 04:06 PM

Quote:

Originally Posted by bashy
Hi

Is there anyway to make a Unkown spider know?
I have the upto date spider xml, so not sure why it would be like this?

Bashy

No, it's only able to decode those that are listed in the updated xml file. You could update your xml file, or ask the people who are maintaining it to add the spiders that you're seeing.

bashy 01-07-2006 04:08 PM

ok no worries thanks anyway, great hack :)

Bashy

Quote:

Originally Posted by mikelbeck
No, it's only able to decode those that are listed in the updated xml file. You could update your xml file, or ask the people who are maintaining it to add the spiders that you're seeing.


LBSources 01-07-2006 04:50 PM

works well .. thanks!

/me clicks install.

nitro 01-07-2006 05:45 PM

Looks like its working ok in 3.5.0 patched :)

Any easy way to add the whats going on or more importantly the who's online to the page ?

mikelbeck 01-07-2006 06:45 PM

Quote:

Originally Posted by nitro
Looks like its working ok in 3.5.0 patched :)

Any easy way to add the whats going on or more importantly the who's online to the page ?

Do you mean the box that's display at the bottom of forumhome?

I would think that you could just copy that block from the forumhome template and put it into the spiderwatcher_spiders template.

bashy 01-07-2006 06:49 PM

Hi

I am also interested in this, could you please tell me exactly the right code to copy from forumhome and to exactly where to put it please?

Regards

Bashy


Quote:

Originally Posted by mikelbeck
Do you mean the box that's display at the bottom of forumhome?

I would think that you could just copy that block from the forumhome template and put it into the spiderwatcher_spiders template.


nitro 01-07-2006 07:12 PM

Quote:

Originally Posted by mikelbeck
Do you mean the box that's display at the bottom of forumhome?

I would think that you could just copy that block from the forumhome template and put it into the spiderwatcher_spiders template.

That would just display the block and images with no info. Guess I will have to try an find the php code in vb files and take pot luck on adding it to the spiders.php

Lizard King 01-07-2006 09:11 PM

Quote:

Originally Posted by nitro
That would just display the block and images with no info. Guess I will have to try an find the php code in vb files and take pot luck on adding it to the spiders.php

https://vborg.vbsupport.ru/showthread.php?t=104591

nitro 01-07-2006 10:02 PM

Thanks that works great best click install there aswell ;)


Iv noticed this spider product also detects JumpD's RSS article bot as an unknown spider.

I dont mind that as it shows that bots hit counter aswell but how would I add to the spiders xml file this bot which has an ip of 0.0.0.0 and i dont think it has an agent but I maybe wrong so it shows as say Articlebot instead of unknown ?

Zenith 01-07-2006 11:36 PM

Tidy hack, thanks.

I notice that the last visited date is out though. I only installed a few minutes ago but the few spiders already listed are showing "Yesterday". Time is correct, but it should be "Today".

Clicked install...

mikelbeck 01-08-2006 12:53 AM

Version 1.0.0 Beta 3 is available.

1.0.0 Beta 3 - 01/07/2006
- Split up the display of "known" and "unknown" spiders

Upgrade Instructions
1. Upload (and overwrite) spiders.php to the root of your forum.
2. Import the file product-spiderwatcher.xml using the Manage Products module. Make sure the "Allow Overwrite" option is set to "Yes".
3. Add a link to spiders.php on your navbar or footer.

Zia 01-08-2006 04:14 AM

Thankx Man.. for real kool support..


-Unknown Spider
Googlebot-Image/1.0 01-07-2006
01:09 AM 66.249.65.202 errordocs 2

-Unknown Spider
Mozilla/3.01 (compatible;) 01-07-2006
03:48 AM 69.88.30.146, 202.52.197.55, 202.52.197.56, 210.50.143.21 cron, errordocs 213

-Unknown Spider
Mozilla/4.0 (compatible; Google Desktop) 01-07-2006
11:55 AM 70.236.181.179 errordocs, external 7



umm one spider hit 213 times..its detected as unknown spider..
can u plz mark them?? and most of time it face errordoc..
how could we detect that..right where it face error doc?
we need to know which bot see error doc in where?


and which bot is doing harm to forum...any way to detect from the tracker page?

plz thanks...help us

bashy 01-08-2006 06:55 PM

Hi..

What i find intersting is the fact that all the spiders are showing as being online 20 hours ago...I know for a fact that most have been in all day long, they very rarely leave ;) so how can it be saying 20hrs ago or should i say why is it saying 20 when it clearly isnt lol

Please advise?

Bashy

Code:

Spider Name (Known Spiders) Last Visit IP Address(es) Page Hits
Google AdSense 20 Hours Ago
08:37 PM 66.249.66.102 THIS_SCRIPT, forumdisplay, index, login, member, misc, newreply, newthread, online, pager, poll, private, profile, register, search, showthread 347
Google 20 Hours Ago
08:48 PM 66.249.66.102 index, showthread 25
MSNBot 20 Hours Ago
04:46 PM 207.46.98.48 VBGooglemap, calendar, forumdisplay, index, jukebox, member, memberlist, search, showthread, vbradio 44
Yahoo! Slurp 20 Hours Ago
08:10 PM 68.142.249.14, 68.142.249.160, 68.142.250.17, 68.142.251.166 forumdisplay, member, showthread 4

Spider Name (Unknown Spiders) Last Visit IP Address(es) Page Hits
Unknown Spider
 20 Hours Ago
08:00 PM 82.110.105.18 external 76
Unknown Spider
FeedBurner/1.0 (http://www.FeedBurner.com) 20 Hours Ago
08:16 PM 66.150.96.109 external 30
Unknown Spider
Jakarta Commons-HttpClient/3.0-rc2 20 Hours Ago
08:16 PM 206.188.0.11, 209.18.119.137, 209.18.119.138 showthread 18
Unknown Spider
Mozilla/4.0 (compatible; Google Desktop) 20 Hours Ago
03:38 PM 86.142.103.72 external 5
Unknown Spider
Yahoo-Blogs/v3.9 (compatible; Mozilla 4.0; MSIE 5.5; http://help.yahoo.com/help/us/ysearc...awling-02.html ) 20 Hours Ago
08:24 PM 209.191.83.13 showthread 47


mikelbeck 01-08-2006 07:59 PM

That is strange...

The only thing I can think is that the server time zone is set to something different from what yours is.

What the plugin does is write a record into the database, and the timestamp it uses is "NOW()", which is the current time. I think it uses the server's current time, but I'm not positive. So if the server's time is different from your time (if it's set to GMT, for example, and your time zone is GMT-4), then the times will be different.

I'll have a look to see if it can be set to the board's current time instead of using the "NOW()" function.

bashy 01-08-2006 08:01 PM

Hi mikelbeck

Cheers for the quick response...
The server is in the UK as i am therefore we are on the same timezone :)

Bashy

Quote:

Originally Posted by mikelbeck
That is strange...

The only thing I can think is that the server time zone is set to something different from what yours is.

What the plugin does is write a record into the database, and the timestamp it uses is "NOW()", which is the current time. I think it uses the server's current time, but I'm not positive. So if the server's time is different from your time (if it's set to GMT, for example, and your time zone is GMT-4), then the times will be different.

I'll have a look to see if it can be set to the board's current time instead of using the "NOW()" function.


mikelbeck 01-08-2006 08:09 PM

Quote:

Originally Posted by bashy
Hi mikelbeck

Cheers for the quick response...
The server is in the UK as i am therefore we are on the same timezone :)

Bashy

Can you go into your admin control panel, click on "Maintenenace", then "Diagnostics", then select "MySQL Variables" in the "System Information" box and click "Submit". What value does it show for "timezone", and what is your timezone? And also, what is your forum's timezone set to?

bashy 01-08-2006 08:13 PM

Hi m8

MySQL Variables
system_time_zone GMT

My time zone is GMT
Forum Timezone is GMT also

I dont suppose this helps you any now does it lol

Bashy


All times are GMT. The time now is 06:09 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01752 seconds
  • Memory Usage 1,850KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_code_printable
  • (19)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (2)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete