Go Back   vb.org Archive > vBulletin Modifications > vBulletin 3.8 Modifications > vBulletin 3.8 Add-ons

Reply
 
Thread Tools
Nexia's Web Crawlers Bundle Details »»
Nexia's Web Crawlers Bundle
Version: 1.00, by vbenhancer vbenhancer is offline
Developer Last Online: Nov 2012 Show Printable Version Email this Page

Category: Miscellaneous Hacks - Version: 3.8.x Rating:
Released: 03-04-2012 Last Update: Never Installs: 29
DB Changes Uses Plugins
Additional Files  
No support by the author.

A bundle of features for the Spiders access. Hits tracking, complete listing for info, Specific usergroup for Spiders, etc

It's globally a merge with some small addons i wrote in the past, and as i did not want to release a ton of minimal tools that just fit together, i make a real bundle, 4 or 5 tools together, with activation and permissions settings when needed.

Spiders List: a little spiders tracker for your forum. It's not tracking each page the engine is viewing, because this is pointless. Instead, It is listing the name of the spiders that visit your sites, the last date of a visit, the number of unique visits and the number of pages viewed. That information is not very important for the indexation of your site, but it helps to see why your site may be occupied or not. You can then take action if a crawler is visiting and still giving no result on search engines.

Specific Usergroup for Spiders: i released this addon on vb.org long time ago, and it was copied in source, but this version is updated and have more flexibility. You simply have to choose the proper usergroup in the settings so when a spider/crawler visit your site, it is considered having some permissions... it's useful if you do not want to fill your robots.txt file with strange access blocks. This let you give access to crawlers for profiles but not visitors messages, etc...

Also remember to follow the TOS of the search engines you are registered to. Google until lately was blocking sites that were ghosting their content.

Display Spiders in WOL: and in any page showing "Currently Active Users" (showthread, forumdisplay, etc) ... that way, you see where these beasts are visiting..

... some other tools are to be decided to join in the bundle, i'll see later!

CRON JOB:
to make it easier on the server, there is a cronjob storing the hourly stats about the crawlers... once the cronjob is done once (it's the cron named Hourly #1), the stats appear in the right place...:

...update: may 1st, 10:50, a small change, the Crawlers listing will now update the spiders list in cache if the file changed, so you can update it when needed.


a note... this engine use the Spiders List from Dream... with his permission.
http://www.wolfshead-solutions.com/spiders-list

someone said to me that vbSEO Googlemap was doing something similar... hum... yeah, similar, tracking 3 web crawlers -- you have to generate new entries by hand if you want to track each crawler...

this engine is compatible with Paul M's Guest tracking... it's not doing exactly the same thing, but you know, you choose... this engine is not tracking guests activity, just crawlers page hits. (adding a small query per page, but useful when you REALLY need to know what web crawlers are doing on your site) -- really good for site owners showing potential for paid Ads!

Also, there will be a newer version with more settings soon... permissions and blocking, etc

...

as a success story, i must say i had a great visitor this week, that i would never been able to track without this engine... the "Majestics MJ12bot" crawler make 20 times the hits on my site that Google was able to do in months... i checked their site, and it was obvious they were trying to leech the site, not crawl it...

Download Now

File Type: zip nex_crawlers_bundle.zip (21.9 KB, 166 views)

Screenshots

File Type: jpg nex_crawlers_bundle_info.jpg (11.7 KB, 0 views)
File Type: jpg nex_crawlers_bundle_list.jpg (62.5 KB, 0 views)
File Type: jpg nex_crawlers_bundle_online.jpg (48.8 KB, 0 views)
File Type: jpg nex_crawlers_bundle_settings.jpg (55.3 KB, 0 views)
File Type: jpg nex_crawlers_bundle_wol_ug_markup.jpg (19.4 KB, 0 views)

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.
Благодарность от:
puertoblack2003

Comments
  #2  
Old 03-05-2012, 09:02 PM
vbenhancer's Avatar
vbenhancer vbenhancer is offline
 
Join Date: Dec 2009
Location: Qu?bec city, Canada
Posts: 740
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

this version was tested on more than 250 websites, so it's supposed to work properly...
Reply With Quote
  #3  
Old 03-05-2012, 09:18 PM
Mosh's Avatar
Mosh Mosh is offline
 
Join Date: Aug 2004
Location: Melbourne, Australia
Posts: 1,968
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Actually Nexia, the spiders list you are pointing to is my spider's list, not Dreams And, yes I do give permission
Reply With Quote
  #4  
Old 03-05-2012, 09:20 PM
vbenhancer's Avatar
vbenhancer vbenhancer is offline
 
Join Date: Dec 2009
Location: Qu?bec city, Canada
Posts: 740
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

oh, i didn't see the change of ownership, sorry.. rofl

and thanks...

... am i the only coding making any usage of the spider engine yet?!

looks like it was added in vB 3.6 and nobody ever used it...
Reply With Quote
  #5  
Old 03-05-2012, 09:27 PM
Mosh's Avatar
Mosh Mosh is offline
 
Join Date: Aug 2004
Location: Melbourne, Australia
Posts: 1,968
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by vbenhancer View Post
oh, i didn't see the change of ownership, sorry.. rofl

and thanks...
Dream stopped his, I wrote mine from scratch.

Quote:
Originally Posted by vbenhancer View Post
... am i the only coding making any usage of the spider engine yet?!

looks like it was added in vB 3.6 and nobody ever used it...
Boofo & Paul M have hacks that make use of the spiders list as well.
Reply With Quote
  #6  
Old 05-28-2012, 03:43 AM
AFMichael AFMichael is offline
 
Join Date: Sep 2004
Location: Florida
Posts: 29
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Excellent mod! Installed!
Reply With Quote
  #7  
Old 05-31-2012, 10:57 PM
matrex722's Avatar
matrex722 matrex722 is offline
 
Join Date: Jan 2007
Posts: 161
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Excellent mod! Installed!
Reply With Quote
  #8  
Old 06-03-2012, 06:26 AM
Fashel.Net Fashel.Net is offline
 
Join Date: May 2012
Posts: 3
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

thanks for this mod

if the crawlers activity could be shown only by Administrators it will fantastic, i hope you add this option to the mod because i really liked it

Installed
Reply With Quote
  #9  
Old 06-04-2012, 08:06 AM
incisor incisor is offline
 
Join Date: May 2006
Posts: 22
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

very impressed

thank you for the effort!
Reply With Quote
  #10  
Old 07-12-2013, 01:22 AM
WebMonster2013 WebMonster2013 is offline
 
Join Date: May 2013
Posts: 7
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

i cant get this working at all.



as you can see the bot is viewing a no perm's message.
Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 10:52 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.07220 seconds
  • Memory Usage 2,327KB
  • Queries Executed 24 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (2)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (4)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (1)pagenav_pagelink
  • (10)post_thanks_box
  • (1)post_thanks_box_bit
  • (10)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (1)post_thanks_postbit
  • (10)post_thanks_postbit_info
  • (9)postbit
  • (6)postbit_attachment
  • (10)postbit_onlinestatus
  • (10)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • fetch_musername
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • post_thanks_function_fetch_thanks_bit_start
  • post_thanks_function_show_thanks_date_start
  • post_thanks_function_show_thanks_date_end
  • post_thanks_function_fetch_thanks_bit_end
  • post_thanks_function_fetch_post_thanks_template_start
  • post_thanks_function_fetch_post_thanks_template_end
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_attachment
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete