Go Back   vb.org Archive > vBulletin Modifications > Archive > vB.org Archives > vBulletin 3.5 > vBulletin 3.5 Add-ons
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Member Bots 1.0 - Allow bots to act as members Details »»
Member Bots 1.0 - Allow bots to act as members
Version: 1.00, by trilljester trilljester is offline
Developer Last Online: Aug 2011 Show Printable Version Email this Page

Version: 3.5.3 Rating:
Released: 02-14-2006 Last Update: Never Installs: 192
Uses Plugins
Code Changes  
No support by the author.

--------- Member Bots 1.0 ---------
Written by Trilljester - http://www.trilliummud.com

This small mod allows you to treat search engine robots (i.e. GoogleBot, Yahoo Slurp) as registered users instead of guests, as they are treated by default in VBulletin.

This is handy if your site allows limited access to guests, but full access to registered users. By default, a search engine robot would only be allowed to index what a guest could see, but with this mod, it would allow them to fully index your site as a registered user.

This mod uses the spiders_vbulletin.xml file for determining if a visitor is a bot or not, so make sure you have that file up to date. The one that comes with VB is pretty sparse, but it works. If you're only interested in GoogleBot, then the default file works great.

WARNING! This mod requires a very small edit to a VBulletin core file. This means whenever you upgrade to a new version of VBulletin, you'll need to re-do this edit, if you overwrite this file. If the idea of editing a VBulletin core file bothers you, don't install this mod! I won't be held responsible for your forum being fouled up. This mod is very safe however, so don't fret.

Install Instructions:

1. Unzip the product-memberbot.xml file on your system.

2. Go to your VBulletin AdminCP and click on Manage Products under the Plugin Manager section.

3. Click on Add/Import Product.

4. Click on Browse on the first line (Upload the XML from your computer) and locate the product-memberbot.xml file. Click on Import to upload it to the system.

5. Click on Plugin Manager, and ensure that the new plugin named Bot Checker, which is found under the init_startup hook, is enabled.

6. Now the fun part, you'll need to edit the core VBulletin file init.php which is located in the includes directory under your forum home.

For example, if your forum home is located in /home/blah/forum, then init.php is in /home/blah/forum/includes

If you can edit the file on the server without having to download it, more power to you.

In init.php, find this line (should be near the very bottom of the file, it was line 403 in mine):

Code:
if (!empty($db->explain))
Add ABOVE this line:
Code:
if ($is_bot == 1) {
        $vbulletin->userinfo['usergroupid'] = 2;
}
NOTE: My registered users group is ID #2. Yours may be different, you may change this to any group ID you like.

7. Save init.php and upload back to your server if you had to download the file to edit it.

If you're editing on the server, save it, and that's it!

================================================== ========
That's it! Search Engine bots should now "see" your forum as a regular user, not a guest.

Questions, comments, improvements are more than welcome. Please use this thread as a cental support site. It makes it easier for me to help out. Also, you may visit my site and fire me off a PM for help, or just send a quick thanks or shout out.

Enjoy!
// trilljester

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #62  
Old 03-31-2006, 12:26 PM
amykhar's Avatar
amykhar amykhar is offline
 
Join Date: Oct 2001
Location: PA
Posts: 4,438
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Just a comment. You may want to consider changing the title of this mod to use the word spiders instead of bots. Most people here consider a bot a script that automatically posts something to the forums, while a search engine agent is a spider. More people will search for spider when they want this mod.
Reply With Quote
  #63  
Old 04-01-2006, 03:43 AM
DJ J@M?S's Avatar
DJ J@M?S DJ J@M?S is offline
 
Join Date: Nov 2005
Posts: 9
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

this mod not work guys
Reply With Quote
  #64  
Old 04-12-2006, 06:32 PM
Quarterbore Quarterbore is offline
 
Join Date: Mar 2005
Location: Valley Forge PA
Posts: 538
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

All of you saying this is not working had me worried so I made some changes to mine...

I made a new group that works like my registered group but it is named
"Search Bots" plus I updated my spider list... Next I assigned my bots to be in this new group and saw the bots were in the forums...

I shut their access down to access forums, threads, etc... and the bots all got no permissions errors....

So, I then went back in and updated my settings so they can see forums and threads BUT I made it so they can't see profiles (Can't harvest E-mails that way) and locked them out of forums I didn't want them to see based on the user group...

Works great for me!

Thanks!
Reply With Quote
  #65  
Old 04-12-2006, 08:07 PM
gavinzac's Avatar
gavinzac gavinzac is offline
 
Join Date: Jan 2006
Posts: 95
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by amykhar
Just a comment. You may want to consider changing the title of this mod to use the word spiders instead of bots. Most people here consider a bot a script that automatically posts something to the forums, while a search engine agent is a spider. More people will search for spider when they want this mod.
yeah, pretty much. it seems strange to me that there is an auto-response bot for phpBB & not for vBulletin.
Reply With Quote
  #66  
Old 04-15-2006, 03:02 AM
clasione clasione is offline
 
Join Date: Aug 2004
Location: NY
Posts: 76
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

This is too risky for my blood..... Showing one page to a spider and one page to a guest is extreamly dangerous.... Often search engines will send "secret spiders" to test and see if you are serving different pages.....
Reply With Quote
  #67  
Old 04-15-2006, 10:43 PM
denby denby is offline
 
Join Date: Nov 2004
Location: Canberra, Australia
Posts: 14
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by trilljester
Quote:
Thank you for your note. We recognize your concern. Please be advised that
we don't personally review individual sites, nor do we comment on
webmaster techniques or the details of our search technology beyond what
appears on our site.

We've dedicated an entire section of our site to answering the most common
questions from those who maintain and/or promote websites. You'll find all
of our publicly available information information posted at
http://www.google.com/webmasters/index.html
This is not a definitive clarification of the issue in any way. Google has basically said they don't discuss individual cases or divilge their indexing or verfication processes (apart from saying it's done by spiders not humans), and that you'll find the answers to all your questions in their webmasters' FAQ. It's left completely to the individual to interpret the TOS and decide what's black hat.
Quote:
Originally Posted by Andreas
Attention
Using this Hack is to be considered cloaking your site, and you risk being banned from search engine indexes - like it recently happened to BMW.
To be fair, the BMW case involves very different issues. BMW.de was using quite blatant and crude cloaking: spiders were redirected to pages chocked full of car related keywords at a much greater density than on the pages shown to ordinary visitors in an attempt to boost page rank. Very old, tired strategies that Google can spot at 100 MegaPaces.


I face the usual competing goals of retaining control of my site's privacy and member priviledges while ensuring good SE ranking and the delivery of relevant GoogleAds (and hence healthy GoogleAds revenues). One added problem in all this is that if you have restricted the forum viewing access to logged-in visitors, installed the mod and had your forum pages indexed by Google (essentially the archive URL versions of them), the Google search results will reveal the content via the "Cached" link for each result. If unlogged-in visitors from the Google results click the ordinary link, they will be confronted by a login screen but not if they click the "Cached" link. To avoid this hole in your security you need to add a robots no-cache meta tag to your headerinclude template:
Code:
<meta name="robots" content="index,follow,noarchive">
This says to all spiders that visit this page "you may index this page, and any pages linked to it via hyperlinks but you may not store the content of any of them in the Googlesearch engine cache".

You should also edit/install a robots.txt file in your root or forum folder with the following content:
Code:
User-agent: Googlebot
Disallow:
User-agent: Mediapartners-Google
Disallow:
User-agent: *
Disallow: /
These settings say to the 2 spiders Googlebot and Mediapartners-Google (the GoogleAds spider) "roam my site at will" but tells all others to not crawl any part of my site. It should be noted that only spiders that play by the rules read the robots.txt file. All the big names obey the robots.txt directives but there are lots of smaller rogue spiders who ignore them. Another good reason to edit your spiders_vbulletin.xml file to define which spiders will be accepted into your forum by this mod.


Nice mod but it's operating in the grey zone and until Google definitively clarifies whether it's OK for a spider to see my forum content while an unlogged-in visitor can not, I will not be installing it. I can't afford the risk of losing my significant monthly GoogleAds income or my healthy SE ranking.

I will write to Google and report here if I get a clearer answer than trilljester received.
Reply With Quote
  #68  
Old 04-16-2006, 07:06 AM
MikeF MikeF is offline
 
Join Date: Jan 2006
Posts: 40
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Nice mod but it's operating in the grey zone and until Google definitively clarifies whether it's OK for a spider to see my forum content while an unlogged-in visitor can not, I will not be installing it.
I don't understand how the point hasn't been clarified by the major search engines. Cloaking is when the spiders see content a person casually going to the site doesn't see for the purposes of gaining ranking. If the default usergroup is Unregistered with ability to read posts restricted, and the bots see much more its cloaking...

The bottom line is that its not OK.

Imagine you are Google or Yahoo trying to get the most relevant search results. The archive is a duplicate content / optimization issue... they may rank you lower or higher for duplicate content depending on duplication but theyre not going to terminate you. Now if you added 100 content relevant keywords at the end of every archive page to gain better rank, they probably would.

The same goes for this - if there are some general discrepencies between what registered/unregistered people see its not a big deal. But turning spiders into forum members... Combine this with the vBPopup mod and someone might as well start cheesyforum.org
Reply With Quote
  #69  
Old 04-16-2006, 10:54 PM
AzzidReign's Avatar
AzzidReign AzzidReign is offline
 
Join Date: Apr 2006
Posts: 497
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hmm...seems like this isnt working for me either. I'm lookin at the bots and its saying that they are getting "no permissions" error. Any definite fix to this yet?
Reply With Quote
  #70  
Old 04-22-2006, 02:00 PM
Raydar Raydar is offline
 
Join Date: Dec 2005
Posts: 87
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

My guests can see everything that my members can see. So I'm thinking that I don't need this hack. Unless there is some other advantage to using it?

I also have welcome headers installed. From reading parts of this thread, it seems that welcome headers may be what the google bot is reading when crawling my pages. If so, is there a way to make google ignore the welcome header?

Thanks for any advice.

Reg
Reply With Quote
  #71  
Old 04-28-2006, 01:50 PM
Lammypie Lammypie is offline
 
Join Date: Nov 2005
Posts: 10
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Corbynt, I'm not sure mine is working either. I've adjusted my useragent to "Googlebot/2.X (http://www.googlebot.com/bot.html)" and gone to my site (not logged in as a user, the access I get is contolled by the user group setting, so thats working, its just that I don't appear in the whos online list as googlebot, only as guest. ? any suggestions whats wrong.

Reg Car, I also need to stop spiders from seeing the welcome headers, as they are ruining my search results.

All suggestions welcome
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 04:12 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.07329 seconds
  • Memory Usage 2,329KB
  • Queries Executed 25 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (4)bbcode_code
  • (5)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (6)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (11)post_thanks_box
  • (11)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (11)post_thanks_postbit_info
  • (10)postbit
  • (11)postbit_onlinestatus
  • (11)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete