vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 3.5 Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=113)
-   -   Member Bots 1.0 - Allow bots to act as members (https://vborg.vbsupport.ru/showthread.php?t=108070)

amykhar 03-31-2006 12:26 PM

Just a comment. You may want to consider changing the title of this mod to use the word spiders instead of bots. Most people here consider a bot a script that automatically posts something to the forums, while a search engine agent is a spider. More people will search for spider when they want this mod.

DJ J@M?S 04-01-2006 03:43 AM

this mod not work guys :(

Quarterbore 04-12-2006 06:32 PM

All of you saying this is not working had me worried so I made some changes to mine...

I made a new group that works like my registered group but it is named
"Search Bots" plus I updated my spider list... Next I assigned my bots to be in this new group and saw the bots were in the forums...

I shut their access down to access forums, threads, etc... and the bots all got no permissions errors....

So, I then went back in and updated my settings so they can see forums and threads BUT I made it so they can't see profiles (Can't harvest E-mails that way) and locked them out of forums I didn't want them to see based on the user group...

Works great for me!

Thanks!

gavinzac 04-12-2006 08:07 PM

Quote:

Originally Posted by amykhar
Just a comment. You may want to consider changing the title of this mod to use the word spiders instead of bots. Most people here consider a bot a script that automatically posts something to the forums, while a search engine agent is a spider. More people will search for spider when they want this mod.

yeah, pretty much. it seems strange to me that there is an auto-response bot for phpBB & not for vBulletin.

clasione 04-15-2006 03:02 AM

This is too risky for my blood..... Showing one page to a spider and one page to a guest is extreamly dangerous.... Often search engines will send "secret spiders" to test and see if you are serving different pages.....

denby 04-15-2006 10:43 PM

Quote:

Originally Posted by trilljester
Quote:

Thank you for your note. We recognize your concern. Please be advised that
we don't personally review individual sites, nor do we comment on
webmaster techniques or the details of our search technology beyond what
appears on our site.

We've dedicated an entire section of our site to answering the most common
questions from those who maintain and/or promote websites. You'll find all
of our publicly available information information posted at
http://www.google.com/webmasters/index.html

This is not a definitive clarification of the issue in any way. Google has basically said they don't discuss individual cases or divilge their indexing or verfication processes (apart from saying it's done by spiders not humans), and that you'll find the answers to all your questions in their webmasters' FAQ. It's left completely to the individual to interpret the TOS and decide what's black hat.
Quote:

Originally Posted by Andreas
Attention
Using this Hack is to be considered cloaking your site, and you risk being banned from search engine indexes - like it recently happened to BMW.

To be fair, the BMW case involves very different issues. BMW.de was using quite blatant and crude cloaking: spiders were redirected to pages chocked full of car related keywords at a much greater density than on the pages shown to ordinary visitors in an attempt to boost page rank. Very old, tired strategies that Google can spot at 100 MegaPaces.


I face the usual competing goals of retaining control of my site's privacy and member priviledges while ensuring good SE ranking and the delivery of relevant GoogleAds (and hence healthy GoogleAds revenues). One added problem in all this is that if you have restricted the forum viewing access to logged-in visitors, installed the mod and had your forum pages indexed by Google (essentially the archive URL versions of them), the Google search results will reveal the content via the "Cached" link for each result. If unlogged-in visitors from the Google results click the ordinary link, they will be confronted by a login screen but not if they click the "Cached" link. To avoid this hole in your security you need to add a robots no-cache meta tag to your headerinclude template:
Code:

<meta name="robots" content="index,follow,noarchive">
This says to all spiders that visit this page "you may index this page, and any pages linked to it via hyperlinks but you may not store the content of any of them in the Googlesearch engine cache".

You should also edit/install a robots.txt file in your root or forum folder with the following content:
Code:

User-agent: Googlebot
Disallow:
User-agent: Mediapartners-Google
Disallow:
User-agent: *
Disallow: /

These settings say to the 2 spiders Googlebot and Mediapartners-Google (the GoogleAds spider) "roam my site at will" but tells all others to not crawl any part of my site. It should be noted that only spiders that play by the rules read the robots.txt file. All the big names obey the robots.txt directives but there are lots of smaller rogue spiders who ignore them. Another good reason to edit your spiders_vbulletin.xml file to define which spiders will be accepted into your forum by this mod.


Nice mod but it's operating in the grey zone and until Google definitively clarifies whether it's OK for a spider to see my forum content while an unlogged-in visitor can not, I will not be installing it. I can't afford the risk of losing my significant monthly GoogleAds income or my healthy SE ranking.

I will write to Google and report here if I get a clearer answer than trilljester received. :)

MikeF 04-16-2006 07:06 AM

Quote:

Nice mod but it's operating in the grey zone and until Google definitively clarifies whether it's OK for a spider to see my forum content while an unlogged-in visitor can not, I will not be installing it.
I don't understand how the point hasn't been clarified by the major search engines. Cloaking is when the spiders see content a person casually going to the site doesn't see for the purposes of gaining ranking. If the default usergroup is Unregistered with ability to read posts restricted, and the bots see much more its cloaking...

The bottom line is that its not OK.

Imagine you are Google or Yahoo trying to get the most relevant search results. The archive is a duplicate content / optimization issue... they may rank you lower or higher for duplicate content depending on duplication but theyre not going to terminate you. Now if you added 100 content relevant keywords at the end of every archive page to gain better rank, they probably would.

The same goes for this - if there are some general discrepencies between what registered/unregistered people see its not a big deal. But turning spiders into forum members... :rolleyes: Combine this with the vBPopup mod and someone might as well start cheesyforum.org

AzzidReign 04-16-2006 10:54 PM

Hmm...seems like this isnt working for me either. I'm lookin at the bots and its saying that they are getting "no permissions" error. Any definite fix to this yet?

Raydar 04-22-2006 02:00 PM

My guests can see everything that my members can see. So I'm thinking that I don't need this hack. Unless there is some other advantage to using it?

I also have welcome headers installed. From reading parts of this thread, it seems that welcome headers may be what the google bot is reading when crawling my pages. If so, is there a way to make google ignore the welcome header?

Thanks for any advice.

Reg

Lammypie 04-28-2006 01:50 PM

Corbynt, I'm not sure mine is working either. I've adjusted my useragent to "Googlebot/2.X (http://www.googlebot.com/bot.html)" and gone to my site (not logged in as a user, the access I get is contolled by the user group setting, so thats working, its just that I don't appear in the whos online list as googlebot, only as guest. ? any suggestions whats wrong.

Reg Car, I also need to stop spiders from seeing the welcome headers, as they are ruining my search results.

All suggestions welcome


All times are GMT. The time now is 10:56 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01202 seconds
  • Memory Usage 1,753KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_code_printable
  • (5)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete