vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 2.x Full Releases (https://vborg.vbsupport.ru/forumdisplay.php?f=4)
-   -   vbArchive - Search Engine Indexer for vBulletin (https://vborg.vbsupport.ru/showthread.php?t=47667)

glenvw 01-16-2003 04:15 PM

I am lost :-)

I edited my online.php and uploaded it:

Here is my archive:

http://www.yes-its-free.com/vbbs/archive/

Please splain what I should be seeing

BTW, if I have not said it before, let me say it a couple more thousand times....AWESOME JOB!

TECK 01-16-2003 04:38 PM

You remember the Unkown Location, when you view active users brosing the forums (online.php)? Is gone now, you see the actual text and link instead. :p
You will not see nothing new in the archive files.

TECK 01-16-2003 04:50 PM

Ok, another idea I came up with... If the crawler is Google, for example, and it's hitting the actual forum pages, not the archive, you could use this trick to remove the sessionhash from the templates:
FILE: functions.php
ADD ABOVE [high]?>[/high] THIS CODE:
PHP Code:

function no_sessionhash()
{
  global 
$session;

  
$agent = array(
    
'crawl',
    
'googlebot',
    
'gulliver',
    
'ia_archiver',
    
'internetseer',
    
'linkalarm',
    
'mercator',
    
'openbot',
    
'pingalink',
    
'psbot',
    
'scooter',
    
'slurp',
    
'slysearch',
    
'zeus',
    
'zyborg',
    
'otheruseragentcrawleryouwant'
  
);

  foreach( 
$agent as $useragent )
  {
    if ( 
stristrgetenv'HTTP_USER_AGENT' ) , $useragent ) )
    {
      
$session['sessionhash'] = '';
    }
  }


FILE: /forum global.php
ADD ABOVE [high]$header='';[/high] THIS CODE:
PHP Code:

no_sessionhash(); 

The list above will cover all major crawlers.
Make sure you enter the actual AGENT name ONLY.
To test it, enter for example:
'mozilla',

It will remove all your session hashes, if you browse with Internet Explorer for example.

wooolF[RM] 01-16-2003 05:27 PM

]wiKKid! :D

TECK 01-17-2003 12:43 PM

To block the crawlers go to certain pages, create a robots.txt file with the content:
Code:

User-agent: *
Disallow: /forum/admin/
Disallow: /forum/images/
Disallow: /forum/mod/
Disallow: /forum/attachment.php
Disallow: /forum/avatar.php
Disallow: /forum/editpost.php
Disallow: /forum/member.php
Disallow: /forum/member2.php
Disallow: /forum/memberlist.php
Disallow: /forum/misc.php
Disallow: /forum/moderator.php
Disallow: /forum/newreply.php
Disallow: /forum/newthread.php
Disallow: /forum/online.php
Disallow: /forum/poll.php
Disallow: /forum/postings.php
Disallow: /forum/printarticle.php
Disallow: /forum/printthread.php
Disallow: /forum/private.php
Disallow: /forum/private2.php
Disallow: /forum/register.php
Disallow: /forum/report.php
Disallow: /forum/search.php
Disallow: /forum/sendtofriend.php
Disallow: /forum/threadrate.php
Disallow: /forum/usercp.php

and upload it onto your main (not /forum, the one above it) directory. Crawlers will look ONLY onto the root folder for the robots.txt file.
Make sure you change the /forum folder to your actual folder name.

For more info on robots.txt file, go here.

codewebs 01-17-2003 06:38 PM

i saw a similar version of that google thing on vbulletin.com but i added yours, and your shure it works right, like it wont mess it up for google since we cant test it? (the user agent thing)

TECK 01-17-2003 06:52 PM

To see if it's working, simply use:
no_sessionhash( 'mozilla' );

There is no more sessionhash displayed after.
Make sure you remove it, orelse all IE (or whatever browser you use) users will not have anymore sessions...

Chris M 01-17-2003 08:35 PM

Nice TECK:)

*installs*

Satan

TECK 01-18-2003 03:27 PM

Btw, the script is wise enough to work with .html extensions or not... so if you upgraded from an old version and Google already indexed your files, it will still point to that page. :)
Let's see:
http://www.teckwizards.com/forumdisplay/f-8.html
http://www.teckwizards.com/showthread/t-577.html

and:
http://www.teckwizards.com/forumdisplay/f-8
http://www.teckwizards.com/showthread/t-577

Same results. :)
Go ahead and upgrade your script.

Floris 01-18-2003 03:35 PM

I don't need to upgrade the version I first installed. it is working as i wanted it to work. thank you for letting me know on vbulletin.com with a pm that there was an upgrade :)


All times are GMT. The time now is 10:45 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01665 seconds
  • Memory Usage 1,747KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_code_printable
  • (2)bbcode_php_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (2)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete