Go Back   vb.org Archive > vBulletin Modifications > Archive > vB.org Archives > vBulletin 2.x > vBulletin 2.x Full Releases
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Details »»

Version: , by Overgrow Overgrow is offline
Developer Last Online: Jun 2004 Show Printable Version Email this Page

Version: 2.0.x Rating:
Released: 04-29-2001 Last Update: Never Installs: 53
 
No support by the author.

I am tired of my 200,000 posts not being listed in Google. I was inspired by phpbuilder.com this morning and I wrote:

vbSpiderFriend - the search engine indexer for all of your posts

Purpose: Allow search engine spiders to crawl a linked list of all of your posts.

Project Requirements:

-Friendly URLs (no query strings)
-Good dynamic meta tags
-Never have to touch the script again.. It is Y3K compliant, simply re-submit to the engines to update your listings

Install Requirements:

-vBulletin 1.x or 2.x
-about 10 minutes


1) Download the attached Zip.

2) Open class.mysql.php and put your database login info at the top.

3) Create a new directory called archive under your forum, like /forum/archive

4) Open the included .htaccess and change the Error 404 to your new archive path.

5) Open index.php and change the self-explanatory variables at the top of the file.

6) Upload all 3 files to your archive directory.

7) Submit /forum/archive/index.php to search engines and watch em crawl


DISCLAIMER: I don't use 2.x but I checked the schema and this should work fine.

NOTES: This uses ErrorDocument and query string parsing to get the variables needed. I do not have the time or energy to troubleshoot this if it does not work on your server. Sorry!

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #312  
Old 12-14-2002, 11:13 PM
SoloX SoloX is offline
 
Join Date: Jun 2002
Posts: 135
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Is there any way to feed the archive URL to something like website downloaders to create a full text archive of the VBB? This way, the search engines wont see the nasty 404s. I tried using some win based site downloaders and all stopped at the first page of the archive index (due to 404's).
Reply With Quote
  #313  
Old 12-18-2002, 03:28 AM
SloppyGoat's Avatar
SloppyGoat SloppyGoat is offline
 
Join Date: Feb 2002
Posts: 339
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I don't think I'll try this one. "http://www.overgrow.com/edge/archive" gives me this....
Quote:
Parse error: parse error, expecting `T_OLD_FUNCTION' or `T_FUNCTION' or `T_VAR' or `'}'' in /home/overgrow/HTML/globals/class.mysql.php on line 1

Fatal error: Cannot instantiate non-existent class: mysql in /home/overgrow/HTML/edge/archive/index.php on line 78
Reply With Quote
  #314  
Old 12-21-2002, 01:18 AM
lalo landa lalo landa is offline
 
Join Date: Nov 2002
Posts: 50
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

there 's an option to make work the vbcodes?

REally cool hack Thnkxs!!
Reply With Quote
  #315  
Old 01-24-2003, 11:44 PM
floridaideal floridaideal is offline
 
Join Date: Aug 2002
Posts: 136
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hi all

Thanks for this great hack, I installed it about 3 months ago but so far its not been spidered.

I had the link added to my index page www.top-forums.com and google has now picked the link up and has listed it as http://www.top-forums.com/forum/archive/index.php which is correct.

Its all working ok as far as I can see so why isnt it spidering the whole archives?

Someone please help.

Thanks

Stu
Reply With Quote
  #316  
Old 01-25-2003, 12:23 AM
Prince Prince is offline
 
Join Date: Oct 2001
Posts: 333
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally posted by floridaideal
Hi all

Thanks for this great hack, I installed it about 3 months ago but so far its not been spidered.

I had the link added to my index page www.top-forums.com and google has now picked the link up and has listed it as http://www.top-forums.com/forum/archive/index.php which is correct.

Its all working ok as far as I can see so why isnt it spidering the whole archives?

Someone please help.

Thanks

Stu
Google. will not spider mine either, I have had it installed for around 4 months.

Apparently my page rank on my forums is not high enough.
Reply With Quote
  #317  
Old 01-25-2003, 09:14 AM
floridaideal floridaideal is offline
 
Join Date: Aug 2002
Posts: 136
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Can anyone please help Ironman and me ?

Thanks
Reply With Quote
  #318  
Old 04-13-2003, 11:07 PM
ryancooper ryancooper is offline
 
Join Date: Jul 2002
Posts: 433
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I get a "The page cannot be found" error when I click on the forum links? Any idea why??

Thanks!!
Reply With Quote
  #319  
Old 06-25-2003, 12:41 PM
aquariumpro aquariumpro is offline
 
Join Date: Sep 2002
Posts: 3
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hi:

Since switching hosts ( a much better host!) and upgrading to VBulletin 2.30, vbspiderfriend is still working and we have thousands of links in Google and other SE. Thanks Overgrow!

However, a new development has started, which does not seem to affect the actual function of the script, but is generating hundreds of Kb a day in our server error log that look like:

[Wed Jun 25 09:00:06 2003] [error] [client 211.28.92.157] File does not exist: /home/aquariumpros/websites/www.aquariumpros.com/html/forums/archive/19/2003/01/2/785


These errors are being generated every time a user posts in our forum, and with about 100 posts a day or more, the error log reaches over 20,000 Kb a week!


What's up? and do you think it's possible to fix this?

URL: http://aquariumpros.com/forums/archive/

The script is working fine. It picks up all the new posts and can be easily browsed. Other than those error log entries, it works perfectly! Problem is that that error log bloating is running me ouyt of disk space and it must be emptied every other day.
Reply With Quote
  #320  
Old 07-10-2003, 12:45 PM
KevinM KevinM is offline
 
Join Date: Dec 2002
Posts: 60
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I have installed this hack on my test forum, and it works great on 15 out of 20 of my forums.

Of the ones that don't work, all the forum id's are single digit e.g.
http://www.ukgardeningchat.co.uk/archive/5 where the forum id is 5.

However, if I manually type in 05 at the end of the url instead, it works great. eg.

http://www.ukgardeningchat.co.uk/archive/05

but it does not do this automatically. All the forums from 10 upwards are fine. Any suggestions on how to make it work or force the url to read 05?
Reply With Quote
  #321  
Old 07-27-2003, 02:02 PM
michaelab michaelab is offline
 
Join Date: Jul 2003
Location: Lisbon, Portugal
Posts: 18
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
05-21-01 at 05:29 PM Overgrow said this in Post #83
Streicher: Sorry I have no idea on your problem-- you are the only one who has reported this and I cannot duplicate it.
Well - I am getting exactly the same problem! To re-cap, the problem is that for forums with an ID < 10 clicking on the forum link just reloads the top level arcihive list, and doesn't go into the forum itself.

If you manually add a 0 to the forum ID in the URL (eg .../archive/06) then it loads find and what's more, all subsequent links off the forum also work fine.

I couldn't find the exact problem but for some reason the code that splits up the URL:

PHP Code:
if (ereg("/archive/[0-9]",getenv('REQUEST_URI'))) {

    
$urlArray=explode("/",getenv('REQUEST_URI'));  //split the URL path
    
$a=0;

    while(
$urlArray[$a] != "archive") {
        
$a++;
    }

    
$forumID=$urlArray[($a+1)];
    
$year=$urlArray[($a+2)];
    
$month=$urlArray[($a+3)];
    
$week=$urlArray[($a+4)];
    
$threadID=$urlArray[($a+5)];

    if (
eregi("[a-z]",$forumID) or $forumID or strstr($privateForums,"|".$forumID."|"))
        
$forumID=1;

...is parsing the forum ID as nothing (empty string) for forum IDs < 10. I couldn't be bothered to work out why that was so I fixed the problem using a workaround:

In the forumList() function I changed:

PHP Code:
    if(!empty($data)) {

        while ( list ( 
$key,$forum ) = each ($data)) {
            echo 
spacer(1)."<a href=\"$baseURL/$forum[forumid]\">$forum[title]</a><br>";
        }
    } 
to...

PHP Code:
    if(!empty($data)) {

        while ( list ( 
$key,$forum ) = each ($data)) {
            
$forumNo $forum[forumid];
            if (
$forumNo 10)
                
$forumNo "0" $forumNo;

            echo 
spacer(1)."<a href=\"$baseURL/$forumNo\">$forum[title]</a><br>";
        }
    } 
That way, the 0 is prepended to the forum ID in the output and it all works fine.

I know this thread is pretty old but hopefully it will help some of you out there!

BTW, what's the status on this Hack working with Google or not re. the 404 issue? I have to use this hack rather than any of the others because I can't seem to get the mod_rewrite or ForceType directives to work using Skuzzy or Teck's hacks.

Michael.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 09:46 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.08629 seconds
  • Memory Usage 2,335KB
  • Queries Executed 27 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (3)bbcode_php
  • (3)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (6)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (11)post_thanks_box
  • (11)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (11)post_thanks_postbit_info
  • (10)postbit
  • (11)postbit_onlinestatus
  • (11)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_postinfo_query
  • fetch_postinfo
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete