Very nice work! Can't wait to see the results! But I have a couple problems...
My website is running vBulletin in a /forums/ folder (ex: http://www.domain.com/forums/) and I have a normal website in my root (not vBadvanced or some other portal). I added my root website to the extra-urls.txt in hopes that it will be added to the sitemap. I also set my root as the Sitemap URL, expecting the sitemaps (or atleast the sitemap index) to be generated in my root (http://www.domain.com/sitemap_index.xml.gz).
Problem part::
I just ran the scheduled task to see if it would generate a new sitemap index in the root and no dice. It didn't even generate it in the default vBulletin directory (/forums/). The only place that it is stored is http://www.domain.com/forums/vbseo_s...p_index.xml.gz.
And now I just got a few errors in my log complaining that there was no file http://www.domain.com/vbseo_sitemap/...getsitemap.php (no /forums/).
So anyone know what's going on?
The problem was that I had to include "forums/" in the .htaccess instead of just "vbseo_sitemap/..." which I didn't realize until I actually took a look at it
But I have another bit... I only want Google and Yahoo! to use the archive, so I disabled use of showthread, showpost, member profiles, etc. and only let the sitemaps include the archive. The reason is that I think that search engines can focus more on the content than menu/other links and plus, Yahoo! Slurps have been creating some decent server load, so I think that it might be better with no advanced GUI. Is this a good idea or not?
And a new problem:
Now that spiders are crawling my archive, some of them are not taking the archive/index.php/ as a directory. I'm getting error logs like "File not found: /forums/archive/f-86.html." I don't want to have to edit the archiving script to include index.php/ in front of every one of its links. I'd much rather use a .htaccess type of thing. Does anyone know how I could do the following in .htaccess?
Redirect anything trying to get a *.html file in the /archive/ directory TO /archive/index.php/%filename%
(Where %filename% is the *.html file that the client was trying to get)
Anyone?