vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB3 General Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=111)
-   -   search engine friendly URLs in a new way? (https://vborg.vbsupport.ru/showthread.php?t=51948)

okrogius 04-21-2003 11:00 PM

search engine friendly URLs in a new way?
 
Ala simular to the "ebay style feedback system" kind-of thread. Now to the point... :)

Search engine friendly URLs are always wanted on forums. Why should your thousand of posts not be indexed in google, afterall? There are hacks here that give a search engine friendly "archive", and some mod_rewrite hacks.

I've yet to find a hack for the actual forums, and not some measly archive, that didn't require mod_rewrite and intensive POSIX knowledge. So, I've been playing around here and there with a modification of my own.

Demo:
http://www.cgshockforums.com/

The modification does require Apache ("lookback" feature of mod_mime in particular). I've only modified the showthread and forumdisplay templates to reflect the "cleaner" urls.

Why? A part of this system is also a customized robots.txt file. Googlebot (or any other cralwer that respects robots.txt) is only allowed to index the /go/ "folder" (as all other files are disallowed). Thus the search engine is only indexing the relevant pages :).

Additionally, the system works nicely by hiding the sessionhash depending on whether it is needed or not. Sessionhash is always hidden from search engine crawlers (defined in an array by HTTP_USER_AGENT in phpinclude template).

Comments? Ideas? Suggestions? :p

filburt1 04-21-2003 11:49 PM

What about my hack?

okrogius 04-22-2003 12:04 AM

Quote:

Today at 07:49 PM filburt1 said this in Post #2
What about my hack?

Yours I believe is relying on mod_rewrite, no?

filburt1 04-22-2003 12:51 AM

No, it's just a one-step modification.

okrogius 04-22-2003 08:26 PM

Then, out of curiosity, how does yours work? Having a 404 php page which deciphers the request urls and makes proper include?

filburt1 04-22-2003 08:35 PM

No, it just gets rid of the sessionhash which is all that stops Googlebot from going on an indexing frenzy.

okrogius 04-22-2003 11:13 PM

Quote:

Today at 04:35 PM filburt1 said this in Post #6
No, it just gets rid of the sessionhash which is all that stops Googlebot from going on an indexing frenzy.
True but what I was referring is having forumX.html opposed to forumdisplay.php?forumid=X and sme for showthread.

filburt1 04-22-2003 11:20 PM

The idea though is that it shouldn't matter. Despite myth, Google will still index (some) dynamic URLs.

colicab-d 04-22-2003 11:30 PM

yeah filburt how did you manage to get WDF to have forumxx.html ?? im quit curious as to that :D

filburt1 04-22-2003 11:56 PM

I just used mod_rewrite. Note though that my flat file rewriting also trashes the sessionhash in the process.

Dean C 04-23-2003 09:54 AM

You should make it public and make all the url's correct too ;)

- miSt

okrogius 04-23-2003 08:02 PM

Quote:

Yesterday at 07:20 PM filburt1 said this in Post #8
The idea though is that it shouldn't matter. Despite myth, Google will still index (some) dynamic URLs.
Google indexes dynamic urls fine, as long as there is no session hash present. THe only issue is that, googlbot is programmed not to index too much dynamic urls, as to not overload your site. Generally I find that getting all pages indexes is more valuable then slightly higher load while the crawler is at work.

Till 10-01-2003 08:56 AM

We use a slightly different approach.

Hop over to:
http://forums.htmlcenter.com/archived

Comments appreciated. :)

Till

mccollin 10-01-2003 04:46 PM

I've tried this hack, and the session hash doesn't appear to be getting dropped by Google. For example, when I look at 64.68.87.55, which Samspade says is Google, the url it is displaying has the session hash in it.

amykhar 10-01-2003 04:53 PM

mccollin, it's not in the true url though. The who's online page is not showing you the actual url. It is showing you what is in the session table.

Amy

mccollin 10-02-2003 11:57 PM

Ah, so you think it is working even though who's online would indicate that it isn't. What about in the log files... they are showing the full session hash as well. How would I then know if it is working?


All times are GMT. The time now is 04:28 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01011 seconds
  • Memory Usage 1,743KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (3)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (16)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete