I see your point. However from my perspective I am doing a little experiment of creating some language specific domains (uploading a copy of the forum to these) - only showing one language per language specific domain. That is why I need seperate sitemaps. Hopefully this strategy will give good indexing.
I use language= links and this robots.txt to make spiders only spider the specific language (if some of you see a flaw in the sitemap, I would of course be very interested to know

)
Quote:
User-agent: Googlebot
Disallow: /_gsdata_/
Disallow: /admincp/
Disallow: /archive/
Disallow: /banner/
Disallow: /cgi-bin/
Disallow: /chat/
Disallow: /chat/
Disallow: /chat1/
Disallow: /chat2/
Disallow: /clientscript/
Disallow: /cpstyles/
Disallow: /css/
Disallow: /customavatars/
Disallow: /customprofilepics/
Disallow: /forums/
Disallow: /geek/
Disallow: /groups/
Disallow: /icon/
Disallow: /images/
Disallow: /impex/
Disallow: /includes/
Disallow: /install/
Disallow: /misc/
Disallow: /modcp/
Disallow: /phpbb/
Disallow: /forum_phpbb/
Disallow: /irc/
Disallow: /picture_library/
Disallow: /plesk-stat/
Disallow: /signaturepics/
Disallow: /tem/
Disallow: /test/
Disallow: /tg3/
Disallow: /upload/
Disallow: /vbseo/
Disallow: /vbseo_sitemap/
Disallow: /wiki/
Disallow: /*.php$
Disallow: /*order
Disallow: /*sort
Disallow: /*mode
Disallow: /*goto
Disallow: /*nojs
Disallow: /*s=1
Disallow: /*s=2
Disallow: /*s=3
Disallow: /*s=4
Disallow: /*s=5
Disallow: /*s=6
Disallow: /*s=7
Disallow: /*s=8
Disallow: /*s=9
Disallow: /*s=0
Disallow: /*s=e1
Disallow: /*s=e2
Disallow: /*s=3e
Disallow: /*s=e4
Disallow: /*s=e5
Disallow: /*s=e6
Disallow: /*s=e7
Disallow: /*s=e8
Disallow: /*s=e9
Disallow: /*s=e0
Disallow: /*1$
Disallow: /*2$
Disallow: /*3$
Disallow: /*4$
Disallow: /*5$
Disallow: /*6$
Disallow: /*7$
Disallow: /*8$
Disallow: /*9$
Disallow: /*0$
Disallow: /*language=af
Disallow: /*language=be
Disallow: /*language=bg
Disallow: /*language=ca
Disallow: /*language=cs
Disallow: /*language=cy
Disallow: /*language=da
Disallow: /*language=de
Disallow: /*language=el
Disallow: /*language=en_use_this
Disallow: /*language=es
Disallow: /*language=et
Disallow: /*language=fa
Disallow: /*language=fi
Disallow: /*language=fr
Disallow: /*language=ga
Disallow: /*language=gl
Disallow: /*language=hi
Disallow: /*language=hr
Disallow: /*language=hu
Disallow: /*language=id
Disallow: /*language=is
Disallow: /*language=it
Disallow: /*language=iw
Disallow: /*language=ja
Disallow: /*language=ko
Disallow: /*language=lt
Disallow: /*language=lv
Disallow: /*language=mk
Disallow: /*language=ms
Disallow: /*language=mt
Disallow: /*language=nl
Disallow: /*language=no
Disallow: /*language=pl
Disallow: /*language=pt
Disallow: /*language=ro
Disallow: /*language=ru
Disallow: /*language=sk
Disallow: /*language=se
Disallow: /*language=sl
Disallow: /*language=sq
Disallow: /*language=sr
Disallow: /*language=sv
Disallow: /*language=sw
Disallow: /*language=th
Disallow: /*language=tl
Disallow: /*language=tr
Disallow: /*language=uk
Disallow: /*language=vi
Disallow: /*language=yi
Disallow: /*language=zh-CN
Disallow: /*language=zh-TW
|
Googlebot should adhere to above, I am a bit unsure if there is syntax in above that other spiders will not adhere to.
I dont use the folder structure ("seo links") because I couldnt find a way to use robots.txt to narrow down to one language only using that type of links.