Go Back   vb.org Archive > vBulletin 4 Discussion > vB4 General Discussions
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 03-22-2010, 06:25 PM
David Rose David Rose is offline
 
Join Date: Feb 2010
Posts: 37
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default robots.txt question

Hello,
I just opened my first Vbulletin site a few weeks ago and I was suprised to see that it wasn't yet indexed by Google.

I added the webmaster tools just now and I already received the next message:
We were unable to crawl your Sitemap because we found a robots.txt file at the root of your site but were unable to download it. Please ensure that it is accessible or remove it completely.
I know basically what the robots.txt file is but I wasn't even thinking on using it at the moment.
Is that something I did? Is there an option I need to disable? How do I take it off?

Thanks !
Reply With Quote
  #2  
Old 03-22-2010, 06:53 PM
Brandon Sheley's Avatar
Brandon Sheley Brandon Sheley is offline
 
Join Date: Mar 2005
Location: Google Kansas
Posts: 4,678
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

you should add a robots.txt file
here is mine if you want an example
http://www.general-forums.com/robots.txt
Reply With Quote
  #3  
Old 03-22-2010, 07:01 PM
David Rose David Rose is offline
 
Join Date: Feb 2010
Posts: 37
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thanks for your reply,
at the moment I'd like the Search Engines to crawl through what ever they want.

In that case, is there a robots.txt file that always exists and I need to take it off?

Or in other words, how do I disable this and allow Google to crawl through anything?

Thanks.
Reply With Quote
  #4  
Old 03-22-2010, 07:27 PM
Brandon Sheley's Avatar
Brandon Sheley Brandon Sheley is offline
 
Join Date: Mar 2005
Location: Google Kansas
Posts: 4,678
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I wouldn't advise letting them "crawl what they want"
you'll do far better if you restrict the bots to crawling the actual content, not pages like the member pages, or faq for example..
Reply With Quote
  #5  
Old 03-22-2010, 07:32 PM
David Rose David Rose is offline
 
Join Date: Feb 2010
Posts: 37
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Yeah I know what you mean but I think that I'll decide in a later stage, the forum is compeltly new and it doesn't matter to me right NOW.

Anyways, what should I do in order to get rid of it for now? I want Google to index the homepage and it won't because of that.

I actually was surprised that it's there, is this file coming automatically with the VBulletin ?
Reply With Quote
  #6  
Old 03-22-2010, 08:23 PM
StarBuG's Avatar
StarBuG StarBuG is offline
 
Join Date: Dec 2001
Location: Germany
Posts: 1,033
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Connect to your forum root via FTP and remove the robots.txt

Or chmod it 644 so google can read it.
The problem isn't the restriction inside, it is that google is unable to access/read it at all.
Reply With Quote
  #7  
Old 03-26-2010, 09:28 AM
David Rose David Rose is offline
 
Join Date: Feb 2010
Posts: 37
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thank you, well seems like this problem was solved but now I have a new one.
When adding new pages to the Sitemap, I receive Sitemap errors and warnings:

Sitemap is HTML
Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead.


Any idea what's wrong?

Thanks !
Reply With Quote
  #8  
Old 05-18-2010, 04:14 AM
jkcerda jkcerda is offline
 
Join Date: Mar 2008
Posts: 425
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by Loco.M View Post
you should add a robots.txt file
here is mine if you want an example
http://www.general-forums.com/robots.txt
ok, let me get this straight, all you do is make the robots.txt file and load it to the root folder?
like http://www.vbulletin.com/forum/showt...obots.txt-file
Reply With Quote
  #9  
Old 05-18-2010, 10:36 AM
your24hourstore your24hourstore is offline
 
Join Date: Feb 2010
Posts: 1,226
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

if you want a robots.txtx file that is what you must do :>

they dont put themselves up! LOL

--------------- Added [DATE]1274183085[/DATE] at [TIME]1274183085[/TIME] ---------------

Quote:
Originally Posted by David Rose View Post
Thank you, well seems like this problem was solved but now I have a new one.
When adding new pages to the Sitemap, I receive Sitemap errors and warnings:

Sitemap is HTML
Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead.


Any idea what's wrong?

Thanks !
if you read at all in here you have seen my rants about the sitemaps generated by this software. not only does it create weird errors , sometimes it wont work at all, if you are going to run it, i suggest you run xml sitemap generator paid version, or if you want vbseo.

but vbull has a real problem with its sitemaps, if you use it, you need to set it run create sitemap daily, that way you don't have a messed up sitemap for days or weeks.

instead of running xmlsitemap.php
go to xml sitemap generators google it, and use theirs to generate a xml sitemap, then upload it to your root and point webmaster to that file.

I have never been able to find anyone here or on vbull forums that will address this issue its like your invisible if you report errors in sitemap, but go ahead and start a ticket over at vbull and get ready for them to tell you its your fault.
Reply With Quote
  #10  
Old 05-18-2010, 04:07 PM
jkcerda jkcerda is offline
 
Join Date: Mar 2008
Posts: 425
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by harleyparts View Post
if you want a robots.txtx file that is what you must do :>

they dont put themselves up! LOL



if you read at all in here you have seen my rants about the sitemaps generated by this software. not only does it create weird errors , sometimes it wont work at all, if you are going to run it, i suggest you run xml sitemap generator paid version, or if you want vbseo.

but vbull has a real problem with its sitemaps, if you use it, you need to set it run create sitemap daily, that way you don't have a messed up sitemap for days or weeks.

instead of running xmlsitemap.php
go to xml sitemap generators google it, and use theirs to generate a xml sitemap, then upload it to your root and point webmaster to that file.

I have never been able to find anyone here or on vbull forums that will address this issue its like your invisible if you report errors in sitemap, but go ahead and start a ticket over at vbull and get ready for them to tell you its your fault.

thanks HR
now sitemaps
here?
http://www.xml-sitemaps.com/
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 08:11 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.05416 seconds
  • Memory Usage 2,249KB
  • Queries Executed 13 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (3)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (10)post_thanks_box
  • (10)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (10)post_thanks_postbit_info
  • (10)postbit
  • (10)postbit_onlinestatus
  • (10)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_postinfo_query
  • fetch_postinfo
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete