Go Back   vb.org Archive > vBulletin 4 Discussion > vB4 General Discussions
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 01-19-2010, 03:02 AM
alaska_av8r alaska_av8r is offline
 
Join Date: Dec 2009
Posts: 88
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default crawl-0c.cuil.com

Is anyone else having problems with this search engine crawler. I am assuming thats what it is, I did a google search and came up with cuil.com and that is what it says.

This thing is constantly on my site, and I keep getting mysql database errors from it. That IP addy listed below is registered to crawl-0c.cuil.com.


Database error in vBulletin :

mysql_connect() [<a href='function.mysql-connect'>function.mysql-connect</a>]: Lost connection to MySQL server at 'reading initial communication packet', system error: 111
/home/marin49/public_html/includes/class_core.php on line 312

MySQL Error :
Error Number :
Request Date : Monday, January 18th 2010 @ 11:28:16 AM
Error Date : Monday, January 18th 2010 @ 11:28:16 AM
Script : http://www.boatinghowto.com/external...S2&forumids=38
Referrer :
IP Address : 216.129.119.10
Username :
Classname : vB_Database
MySQL Version :
Reply With Quote
  #2  
Old 01-19-2010, 04:39 AM
BSMedia BSMedia is offline
 
Join Date: Feb 2009
Posts: 454
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Create a robots.txt file and place in it. Or if you have a robots.txt simply append it to the end.

User-Agent: twiceler
Crawl-delay: 30

You can also completely block the crawler, since their search sucks any way, but that's up to you.
Reply With Quote
  #3  
Old 01-19-2010, 09:09 PM
alaska_av8r alaska_av8r is offline
 
Join Date: Dec 2009
Posts: 88
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

thanks bsmedia, I may just go ahead and block them. But just curious what does the robots.txt file actually do. I am new to this so excuse me if it is a dumb question...lol

tim
Reply With Quote
  #4  
Old 01-20-2010, 12:17 AM
BSMedia BSMedia is offline
 
Join Date: Feb 2009
Posts: 454
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

<a href="http://robotstxt.org" target="_blank">http://robotstxt.org</a>

Robots.txt is a file that reputable search engine spiders follow as a set of instructions of directories to include in their search results, to delay their requests to pages, etc.

I forgot to mention it needs to be placed in your top level directory, so you'll access it from domain.com/robots.txt or subdomain.domain.com/robots.txt
Reply With Quote
  #5  
Old 01-20-2010, 03:28 AM
alaska_av8r alaska_av8r is offline
 
Join Date: Dec 2009
Posts: 88
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thank you, I will do a google search on that and see if what I need to put in there and learn all the ins and outs.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 12:55 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.06247 seconds
  • Memory Usage 2,192KB
  • Queries Executed 11 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (5)post_thanks_box
  • (5)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (5)post_thanks_postbit_info
  • (5)postbit
  • (5)postbit_onlinestatus
  • (5)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete