Go Back   vb.org Archive > Community Central > Community Lounge
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 12-11-2006, 04:35 PM
Zia's Avatar
Zia Zia is offline
 
Join Date: Dec 2005
Location: golpo.net
Posts: 931
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default Disallow robots - .htaccess & robots.txt

helo...
as we know we can control bots/spider using robots.txt

we know to disallow bot we use
we use:
Quote:
User-agent: BotName
Disallow: /
but to allow any particular bot for all places can we use bellow???

Quote:
User-Agent: Googlebot
Allow: /
source:http://www.mattcutts.com/blog/new-robotstxt-tool

and using .htaccess we are currently using to disaalow badbot



Quote:
RewriteCond %{HTTP_USER_AGENT} ^mister\ pix [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Reaper [OR]
RewriteCond %{HTTP_USER_AGENT} ^Recorder [OR]
RewriteCond %{HTTP_REFERER} ^http://www.iaea.org$
RewriteRule !^http://[^/.]\.golpo.net.* - [F,L]

<Files php.ini>
Order allow,deny
Deny from all
</Files>
here what the mean of NC & FL ?

-- in other palce i read

Quote:
SetEnvIfNoCase User-Agent "^EmailSiphon" bad_bot
SetEnvIfNoCase User-Agent "^EmailWolf" bad_bot
SetEnvIfNoCase User-Agent "^ExtractorPro" bad_bot
SetEnvIfNoCase User-Agent "^CherryPicker" bad_bot
SetEnvIfNoCase User-Agent "^NICErsPRO" bad_bot
SetEnvIfNoCase User-Agent "^Teleport" bad_bot
SetEnvIfNoCase User-Agent "^EmailCollector" bad_bot

<Limit GET POST>
Order Allow,Deny
Allow from all
Deny from env=bad_bot
</Limit>
---
we are really confused...any one can give us some tips...what is what ?
and is perfect?
which .htaccess is good to disallow badbots?
Reply With Quote
  #2  
Old 12-13-2006, 01:12 PM
Zia's Avatar
Zia Zia is offline
 
Join Date: Dec 2005
Location: golpo.net
Posts: 931
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

helo..
any one can tell me...

about it..

i also want to BAN some bots too...
Reply With Quote
  #3  
Old 12-13-2006, 01:21 PM
optrex optrex is offline
 
Join Date: Sep 2005
Posts: 344
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

There is more than one way to do this. I use

Code:
   SetEnvIfNoCase User-Agent "libwww-perl/" bad_bot
   Order Allow,Deny
   Allow from all
   Deny from env=bad_bot
Reply With Quote
  #4  
Old 12-13-2006, 02:08 PM
Zia's Avatar
Zia Zia is offline
 
Join Date: Dec 2005
Location: golpo.net
Posts: 931
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

GOOD TO KNOW..

Also can u tell...how many bot i can ban/disallow using .htaccess ?
any limitations ? or as much as i want..
Reply With Quote
  #5  
Old 12-13-2006, 02:54 PM
Crow's Avatar
Crow Crow is offline
 
Join Date: Jan 2005
Location: Detroit,Mich
Posts: 76
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Can't you just use..

Code:
User-agent: *
Disallow: /
And that work for all bots? and all pages?
Reply With Quote
  #6  
Old 12-13-2006, 02:57 PM
optrex optrex is offline
 
Join Date: Sep 2005
Posts: 344
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Yes but then you would end up blocking yahoo and google etc, NOT a very good idea!

btw, zia, afaik there are no restrictions in htaccess
Reply With Quote
  #7  
Old 12-13-2006, 04:46 PM
Crow's Avatar
Crow Crow is offline
 
Join Date: Jan 2005
Location: Detroit,Mich
Posts: 76
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by optrex View Post
Yes but then you would end up blocking yahoo and google etc, NOT a very good idea!

btw, zia, afaik there are no restrictions in htaccess
Sorry I mis-read the post, for some odd reason I thought he wanted to ban all bots.. Must be lack of sleep.:tired:
Reply With Quote
  #8  
Old 12-15-2006, 12:40 AM
SaN-DeeP's Avatar
SaN-DeeP SaN-DeeP is offline
 
Join Date: Jun 2002
Location: Mumbai, India
Posts: 1,195
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

As much as you want but banning from .htaccess = increase on server loads, try adding this values to your httpd.conf on apache webserver.

Interesting discussion here.
Reply With Quote
  #9  
Old 12-15-2006, 03:44 PM
Zia's Avatar
Zia Zia is offline
 
Join Date: Dec 2005
Location: golpo.net
Posts: 931
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by SaN-DeeP View Post
As much as you want but banning from .htaccess = increase on server loads, try adding this values to your httpd.conf on apache webserver.

Interesting discussion here.
Im infact curious about this Server laod?

here im attaching my portion..yet not i implemeted it..

planning to do...

All the way i want to disallow bad-bots..

any one around here to have a look on it..?

Any comments about the bots ?

httpd.conf --hmm i gez i have to contact with service provider tho on a vps
Attached Files
File Type: txt htaccess.txt (6.9 KB, 27 views)
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 03:00 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.04183 seconds
  • Memory Usage 2,267KB
  • Queries Executed 14 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (2)bbcode_code
  • (6)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (9)post_thanks_box
  • (9)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (9)post_thanks_postbit_info
  • (9)postbit
  • (1)postbit_attachment
  • (9)postbit_onlinestatus
  • (9)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_postinfo_query
  • fetch_postinfo
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • postbit_attachment
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete