vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB4 Programming Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=252)
-   -   What should be in robots.txt? (https://vborg.vbsupport.ru/showthread.php?t=304164)

Max Taxable 11-06-2013 03:13 PM

Quote:

Originally Posted by Videx (Post 2458680)
If we're taking a poll, I have both those files in mine. It never occurred to me that a hacker would need that information for some nefarious purpose. I mean, once they get into the server they're going to know all that stuff anyway.

Nobody mentioned hackers. Script Kiddies, was the meter.

There is no reason to include admincp and modcp in robots.txt. The bad bots are going there anyway.

Digital Jedi 11-06-2013 06:58 PM

Quote:

Originally Posted by GaryT (Post 2458297)
Ok so for me to open my forum to badbots or "scripy kiddies" I'm at risk, I got that.

Then here is my headache, restrict the bad guys and lose being found by folks looking for the type of content my site offers or, take a chance so I may gain more members?

I'm going to start Googleing and hopefully there is a sweet spot to go with.

Thanks

Particularly, you don't want to include your admin and mod directories if you've changed them for the very reason vBulletin let's you change them in the first place. To make it harder for a hacker to guess where they are.

But as an aside, it's pointless to put them in robots.txt, since, as was mentioned, robots.txt is an honor system thing. Legitimate bots already know about and have no use for your Admin CP (presuming the name hasn't been changed.) Whereas bad bots aren't going to honor it in the first place. Robots.txt is something you use to control how much legitimate bots see and index. Don't even consider it for bad bots. For bad bots, consider other blocking tools such as Simon's modification or .htaccess. In my case, my host let's me ban IP addresses from all my domains via cPanel, which simply automates the editing of .htaccess across all my domains.

For robots.txt, you may want to try a little trial and error. A few years ago I was getting slammed by MSNbot. I didn't know why (probably because Bing was about to startup in a year or two), but it was hogging system resources and was exacerbating pre-existing conditions. So I set a crawl-delay for MSN.

[pre]User-agent: msnbot
Crawl-delay: 3[/pre]

Basically, I'm saying you tweak robots.txt according to your needs. Look at your modifications that have their own unique pages. Do you feel the need to have it index every one of those? Just as an example, if you had iTrader installed, you probably wouldn't need the iTrader ratings page indexed. Or the arcade. Or a page devoted to Facebook login. You probably have default pages that don't need to be indexed, like login.php or private.php. Robot.txt will help to keep bots from wasting their time there.

K4GAP 11-07-2013 04:29 AM

That's some good info, thanks.


All times are GMT. The time now is 02:02 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01111 seconds
  • Memory Usage 1,725KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (1)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (3)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete