vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB3 General Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=111)
-   -   Googlebot! (https://vborg.vbsupport.ru/showthread.php?t=44659)

filburt1 10-16-2002 12:55 AM

Googlebot!
 
I dumped this in my phpinclude template. Does it look like it will properly e-mail me if Googlebot is encountered, then not e-mail me again for two hours even if it is still there?
PHP Code:

if (strpos($_SERVER['HTTP_USER_AGENT'], 'Googlebot') != false)
{
    
$result mysql_query("SELECT lastvisit + INTERVAL 2 HOUR > NOW() AS withintwohours FROM nonstandard_visitors WHERE useragent = 'Googlebot'");
    
$s mysql_fetch_row($result);
    if (
$s[0] == '1')
    {
        
mail($technicalemail'Googlebot is at the site!''Googlebot is at the site at ' strftime('%X') . '.
            There will be no more e-mails until at least two hours have passed.'
);
        
$result mysql_query("UPDATE nonstandard_visitors SET lastvisit = NOW() WHERE useragent = 'Googlebot'");
    }


The schema for nonstandard_visitors is:
Code:


mysql> describe nonstandard_visitors;
+-----------+---------------+------+-----+---------+----------------+
| Field    | Type          | Null | Key | Default | Extra          |
+-----------+---------------+------+-----+---------+----------------+
| id        | int(11)      |      | PRI | NULL    | auto_increment |
| useragent | varchar(255)  |      |    |        |                |
| lastvisit | timestamp(14) | YES  |    | NULL    |                |
+-----------+---------------+------+-----+---------+----------------+
3 rows in set (0.00 sec)


DrkFusion 10-16-2002 01:08 AM

Sure why not :)

*doesn't know what ^^ is talking about*

Your forums are still not being being listed :-/

-Arunan

filburt1 10-16-2002 01:31 AM

GB crawled them before the modifications were complete, and still hasn't even added stuff to the index.

I'm confident :)

Erwin 10-16-2002 01:40 AM

Google looks like it's restricting pages from one site to only up to 1000 pages. It used to be much higher. You can test that script easily by changing the HTTP bit to the one that looks for an Opera browser, then surf your site with Opera. :) It looks okay from here.

filburt1 10-16-2002 01:43 AM

How does it get to the 1000th? Go to one page, parse everything, then back to the orignal, next link, everything on that page, etc., or recursively until it just buries itself in the first link?

Of course I think 1000 will still be fine for me; I don't have that many forums or threads to account for 1000 indicies :)

Erwin 10-16-2002 02:00 AM

They may have worked out .htaccess aliasing, because with my site, and other sites like eva2000's, Google no longer list all the aliased page - some, but not all. It prefers 3rd-party links like Photopost (with mine) or Gallery links (with eva2000)'s site. eva2000's site used to have over 20,000 pages in, but now down to only 900 odd pages. Same with mine.

To see how many pages your site has in Google, run this search:

domainname site:domainname.com

(replace domainname with your site's domain name)

Erwin 10-16-2002 02:02 AM

Quote:

Originally posted by filburt1
Of course I think 1000 will still be fine for me; I don't have that many forums or threads to account for 1000 indicies :)
But if you have made it so that EVERY page of a thread is indexable, and you have over 350,000 posts... :) That's a lot of potential pages in Google. I remember fastforward saying that he had over 75,000 pages in and they were only the first page of every thread, not every page.

filburt1 10-16-2002 05:15 PM

Well apparently my code doesn't work because GB is there now and I didn't get an e-mail :(

DrkFusion 10-16-2002 06:58 PM

How do you know if GB is there?

-Drk

filburt1 10-16-2002 06:59 PM

He happened to be in WOL :p

NTLDR 10-16-2002 06:59 PM

Quote:

Originally posted by DrkFusion
How do you know if GB is there?
Resolve the IP, it will contain googlebot if its browsing the forums, also if you host provides you with site stats it will probably appear there too.

fastforward 10-17-2002 10:51 PM

Quote:

Originally posted by Erwin
Google looks like it's restricting pages from one site to only up to 1000 pages. It used to be much higher. You can test that script easily by changing the HTTP bit to the one that looks for an Opera browser, then surf your site with Opera. :) It looks okay from here.
Not sure about the 1000 limit, but mine have dropped considerably. I used to have over 150,000 for dbforums.com and now I've just got 60,400.

Erwin 10-18-2002 03:51 AM

Quote:

Originally posted by fastforward

Not sure about the 1000 limit, but mine have dropped considerably. I used to have over 150,000 for dbforums.com and now I've just got 60,400.

You're lucky. I figure it's because your pages are already in. It's harder now to get a lot of pages from the same domain indexed.


All times are GMT. The time now is 03:48 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01305 seconds
  • Memory Usage 1,747KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_code_printable
  • (1)bbcode_php_printable
  • (4)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (13)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete