vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   Community Lounge (https://vborg.vbsupport.ru/forumdisplay.php?f=13)
-   -   Google Tidbit - Read if your using some sort of an archive or search hack. (https://vborg.vbsupport.ru/showthread.php?t=55024)

Slynderdale 07-05-2003 02:00 PM

Google Tidbit - Read if your using some sort of an archive or search hack.
 
Almost every here porbily heard of google right?
It has a seach feature that allows you to search a site and so on using: site:www.yourdomaim.com

I just recently found another command out that lists all of the indexed links to your site. I dont know how many of you know of this, but if you use an archive or search hack. This would be handy to get the count of all the links google indexed.

Heres the command:

allinurl:yourdomain.com

Thats all, dont add any search words or any thing like that.

Example:

allinurl:vbulletin.org
http://www.google.com/search?q=allin...bulletin%2Eorg

Results 1 - 6 of about 11,700. Search took 0.10 seconds.

Smoothie 07-05-2003 03:15 PM

very interesting

allinurl:macfora.com

Results 1 - 7 of about 18,100. Search took 0.14 seconds.

Dean C 07-05-2003 03:30 PM

Results 1 - 10 of about 10,100. Search took 0.25 seconds

Has some very interesting results...

Slynderdale 07-05-2003 04:51 PM

If any one's interested, I made a function that connects to google and does a small quick query to find out how many sites google has indexed for the given domain, then the function parses the html and returns the number of indexed sites.

PHP Code:

function GetGoogleStats($site,$remove_commas=false) {
  global 
$HTTP_SERVER_VARS;
  
$site trim($site);
  if (
$site) {
    
$url "http://www.google.com/search?q=allinurl:$site&num=1&hl=en&lr=&ie=UTF-8&oe=UTF-&safe=off&start=0&sa=N&filter=0";
    
$preg_match "Results <b>([0-9,]+)<\/b> - <b>([0-9,]+)<\/b> of about <b>([0-9,]+)<\/b>.";
    if (
$url) {
      
$urlArray parse_url($url);
      if (!
$urlArray[port]) $urlArray[port] = "80";
      if (!
$urlArray[path]) $urlArray[path] = "/";
      if (
$fp = @fsockopen($urlArray[host], $urlArray[port], $errnum$errstr10)) {
        
$port = ($urlArray[port]!="80"?":".$urlArray[port]:"");
        
$dump "GET $urlArray[path]?$urlArray[query] HTTP/1.1\r\n";
        
$dump .= "User-Agent: ".$HTTP_SERVER_VARS['HTTP_USER_AGENT']."\r\n";
        
$dump .= "Host: $urlArray[host]$port\r\n";
        
$dump .= "Connection: close\r\n";
        
$dump .= "\r\n";
        if (
trim($dump)) {
          
fputs($fp$dump);
        }
        
$buffer "";
        while (!
feof($fp)) {
          
$line chop(fgets($fp,50000));
          if (
preg_match("/".$preg_match."/i",$buffer$matches)) {
            
$indexes = ($matches[3]==0?"0":$matches[3]);
            break;
          }
          
$buffer .= $line;
        }
        
fclose($fp);
        if (!isset(
$indexes)) {
          if (
preg_match("/".$preg_match."/i",$buffer$matches)) {
            
$indexes = ($matches[3]==0?"0":$matches[3]);
          }
        }
        if (isset(
$indexes)) {
          if (
$remove_commas==true) {
            
$indexes str_replace(',','',$indexes);
            return 
intval($indexes);
          } else {
            return 
$indexes;
          }
        } else {
          return 
0;
        }
      }
    }
  }
  return 
0;


Example:
PHP Code:

$site trim($site);
if (!
$site$site "google.com";
$stats GetGoogleStats($site);
if (
$stats>0) echo "Google has indexed $stats pages for ".htmlspecialchars($site);
else echo 
"There was an error proccessing the data."

Output:
Google has indexed 2,000,000 pages for google.com

Slynderdale 07-05-2003 05:32 PM

That functio I gave you returns the number of indexes if the url's that contain the phrase you enter.

For example, if you enter php, it gives you the index count of all the sites with php in the url like php.net.

Google has indexed 130,000,000 pages that the url contain the phrase "php".


All times are GMT. The time now is 08:34 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.00985 seconds
  • Memory Usage 1,748KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_php_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (5)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete