If any one's interested, I made a function that connects to google and does a small quick query to find out how many sites google has indexed for the given domain, then the function parses the html and returns the number of indexed sites.
PHP Code:
function GetGoogleStats($site,$remove_commas=false) {
global $HTTP_SERVER_VARS;
$site = trim($site);
if ($site) {
$url = "http://www.google.com/search?q=allinurl:$site&num=1&hl=en&lr=&ie=UTF-8&oe=UTF-&safe=off&start=0&sa=N&filter=0";
$preg_match = "Results <b>([0-9,]+)<\/b> - <b>([0-9,]+)<\/b> of about <b>([0-9,]+)<\/b>.";
if ($url) {
$urlArray = parse_url($url);
if (!$urlArray[port]) $urlArray[port] = "80";
if (!$urlArray[path]) $urlArray[path] = "/";
if ($fp = @fsockopen($urlArray[host], $urlArray[port], $errnum, $errstr, 10)) {
$port = ($urlArray[port]!="80"?":".$urlArray[port]:"");
$dump = "GET $urlArray[path]?$urlArray[query] HTTP/1.1\r\n";
$dump .= "User-Agent: ".$HTTP_SERVER_VARS['HTTP_USER_AGENT']."\r\n";
$dump .= "Host: $urlArray[host]$port\r\n";
$dump .= "Connection: close\r\n";
$dump .= "\r\n";
if (trim($dump)) {
fputs($fp, $dump);
}
$buffer = "";
while (!feof($fp)) {
$line = chop(fgets($fp,50000));
if (preg_match("/".$preg_match."/i",$buffer, $matches)) {
$indexes = ($matches[3]==0?"0":$matches[3]);
break;
}
$buffer .= $line;
}
fclose($fp);
if (!isset($indexes)) {
if (preg_match("/".$preg_match."/i",$buffer, $matches)) {
$indexes = ($matches[3]==0?"0":$matches[3]);
}
}
if (isset($indexes)) {
if ($remove_commas==true) {
$indexes = str_replace(',','',$indexes);
return intval($indexes);
} else {
return $indexes;
}
} else {
return 0;
}
}
}
}
return 0;
}
Example:
PHP Code:
$site = trim($site);
if (!$site) $site = "google.com";
$stats = GetGoogleStats($site);
if ($stats>0) echo "Google has indexed $stats pages for ".htmlspecialchars($site);
else echo "There was an error proccessing the data.";
Output:
Google has indexed 2,000,000 pages for google.com