OMG! Thirty googlebots just can't get enough of Nexology.
Teck, you are the man, without doubt.
Few things though. Have only seen them hit the Archive once or twice. They seem to be casing the forum due to the sessionhash being removed.
I tried the useragent hack (by inphinity) but that just returned:
Code:
Fatal error: Call to undefined function: no_sessionhash() in /home/httpd/vhosts/hostnexus.com/httpdocs/forum/global.php on line 296
Not sure if I was doing things wrong, but I removed your extra mod to functions.php:
Code:
function no_sessionhash()
{
global $session;
$agent = array(
'crawl',
'googlebot',
'gulliver',
'ia_archiver',
'internetseer',
'linkalarm',
'mercator',
'openbot',
'pingalink',
'psbot',
'scooter',
'slurp',
'slysearch',
'zeus',
'zyborg',
'otheruseragentcrawleryouwant'
);
foreach( $agent as $useragent )
{
if ( stristr( getenv( 'HTTP_USER_AGENT' ) , $useragent ) )
{
$session['sessionhash'] = '';
}
}
}
and replaced that with:
Code:
function useragentcheck( $match_agent, $agent_code )
{
$agent = array(
'googlebot' => 'www.google.com/|||Google',
'gulliver' => 'www.northernlight.com/|||Northern Light',
'ia_archiver' => 'www.archive.org/|||The Internet Archive',
'internetseer' => 'www.internetseer.com/|||Internet Seer',
'linkalarm' => 'linkalarm.com/|||Link Alarm',
'mercator' => 'www.research.compaq.com/SRC/mercator/|||Mercator',
'openbot' => 'www.openfind.com.tw/|||Openbot',
'pingalink' => 'www.pingalink.com/|||PingALink Monitor',
'psbot' => 'www.picsearch.com/bot.html|||PicSearch',
'scooter' => 'www.altavista.com/|||AltaVista',
'slurp' => 'www.inktomi.com/slurp.html|||Inktomi',
'turnitinbot' => 'www.turnitin.com/robot/crawlerinfo.html|||Turnitin',
'slysearch' => 'www.turnitin.com/robot/crawlerinfo.html|||Turnitin',
'zeus' => 'www.waltbren.com/products/zeus_internet_robot.htm|||Zeus Internet Marketing',
'zyborg' => 'www.wisenutbot.com/|||WiseNut',
'teoma' => 'www.teoma.com/|||Teoma/Ask Jeeves',
'spider' => 'Web Spider',
'spyder' => 'Web Spyder',
'crawl' => 'Web Crawler',
'robot' => 'Web Robot'
);
foreach( $agent as $useragent => $agenturl )
{
if ( preg_match ("/^\d+$/", $useragent) )
{
$useragent = $agenturl;
$agenturl = "Search Engine";
}
if ( preg_match ("/". preg_quote ($useragent) ."/i", $match_agent) )
{
$agentinfo = preg_split ("/\|\|\|/", $agenturl);
if (!($agentinfo[1])) {
$agentinfo[0] = "http://www.robotstxt.org/wc/active.html";
$agentinfo[1] = "Web Robot $useragent";
}
switch ($agent_code) {
case 0:
return 1;
break;
case 1:
return $agentinfo[1];
break;
case 2:
return '</a><a href="http://'. $agentinfo[0] .'" alt="'. $agentinfo[1] .'"><i>'. $agentinfo[1] .'</i>';
break;
}
}
}
}
Or can I just tack inphinity's hack onto the end of functions.php after your no_sessionhash() function?