vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 2.x Full Releases (https://vborg.vbsupport.ru/forumdisplay.php?f=4)
-   -   vbArchive - Search Engine Indexer for vBulletin (https://vborg.vbsupport.ru/showthread.php?t=47667)

sajjid 05-30-2003 02:33 PM

Currently 2 members, 1 guests and 18 web robots (Google) on the boards.|

is it normal tohave 14 google bots on the boards they been on for few hours now i thought they only stay for few mminutes plus what does this mean

Google | Retrieving Password | 02:30 PM | 64.68.82.50

This is what I saw on the "Who's Online" section. What does it mean retrieving password? Is it taking our passwords?

telc 05-30-2003 03:50 PM

Quote:


This is what I saw on the "Who's Online" section. What does it mean retrieving password? Is it taking our passwords?
no its not taking your passwords its just following this URL:

http://www.spinzone.org/forums/member.php?action=lostpw

That as far as it will get.

infiniterb 05-30-2003 04:41 PM

I've never even seen a google bot on the forums and I've been using the Search Archiver hack for about 2 months.

sajjid 05-30-2003 04:44 PM

Quote:

Today at 06:41 PM infiniterb said this in Post #603
I've never even seen a google bot on the forums and I've been using the Search Archiver hack for about 2 months.

come and visit my forums and you will see loads of googlebots
did you submit your site to all the search engines listed in the first post?

infiniterb 05-30-2003 04:51 PM

Yup, did it initially when I installed the hack. Very weird. Now I'm working on the optimizations and making my archive folder .html instead of just www.clantt.com/archive/ in hopes that will help.

sajjid 05-30-2003 05:03 PM

i was going to edit my preious post to say place a link on my forums maybe the bots will pick it up but i see you already done i may it live link you have more chances one of the bots picking it up.
also did you do the

Quote:

Forums Optimizations
You MUST perform also some the mods listed below if you want your forums optimized properly for search engines indexing.
Steps 1 to 3 are vital, the rest is optional.

1. TO STRIP THE sessionhash FROM TEMPLATES (ONLY FOR CRAWLERS), READ MORE HERE.
2. TO BLOCK CRAWLERS GO TO CERTAIN PAGES, READ MORE HERE.
3. TO LINK EACH FORUM/THREAD DIRECTLY TO ARCHIVE FILES, READ MORE HERE.
4. TO DISPLAY NICE LOCATIONS, THE FIX FOR online.php FILE IS HERE.
5. TO DISPLAY CRAWLER NAME INSTEAD OF GUEST ON FRONTPAGE AND ONLINE PAGE, READ MORE HERE. (mod by Inphinity and xiphoid)
6. TO DISPLAY CRAWLER NAME INSTEAD OF GUEST ON ONLINE PAGE only, READ MORE HERE.
7. IF YOU WANT THE MAIN ARCHIVE FILE TO HAVE A .php EXTENSION, READ MORE HERE.
8. TO CHANGE THE threads/posts per page NUMERIC VALUES, READ MORE HERE.
9. TO DISPLAY THE SMILIES AS image parsed, READ MORE HERE (mod by Logician).

IMPORTANT
Kill crawler918.com! READ MORE HERE.
i wonder how will these bot stay on my forums because they been here allday today Thanks to TECK
TECK i must say you have made one hell of a hack i never had this amny visits from bot before

infiniterb 05-30-2003 05:15 PM

Thanks a lot man. I'm having trouble now with the Friendly Urls...it seems to not be working right.

Also, in the user agent checking, where do you put this code:

## function for user ip address checking
## matches full/part of an ip address
## might be useful for people who dont have a .htaccess file
## or those who want to identify bots who dont supply a valid or a cloaked
## useragent. probably should be called on return of 0 from useragentcheck
## in online.php
##
## i think its unnecessary. also the ip address matching isnt great since php
## cant handle CIDR addresses so either you break the ip address up and match
## values or you use ranges (as below) which will also identify ip outside
## the allocated range
## ie crawler918.com
## http://ws.arin.net/cgi-bin/whois.pl?queryinput=!%20NET-12-148-209-192-1
## 12.148.209.192/26
## /26 is 62 ip addresses identifying 12.148.209. means that you're blocking 254 ip
## address which will exclude non rogue ips.
## ip address have a tendancy to change and would result in a fairly bit list.

function useripaddresscheck( $match_addr, $addr_code )
{

$addr = array(
'12.148.209.' => 'www.nameprotect.com|||crawler918.com',
'12.148.196.' => 'www.nameprotect.com|||crawler918.com',
'12.175.0.' => 'www.nameprotect.com|||crawler918.com',

'63.148.99.' => 'www.cyveillance.com|||cyveillance',
'65.118.41.' => 'www.cyveillance.com|||cyveillance'
);

foreach( $addr as $useraddr => $addrurl )
{
if ( preg_match ("/^\d+$/", $useraddr) )
{
$useraddr = $addrurl;
$addrurl = "Web Robot";
}

if ( preg_match ("/^". preg_quote ($useraddr) ."\d+/i", $match_addr) )
{
$addrinfo = preg_split ("/\|\|\|/", $addrurl);
if (!($addrinfo[1])) {
$addrinfo[0] = "http://www.robotstxt.org/wc/active.html";
$addrinfo[1] = "Web Robot ".$useraddr."*";
}

switch ($addr_code) {
case 0:
return 1;
break;
case 1:
return $addrinfo[1];
break;
case 2:
return '</a><a href="http://'. $addrinfo[0] .'" alt="'. $addrinfo[1] .'"><i>'. $addrinfo[1] .'</i>';
break;
}
}
}

}

## ----------------------------------------------------------------------------- ##

infiniterb 05-30-2003 05:49 PM

I'm also having problems with the Friendly URL's not showing up...seems like some of the code isn't exact in the hack as it is in the templates.

sajjid 05-30-2003 06:04 PM

Quote:

Today at 07:15 PM infiniterb said this in Post #607
Thanks a lot man. I'm having trouble now with the Friendly Urls...it seems to not be working right.

Also, in the user agent checking, where do you put this code:

## function for user ip address checking
## matches full/part of an ip address
## might be useful for people who dont have a .htaccess file
## or those who want to identify bots who dont supply a valid or a cloaked
## useragent. probably should be called on return of 0 from useragentcheck
## in online.php
##
## i think its unnecessary. also the ip address matching isnt great since php
## cant handle CIDR addresses so either you break the ip address up and match
## values or you use ranges (as below) which will also identify ip outside
## the allocated range
## ie crawler918.com
## http://ws.arin.net/cgi-bin/whois.pl?queryinput=!%20NET-12-148-209-192-1
## 12.148.209.192/26
## /26 is 62 ip addresses identifying 12.148.209. means that you're blocking 254 ip
## address which will exclude non rogue ips.
## ip address have a tendancy to change and would result in a fairly bit list.

## ----------------------------------------------------------------------------- ##

i was not sure on the last bit because instructions were not very
clear so i left it out here is screen shot of my forums
http://www.spinzone.org/web_ro1.jpg

infiniterb 05-30-2003 06:24 PM

I got my friendly URLs fixed...forgot the Navbar thing at the end.

Yeah, I guess I'll leave that out since I don't know exactly where to put it.

Mickie D 05-31-2003 08:17 AM

hi can i still get support for my trouble i am having with an earlier post i made ??

post 598 from this thread please

Haddy 05-31-2003 04:34 PM

Mysterious problems with installing on new sever....It only shows it in a text format -> http://www.themodfathers.com/forums/archive

Not sure what Ive done wrong...Tryed to install about 3 times on this new server and nothing seems to work....Going to keep trying but if I could be pointed in the right direction Id appriciate it...

TECK 05-31-2003 06:17 PM

You don't have the forcetype directive installed on your server.

TECK 05-31-2003 06:18 PM

Quote:

05-29-03 at 06:11 PM Mickie D said this in Post #598
i get this error aswell ???

Warning: Unknown modifier 'k' in /home/digitalw/public_html/forums/index.php on line 387

and i only get it when there are robots on my site also .. if that helps ???

anyone else have it or know what it means ????

Uninstall the hack, it will work.
Then double check the steps you did to change the script...

Haddy 05-31-2003 06:41 PM

Quote:

Today at 02:17 PM TECK said this in Post #613
You don't have the forcetype directive installed on your server.
What is this "forcetype directive"? We went from a cpanel server to a raq server if that makes any difference...

TECK 05-31-2003 08:56 PM

You should ask your host about it.

Mickie D 06-01-2003 11:51 AM

well i know your hack works 100% fine :)

its when i added that mod to show on the main page (the one by the other guys)... but i dunno where else to get support on that ??

i also know its only when ask jeeves is on my site, so many people might not get the error until they get visited by ask jeeves ???

i done a search on google and a few other engines for this phrase:

Warning: Unknown modifier 'k' in

found about 5 sites with this problem, one was in dutch so i translated the page and they have the same problem when jeeves is visiting then... i dont get any error until jeeves is on the site

TECK 06-02-2003 02:53 PM

I would personally remove that adress so there are no more problems.
This is an optional hack made by someone else, is not me who wrote the code...

Mickie D 06-04-2003 09:48 AM

i rechecked my coding done the whole lot again to be on the safe side your ahck went flawlessly so did all the mods but again jeeves was visiting and i got the error again :(

i do think that the makers should have a look at there code because people that have this mod installed and never had jeeves on one day will get jeeves on and they will be completely baffled i was lucky that i got the error straight away because jeeves was online

i have PM'd one of the guys with the error and the lines from my index.php ... but i know everyone who makes hacks do it out of the kindness of there heart and also have there own lifes :)

but yeah i have removed jeeves from the code :) thanks

TECK 06-04-2003 05:00 PM

I think we can all live without jeeves, if we want to use that code mod. :)

sajjid 06-05-2003 12:10 AM

when i lookup my site with trafficseeker for link back to my site it shows 964 on msn but when i search only get few also since i installed this hack which is great i am having to delete about 50 members a day because they dont post a thing just browse and every time i look Who's Online i say 2-3 people registering at all times i guess most visitors dont come in to forums just browse archives.
again its a great hack

gmarik 06-06-2003 05:56 AM

Teck, only Results 1 - 10 of about 68,600. Search took 0.27 seconds on google, not quite 1 000 000 ... :)

TECK 06-07-2003 01:21 AM

Google is back strong on my website! Whohooo!

Dev-United 06-09-2003 09:59 PM

What great hack, thanks for all the hard work.

And oh, seems like there is a problem showing your post about linking each post to the archive. I get a timeut every time I go to read https://vborg.vbsupport.ru/showthrea...457#post344457

Sharg 06-13-2003 11:16 PM

To unistall this script, is there anything to revert or delete in the database, or should we just delete the templates ?

Cordially,
Sharg

TECK 06-14-2003 03:25 PM

Run the uninstaller.

andyhodges 06-16-2003 01:22 AM

I had a question about the script and perhaps it was already mentioned but I can't find any thread.

Is there a reason why the archives don't contain a LOGO and the regular forum links like "home, control panel, register, etc" as a navigation? Does that disrupt the spider/search engine when it's caching pages?

-Andy

Sharg 06-18-2003 01:32 PM

Quote:

06-14-03 at 05:25 PM TECK said this in Post #626
Run the uninstaller.

Which script is the uninstaller in ?

Cordially,
Sharg

sajjid 06-19-2003 06:48 PM

here is something i found at webmasterworld.com if you wanna keep away all the nasty and unwanted bots like email snatchers and rest , you wanna remove some of them this my htaccess just dont remove the first and last two lines
use this htaccess just copy and paste.
save as text file
upload to root
rename it to .htaccess


RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:craftbot@yahoo.com [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^DISCo [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^eCatch [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^EirGrabber [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailSiphon [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Express\ WebPictures [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^ExtractorPro [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^EyeNetIE [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^FlashGet [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^GetRight [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Go!Zilla [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Go-Ahead-Got-It [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^GrabNet [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Grafula [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^HMView [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^HTTrack [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Stripper [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Sucker [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^InterGET [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Internet\ Ninja [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^JetCar [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^larbin [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^MIDown\ tool [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Mister\ PiX [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Navroad [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^NearSite [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^NetAnts [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^NetSpider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Net\ Vampire [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^NetZIP [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Octopus [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Explorer [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Navigator [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Papa\ Foto [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^pcBrowser [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^RealDownload [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^ReGet [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Siphon [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^SiteSnagger [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^SmartDownload [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperBot [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperHTTP [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Surfbot [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^tAkeOut [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Teleport\ Pro [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^VoidEYE [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Sucker [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebAuto [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebCopier [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebFetch [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebReaper [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebSauger [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ eXtractor [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebWhacker [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^WebZIP [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Wget [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Widow [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Xaldon\ WebSpider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Zeus
RewriteRule ^.* - [F,L]

sajjid 06-19-2003 06:50 PM

here is something i found at webmasterworld.com if you wanna keep away all the nasty and unwanted bots like email snatchers and rest , you wanna remove some of them this my htaccess just dont remove the first and last two lines
use this htaccess just copy and paste.
save as text file
upload to root
rename it to htaccess

for some strange reason i cannot paste it please see the attachment if i copy and paste i get error

S.Shady 06-19-2003 09:44 PM

How can i deleate a folder with a httaccess in it because cuteftp doesnot show httaccess files ?

Also installed and working perfectly :) http://hcom.planethael.com/cb/archive/

sajjid 06-19-2003 09:56 PM

well you have to go in to your cp or just upload the new one old will get replaced
if you want to delete all data then just upload empty htacess file to where you want to delete the htacess file
hack works like a bot charm see it working here
http://www.spinzone.org/

S.Shady 06-19-2003 10:13 PM

sajjid uploading a empty httaccess and deleating the dir did not work :( and im not the site admin so i dont have access to the cpanel atm so i cant perm del it. its not a big deal but it annoys the hell out of me to see extra directorys just sitting on my ftp screen

sajjid 06-19-2003 10:21 PM

if you just upload empty text file and rename it .htaccess once upload that should do the trick another since i installed this hack i got crawlers every day on my forums i have four right now had about 28 google bots the other day

sajjid 06-19-2003 10:23 PM

S.Shady just wondering who did you graphics on youe site they look good

S.Shady 06-20-2003 12:25 AM

The head admin did the main site i just brought it into the vb. GrimMethod as he goes by on our site.

witch reminds me i need to add some things thats missing from the main site

aalspach 06-22-2003 04:30 PM

hi

I went through directions but something confused me. ------

1. I changed htaccess.txt to .htaccess , but the file went away?
2. I didn't have the directory /admin/forum folder) to perform the code change in template.php ?

thanks
alfred

S.Shady 06-22-2003 06:30 PM

the htaccess file should disappear as said in the pm

template.php is in the admin folder

fastlaneweb 06-24-2003 05:58 PM

Just installed, great hack! I can't wait to see the search engine results.

Thanks Again!

_________________
Bradley Green
http://www.laserforum.org

aalspach 06-24-2003 05:59 PM

yep, got it going. thanks!


All times are GMT. The time now is 05:06 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.02081 seconds
  • Memory Usage 1,844KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (7)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete