vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 2.x Full Releases (https://vborg.vbsupport.ru/forumdisplay.php?f=4)
-   -   vbArchive - Search Engine Indexer for vBulletin (https://vborg.vbsupport.ru/showthread.php?t=47667)

NexDog 03-02-2003 03:33 PM

Teck - why the need to rename to archive.html? I have 20 Inktomi spiders in there now so don't want to interrupt anything.

Might do on vbHome. Mainly doing this for added content and links and not so worried that everything gets listed in Google...

TECK 03-02-2003 04:32 PM

I'm telling you what I did, only.
That doesn't mean you have to do the same like me...

NexDog 03-02-2003 09:21 PM

Hey man, it's your hack so I just want to benefit from your skills. ;)

Woke up this morning to find all bots in the Archive - 30 googlebots doing their thing so me is a happy chappy. :D

Would just like to know the theory behind naming the archive with a .html extension.....

phenom 03-03-2003 09:09 PM

alright, I'm getting a parse error after I add the add_to_functions.php to admin/functions.php

Any ideas?

phenom 03-03-2003 10:25 PM

nevermind, a small little tiny typo

erdem 03-04-2003 04:29 AM

Quote:

Originally posted by NexDog
Hey man, it's your hack so I just want to benefit from your skills. ;)

Woke up this morning to find all bots in the Archive - 30 googlebots doing their thing so me is a happy chappy. :D

Would just like to know the theory behind naming the archive with a .html extension.....

it seems bots arent interested in visiting my site ;)
still waiting for bot rush but only sometimes googlebot10-11-12 visits my board then leave ;) they cant find archive ... they try to "retrieve password" , "register" and similiar things and they go out after visiting main page of archive finally ...

do i have to put every php file as it listed by TECK to robots.txt ... or "removing all those and allowing bots to visit all those" is better ?

thanks

NexDog 03-04-2003 08:53 AM

What's your board's URL? I would be happy to take a look for you. And yea, put all php pages in the robots.txt except index.php.

erdem 03-04-2003 09:48 AM

Quote:

Originally posted by NexDog
What's your board's URL? I would be happy to take a look for you. And yea, put all php pages in the robots.txt except index.php.
http://www.trojanforge.net
vbhome installed as home.php ,
via htaccess top of domain is home.php
board index is at : http://www.trojanforge.net/index.php
archive is at : http://www.trojanforge.net/archive/

and for robots.txt ; as they will get no permission notices they can browse other pages from that pages i think via top and bottom links ... ermm ;)

NexDog 03-04-2003 12:01 PM

Hmmm, don't know why they aren't finding the archive. I would implement the mod that links the on/off gif to the corresponding forum in the archive.

Dean C 03-04-2003 06:15 PM

I've installed this 3 times for clients with absolute ease ...... until today ;)

In the server when i go to rename the archive.txt and then when i rename it to 'archive' it disappears from the file list. And upon a refresh it reappears as the old filename. So i thought i'd try uploading it as just 'archive' and not archive.txt and then in the ftp log thingy it says:

Quote:

550 archive: Not a regular file
I don't know what to do or how to fix it :(

Regards

- miSt

erdem 03-04-2003 06:46 PM

Quote:

Originally posted by NexDog
Hmmm, don't know why they aren't finding the archive. I would implement the mod that links the on/off gif to the corresponding forum in the archive.
allright tnx , i added that also ...
i am thinkin of not editin robots.txt ... i think that wont cause anything only no permission pages ;) maybe bots pisses of and goes away ;)

anyway tnx for tips ... i will wait for next check/rush of bots ... i hope they will ;)

thanks for this great hack ...
greetz

Overgrow 03-06-2003 06:36 PM

Got a lot of new guests from the archive, but not many are registering? Read this:

https://vborg.vbsupport.ru/showthrea...threadid=49721

klunderj 03-08-2003 06:16 PM

I attempted this install. when I go to my archive it points my forums to .../forumdisplay/f-1.html, etc. and I get the "forum not specified" error.

My question... what process creates these .html files? Should my install have created them? or are these created dynamically?

iceman11111 03-09-2003 04:22 PM

Great Hack!

Just installed it!

I was banging my head for about 2 hours trying to get it to work, Then I noticed that I missed the period infront of "htaccess", then bingo, everything is working great!

Thanks
:)

klunderj 03-09-2003 09:49 PM

in step 3 of the installation, when you are to rename the files... are we supposed to rename them without any file extension?? Ie. forumdisplay rather than forumdisplay.txt or do they need the .php extension?

I am having problems getting this install to work.. my forums arent linked to anything... ie f-1.html is not found

-James

iceman11111 03-10-2003 01:54 AM

How long does it usually take before the bots start invading?

JulianD 03-10-2003 02:40 AM

I think I need to install this hack with some modifications ... Thanks teck.

iceman11111 03-10-2003 01:18 PM

Quote:

Originally posted by iceman11111
How long does it usually take before the bots start invading?
No one is crawling on my page :(

How long does it usually take?

limey 03-10-2003 03:17 PM

You need to submit your archive to google et all.

iceman11111 03-11-2003 04:04 PM

Quote:

Originally posted by iceman11111


No one is crawling on my page :(

How long does it usually take?

Bump!!!

:)

mheinemann 03-11-2003 04:15 PM

There is no set time. It took google about a month to crawl mine and another month for them to actually show up on google. Others have had theirs crawled days after they installed this hack. Just be patient.

Overgrow 03-11-2003 05:26 PM

>>You need to submit your archive to google et all.

NO! Don't ever submit anything to Google. It will rank you higher if it finds you on it's own. Make sure you get other sites to link to yours and let the spider come naturally. To have your archive is listed, make sure it is linked FROM YOUR HOME PAGE. Even a small link is fine.

Thanks, Teck.. Dance, Google, Dance! I'm getting listed..

Schorsch 03-11-2003 06:17 PM

Quote:

Originally posted by Overgrow
To have your archive is listed, make sure it is linked FROM YOUR HOME PAGE. Even a small link is fine.
that's good to know. thanks for the tip!

NexDog 03-11-2003 09:13 PM

I have the GoogleDeepCrawler on our forum right now. I'm wondering if I should add forumdisplay.php and showthread.php to the robots.txt as I don't know what Google will think if it spiders the forum and archive, finding the same content. Could drop one or the other. What has everyone else done with successful listings in Google?

NexDog 03-11-2003 09:15 PM

Just to let you know that Google's Freshie and DeepCrawler come in on different IPs:

Overgrow 03-11-2003 09:18 PM

So you're saying DeepCrawlers identify with "crawl#" and FreshCrawlers idenfity with "crawler#"? Thanks, good info.

>>I'm wondering if I should add forumdisplay.php and showthread.php to the robots.txt as I don't know what Google will think if it spiders the forum and archive, finding the same content.

I don't think so.. while they contain some of the same content, they are in different formats.. which should equal different content to the spider.

NexDog 03-11-2003 09:25 PM

I see you had success and have your archive set up like mine. Did google's bots hit your forum and archive? Do you have showthread.php listed in your robots.txt?

NexDog 03-11-2003 09:26 PM

WOW, PR7! What was your page rank before you messed with the archive? Due to our server move and new site, our dropped to 5 from 6 - not happy. :(

NexDog 03-11-2003 09:31 PM

Looks like you're getting slammed:

Currently Active Users (977):450 members, 489 guests, and 38 web robots

I take it you have your own server. :D

Overgrow 03-11-2003 09:35 PM

/me blushes at mention of his 7" PR

Google has just started digesting the archive. I started getting hits in from it a day or two ago so I'm not sure it has affected the PageRank yet. I'm not even sure the archive has been listed fully in Google, we may be in for the short dance right now, to be re-added later.

>>Did google's bots hit your forum and archive? Do you have showthread.php listed in your robots.txt?

Google has always gotten some of the posts since they are linked from the front page, then it finds the forums. It never got into the deepcrawl of the archive though. Funny, as I think I was the first person to release a vB archive hack that lots of people still use successfully... but I never linked mine from my front page so it was never picked up. Only when I saw Teck's was I inspired to get it working and linked.

But oh yea, no, I don't have anything special listed in robots.txt

>>38 web robots, I take it you have your own server.

hehe those robots have been here for weeks. Only in the past few days have I had some really high guest numbers. Usually members / guests is even. Now I have a hundred or more guests on than members thanks to the archive. Hopefully they are getting into the rest of the site as well.

Schorsch 03-11-2003 10:05 PM

wow Overgrow, this must produce a huuuuge portion of traffic :bunny:

iceman11111 03-11-2003 10:18 PM

How do you get your link on othere websites?

Overgrow 03-11-2003 10:19 PM

Well I suppose you could hack into the other websites and put your link up.. or you can email the administrators and ask if they do link exchanges?

NexDog 03-12-2003 03:23 AM

Had this bot on my forum for days:

argon.oxeo.com

Have no idea what it is. Does anyone know and should I deny it access?

NexDog 03-12-2003 10:16 AM

Quote:

Originally posted by Overgrow
So you're saying DeepCrawlers identify with "crawl#" and FreshCrawlers idenfity with "crawler#"? Thanks, good info.

Sorry, just caught this.

Google's Freshbot comes in on IP range 64.68.82 and the deep crawler that goes out in Google dance time comes in on 216.239.46.

Overgrow 03-13-2003 01:39 AM

OK there is a point where the archive can work too well. I was getting hundreds of off-topic hits from Google searchers thanks to the spidering of my off-topic chat areas. The last thing I want is to expand the off-topic areas and get more general chatters in, so I added disallows to my robots.txt to the chatting archives and I made the actual archive threads offlimits in showthread.txt.. One question:

Will Google DE-list my pages once they are disallowed there?


I'm getting great results for things like:

Christina Agulera Naked
and
Internet Security Procedures

glenvw 03-13-2003 01:48 PM

My hat is off to you TECH!

5600 Pages (threads) indexed by Yahoo and Google.

You da man!

Question...I have to delete threads once in a while that are bad content. When someone lands on a deleted thread from the search hack, they get the page with no images, etc..

Is there a re-direct we could use that sends them somewhere else to fix that?

Thanks again!

Logician 03-18-2003 07:42 PM

I don't know if this add-on was already provided in the thread or not, my apologies if it did:

This code keeps updating the original thread's view count even if your archive thread is visited:

Edit "showthread" (hack file! not showthread.php!), find:
PHP Code:

eval( 'dooutput( "' gettemplate'archive' ) . '" );' ); 

Before that add:

PHP Code:

if ($noshutdownfunc) {
  
$DB_site->query("UPDATE thread SET views=views+1 WHERE threadid='$thread[threadid]'");
} else {
  
$shutdownqueries[]="UPDATE LOW_PRIORITY thread SET views=views+1 WHERE threadid='$thread[threadid]'";


Enjoy..

TECK 03-21-2003 10:08 PM

Quote:

03-11-03 at 11:25 PM NexDog said this in Post #467
I see you had success and have your archive set up like mine. Did google's bots hit your forum and archive? Do you have showthread.php listed in your robots.txt?
Hmmm... Google results for vBulletin.org Archive:
Results 1 - 10 of about 10,600. Search took 0.04 seconds

Google results for TeckWizards.com Archive:
Results 1 - 10 of about 727. Search took 0.06 seconds
(I don't allow the robots to visit any .php forum file and all archive files are still indexed. I have my archive set outside the forum folder.)

TECK 03-21-2003 10:11 PM

Quote:

03-13-03 at 03:48 PM glenvw said this in Post #477
My hat is off to you TECH!

5600 Pages (threads) indexed by Yahoo and Google.

You da man!

Question...I have to delete threads once in a while that are bad content. When someone lands on a deleted thread from the search hack, they get the page with no images, etc..

Is there a re-direct we could use that sends them somewhere else to fix that?

Thanks again!

In your .htaccess file, add this line:
[high]ErrorDocument 404 http://www.yoursite.com/[/high]


All times are GMT. The time now is 12:50 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.02015 seconds
  • Memory Usage 1,832KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_php_printable
  • (10)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete