vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 2.x Full Releases (https://vborg.vbsupport.ru/forumdisplay.php?f=4)
-   -   vbArchive - Search Engine Indexer for vBulletin (https://vborg.vbsupport.ru/showthread.php?t=47667)

kuska 02-02-2003 04:48 PM

TECK sorry to bother but i got more questions :)

Does it matter that i have a different Search Engine Indexer (Skuzzys) installed with your Archive? I figured why not double my chances of getting indexed since Skuzzy Archive uses different addresses for the forums, thread, posts than yours. Plus Skuzzys does not include your pimp dynamic Meta tags:)
Im just wondering if it matters to search engines... I installed Skuzzys about 3 weeks ago, within the 1st day intkomi started spidering it.... I installed your hack about a 1-2 weeks ago and no robots visit it and im just wondering is it because i have the skuzzys indexer installed also.

Here is what i have:
Skuzzys hack at
http://www.nakazdytemat.com/forum
Yours
http://www.NaKazdyTemat.com/archive


I dont see why there would be a problem between those two but i want your opinion on it TECK... Because if there is a problem im going to pick your hack and remowe SkuZzys...

And is anyone else doing what i am doing?

Thanks.

TECK 02-02-2003 05:35 PM

Quote:

Originally posted by saint_seiya
Sure, sorry thats the default font lol.

If you tell me how, there is no index file to edit, and if i change the color in my main vb settings my forum layout is gonna get screwed up :(

Edit "archive" template, at the bottom, where is marked the copyright.

TECK 02-02-2003 05:39 PM

Quote:

Originally posted by kuska
TECK sorry to bother but i got more questions :)

Does it matter that i have a different Search Engine Indexer (Skuzzys) installed with your Archive? I figured why not double my chances of getting indexed since Skuzzy Archive uses different addresses for the forums, thread, posts than yours. Plus Skuzzys does not include your pimp dynamic Meta tags:)
Im just wondering if it matters to search engines... I installed Skuzzys about 3 weeks ago, within the 1st day intkomi started spidering it.... I installed your hack about a 1-2 weeks ago and no robots visit it and im just wondering is it because i have the skuzzys indexer installed also.

Here is what i have:
Skuzzys hack at
http://www.nakazdytemat.com/forum
Yours
http://www.NaKazdyTemat.com/archive


I dont see why there would be a problem between those two but i want your opinion on it TECK... Because if there is a problem im going to pick your hack and remowe SkuZzys...

And is anyone else doing what i am doing?

Thanks.

There should be no problems using both archives. However, if you don't have them submitted, is useless to have them both, since the contents would be similar...

wooolF[RM] 02-02-2003 06:17 PM

][high]* wooolF[RM] pokes TECK in the eye
[/high]

just went by to say nice hack once again :) keep it up

Mike Gaidin 02-02-2003 06:19 PM

Installed and working great. Thanks TECK. :)

saint_seiya 02-02-2003 06:48 PM

Ok, i changed it:

http://www.vgcity.com/forum/archive/

Now can you add the link :)

Anyway, i was wondering if you could help me set up mod rewrite to rewrite all the urls of www.vgcity.com so they could be easily spidered. I am willing to pay :-/ I just need to get it done :(

kuska 02-02-2003 08:26 PM

I did submit both of them :)
Ill just wait and pull my hair out till spiders come.
Thanks for this great hack.

jjj0923 02-03-2003 08:41 PM

I installed this hack and a fine one it is.

I have mod_rewrite turned on in my apache server because I have a rewrite rule that keeps people from stealing my bandwidth and serving images from my site on their sites...

I installed the hack as instructed and embedded a link to /archive in my home page in the hopes that spiders would pick it up and follow it, I also visited the major search engines and submitted a new url ending with /archive...but JUST TO BE SAFE... I wrote a rule and some conditions that would insure spiders (the one's I know visit my site from my logs) would DEFINATELY spider the new html generated pages and here's how I did it:

Quote:

RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} ^(.*)googlebot(.*)$ [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^(.*)slurp(.*)$ [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^(.*)scooter(.*)$ [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^(.*)webcrawler(.*)$ [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^(.*)surveybot(.*)$ [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^(.*)mercator(.*)$ [NC]
RewriteRule ^(.*)\.php(.*)$ http://www.yourdomain.com/upload/archive [R=302]
Of course you would subsitute www.yourdomain.com with your domain name and substitute 'upload' with the directory name that contains 'archive' on your system.

I installed the hack yesterdaty and visits from slurp today are getting redirected to the archive generated pages.

Thanks for a great hack.

:)

kuska 02-04-2003 02:05 AM

Usefull tool:
Link Popularity Check
Popularity script checks 7 major Search Engines for inbound links to your site.
Checks Google, AllTheWeb, AltaVista, AOL, MSN, HotBot, and Lycos.
Comparison to your competitors.
Option to send the report to your Email address.
Fast and accurate.

THIS IS NOT A SPAM POST!!!!! Just thought it might be usefull for this hack :) Im in the 0 to 250 references spot : \ ...vb.org has 2,501 to 10,000 references :)
Here is the link:

http://www.free-webmaster-tools.com/link_popularity.htm

TECK 02-04-2003 02:48 AM

Thanks kuska. :)
Just checked now and it returned 1568 links for our site...

Cool tool. :)

jjj0923 02-04-2003 11:42 AM

Got a question your Teck...

now that "we" have all these 'archive' links showing up on search engines, how do I redirect the non-search engines to my real forums once they click on a link to one of the archive pages that will be displayed in the search engine results. Should I just write a rule and add it to my .htaccess file?

How do you do it?

- jeff

jjj0923 02-04-2003 11:52 AM

How about this as an enhancement?

Add some php code to archive that checks to see if the user_agent is one of the popular browers and if so, generate a link in the displayed /html that includes the threadid of the original post. This way, when a user clicks through from one of the search engines a link is displayed to that exact thread back to your real forums thread.

- jeff

TECK 02-04-2003 02:13 PM

Ok, I corrected a minor bug that occurs when you have a new forum published with no threads on it.
Fix example (for the time it will stay empty):
http://www.teckwizards.com/forumdisplay/f-38.html

FILE: forumdisplay.txt
FIND:
Code:

  if ( $limitlower <= 0 )
  {
    $limitlower = 1;
  }

REPLACE WITH:
Code:

  if ( $limitlower <= 0 )
  {
    $limitlower = 1;
  }
  if ( empty( $totalthreads ) )
  {
    $limitlower = 0;
    $showthreadbits = 'No threads available. Please try a different location.';
  }

That will take care of the problem. I updated the .zip file.

jjj0923 02-04-2003 02:23 PM

inktomi crawlers...

there have been 35 on my site since 8:00 am this morning - that's over 3 hours....

:)

Overgrow 02-04-2003 02:31 PM

>>Should I just write a rule and add it to my .htaccess file

If you write a rule to auto-forward you may be banned by Google for link-cloaking. The best way is to try and make a prominent link, as you suggested. That's what my old hack did and hopefully Teck will update this to do the same.

jjj0923 02-04-2003 02:42 PM

Quote:

Originally posted by Overgrow
>>Should I just write a rule and add it to my .htaccess file

If you write a rule to auto-forward you may be banned by Google for link-cloaking. The best way is to try and make a prominent link, as you suggested. That's what my old hack did and hopefully Teck will update this to do the same.

banned by goggle - i think not.

google could care less if you redirect....for heavens save this entire hack is one big redirect.

:)

jjj0923 02-04-2003 02:44 PM

while this is a great hack and will get your pages on websites, I think a lot of people are going to be very disappointed when they land on a page only to realize that there's no way to:

a) click through to your forum
b) find the original thread in your forum so that they can reply or join.

I showed this to a bunch of very seasoned web developers I work with and they ALL expressed the same opinion. It need to be enhanced, THEY THINK IT'S GREAT MIND YOU, but will leave the clicking-through end user somewhat disappointed. I'm look at the code right now to insert the link via the php (for browsers) instead of writing an external rule.


:)

TLucent 02-04-2003 05:03 PM

How do you lock your board down for registered or logged in users only yet maintain your archives for spidering?

TECK 02-04-2003 05:12 PM

What's the use to do this? This is really bad and unethical.
Scenario:
I search the web and find exactly what I need.
I go to the archive and click on the VIEW THREAD... BOOM, no permission.
I leave the site rightaway, swearing at the guy who had such an idea to publish twisted methods to gain more users on his board.

As long as you have my script installed, with my copyright on it, you are not allowed to use any forcing methods to block guests for the forums, but not to the archive. The script is designed to work the same way like the forums, and it will stay like that.
Feel free to uninstall it and use another script, I really want you to do this.

jjj0923 02-04-2003 05:18 PM

Quote:

Originally posted by TECK
What's the use to do this? This is really bad and unethical.
Scenario:
I search the web and find exactly what I need.
I go to the archive and click on the VIEW THREAD... BOOM, no permission.
I leave the site rightaway, swearing at the guy who had such an idea to publish twisted methods to gain more users on his board.

As long as you have my script installed, with my copyright on it, you are not allowed to use any forcing methods to block guests for the forums, but not to the archive. The script is designed to work the same way like the forums, and it will stay like that.
Feel free to uninstall it and use another script, I really want you to do this.

RIGHT ON!!! - I COULD NOT HAVE SAID IT ANY BETTER MYSELF.

EXPOSURE IS EVERYTHING. I take a look at 80% of the site listed here by people and they only have a few members. I want tens of thousands of members, not a few members. This is a great hack and gets you exposure!!!

TLucent 02-04-2003 05:25 PM

Well Nakkid,

The site I run is a free service for anyone who REGISTERS with no ads and nothing is sold just a community forum that offers potentially usefull information for FREE. There is never nothing to buy and I pay for the hosting and bandwidth out of my pocket. I pay for the software that is ran on the website and I devote endless hours to provide a FREE service to whom ever would like to use it. All I ask is for them to register. I will respect your wishes with the use of your vbArchive code however do not respect your impractical suggestion of "twisted" and "unethical" methods "to gain more users" at the cost of the poor soul who devoted much time and money simply clicking on a link and ended up on my site.

.:TRansLucent.:

jjj0923 02-04-2003 05:34 PM

Hey ".:TRansLucent.:"

You're an idiot....and now you're the first person on my "ignore list" :)

Respectfully submitted....

TLucent 02-04-2003 05:39 PM

yeah and your funny with such useful, constructive contributions..

I'm off this subject my point stand clear.

TECK 02-04-2003 06:18 PM

Quote:

Originally posted by TLucent
Well Nakkid,

The site I run is a free service for anyone who REGISTERS with no ads and nothing is sold just a community forum that offers potentially useful information for FREE. There is never nothing to buy and I pay for the hosting and bandwidth out of my pocket. I pay for the software that is ran on the website and I devote endless hours to provide a FREE service to whom ever would like to use it. All I ask is for them to register. I will respect your wishes with the use of your vbArchive code however do not respect your impractical suggestion of "twisted" and "unethical" methods "to gain more users" at the cost of the poor soul who devoted much time and money simply clicking on a link and ended up on my site.

.:TRansLucent.:

Don't take in a negative way, but most visitors will think like that, the way I posted earlier.
I know what you mean by free service, look at me and my hacks... hell, vbHome eat my life for a week and I did it for free.

I was speaking not directly to you, but to a person who would do this. I guarantee you most people will close the browser and never go back to your site, if you block them... I know it from my own experience with my protected forums.

Since I open the gate to everyone, I had over 200 new members in 2 weeks. Before it used to be like this scenario:
User come and visit the site, he registers and he get accepted to the restricted areas, if he displays the vbHome (lite) copyright onto his website.
Look onto those forums:
http://www.teckwizards.com/forum/for...p?s=&forumid=6

If you click on any thread, you will get a no_permission, until you register... but you still can view the thread titles. The rest of the forums are open to guests.
That is a hack I made, not released at vBulletin...

Anyway, the idea is this: is better to have your site open, trust me, people will register.
Good luck with your projects.

TECK 02-04-2003 06:23 PM

Quote:

Originally posted by Overgrow
>>Should I just write a rule and add it to my .htaccess file

If you write a rule to auto-forward you may be banned by Google for link-cloaking. The best way is to try and make a prominent link, as you suggested. That's what my old hack did and hopefully Teck will update this to do the same.

Overgrow, please post your mod, attached onto a .txt file so I can link it to the first post, with credit, like I did with Logician's mod.

jjj0923 02-04-2003 06:26 PM

good points, Teck.

I welcome everyone and promote the living dayligghts of out my site. I paid for it all; bandwidth, servers, software everything to the tune of over $10,000 in the past year.

In less than a year I have over 1,600 members and ten more join everyday on average. I used to call potential advertisers, now they contact me. Open forums are the best...

:)

Banana 02-04-2003 10:19 PM

Teck, if I have this hack installed and vblite - (how) does Googlebot retrieve 'achive'? Just that I've been crawled twice by Google and it only goes to the vanilla forums - and, yes, I have the Friendly URL link there too :(

TECK 02-04-2003 11:28 PM

There are millions of websites out there with billions of pages. Be patient please.
It can take up to 2 months until you get "really" indexed, because it's done step by step...

Did you read the email from Google I posted in the first post?

Banana 02-05-2003 02:29 PM

Yes thanks TECK I have. Yes we've been listed for months by Google, but none of the last 3 searches has found archives.

It always nice to read your condesending posts though. Makes my heart warm to you so much. I must remember to uninstall all your hacks because you really are intolerable most of the time.

jjj0923 02-05-2003 02:38 PM

Hey Banana (spelled wrong ...oh and so is condescending). Please remove all his hacks.... it will be one less bozo he has to respond so in the future so that he can gratiously support (at his largesse I remind you) us who are truly appreciative of all of his excellent work and contributions.

:)

TLucent 02-05-2003 07:25 PM

"googlebot.com (64.68.82.xxx) - Spider/Robot
04 Feb -- 18:33:47 -- -- Code 301 Moved Permanently = /forum"

I get this nearly everyday, and have never really been indexed. Is this normal?

Thx

saint_seiya 02-05-2003 07:28 PM

This is weird, i pay for lycos insite select ( http://insite.lycos.com ) and it still has not been indexed :( Any ideas why? I added a link to the archive today, from my forum page in case that was it.

As you see it should be 48 hour spider refreshes and i installed this a while ago :) Am I doing something wron Teck? I did your archive this weekend, i will wait this week and then email lycos for support ;)

BTW, my site is www.vgcity.com , archive: http://www.vgcity.com/forum/archive :chinese:

PS.- Teck when you finish your other project can you tell me, i think it was vBHL . Thanks :) :smoke:

jjj0923 02-05-2003 07:32 PM

google - don't think that's normal:

I get this:

Quote:

2/5/2003 4:22:53 PM
Search String: googlebot
Replace String:
Path: D:\logs
File Mask: *.*
Search Subdirectories
crawler10.<googlebot>.com - - [23/Dec/2002:06:22:22 -0500] "GET / HTTP/1.0" 302 0 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"
crawler10.googlebot.com - - [23/Dec/2002:06:22:22 -0500] "GET / HTTP/1.0" 302 0 "-" "<Googlebot>/2.1 (+http://www.googlebot.com/bot.html)"
crawler10.googlebot.com - - [23/Dec/2002:06:22:22 -0500] "GET / HTTP/1.0" 302 0 "-" "Googlebot/2.1 (+http://www.<googlebot>.com/bot.html)"
crawler10.<googlebot>.com - - [23/Dec/2002:06:22:24 -0500] "GET /robots.txt HTTP/1.0" 404 279 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"
crawler10.googlebot.com - - [23/Dec/2002:06:22:24 -0500] "GET /robots.txt HTTP/1.0" 404 279 "-" "<Googlebot>/2.1 (+http://www.googlebot.com/bot.html)"
crawler10.googlebot.com - - [23/Dec/2002:06:22:24 -0500] "GET /robots.txt HTTP/1.0" 404 279 "-" "Googlebot/2.1 (+http://www.<googlebot>.com/bot.html)"
crawler10.<googlebot>.com - - [23/Dec/2002:06:22:28 -0500] "GET /upload/index.php HTTP/1.0" 200 122266 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"



TECK 02-05-2003 07:49 PM

302 is not an error, check w3c related sites for the error number.
The 404 you get it because you don't have a robots.txt file where resides the main files, not the forum ones.

wooolF[RM] 02-06-2003 09:03 AM

]Just got an idea...

Imagine Forum home page :
Users Currently Online: 200 [ 100 users + 100 guests ] <-- just an EXAMPLE

idea is to trace IPs of all users and if they match any of the IPs owned by any of search crawlers like googlebot, altavista etc, show this :

Users Currently Online: 200 [ 100 users + 80 guests + Google + Altavista ] <-- just an EXAMPLE


Maybe looks ugly... maybe add extra queries... instead of dnsing/tracing all IPs u can just look after its ident (like Mozilla for IE).


PS: maybe it's not clever to add it on the forum home, but I would REALLY like to see this feature implemented on Who's Online page :)

I know u can do it, TECK ;)

Overgrow 02-06-2003 05:57 PM

From the Google Webmasters FAQ:

What is cloaking?

The term "cloaking" is used to describe a website that returns altered webpages to search engines crawling the site. In other words, the webserver is programmed to return different content to Google than it returns to regular users, usually in an attempt to distort search engine rankings. This can mislead users about what they'll find when they click on a search result. To preserve the accuracy and quality of our search results, Google may permanently ban from our index any sites or site authors that engage in cloaking to distort their search rankings.

http://www.google.com/webmasters/faq.html


I'm assuming what you mean to do is give the spider a different page than a user gets if they click through from the search results.

That is Link Cloaking and that is grounds for banning, no matter how similar the pages are. Do at your own risk.

Overgrow 02-06-2003 05:59 PM

>>Yes we've been listed for months by Google, but none of the last 3 searches has found archives

Hahahah let's blame Teck for Google's spidering. People using his hack are in Google. If you can't get your archive in there, that is your fault. Even people using my old vBSpiderFriend are doing very well in Google... DevShed was serving me answers with it just last week from the top Google 1-5 result spots.

Floris 02-06-2003 07:21 PM

Tonight I received a google attack :)

49 guests online, 5 members :)

saint_seiya 02-06-2003 07:23 PM

How did you all get that part where it says from where the guest came. I am going to read the readme again :p

Floris 02-06-2003 07:23 PM

damn
IT DOESN"T STOP

4 Members and 53 Guests

The hosts are a vbulletin option > resolve hosts on whois online : yes.


All times are GMT. The time now is 05:24 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.03978 seconds
  • Memory Usage 1,852KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_code_printable
  • (8)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete