vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 2.x Full Releases (https://vborg.vbsupport.ru/forumdisplay.php?f=4)
-   -   vB Easy Archive FINAL - Search Engine Spiderable Hack! Get your posts listed @ google (https://vborg.vbsupport.ru/showthread.php?t=47087)

cono 01-10-2003 04:31 PM

how could i get the header and footer to look like the rest of the board? is there anything i can do to make it fit right in? or do i have to mimic the template useing a custom html to make it look close to the boards style?

thanks, great job on this hack! probally the best one of the year... so far LOL

NTLDR 01-10-2003 04:38 PM

@cono The quickest way would be too copy the HTML directly from viewing the forums and add it to header.php

@SkuZZy you can add me to the list too if you want (http://www.thesistersthree.com/archive) Works well as a text only version of the site and sub forums seem to work now too :$

cklaszlo 01-10-2003 05:13 PM

ours has crashed. We rebuilt our server. Linux Redhat 8.0 and now the script doesn't work. any suggestions?

http://www.thrillnetwork.com/boards/archive/

All the paths are the same. and the index page pulls up it just doesn't parse any of the forum pages.

Is this an Apache thing? Permissions? Very odd.

subduck 01-10-2003 06:06 PM

You can add me to the list if you like :)

www.subduck.com/history

nice hack!

Destee 01-10-2003 06:21 PM

Thank you for sharing this hack Xenon and SkuZZy.

The installation went well and appears to be working properly.
http://www.destee.com/forums/archives

I noticed that there are many posts in this thread where folk
talk about Google spidering their archives/history ... yet when
I did a search on "modified/released by SkuZZy" ... I was returned
only 242 results <??>.

Thanks Again ... Clicking Installed.

Destee

NTLDR 01-10-2003 06:24 PM

Quote:

Originally posted by Destee
I noticed that there are many posts in this thread where folk
talk about Google spidering their archives/history ... yet when
I did a search on "modified/released by SkuZZy" ... I was returned
only 242 results

Most of the sites that have been crawled by google probably haven't been added to the index yet.

Destee 01-10-2003 06:37 PM

NTLDR ... Okay, Thanks.

Destee

ladyfyre 01-10-2003 11:26 PM

well granted, my forums have a lot of hacks installed, but despite having checked, double checked, and triple checked the path for global.php, i am still getting a blank screen. Any ideas?

SkuZZy 01-11-2003 07:41 AM

Quote:

Originally posted by Destee
Thank you for sharing this hack Xenon and SkuZZy.

The installation went well and appears to be working properly.
http://www.destee.com/forums/archives

I noticed that there are many posts in this thread where folk
talk about Google spidering their archives/history ... yet when
I did a search on "modified/released by SkuZZy" ... I was returned
only 242 results <??>.

Thanks Again ... Clicking Installed.

Destee

I really must say, I love your archive. Nice job styling it! Added you to the top of the list ;)

Like NTLDR mentioned, it takes google a few weeks to update all pages. No pages should even be in google as of now and they will be gone with the next few days (what you're seeing is called everflux). At the end of the month however, there should be alot ;)

Thanks for installing the hack.

SkuZZy 01-11-2003 07:47 AM

I'd like to address all the people reporting problems. It's obvious these scripts have certain problems with various server configurations. Thrillnetwork proves this even further, when after upgrading to Redhat 8.0, their archive won't work now. I really don't know what to say. I am trying to figure out what the problem is and i've even asked more l33t programmers and no one seems to know. So I guess, bottom line is, if you can get this hack to work for you (which it seems like most people can) then congrats. If not, please, keep posting your errors here and I AM working on fixing it, but for now i'd suggest looking over some of the tips already posted in this thread about things needed in httpd.conf - Also, if your archive isn't working, try changing the two file names (topic / forum) to "topic.php" and "forum.php" then see if the url will work like that (IE: http://www.blah.com/forum.php/100.html) .... if it works after you rename them to PHP, then that means it IS a .htaccess error. Make sure your server supports the .htaccess you're trying to use. I'm not a pro on this command, but it's ForceType application/x-httpd-php ... so obviously the httpd.conf file is required and perhaps some hosts have it disabled. I have a feeling this is the problem many are suffering from. I will look into it more.

SkuZZy 01-11-2003 08:03 AM

Ok ladies and gentlemen. I've figured out what the problem is (now wasn't that quick?). If you are one of the people who have installed the hack and can see the front page of your archive, but the links don't work... here is the fix. You are missing a required directive from a module called "mod_mime". To fix this, ask your web host to install mod_mime. It includes the directive called ForceType which is used in the .htaccess file included with this script to flatten your URL's and take the .php out of them. Without mod_mime installed, the script basically won't work as described. For more information about Mod_mime, visit http://httpd.apache.org/docs/mod/mod_mime.html

SkuZZy

crocket 01-11-2003 08:20 AM

[high]* crocket gets to work, Hopefully this works :D[/high]

SkuZZy 01-11-2003 08:27 AM

Quote:

Originally posted by crocket
* crocket gets to work, Hopefully this works :D
If it works, i'm sure you'll let me know :devious:

crocket 01-11-2003 12:20 PM

Looked at my local server... coz that dont show sub pages either..

and on that..
LoadModule mime_module modules/mod_mime.so

is in the http.conf

Therefore mod_mime is allready running :(

Or have i missed something?!

Crocket

cklaszlo 01-11-2003 03:18 PM

any way I can get the previous version? For some reason I think that may work. I can't find my copy. thanks.


***** Never mind found it in an old Back-up. The index page works again but no linking. We are checking Mod_rewrite.

***** We revert back to the previous version of this script and it works!!! We just had to add Mod_rewrite. For some reason the new version doesn't work. I guess if it aint broke don't fix it.

We also signed up and added that pop-up code. Are we sure Google won't mind it too much?

***** I was wondering if you are someone else can help us create a similar script for our Roller Coaster and Amusement Park database. It would be great to get all those pages listed too.

Coasters: http://www.thrillnetwork.com/index.p...nc=allcoasters

Amusement parks: http://www.thrillnetwork.com/index.p...&func=allparks

codewebs 01-11-2003 05:08 PM

that shouldnt be that hard :-/ just use mod_rewrite, ill try to figure it out, unless someone else does before me.

cklaszlo 01-11-2003 05:10 PM

really. That would be awesome!!!

And FYI as soon as we put the old script back up guess what, inktomisearch started to crawl the archive!!! :)

SkuZZy 01-11-2003 08:38 PM

Quote:

Originally posted by cklaszlo


We also signed up and added that pop-up code. Are we sure Google won't mind it too much?
[/url]

Google can't read javascript ;)

TECK 01-11-2003 09:07 PM

Yes it can. This is posted on my site:
GOOGLE FACTS
1. Google uses a crawler named Googlebot which crawls the web approximately every thirty days.
2. It is not necessary to submit any page to Google. If you do submit, submit only your most important page to this search engine.
3. Googlebot is a deep crawler and should crawl all of your pages.
4. Google supplies ranking results for placement in Netscape Search, the ODP, Anzwers, Yahoo! and Ilor.
5. Google can crawl pages in ASP, JSP, CFM, PHP, Excel, Microsoft Word, newsgroups, PDF and PostScript files, Power Point and Rich Text formats.
6. Google loves sites with a high number of legitimate, relevant incoming links.
7. Google hates spam.

SkuZZy 01-11-2003 09:26 PM

Quote:

Originally posted by TECK
Yes it can. This is posted on my site:
GOOGLE FACTS
1. Google uses a crawler named Googlebot which crawls the web approximately every thirty days.
2. It is not necessary to submit any page to Google. If you do submit, submit only your most important page to this search engine.
3. Googlebot is a deep crawler and should crawl all of your pages.
4. Google supplies ranking results for placement in Netscape Search, the ODP, Anzwers, Yahoo! and Ilor.
5. Google can crawl pages in ASP, JSP, CFM, PHP, Excel, Microsoft Word, newsgroups, PDF and PostScript files, Power Point and Rich Text formats.
6. Google loves sites with a high number of legitimate, relevant incoming links.
7. Google hates spam.

Not everything google says is true. Perhaps they CAN spider javascript, but they don't.

John 01-11-2003 09:27 PM

Help!

I get nothing but a blank page. And yes, all paths are correct - trust me!

Mod_mime is installed, on Apache, PHP 4.2.3

What do I do now? :(

TECK 01-11-2003 09:29 PM

Quote:

Originally posted by SkuZZy


Not everything google says is true. Perhaps they CAN spider javascript, but they don't.

I doubt they will post something that could affect their solid reputation.

John 01-11-2003 09:32 PM

Also: when I upload .htaccess, it says the transfer was successful, but then I can't view it on the server. I wonder if it's there? :confused:

SkuZZy 01-11-2003 09:46 PM

Quote:

Originally posted by TECK

I doubt they will post something that could affect their solid reputation.

Believe it or not, google isn't perfect. Yes, most of what they say is true, but still limited. Tbhey say they can spider PHP, so why don't they spider vbulletin then? Or thousands of other PHP scripts? Because google doesn't like the URL's that some of them use, they are too complex or google doesn't like the session hash, or many other reasons. Same thing goes with javascript. Google understands a VERY LIMITED amount of javascript. Google would be stupid to spider javascript, as it can cause so many problems and also manipulate googlebot into a loop. This is why websites put text links inside javascript when they don't want them to count as a link in google, cause they know google won't touch javascript. It's standard practice.

John 01-11-2003 10:01 PM

*cries*

DrkFusion 01-11-2003 10:15 PM

LMAO

Tecks archive is pretty spiffy, I now have a total of 3 archives running lol...Googles gotta gotta do 1 of them.

John 01-11-2003 10:37 PM

Ok, fixed - I feel like a complete idiot for posting my cries of help. I needed to use

chdir("..");
require("./global");

instead of the path to the global file.

/me feels like a retard

TECK 01-11-2003 11:46 PM

Quote:

Originally posted by SkuZZy


Believe it or not, google isn't perfect. Yes, most of what they say is true, but still limited. Tbhey say they can spider PHP, so why don't they spider vbulletin then? Or thousands of other PHP scripts? Because google doesn't like the URL's that some of them use, they are too complex or google doesn't like the session hash, or many other reasons. Same thing goes with javascript. Google understands a VERY LIMITED amount of javascript. Google would be stupid to spider javascript, as it can cause so many problems and also manipulate googlebot into a loop. This is why websites put text links inside javascript when they don't want them to count as a link in google, cause they know google won't touch javascript. It's standard practice.

Please make 2 files: test.php and test.js
Include some code to display a readable text in each file, submit them to Google then let me know if they are indexed.

Google indexes all files that have an text readable output ONLY, as mentioned onto my GOOGLE FACTS, few posts above.

Before I made my own Archive system (almost identical to VB3 one), I did a research on several search engines, in order to optimize my code properly and avoid any Cloacking techniques.

SkuZZy 01-12-2003 03:37 AM

Quote:

Originally posted by TECK

Please make 2 files: test.php and test.js
Include some code to display a readable text in each file, submit them to Google then let me know if they are indexed.

Google indexes all files that have an text readable output ONLY, as mentioned onto my GOOGLE FACTS, few posts above.

Before I made my own Archive system (almost identical to VB3 one), I did a research on several search engines, in order to optimize my code properly and avoid any Cloacking techniques.

I said above that google will spider some javascript. If you make a .js file with plain text or whatever then maybe google will spider it. But i'm talking about javascript embedded within' .html or .php files. For instance, for the popups to work, you put:

Code:

<SCRIPT LANGUAGE="JAVASCRIPT" SRC="http://view.popupsponsor.com/media/lx.js?a_id=10729&creative_type=1"></SCRIPT>
in your <head> tags. Google will NOT read this when it views the html page. It will see the javascript but it will ignore anything in between the <script> tags. In other words, it won't open the url http://view.popupsponsor.com/media/l...reative_type=1 ... therefore, putting popup windows won't effect google's spiderings, which was the original point. I'm sure google might spider some javascript or .js files, but not the types I use.

TECK 01-12-2003 05:01 AM

Ahaaaa, I got your point now. :D
You are totally right about those URL's. But that's not the case for vBulletin, right? If I understood correctly he was referring to the pm popup...
I might be wrong, but about those URL's listed above, you definitely don't want them, as you said. :)

crocket 01-12-2003 08:57 AM

where can i get the old mod_rewrite version? i wanna try that!

cklaszlo 01-12-2003 01:06 PM

Not sure if this will help you but on our current server only

vB Easy Archive v1.0 - Released on December 1st, 2002 ####

will run correctly.

Here are our current server configs:

PHP Version 4.2.2
Red Hat Linux 8.0
Apache Version Apache 1.3.27
Loaded Modules: mod_php4, mod_setenvif, mod_so, mod_auth, mod_access, mod_rewrite, mod_alias, mod_userdir, mod_actions, mod_imap, mod_asis, mod_cgi, mod_dir, mod_autoindex, mod_include, mod_status, mod_negotiation, mod_mime, mod_log_config, mod_env, http_core

I hope this might help you try to figure out a more universal script. Since we've been very successful with it so far.

Dean C 01-12-2003 02:06 PM

Wow this looks fantastic - how on earth did i ever miss this thread :)

- miSt

glenvw 01-12-2003 04:52 PM

I host with VenturesOnline and they claim the mod is installed on my server..


Here is my config file:

<?php

################################################## ############
### vB Easy Archive v2.1 - Released on December 29th, 2002 ###
## Scripts created by Xenon and modified/released by SkuZZy ##
### Copyright info MUST stay on the bottom of all scripts ####
#### Visit http://www.vbulletin.org for updated versions! ####
################################################## ############

// Full PATH to your global.php file

require('/home/yesits/public_html/vbbs/global.php');

// Title of your website/forums... used for <title> tags on most pages

$title="Yes-its-free.Com - Your Highway To The World's Free Stuff!";

// Forums URL - Directory your vbulletin forums are in (NO TRAILING SLASH!)

$forumurl="http://www.yes-its-free.com/vbbs";

// Archive URL - The location of these scripts (NO TRAILING SLASH!)

$archiveurl="http://www.yes-its-free.com/easyarchive";

?>

All paths are right and I still get all white page when I call;

http://www.yes-its-free.com/easyarchive/

John 01-12-2003 04:56 PM

Quote:

Originally posted by glenvw
I host with VenturesOnline and they claim the mod is installed on my server..


Here is my config file:

<?php

################################################## ############
### vB Easy Archive v2.1 - Released on December 29th, 2002 ###
## Scripts created by Xenon and modified/released by SkuZZy ##
### Copyright info MUST stay on the bottom of all scripts ####
#### Visit http://www.vbulletin.org for updated versions! ####
################################################## ############

// Full PATH to your global.php file

require('/home/yesits/public_html/vbbs/global.php');

// Title of your website/forums... used for <title> tags on most pages

$title="Yes-its-free.Com - Your Highway To The World's Free Stuff!";

// Forums URL - Directory your vbulletin forums are in (NO TRAILING SLASH!)

$forumurl="http://www.yes-its-free.com/vbbs";

// Archive URL - The location of these scripts (NO TRAILING SLASH!)

$archiveurl="http://www.yes-its-free.com/easyarchive";

?>

All paths are right and I still get all white page when I call;

http://www.yes-its-free.com/easyarchive/

I had exactly the same problem - had everything installed properly, mod_mime, etc, kept on getting a blank page. In the end I simply used chdir to navigate to the global file instead, and it worked perfectly!

glenvw 01-12-2003 05:01 PM

Can you help a dummy out and edit my file or paste exactly what I need to do?

John 01-12-2003 05:03 PM

My chdir knowledge is limited, and I really can't be bothered to think right now.

If you put your archive directory INSIDE of your forum directory, I can paste my config file here.

glenvw 01-12-2003 05:05 PM

no problem

I appreciate it

Please do

John 01-12-2003 05:13 PM

I'll have to do it later - my only copy of config.php is on my ftp server, and I can't log in at the moment...

glenvw 01-12-2003 05:17 PM

Thanks John

Shoot fuddie

I am heading out for few days and wanted to get this finished. I will keep watching for your reply or anyone else willing to help.

Thank you


All times are GMT. The time now is 09:34 PM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01688 seconds
  • Memory Usage 1,846KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_code_printable
  • (10)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (1)pagenav_pagelinkrel
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (40)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete