![]() |
Optimized attachments in vB2.X.X
Hi vB Troopers,
***************** * Hack Idea: ***************** This hack is designed to improve 2 things: 1. Prevent duplicate attachments altogether. 2. Speed up the query for duplicates and avoid errors. What does vB do now? --------------------------------- If you have "Allow Duplicate Images" set to "No" in your attachments options and your database has a very large amount of attachments, at worst, the server will timeout or the query to check for duplicate attachments takes an extreme amount of time. The duplicates query also checks only for duplicates from a particular user so actually it is possible to have duplicate attachments in the database. What does vB do differently after the hack? ------------------------ With the hack installed, each attachment receives a hash which is stored in the attachment table. This hash is then used to find duplicate attachments regardless of whether or not the attachment is from the same user or not. So duplicate attachments are not possible. Why vB works faster with this hack? ------------------------------------ Since the hash field and not the actual file content is being compared, then much less server resources are needed for MySql to run the query. Also the hash field is indexed. This in turn speeds up the saving of the attachments when checking for duplicates. Please be aware that this hack alters the database and when carried out incorrectly, could cause data loss. Installation is at your own risk. That said, enjoy!:D Scott This hack was written by pogo. Also note, a system similiar to this will be part of vB3, so no need to ask if it will be added. |
[Release vB 2.2/3.x] Optimierung des Speicherns von Anh?ngen
Sinn des Hacks: Wenn man sehr viele Anh?nge in seiner DB hat und der Server mit einem zu kurzen Timeout versehen ist, passiert es, dass man keine Anh?nge mehr hochladen kann, weil ein 404 Fehler erscheint. Dieser Hack ver?ndert die ?berpr?fung auf doppelte Anh?nge derart, dass sie sehr viel schneller abl?uft. Wie es im original vBulletin ist: Wenn man in den Options keine doppelten Anh?nge erlaubt, wird beim Hochladen eines Anhangs ?berpr?ft, ob man diesen schon einmal hochgeladen hat. Es wird nicht ?berpr?ft, ob dergleiche Anhang schon in der DB, sondern nur, ob der augenblickliche Autor diesen Anhang schon gepostet hat. Wie es mit dem Hack ist: Es wird ?berpr?ft, ob dergleiche Anhang schon in der DB ist. Ist dies der Fall, wird der Anhang nicht erneut in der DB gespeichert, sondern der bestehende Anhang im Beitrag angezeigt. Wer braucht den Hack? Der Hack nimmt vermutlich allen Foren mit vielen Anh?ngen ein wenig (wenn nicht viel) Belastung vom Server. Die Deutsche Version findest du hier. |
hmm, from what i see, it's looking good.
just one thing might cause trouble (very rarely of course but it could.) you are using a hash function to decide if an attachment is already in the db or not. lesson one of hashing is that hashes can be compared faster than the objects, but nevertheless the best hashing function could produce similar hash results with different objects (it's called the birthday problem or something like) md5 hash spreads the hash result very good, but it is NO injective function, and therfore a bug can occure. (as said rarely but possible) to solve the problem, you have to compare the full data after selecting it with the hash comparison. |
Wow a hack from a vb dev - this is a rare and special occasion :)
This hack sounds impressive. Good job pal :)! Regards - miSt |
Quote:
@Xenon. Good point but as you said, very rare that something like you mentioned would happen. Thanks for the input. Scott |
;) welcome scott :)
you know, me, always on the search of possible problems ;) ok, the chance of two matching hashes is 1 to 3.4 * 10^38 but it is there ;) @Mist: they are just developing the german in the new vb *gg* |
Still developing vbulletin hehe ;)
This is such a good idea. I hope its in vb3? ;) - miSt |
Well if the bugs get ironed out from what Xenon pointed out, I'll most definitely install :)
|
you can install it, the chance of this bug occurs is nearly 0
if you want to be absolutley sure use instead of this: PHP Code:
PHP Code:
|
A hack from molinari is always a good one. :)
|
If you add the code Xenon pointed out, wouldn't it run the same speed as it does now?
Quote:
|
No. It is only one additional comparison that should run fast.
|
So, as the hack writer, do you recommend to add it? ;)
|
can i answer?
it's not recommended, but it prevents a bug which can occur (normally one on every million board, but a bug, so in vb3 i think this would be in..) |
Ok, then, just for curiosities sake, what would happen if the bug ever did occur? Would it only happen once in a long while and what would it do or cause?
|
If the hashes of two different attachments are the same the new attachment wouldn't make it into the database because it is assumed that it is the same file like the one that is already in the database.
So the bug would be that the wrong attachment is shown in the post. I don't think that you will encounter this bug. |
Thanks for explaining that. I will go ahead and add the original version then. Thanks! ;) And great hack, by the way. :)
|
This is bugging the heck out of me for some reason. What does this come out to?
1 to 3.4 * 10^38 I tried to do it on the windows Scientific Calculator and I only come up with something like 149.5. |
it is 0.000000000000000000000000000000000000002941 :)
|
The chances that you want to upload a file that has the same hash like a file that is already in the database are 1:340.000.000.000.000.000.000.000.000.000.000.000. 000
That means you have to upload this many files before the "bug" will occur. Theoretically. |
Quote:
By the way, how would you enter that equation on a calculator? |
*gg*
my calculator can handle exponental operations, so i can just type in 3.4 exp 38 :) ok, it's not a MS calculator, but one i bought for scholl long time ago :) |
Guess I'm going to have to get me one of those. Who knows when I'll run into this again? What do you think the equation would be on that one? LOL
|
What are the odds for it happening two times in a row for the same user? ;)
Just kidding. :) Smart hack btw. |
Is this needed, or does it help if allow duplicate images is set to "yes"?
|
I cant run this query via phpmyadmin, im getting a no SQL query error.
[sql]ALTER TABLE attachment ADD hash VARCHAR(32) DEFAULT '0' NOT NULL;[/sql] |
Something is buggy in my cpanel, i got them to run.
|
I noticed, before adding this hack, I had this line:
PHP Code:
PHP Code:
|
Yes. It is this way in the actual vBulletin.
Quote:
|
how would i go about removing this hack
after installing it, my forum has been going very very slow :( how do i remove these queries? ------------------------------------------------------------------------------- ALTER TABLE attachment ADD hash VARCHAR(32) DEFAULT '0' NOT NULL; ------------------------------------------------------------------------------- ------------------------------------------------------------------------------- UPDATE attachment SET hash = md5(filedata); ------------------------------------------------------------------------------- ------------------------------------------------------------------------------- ALTER TABLE attachment ADD INDEX(hash); ------------------------------------------------------------------------------- hope sum1 can help thanks |
lifesourcerec - The @ prevents it from outputting an error if the function fails. It was added in there in vb 2.2.7 i believe (i can remember from when i upgraded)
- miSt |
Quote:
In the table listing menu click attachment. Then in the fields table click delete in the hash row. And last click delete a little below in the index area. |
Quote:
dont get me wrong, great hack, i would just like 2 tottaly remove it now, :( |
ahh yea slowly but surely, the speed is improving. :)
|
Hi all. There is a small (embarrassing) mistake in the functions.php replacement code. :0 Please correct the following lines in functions.php.
PHP Code:
PHP Code:
Scott |
Quote:
I highly doubt that this hack is the cause of the slow down on your site. Scott |
Quote:
all is fine now though. forum is running normally again. :D |
Scott, how about the English version? ;)
|
When i apply the fix to functions.php i get this error:
PHP Code:
|
I think the code should be like this:
PHP Code:
Correct me if i am wrong Molinari... Grtz |
All times are GMT. The time now is 04:47 PM. |
Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information | |
---|---|
|
|
![]() |
|
Template Usage:
Phrase Groups Available:
|
Included Files:
Hooks Called:
|