vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vBulletin 3.6 Add-ons (https://vborg.vbsupport.ru/forumdisplay.php?f=194)
-   -   Administrative and Maintenance Tools - Link / URL Status Checker (https://vborg.vbsupport.ru/showthread.php?t=134208)

bairy 12-16-2006 10:00 PM

Link / URL Status Checker
 
What does this do?
For anyone who has seen or got my Image Status Checker hack, this is exactly the same, but works on links ([url=], [url]) instead.

Basically it scans all your posts, extracts all the url tags, and scans each of the images to see if they're still valid.

The rest of this is basically the description of that hack reworded


Why?
I had a look at all the links on my site and was alarmed at how many were now gone. Since the only way you can check the links on your board is to manually read every post and click them, I decided to come up with a better way... and this is it.


How does it work?
The first part: In the AdminCP, under Maintenance and Update Counters... right at the bottom is this hack. It works by looking up every url tag, then requesting the image, and reading the http status code. So code 200 means 'image ok', 404/410 means 'image gone' etc. That then gets stored in a database table. A server has 15 seconds to reply to the request or the status is labelled as "Unknown"
The second part: The browsing element, linkstatuscheck.php (original filename huh!). This allows you to browse all the images found in the last scan using some powerful filtering (statuses to display, search, order by).


Hack features
  • General
  • Fully phrased.
  • Templates are grouped. Who's online handled.
  • Part 1 - Admin
  • Reads the post table, scans all the [url=] and [url] tags on demand and records the actual http status code returned.
  • If it gets stuck during the scan, you can restart the section it's currently doing.
  • If an link appears in more than one post, it's only checked once.
  • Start from, per page and timeout options for scanning.
  • Part 2 - Browser
  • Status codes are put into one of three descriptions for simplicity: Working, Dead, Unknown. Unknown is if the server didn't respond or similar - on the basis that a temporary timeout doesn't necessarily mean the image has gone.
  • In the browser, link urls are force wrapped. Unless people post using all caps, you have a low screen resolution, or the font size is big, the table should never stretch.
  • Filtering allows you to show just the working/dead/unknown images, and there's a search facility for a variety of fields.
  • Convenient link to edit the post (if a dead link is found). This works by can_moderate - edit links only appear for people who own the post, or can moderate the forum it's in.
  • Works by canview - if someone can't view a particular forum (e.g. staff forum) normally, they can't view the images within it.
  • Uses css for common stuff to reduce the size of the outputted pages.


Bad Things
It's far from a perfect hack, there are many things to do. Please be aware that I won't be doing them, but if anyone else wants a crack, feel free!
  • Only supports http://, not https://
  • Can only handle replies like: HTTP 1.x 200 as the first line.
  • Only supports [url] and [url=] tags. If you have HTML turned on in any forums it won't see <a href=> links.
  • Biggie: There's no way to update a single post or link without a full re-scan. That means if someone edits their post to update or remove a dead link, it will not change on the browser until a full re-scan is done. I did play with various update methods but most are flawed in one way or another.
  • No cron job.
  • No session variables. (People without cookies will be logged out a lot).


Footnotes
The code to the Image Status Check hack is very similar, so I pretty much copy pasted and adapted it. It is a little bigger due to handling the [url=] "option", but you may find references to images. Let me know if you do.


It will work on 3.6 and 3.5, though you'll need to remove the "executionorder=" from the .xml file to get it working on 3.5


Installation
Upload linkstatuscheck.php to your vB directory. Install the product, set overwrite to yes.


Customizing
  • By default it's set to only allow moderators, super-moderators and administrators to view the browser. This can be changed with the setting in AdminCP > vB Options.
  • The phrases all start with usc_ if you want to change them.
  • You can add a link to linkstatuschecker.php on the navbar (or anywhere) if you want your members to be able to view it.


Screenies
Exactly the same as the Image Status Checker, only with URL links rather than image links.
What? I'm feeling lazy.

bashy 12-17-2006 06:30 PM

Hi

Great idea thanks, but in the Search for text area, what do i actually input there?

I cracked it now, Ya have to run the checker in the Maintenabce section in the update countser section :)
Then you use the browser part, is that correct?

Veer 12-17-2006 06:31 PM

Thanks, :)
Its Very Useful for Me, Because My Site Have Lots n Lots of Images, Shared by Members.

Question / Suggestion: (for future update)
Is there any chance to check image / link status automatically and if a image link is dead then system auto send a PM to user who has the post with dead link?

Thanks again.

projectego 12-17-2006 06:36 PM

Thanks, this sounds like a very helpful add-on! ;)

bashy 12-17-2006 06:40 PM

looks like all my links are of unknown any ideas?

bairy 12-17-2006 06:49 PM

Quote:

Originally Posted by bashy (Post 1140483)
I cracked it now, Ya have to run the checker in the Maintenabce section in the update countser section :)
Then you use the browser part, is that correct?

Yes, maintainence first, then the browser.


Quote:

Originally Posted by Pehchaan (Post 1140484)
Question / Suggestion: (for future update)
Is there any chance to check image / link status automatically and if a image link is dead then system auto send a PM to user who has the post with dead link?

There probably won't be a future update. It was really just a hack to fill a need, rather than something I plan to develop further.
Besides, auto PMing would add a lot of strain to the server, and people can just look at the linkstatuscheck.php if they want to see which of their posts are in trouble.


Quote:

Originally Posted by bashy (Post 1140491)
looks like all my links are of unknown any ideas?

Well, assuming you know at least one of them actually works, I'd guess the server can't communicate with the remote sites. Perhaps fopen is disabled in php?

bashy 12-17-2006 07:08 PM

allow_url_fopen is on just checked

Quote:

Originally Posted by bairy (Post 1140499)
Well, assuming you know at least one of them actually works, I'd guess the server can't communicate with the remote sites. Perhaps fopen is disabled in php?


bashy 12-17-2006 07:09 PM

out of 3000+ links 3 are dead about 5 or 6 are working and the rest unknown!!

bairy 12-17-2006 07:11 PM

I'm afraid I don't know then, I don't really know php's local/remote file commands all that well.
I can tell you it uses fsockopen to ping the server, then fwrite, fread to communicate.

bashy 12-17-2006 07:14 PM

Well, i just installed ya image one as well and get the same, 99% of unknown status!
What does duplicate / dealt with mean? i get a lot of them
Also in the update counters, the 1st page seems to work well, all the rest give the duplicate / dealt


All times are GMT. The time now is 08:04 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01136 seconds
  • Memory Usage 1,754KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (4)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (2)pagenav_pagelink
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (10)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • pagenav_page
  • pagenav_complete
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete