PDA

View Full Version : Remove HTML from CNN RSS Feeds


JackThomson120
03-03-2009, 12:44 AM
Well I have tried editing the files which deal with the RSS.
Such as changing &lt; to < and &gt; to >, and then removing any HTML - I have also tried making it out put it as HTML without removing it.

But there seams to be a function within which is replacing the tags to blank space.

So does anyone know how I could do this?

Example of CNN RSS:
An American journalist reported by her father as being detained in Iran had been working there without a permit, Iran's Foreign Ministry said Monday. Saberi's father, who lives in North Dakota, told CNN that his daughter called him from jail February 10 and told him she'd bought a bottle of wine for a friend's birthday. "Authorities told her the person who sold her the wine turned her in," Reza Saberi said.&lt;div class="feedflare"&gt;
&lt;a href="http://rss.cnn.com/~ff/rss/cnn_topstories?a=h9RrtQSNRPA:Qt7zAGctjf8:yIl2AUoC8 zA"&gt;&lt;img src="http://feeds2.feedburner.com/~ff/rss/cnn_topstories?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://rss.cnn.com/~ff/rss/cnn_topstories?a=h9RrtQSNRPA:Qt7zAGctjf8:7Q72WNTAK BA"&gt;&lt;img src="http://feeds2.feedburner.com/~ff/rss/cnn_topstories?d=7Q72WNTAKBA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://rss.cnn.com/~ff/rss/cnn_topstories?a=h9RrtQSNRPA:Qt7zAGctjf8:V_sGLiPBp WU"&gt;&lt;img src="http://feeds2.feedburner.com/~ff/rss/cnn_topstories?i=h9RrtQSNRPA:Qt7zAGctjf8:V_sGLiPBp WU" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://rss.cnn.com/~ff/rss/cnn_topstories?a=h9RrtQSNRPA:Qt7zAGctjf8:qj6IDK7rI Ts"&gt;&lt;img src="http://feeds2.feedburner.com/~ff/rss/cnn_topstories?d=qj6IDK7rITs" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://rss.cnn.com/~ff/rss/cnn_topstories?a=h9RrtQSNRPA:Qt7zAGctjf8:gIN9vFwOq vQ"&gt;&lt;img src="http://feeds2.feedburner.com/~ff/rss/cnn_topstories?i=h9RrtQSNRPA:Qt7zAGctjf8:gIN9vFwOq vQ" border="0"&gt;&lt;/img&gt;&lt;/a&gt;

&lt;/div&gt;&lt;img src="http://feeds2.feedburner.com/~r/rss/cnn_topstories/~4/h9RrtQSNRPA" height="1" width="1"/&gt;

Thanks in advanced :)

--------------- Added 1236050784 at 1236050784 ---------------

I found one way around it, instead of editing vBulletin files, create a PHP file which edits the RSS file before imported to vBulletin.

rss.php
<?php
$rss = $_GET["rss"];
$rss = file_get_contents($rss);
$rss = preg_replace('/&lt;a href="(.*?)"&gt;/', '', $rss);
$rss = preg_replace('/&lt;img src="(.*?)" border="0"&gt;/', '', $rss);
$rss = preg_replace('/&lt;div class="feedflare"&gt;/', '', $rss);
$rss = preg_replace('/&lt;\/img&gt;/', '', $rss);
$rss = preg_replace('/&lt;\/a&gt;/', '', $rss);
$rss = preg_replace('/&lt;\/div&gt;/', '', $rss);
$rss = preg_replace('/&lt;img src="(.*?)" height="1" width="1"\/&gt;/', '', $rss);

echo $rss;
?>

Then for the Feed URL, just simply put the following:
rss.php?rss=http://rss.cnn.com/rss/cnn_latest.rss
or
rss.php?rss=http://rss.cnn.com/rss/cnn_topstories.rss

vAsia
05-31-2011, 03:08 AM
I followed the instruction and created rss.php in /admincp folder. When I tried to http://mydomain.com/admincp/rss.php?rss=http://rss.cnn.com/rss/cnn_latest.rss, I got this error message:

------------------------------
Forbidden

You don't have permission to access /rss.php on this server.

Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument to handle the request.
-------------------------------

I then tried chmod 744, 755, 777 to the rss.php file, but that didn't work.

Any clue?

BirdOPrey5
05-31-2011, 07:17 PM
rss.php should probably be in your forum root directory. Also for the URL put in the full URL like:
http://yourforum.com/forum/rss.php?http://www.cnn.com/rss/

weindians
07-04-2011, 10:33 PM
I'm getting this error when using that rss.php?rss= infront of feed url...


Warning: fsockopen() [function.fsockopen]: php_network_getaddresses: getaddrinfo failed: Name or service not known in [path]/includes/class_rss_poster.php on line 98

Warning: fsockopen() [function.fsockopen]: unable to connect to rss.php?rss=http:80 (php_network_getaddresses: getaddrinfo failed: Name or service not known) in [path]/includes/class_rss_poster.php on line 98

Warning: Unable to fetch RSS Feed in [path]/includes/class_rss_poster.php on line 231
mathrubhumi news Unable to open URL or no data returned