|
With RSS Feed Filters you can extract the article from the HTML page
containing the article.
Different providers require different filters, in fact what My Web
News is doing is extracting the contents between a start and a end tag.
These tags are unique strings inside the source of the HTML of the page
containing the article.
My Web News uses 3 parts of the HTML document, a Title, a SubTitle and
the Article. Another thing is that some providers use relative
addressing for images and some use absolute addressing. Since My Web
News first retrieves the article and then show it locally, relative
addressing will not show the images, unless the image addresses are
transformed to absolute addresses.
Values in the Filters.INI file should be enclosed between quotes.
Everything within the outside quotes is part of the value, even if its
another quot.
The filters are stored in the Filters.INI file and contain two basic
sections.
The [Main] section:
Count=Number - gives the number of filters.
Provider?=URL - gives the main URL of the Provider, where
the question mark is a sequential number, starting at 1.
The [Provider?] section:
Title=YourTitle - gives a Title which will be on the page
before the title extracted from the HTML document.
TitleStart=StartTag - unique string in the HTML document just
before the title string, you can use the HTML document title which comes
after "<title>".
TitleEnd=EndTag - unique string in the HTML document just after
the title string, you can use the HTML document title which ends with "</title>".
SubTitle=YourSubTitle - gives a SubTitle which will be on the
page before the subtitle extracvted from the HTML document.
SubTitleStart-StartTag - unique string in the HTML document just
before the subtitle string, you can use the HTML document meta commands
to find a suited subtitle like "<meta name="Section"
content="".
SubTitleEnd=EndTag - unique string in the HTML document just after
the subtitle string, you can use the HTML document meta commands to find
a suited subtitle which usually ends with a quot.
Article=YourArticle - gives a Title to the article as it appears
in the IFrame window.
ArticleStart=StartTag - unique string in the HTML document just
before the article string, this is different for each provider.
ArticleEnd=EndTag - unique string in the HTML document just after
the article string, this is different for each provider.
ReplaceSource=RelativeImageURL - relative path as used in the "<img
src=" tag.
ReplaceWith=AbsoluteImageURL - absolute path to replace the
relative path.
See the Filters.INI file for examples.
|
[Main]
Count=5
Provider1="http://news.bbc.co.uk"
Provider2="http://www.elsevier.nl"
Provider3="http://www.reuters.com"
Provider4="http://www.nieuws.nl"
Provider5="http://www.cnn.com"
[http://news.bbc.co.uk]
Title="Title: "
TitleStart="<meta name="Headline" content=""
TitleEnd="""
SubTitle="SubTitle: "
SubTitleStart="<meta name="Section" content=""
SubTitleEnd="""
Article="Article:"
ArticleStart="<!-- S BO -->"
ArticleEnd="<!-- E BO -->"
ReplaceSource="src="/nol"
ReplaceWith="src="http://newsimg.bbc.co.uk/nol"
[http://www.elsevier.nl]
Title="Title: "
TitleStart="<title>"
TitleEnd="</title>"
SubTitle="SubTitle: "
SubTitleStart="<p class="pubdatum">"
SubTitleEnd="<br/>"
Article="Artikel:"
ArticleStart="<!-- BEGIN ARTIKEL -->"
ArticleEnd="<p class="
ReplaceSource="src="/artimg"
ReplaceWith="src="http://www.elsevier.nl/artimg"
[http://www.reuters.com]
Title="Title: "
TitleStart="<title>"
TitleEnd="</title>"
SubTitle="SubTitle: "
SubTitleStart="<meta name="description" content=""
SubTitleEnd="">"
Article="Article:"
ArticleStart="<!-- article text begins -->"
ArticleEnd="<!-- article text ends -->"
ReplaceSource=""
ReplaceWith=""
[http://www.nieuws.nl]
Title="Title: "
TitleStart="<title>"
TitleEnd="</title>"
SubTitle="SubTitle: "
SubTitleStart="CONTENT=""
SubTitleEnd="">"
Article="Artikel:"
ArticleStart="<div id="content">"
ArticleEnd="</div>"
ReplaceSource="src="/news"
ReplaceWith="src="http://www.nieuws.nl/news"
[http://www.cnn.com]
Title="Title: "
TitleStart="<meta name="SECTION" content=""
TitleEnd="""
SubTitle="SubTitle: "
SubTitleStart="<title>"
SubTitleEnd="</title>"
Article="Article:"
ArticleStart="<!--===============Paste story between here=================-->"
ArticleEnd="<!--===============Paste story between here=================-->"
ReplaceSource=""
ReplaceWith=""
|
|
Please visit www.samconsult.biz for more information.
|