【文件属性】:
文件名称:DIHtmlParser v7.9.0 for D10.1 Berlin
文件大小:19.64MB
文件格式:RAR
更新时间:2019-09-03 16:05:33
HtmlParser
The HTML pieces are:
CData Sections: CData Sections, found in XML, are used to escape blocks of text containing characters which would otherwise be recognized as markup. A CData section begins with .
Comments: The Comments' contents are returned readily stripped of the comment markers. A comment starts with .
Document Type Definitions: A Document Type Definition defines the syntax of markup constructs. It begins with .
HTML Processing Instructions: HTML Processing Instructions are a mechanism to capture platform-specific idioms. They start with and end with >.
HTML-Tags: HTML-Tags are readily parsed into Name, Attributes and Values. DIHtmlParser recognizes Start Tags, End Tags and Empty Element Tags. Example: .
Scripts: DIHtmlParser returns the contents between the tags as simple text. The surrounding HTML tags are reported separately.
Styles: DIHtmlParser returns the contents between the tags as simple text. The surrounding HTML tags are reported separately.
Text: Text is everything which is not markup. If the NormalizeWhiteSpace option is enabled, DIHtmlParser reduces multiple white space to a single character. Preformatted text wrapped by
and
is never normalized.
Titles: DIHtmlParser returns the contents between the
and tags as simple text. Titles are not normal text because they are parsed differently.
XML Processing Instructions: XML Processing Instructions are similar to the HTML Processing Instructions with a slightly different syntax: They begin with .
The Non-HTML pieces are:
Active Server Pages (ASP): Active Server Page markup is often used to enclose scripting macros. It begins with <% and runs up to %>.
Custom-Tags: Custom Tags are similar to HTML-Tags and to what Delphi's Help calls Transparent Tags. For DIHtmlParser, a Custom-Tags' name must begin with a user-define start character just as #like in <#Name Attribute=“Value” />.
PHP: PHP is a powerful and popular scripting language. Its markup begins with .
Server Side Includes (SSI): SSI, an extension of the Apache Web Server, starts with . It allows to insert include files and other data into HTML documents on the fly.
Parsing Efficiency