HTML parser =========== Tag parser ---------- Tag parser understands the following tag notation: 1) < ... parameter=value ... > 2) < ... parameter="value" ... > 3) < ... parameter='value' ... > Special characters ------------------ indexer understands the following special HTML characters. 1) < > &   " 2) All SGML ISO-8859-1 entities: ä ü and other. 3) Characters in their ASCII code notation: ê Meta tags --------- Indexer's HTML parser currently understands the following META tags. Note that "HTTP-EQUIV" may be used instead of "NAME" in all entries. 1) This is used to eliminate the character set if it is not specified in "Content-type" HTTP header. 2) URL value will be inserted in database. 3) 4) 5) with content value ALL, NONE, INDEX, NOINDEX, FOLLOW, NOFOLLOW. Links ----- HTML parser understand the following links. 1) 2) 3) 4) 5) 6) If BASE HREF value has illegally formed URL, the current one will be used insted to compese relative links. Comments -------- 1) Text inside the considered as HTML comment. 2) You may use special .... comment tags to exclude the text between from the indexing. This may be usefull to hide such things like menus and others from indexing.