[
Skip Navigation]
≡
-
Help
:
CMS Detectors
≡
CMS Detectors@Help
View
Source
History
Discussion
Help Group
Create/Find Pages
Group Feed
[
History
] [
Discuss
]
Locale: en-US
Page: CMS Detectors
Page Type:
Standard
Page and Feedback
Page Alias
Media List
Presentation
Url Shortener
Share Wall
Alias Page To:
Page Border:
Solid
Dashed
None
Table of Contents:
Title:
Author:
Meta Robots:
Meta Description:
Header Page Name:
Footer Page Name:
'''CMS Detectors''' are used to help Yioop get to the most important content on a web page. <br /><br /> You must enter the '''Name'''. The Header Regex and Important Content XPath are optional but will have no effect if they are not entered. <br /> '''The Header Regex''' is used to detect the CMS. The header of most CMS created sites are very common. A specifically crafted regular expression can be used to detect the CMS you are looking for. It looks in the href value in a rel='stylesheet' tag or the src value in a type='text/javascript' tag. <br /><br /> The '''Important Content XPath''' is used to target the most important content for summarizing. The first entry is where to target the important content. Any subsequent entry will be used to remove content within the important content. Append each removal XPath to the end of the value delimited by three pound signs (###). <br /> '''Example:''' <br /><br /> <table border='1'> <th>Setting</th> <th>Value</th> <tr><td>Name</td><td>Wordpress</td></tr> <tr><td>Header Regex </td><td>wp-(?:content|includes)</td></tr> <tr><td>Important Content XPath</td><td>//div[@id="content"]###<br />//div[@id="comments"]###<br />//div[@id="respond"]</td></tr> </table> <br />
Page Resources
Resources are images, videos, or files associated with this page.
No resources have been saved to this page yet.
[
X
]
(c) This Site -
This Search Engine