A B C D E F G H I J K L M N O P R S T U V W X _
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- addAttribute(String, String) - Method in class org.htmlcleaner.TagNode
-
Deprecated.Use setAttribute instead Adds specified attribute to this tag or overrides existing one.
- addAttributesToTag(TagNode, Map) - Method in class org.htmlcleaner.HtmlCleaner
-
Add attributes from specified map to the specified tag.
- addAttributeTransformation(String) - Method in class org.htmlcleaner.TagTransformation
-
Adds new attribute transformation in which destination attrbute will not exists (simply removes it from list of attributes).
- addAttributeTransformation(String, String) - Method in class org.htmlcleaner.TagTransformation
-
Adds new attribute transformation to this tag transformation.
- addChild(Object) - Method in class org.htmlcleaner.TagNode
- addChildren(List) - Method in class org.htmlcleaner.TagNode
-
Add all elements from specified list to this node.
- addEntity(String, int) - Static method in class org.htmlcleaner.SpecialEntity
-
Add new entity to the set.
- addItemForMoving(BaseToken) - Method in class org.htmlcleaner.TagNode
- addNamespaceDeclaration(String, String) - Method in class org.htmlcleaner.TagNode
-
Adds namespace declaration to the node
- addPossibleHeadCandidate(TagInfo, TagNode, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Checks if specified tag with specified info is candidate for moving to head section.
- addSavedAsContent() - Method in class org.htmlcleaner.HtmlTokenizer
- addTag(String, int) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- addTagInfo(TagInfo) - Method in class org.htmlcleaner.DefaultTagProvider
-
Sets new tag info.
- addText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- addToken(BaseToken) - Method in class org.htmlcleaner.HtmlTokenizer
- addTransformation(TagTransformation) - Method in class org.htmlcleaner.CleanerTransformations
-
Adds specified tag transformation to the collection.
- advancedxmlescape - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- advancedXmlEscape - Variable in class org.htmlcleaner.CleanerProperties
- allowhtmlinsideattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- allowHtmlInsideAttributes - Variable in class org.htmlcleaner.CleanerProperties
- allowmultiwordattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- allowMultiWordAttributes - Variable in class org.htmlcleaner.CleanerProperties
- allowsAnything() - Method in class org.htmlcleaner.TagInfo
- allowsBody() - Method in class org.htmlcleaner.TagInfo
- allowsItem(BaseToken) - Method in class org.htmlcleaner.TagInfo
- allTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- attName - Variable in class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
- attName - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
- attributes - Variable in class org.htmlcleaner.TagNode
- attributeTransformations - Variable in class org.htmlcleaner.TagTransformation
- attributeValue() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses a single tag attribute - it is expected to be in one of the forms: name=value name="value" name='value' name
- attValue - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
B
- BaseToken - Interface in org.htmlcleaner
-
Base token interface.
- belongsTo - Variable in class org.htmlcleaner.TagInfo
- BODY - Static variable in class org.htmlcleaner.TagInfo
- bodyNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- BOOL_ATT_EMPTY - Static variable in class org.htmlcleaner.CleanerProperties
- BOOL_ATT_SELF - Static variable in class org.htmlcleaner.CleanerProperties
- BOOL_ATT_TRUE - Static variable in class org.htmlcleaner.CleanerProperties
- booleanAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
- booleanatts - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- BrowserCompactXmlSerializer - Class in org.htmlcleaner
-
Broswer compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
- BrowserCompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.BrowserCompactXmlSerializer
C
- calculateRootNode(HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Assigns root node to internal variable.
- characters(char[], int, int) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- children - Variable in class org.htmlcleaner.TagNode
- childTags - Variable in class org.htmlcleaner.TagInfo
- clean(File) - Method in class org.htmlcleaner.HtmlCleaner
- clean(File, String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(InputStream) - Method in class org.htmlcleaner.HtmlCleaner
- clean(InputStream, String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(Reader) - Method in class org.htmlcleaner.HtmlCleaner
- clean(Reader, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Basic version of the cleaning call.
- clean(String) - Method in class org.htmlcleaner.DoctypeToken
- clean(String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(URL) - Method in class org.htmlcleaner.HtmlCleaner
-
Creates instance from the content downloaded from specified URL.
- clean(URL, String) - Method in class org.htmlcleaner.HtmlCleaner
- CleanerProperties - Class in org.htmlcleaner
-
Properties defining cleaner's behaviour
- CleanerProperties() - Constructor for class org.htmlcleaner.CleanerProperties
- CleanerTransformations - Class in org.htmlcleaner
-
Contains transformation collection.
- CleanerTransformations() - Constructor for class org.htmlcleaner.CleanerTransformations
- CleanTimeValues() - Constructor for class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- closeAll(List<BaseToken>, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Close all unclosed tags if there are any.
- closeSnippet(List, HtmlCleaner.TagPos, Object, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- collectNamespacePrefixesOnPath(Set<String>) - Method in class org.htmlcleaner.TagNode
-
Collect all prefixes in namespace declarations up the path to the document root from the specified node
- CommandLine - Class in org.htmlcleaner
-
Command line usage class.
- CommandLine() - Constructor for class org.htmlcleaner.CommandLine
- comment() - Method in class org.htmlcleaner.HtmlTokenizer
- CommentNode - Class in org.htmlcleaner
-
HTML comment token.
- CommentNode(String) - Constructor for class org.htmlcleaner.CommentNode
- commonStr - Variable in class org.htmlcleaner.HtmlTokenizer
- CompactHtmlSerializer - Class in org.htmlcleaner
-
Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
- CompactHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactHtmlSerializer
- CompactXmlSerializer - Class in org.htmlcleaner
-
Compact XML serializer - creates resulting XML by stripping whitespaces.
- CompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactXmlSerializer
- ConfigFileTagProvider - Class in org.htmlcleaner
-
Default tag provider - reads XML file in specified format and creates tag infos
- ConfigFileTagProvider() - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(File) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(URL) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(InputSource) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider.ConfigParser - Class in org.htmlcleaner
-
SAX parser for tag configuration files.
- ConfigParser(Map) - Constructor for class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- content - Variable in class org.htmlcleaner.CommentNode
- content - Variable in class org.htmlcleaner.ContentNode
- content() - Method in class org.htmlcleaner.HtmlTokenizer
- CONTENT_ALL - Static variable in class org.htmlcleaner.TagInfo
- CONTENT_NONE - Static variable in class org.htmlcleaner.TagInfo
- CONTENT_TEXT - Static variable in class org.htmlcleaner.TagInfo
- ContentNode - Class in org.htmlcleaner
-
HTML text token.
- ContentNode(char[], int) - Constructor for class org.htmlcleaner.ContentNode
- ContentNode(String) - Constructor for class org.htmlcleaner.ContentNode
- contentType - Variable in class org.htmlcleaner.TagInfo
- continueAfterTags - Variable in class org.htmlcleaner.TagInfo
- copyTags - Variable in class org.htmlcleaner.TagInfo
- createDocumentNodes(List, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- createDOM(TagNode) - Method in class org.htmlcleaner.DomSerializer
- createElement(TagNode) - Method in class org.htmlcleaner.JDomSerializer
- createElement(TagNode, Document) - Method in class org.htmlcleaner.DomSerializer
- createJDom(TagNode) - Method in class org.htmlcleaner.JDomSerializer
- createSubnodes(Element, List) - Method in class org.htmlcleaner.JDomSerializer
- createSubnodes(Document, Element, List) - Method in class org.htmlcleaner.DomSerializer
- createTagNode(String) - Method in class org.htmlcleaner.HtmlTokenizer
- createTagNode(String, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- createTagNode(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
D
- DEFAULT_CHARSET - Static variable in class org.htmlcleaner.HtmlCleaner
- DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyHtmlSerializer
- DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyXmlSerializer
- DefaultTagProvider - Class in org.htmlcleaner
-
This class is automatically created from ConfigFileTagProvider which reads default XML configuration file with tag descriptions.
- DefaultTagProvider() - Constructor for class org.htmlcleaner.DefaultTagProvider
- defineAllowedChildrenTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseBeforeCopyInsideTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseBeforeTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseInsideCopyAfterTags(String) - Method in class org.htmlcleaner.TagInfo
- defineFatalTags(String) - Method in class org.htmlcleaner.TagInfo
- defineForbiddenTags(String) - Method in class org.htmlcleaner.TagInfo
- defineHigherLevelTags(String) - Method in class org.htmlcleaner.TagInfo
- defineNamespaceDeclarations(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
- defineRequiredEnclosingTags(String) - Method in class org.htmlcleaner.TagInfo
- dependencyName - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- deprecated - Variable in class org.htmlcleaner.TagInfo
- dest - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- destTag - Variable in class org.htmlcleaner.TagTransformation
- doctype() - Method in class org.htmlcleaner.HtmlTokenizer
- docType - Variable in class org.htmlcleaner.TagNode
- DoctypeToken - Class in org.htmlcleaner
-
HTML doctype token.
- DoctypeToken(String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
- DomSerializer - Class in org.htmlcleaner
-
DOM serializer - creates xml DOM.
- DomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.DomSerializer
- DomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.DomSerializer
- dontEscape(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
- dontEscape(TagNode) - Method in class org.htmlcleaner.XmlSerializer
E
- endElement(String, String, String) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- EndTagToken - Class in org.htmlcleaner
-
HTML tag end token.
- EndTagToken() - Constructor for class org.htmlcleaner.EndTagToken
- EndTagToken(String) - Constructor for class org.htmlcleaner.EndTagToken
- entities - Static variable in class org.htmlcleaner.SpecialEntity
- escapeText(String) - Method in class org.htmlcleaner.HtmlSerializer
- escapeXml - Variable in class org.htmlcleaner.DomSerializer
- escapeXml - Variable in class org.htmlcleaner.JDomSerializer
- escapeXml(String) - Method in class org.htmlcleaner.XmlSerializer
- escapeXml(String, CleanerProperties, boolean) - Static method in class org.htmlcleaner.Utils
-
Escapes XML string.
- evaluateAgainst(Collection, int, int, boolean, int, int, boolean, Collection) - Method in class org.htmlcleaner.XPather
- evaluateAgainstNode(TagNode) - Method in class org.htmlcleaner.XPather
-
Main public method for this class - a way to execute XPath expression against specified TagNode instance.
- evaluateFunction(Collection, int, int, int, int, boolean) - Method in class org.htmlcleaner.XPather
-
Evaluates specified function.
- evaluateLogic(Collection, Collection, String) - Method in class org.htmlcleaner.XPather
-
Evaluates logic operation on two collections.
- evaluateTemplate(String, Map) - Static method in class org.htmlcleaner.Utils
-
Evaluates string template for specified map of variables.
- evaluateXPath(String) - Method in class org.htmlcleaner.TagNode
-
Evaluates XPath expression on give node.
- execute() - Method in class org.htmlcleaner.HtmlCleanerForAnt
-
Implementation of Ant task execution.
F
- factory - Variable in class org.htmlcleaner.JDomSerializer
- fatalTag - Variable in class org.htmlcleaner.TagInfo
- filterByCondition(Collection, int, int) - Method in class org.htmlcleaner.XPather
-
Filter nodes satisfying the condition
- findClosingIndex(int, int) - Method in class org.htmlcleaner.XPather
- findElement(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
-
Finds first element in the tree that satisfy specified condition.
- findElementByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- findElementByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- findElementHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- findFirstTagPos() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- findTag(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- findTagToPlaceRubbish() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- flatten(int, int) - Method in class org.htmlcleaner.XPather
- fullUrl(String, String) - Static method in class org.htmlcleaner.Utils
-
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
G
- generateCode - Variable in class org.htmlcleaner.ConfigFileTagProvider
- getAllElements(boolean) - Method in class org.htmlcleaner.TagNode
- getAllElementsList(boolean) - Method in class org.htmlcleaner.TagNode
- getArgValue(String[], String) - Static method in class org.htmlcleaner.CommandLine
- getAsString(TagNode) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, boolean) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, String) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
- getAttributeByName(String) - Method in class org.htmlcleaner.TagNode
- getAttributes() - Method in class org.htmlcleaner.TagNode
- getAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
- getBelongsTo() - Method in class org.htmlcleaner.TagInfo
- getBooleanAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
- getCharacter() - Method in class org.htmlcleaner.SpecialEntity
- getCharsetFromContent(URL) - Static method in class org.htmlcleaner.Utils
- getCharsetFromContentTypeString(String) - Static method in class org.htmlcleaner.Utils
- getChildIndex(HtmlNode) - Method in class org.htmlcleaner.TagNode
- getChildren() - Method in class org.htmlcleaner.TagNode
- getChildTagList() - Method in class org.htmlcleaner.TagNode
- getChildTags() - Method in class org.htmlcleaner.TagInfo
- getChildTags() - Method in class org.htmlcleaner.TagNode
- getCommentedContent() - Method in class org.htmlcleaner.CommentNode
- getContent() - Method in class org.htmlcleaner.CommentNode
- getContent() - Method in class org.htmlcleaner.ContentNode
- getContent() - Method in class org.htmlcleaner.DoctypeToken
- getContentType() - Method in class org.htmlcleaner.TagInfo
- getContinueAfterTags() - Method in class org.htmlcleaner.TagInfo
- getCopyTags() - Method in class org.htmlcleaner.TagInfo
- getCurrentChar() - Method in class org.htmlcleaner.HtmlTokenizer
- getDecimalNCR() - Method in class org.htmlcleaner.SpecialEntity
- getDestTag() - Method in class org.htmlcleaner.TagTransformation
- getDocType() - Method in class org.htmlcleaner.HtmlTokenizer
- getDocType() - Method in class org.htmlcleaner.TagNode
- getElementList(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
-
Get all elements in the tree that satisfy specified condition.
- getElementListByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- getElementListByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElementListHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElements(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByName(Collection, int, int, boolean, boolean) - Method in class org.htmlcleaner.XPather
-
For the given source collection and specified name, returns collection of subnodes or attribute values.
- getElementsHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- getEntity(String) - Static method in class org.htmlcleaner.SpecialEntity
- getEscapedValue() - Method in class org.htmlcleaner.SpecialEntity
- getFatalTag() - Method in class org.htmlcleaner.TagInfo
- getHexNCR() - Method in class org.htmlcleaner.SpecialEntity
- getHigherTags() - Method in class org.htmlcleaner.TagInfo
- getHyphenReplacementInComment() - Method in class org.htmlcleaner.CleanerProperties
- getIndent(int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getIndent(int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getInnerHtml(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
-
For the specified node, returns it's content as string.
- getInstance() - Static method in class org.htmlcleaner.DefaultTagProvider
- getIntCode() - Method in class org.htmlcleaner.SpecialEntity
- getItemsToMove() - Method in class org.htmlcleaner.TagNode
- getKey() - Method in class org.htmlcleaner.SpecialEntity
- getLastTagPos() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- getMaxEntityLength() - Static method in class org.htmlcleaner.SpecialEntity
- getMustCloseTags() - Method in class org.htmlcleaner.TagInfo
- getName() - Method in class org.htmlcleaner.DoctypeToken
- getName() - Method in class org.htmlcleaner.TagInfo
- getName() - Method in class org.htmlcleaner.TagToken
- getNamespaceDeclarations() - Method in class org.htmlcleaner.TagNode
- getNamespaceURIOnPath(String) - Method in class org.htmlcleaner.TagNode
- getParent() - Method in class org.htmlcleaner.TagNode
- getPart1() - Method in class org.htmlcleaner.DoctypeToken
- getPart2() - Method in class org.htmlcleaner.DoctypeToken
- getPart3() - Method in class org.htmlcleaner.DoctypeToken
- getPart4() - Method in class org.htmlcleaner.DoctypeToken
- getPermittedTags() - Method in class org.htmlcleaner.TagInfo
- getProperties() - Method in class org.htmlcleaner.HtmlCleaner
- getPruneTags() - Method in class org.htmlcleaner.CleanerProperties
- getRequiredParent() - Method in class org.htmlcleaner.TagInfo
- getSingleLineOfChildren(List) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getSingleLineOfChildren(List) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getSourceTag() - Method in class org.htmlcleaner.TagTransformation
- getTagInfo(String) - Method in class org.htmlcleaner.ConfigFileTagProvider
- getTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
- getTagInfo(String) - Method in interface org.htmlcleaner.ITagInfoProvider
- getTagInfoProvider() - Method in class org.htmlcleaner.CleanerProperties
- getTagInfoProvider() - Method in class org.htmlcleaner.HtmlCleaner
- getText() - Method in class org.htmlcleaner.TagNode
- getTokenList() - Method in class org.htmlcleaner.HtmlTokenizer
- getTransformation(String) - Method in class org.htmlcleaner.CleanerTransformations
- getTransformations() - Method in class org.htmlcleaner.HtmlCleaner
- getXmlAsString(TagNode) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use getAsString() instead.
- getXmlAsString(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use getAsString() instead.
- getXmlName(String) - Static method in class org.htmlcleaner.Utils
- getXmlNSPrefix(String) - Static method in class org.htmlcleaner.Utils
- go() - Method in class org.htmlcleaner.HtmlTokenizer
- go(int) - Method in class org.htmlcleaner.HtmlTokenizer
H
- hasAttribute(String) - Method in class org.htmlcleaner.TagNode
-
Checks existance of specified attribute.
- hasAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
- hasChildren() - Method in class org.htmlcleaner.TagNode
- hasCopyTags() - Method in class org.htmlcleaner.TagInfo
- hasPermittedTags() - Method in class org.htmlcleaner.TagInfo
- hasTransformationForTag(String) - Method in class org.htmlcleaner.CleanerTransformations
- HEAD - Static variable in class org.htmlcleaner.TagInfo
- HEAD_AND_BODY - Static variable in class org.htmlcleaner.TagInfo
- HeadlessTagNode(TagNode) - Constructor for class org.htmlcleaner.Serializer.HeadlessTagNode
- headNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- higherTags - Variable in class org.htmlcleaner.TagInfo
- HtmlCleaner - Class in org.htmlcleaner
-
Main HtmlCleaner class.
- HtmlCleaner() - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates cleaner instance with default tag info provider and default properties.
- HtmlCleaner(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with default tag info provider and specified properties
- HtmlCleaner(ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with specified tag info provider and default properties
- HtmlCleaner(ITagInfoProvider, CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with specified tag info provider and specified properties
- HtmlCleaner.CleanTimeValues - Class in org.htmlcleaner
- HtmlCleaner.OpenTags - Class in org.htmlcleaner
-
Class that contains information and mathods for managing list of open, but unhandled tags.
- HtmlCleaner.TagPos - Class in org.htmlcleaner
-
Contains information about single open tag
- HtmlCleanerException - Exception in org.htmlcleaner
-
General HtmlCleaner runtime exception.
- HtmlCleanerException() - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(String) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(String, Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerForAnt - Class in org.htmlcleaner
-
Support for ANT.
- HtmlCleanerForAnt() - Constructor for class org.htmlcleaner.HtmlCleanerForAnt
- htmlNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- HtmlNode - Interface in org.htmlcleaner
-
Marker interface denoting nodes of the document tree
- HtmlSerializer - Class in org.htmlcleaner
-
Abstract HTML serializer - contains common logic for descendants.
- HtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlSerializer
- HtmlTokenizer - Class in org.htmlcleaner
-
Main HTML tokenizer.
- HtmlTokenizer(Reader, CleanerProperties, CleanerTransformations, ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlTokenizer
-
Constructor - cretes instance of the parser with specified content.
- hyphenreplacement - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- hyphenReplacementInComment - Variable in class org.htmlcleaner.CleanerProperties
I
- identifier() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses an identifier from the current position.
- ignorePermitted - Variable in class org.htmlcleaner.TagInfo
- ignoreqe - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- ignoreQuestAndExclam - Variable in class org.htmlcleaner.CleanerProperties
- ignoreUntil(char) - Method in class org.htmlcleaner.HtmlTokenizer
- incharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- indents - Variable in class org.htmlcleaner.PrettyHtmlSerializer
- indents - Variable in class org.htmlcleaner.PrettyXmlSerializer
- indentString - Variable in class org.htmlcleaner.PrettyHtmlSerializer
- indentString - Variable in class org.htmlcleaner.PrettyXmlSerializer
- info - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
- insertChild(int, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node at specified position in array of children
- insertChildAfter(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node in the list of children after specified child
- insertChildBefore(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node in the list of children before specified child
- intCode - Variable in class org.htmlcleaner.SpecialEntity
- isAdvancedXmlEscape() - Method in class org.htmlcleaner.CleanerProperties
- isAllowedInLastOpenTag(BaseToken, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- isAllowHtmlInsideAttributes - Variable in class org.htmlcleaner.HtmlTokenizer
- isAllowHtmlInsideAttributes() - Method in class org.htmlcleaner.CleanerProperties
- isAllowMultiWordAttributes - Variable in class org.htmlcleaner.HtmlTokenizer
- isAllowMultiWordAttributes() - Method in class org.htmlcleaner.CleanerProperties
- isAllRead() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if end of the content is reached.
- isAtt(String) - Method in class org.htmlcleaner.XPather
-
Checks if token is attribute (starts with @)
- isCaseSensitive - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
- isChar(char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position is equal to specified char.
- isChar(int, char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position is equal to specified char.
- isCharEquals(char) - Method in class org.htmlcleaner.HtmlTokenizer
- isCharSimple(char) - Method in class org.htmlcleaner.HtmlTokenizer
- isContinueAfter(String) - Method in class org.htmlcleaner.TagInfo
- isCopy(String) - Method in class org.htmlcleaner.TagInfo
- isDeprecated() - Method in class org.htmlcleaner.TagInfo
- isEmpty() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- isEmptyString(Object) - Static method in class org.htmlcleaner.Utils
- isEmptyTag() - Method in class org.htmlcleaner.TagInfo
- isFatalTagSatisfied(TagInfo, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Checks if open fatal tag is missing if there is a fatal tag for the specified tag.
- isFormed - Variable in class org.htmlcleaner.TagNode
- isFormed() - Method in class org.htmlcleaner.TagNode
- isFullUrl(String) - Static method in class org.htmlcleaner.Utils
-
Checks if specified link is full URL.
- isFunctionCall(int, int) - Method in class org.htmlcleaner.XPather
-
Checks if tokens in specified range represents valid function call.
- isHeadAndBodyTag() - Method in class org.htmlcleaner.TagInfo
- isHeadTag() - Method in class org.htmlcleaner.TagInfo
- isHexadecimalDigit(char) - Static method in class org.htmlcleaner.Utils
- isHigher(String) - Method in class org.htmlcleaner.TagInfo
- isIdentifier(String) - Method in class org.htmlcleaner.XPather
-
Checks if given string is valid identifier.
- isIdentifierChar() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position can be identifier part.
- isIdentifierHelperChar(char) - Static method in class org.htmlcleaner.Utils
-
Checks if specified character can be part of xml identifier (tag name of attribute name) and is not standard identifier character.
- isIdentifierStartChar() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position can be identifier start.
- isIdentifierStartChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position can be identifier start.
- isIgnorePermitted() - Method in class org.htmlcleaner.TagInfo
- isIgnoreQuestAndExclam() - Method in class org.htmlcleaner.CleanerProperties
- isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
- isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.XmlSerializer
- isMustCloseTag(TagInfo) - Method in class org.htmlcleaner.TagInfo
- isNamespacesAware - Variable in class org.htmlcleaner.HtmlTokenizer
- isNamespacesAware() - Method in class org.htmlcleaner.CleanerProperties
- isOmitComments - Variable in class org.htmlcleaner.HtmlTokenizer
- isOmitComments() - Method in class org.htmlcleaner.CleanerProperties
- isOmitDeprecatedTags - Variable in class org.htmlcleaner.HtmlTokenizer
- isOmitDeprecatedTags() - Method in class org.htmlcleaner.CleanerProperties
- isOmitDoctypeDeclaration() - Method in class org.htmlcleaner.CleanerProperties
- isOmitHtmlEnvelope() - Method in class org.htmlcleaner.CleanerProperties
- isOmitUnknownTags - Variable in class org.htmlcleaner.HtmlTokenizer
- isOmitUnknownTags() - Method in class org.htmlcleaner.CleanerProperties
- isOmitXmlDeclaration() - Method in class org.htmlcleaner.CleanerProperties
- isPreserveSourceAttributes() - Method in class org.htmlcleaner.TagTransformation
- isRecognizeUnicodeChars() - Method in class org.htmlcleaner.CleanerProperties
- isReservedTag(String) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if specified tag name is one of the reserved tags: HTML, HEAD or BODY
- isReservedXmlChar(char) - Static method in class org.htmlcleaner.Utils
- isScriptOrStyle(TagNode) - Method in class org.htmlcleaner.Serializer
- isStartToken(Object) - Method in class org.htmlcleaner.HtmlCleaner
- isToken(String, int) - Method in class org.htmlcleaner.XPather
- isTranslateSpecialEntities() - Method in class org.htmlcleaner.CleanerProperties
- isTransResCharsToNCR() - Method in class org.htmlcleaner.CleanerProperties
- isTransSpecialEntitiesToNCR() - Method in class org.htmlcleaner.CleanerProperties
- isTreatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.HtmlTokenizer
- isTreatDeprecatedTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
- isTreatUnknownTagsAsContent - Variable in class org.htmlcleaner.HtmlTokenizer
- isTreatUnknownTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
- isUnique() - Method in class org.htmlcleaner.TagInfo
- isUseCdataForScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
- isUseEmptyElementTags() - Method in class org.htmlcleaner.CleanerProperties
- isValid() - Method in class org.htmlcleaner.DoctypeToken
- isValidDouble(String) - Method in class org.htmlcleaner.XPather
- isValidInt(String, int) - Static method in class org.htmlcleaner.Utils
- isValidInteger(String) - Method in class org.htmlcleaner.XPather
- isValidXmlChar() - Method in class org.htmlcleaner.HtmlTokenizer
- isValidXmlChar(char) - Static method in class org.htmlcleaner.Utils
- isValidXmlCharSafe() - Method in class org.htmlcleaner.HtmlTokenizer
- isValidXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
-
Chacks whether specified string can be valid tag name or attribute name in xml.
- isWhitespace() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position is whitespace.
- isWhitespace(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position is whitespace.
- isWhitespaceSafe() - Method in class org.htmlcleaner.HtmlTokenizer
- isWhitespaceString(Object) - Static method in class org.htmlcleaner.Utils
-
Checks whether specified object's string representation is empty string (containing of only whitespaces).
- ITagInfoProvider - Interface in org.htmlcleaner
-
Provides set of TagInfo instances.
- itemsToMove - Variable in class org.htmlcleaner.TagNode
J
- JDomSerializer - Class in org.htmlcleaner
-
JDom serializer - creates xml JDom instance out of the TagNode.
- JDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.JDomSerializer
- JDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.JDomSerializer
K
- key - Variable in class org.htmlcleaner.SpecialEntity
L
- last - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
- list - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
- ltrim(String) - Static method in class org.htmlcleaner.Utils
-
Trims specified string from left.
M
- main(String[]) - Static method in class org.htmlcleaner.CommandLine
- main(String[]) - Static method in class org.htmlcleaner.ConfigFileTagProvider
-
Generates code for tag provider class from specified configuration XML file.
- makeCopy() - Method in class org.htmlcleaner.TagNode
- makeTagNodeCopy(TagNode, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- makeTree(List<BaseToken>) - Method in class org.htmlcleaner.HtmlTokenizer
- makeTree(List<BaseToken>, ListIterator<BaseToken>, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- mappings - Variable in class org.htmlcleaner.CleanerTransformations
- maxEntityLength - Static variable in class org.htmlcleaner.SpecialEntity
- mustAddRequiredParent(TagInfo, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Check if specified tag requires parent tag, but that parent tag is missing in the appropriate context.
- mustCloseTags - Variable in class org.htmlcleaner.TagInfo
N
- name - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
- name - Variable in class org.htmlcleaner.TagInfo
- name - Variable in class org.htmlcleaner.TagNode.TagNodeNameCondition
- name - Variable in class org.htmlcleaner.TagToken
- namespacesaware - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- namespacesAware - Variable in class org.htmlcleaner.CleanerProperties
- nodebyxpath - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- nsDeclarations - Variable in class org.htmlcleaner.TagNode
O
- omitcomments - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitComments - Variable in class org.htmlcleaner.CleanerProperties
- omitDeprecatedTags - Variable in class org.htmlcleaner.CleanerProperties
- omitdeprtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitdoctypedecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitDoctypeDeclaration - Variable in class org.htmlcleaner.CleanerProperties
- omitenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omithtmlenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitHtmlEnvelope - Variable in class org.htmlcleaner.CleanerProperties
- omitunknowntags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitUnknownTags - Variable in class org.htmlcleaner.CleanerProperties
- omitxmldecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitXmlDeclaration - Variable in class org.htmlcleaner.CleanerProperties
- openPreTags - Variable in class org.htmlcleaner.CompactHtmlSerializer
- OpenTags() - Constructor for class org.htmlcleaner.HtmlCleaner.OpenTags
- org.htmlcleaner - package org.htmlcleaner
- outcharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- outputtype - Variable in class org.htmlcleaner.HtmlCleanerForAnt
P
- parent - Variable in class org.htmlcleaner.TagNode
- parse(InputSource) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- parserFactory - Static variable in class org.htmlcleaner.ConfigFileTagProvider
- part1 - Variable in class org.htmlcleaner.DoctypeToken
- part2 - Variable in class org.htmlcleaner.DoctypeToken
- part3 - Variable in class org.htmlcleaner.DoctypeToken
- part4 - Variable in class org.htmlcleaner.DoctypeToken
- permittedTags - Variable in class org.htmlcleaner.TagInfo
- position - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
- preserveSourceAttributes - Variable in class org.htmlcleaner.TagTransformation
- PrettyHtmlSerializer - Class in org.htmlcleaner
-
Pretty HTML serializer - creates resulting HTML with indenting lines.
- PrettyHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
- PrettyHtmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
- PrettyXmlSerializer - Class in org.htmlcleaner
-
Pretty XML serializer - creates resulting XML with indenting lines.
- PrettyXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
- PrettyXmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
- properties - Variable in class org.htmlcleaner.HtmlCleaner
- props - Variable in class org.htmlcleaner.DomSerializer
- props - Variable in class org.htmlcleaner.HtmlTokenizer
- props - Variable in class org.htmlcleaner.JDomSerializer
- props - Variable in class org.htmlcleaner.Serializer
- pruneNodeSet - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- prunetags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- pruneTags - Variable in class org.htmlcleaner.CleanerProperties
- pruneTagSet - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
R
- readIfNeeded(int) - Method in class org.htmlcleaner.HtmlTokenizer
- recognizeUnicodeChars - Variable in class org.htmlcleaner.CleanerProperties
- removeAllChildren() - Method in class org.htmlcleaner.TagNode
-
Removes all children (subelements and text content).
- removeAttribute(String) - Method in class org.htmlcleaner.TagNode
-
Removes specified attribute from this tag.
- removeChild(Object) - Method in class org.htmlcleaner.TagNode
-
Remove specified child element from this node.
- removeFromTree() - Method in class org.htmlcleaner.TagNode
-
Remove this node from the tree.
- removeTag(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- removeTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
-
Removes tag info with specified name.
- replaceChild(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Replaces specified child node with specified replacement node.
- requiredParent - Variable in class org.htmlcleaner.TagInfo
- RESERVED_XML_CHARS - Static variable in class org.htmlcleaner.Utils
- rootNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- rtrim(String) - Static method in class org.htmlcleaner.Utils
-
Trims specified string from right.
S
- satisfy(TagNode) - Method in interface org.htmlcleaner.TagNode.ITagNodeCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagAllCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeNameCondition
- save(char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves specified character to the temporary buffer.
- saveCurrent() - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves character at current runtime position to the temporary buffer.
- saveCurrent(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves specified number of characters at current runtime position to the temporary buffer.
- saveCurrentSafe() - Method in class org.htmlcleaner.HtmlTokenizer
- saveToLastOpenTag(List, BaseToken, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- serialize(Serializer, Writer) - Method in interface org.htmlcleaner.BaseToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.CommentNode
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.ContentNode
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.DoctypeToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.EndTagToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.TagNode
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.Serializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleXmlSerializer
- serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
- serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
- serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
- serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
- serializePrettyHtml(TagNode, Writer, int, boolean, boolean) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- serializePrettyXml(TagNode, Writer, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- Serializer - Class in org.htmlcleaner
-
Basic abstract serializer - contains common logic for descendants (methods
writeXXX()
. - Serializer(CleanerProperties) - Constructor for class org.htmlcleaner.Serializer
- Serializer.HeadlessTagNode - Class in org.htmlcleaner
-
Used to implement serialization with missing envelope - omiting open and close tags, just serialize children.
- set - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
- setAdvancedxmlescape(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAdvancedXmlEscape(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAllowhtmlinsideattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAllowHtmlInsideAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAllowmultiwordattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAllowMultiWordAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAttribute(String, String) - Method in class org.htmlcleaner.EndTagToken
- setAttribute(String, String) - Method in class org.htmlcleaner.TagNode
-
Adding new attribute ir overriding existing one.
- setAttribute(String, String) - Method in class org.htmlcleaner.TagToken
- setAttributes(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
- setAttributes(TagNode, Element) - Method in class org.htmlcleaner.DomSerializer
- setBelongsTo(int) - Method in class org.htmlcleaner.TagInfo
- setBooleanAttributeValues(String) - Method in class org.htmlcleaner.CleanerProperties
- setBooleanatts(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setChildren(List) - Method in class org.htmlcleaner.TagNode
- setChildTags(Set) - Method in class org.htmlcleaner.TagInfo
- setContinueAfterTags(Set) - Method in class org.htmlcleaner.TagInfo
- setCopyTags(Set) - Method in class org.htmlcleaner.TagInfo
- setDeprecated(boolean) - Method in class org.htmlcleaner.TagInfo
- setDest(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setDocType(DoctypeToken) - Method in class org.htmlcleaner.TagNode
- setFatalTag(String) - Method in class org.htmlcleaner.TagInfo
- setFormed() - Method in class org.htmlcleaner.TagNode
- setFormed(boolean) - Method in class org.htmlcleaner.TagNode
- setHigherTags(Set) - Method in class org.htmlcleaner.TagInfo
- setHyphenreplacement(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setHyphenReplacementInComment(String) - Method in class org.htmlcleaner.CleanerProperties
- setIgnorePermitted(boolean) - Method in class org.htmlcleaner.TagInfo
- setIgnoreqe(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setIgnoreQuestAndExclam(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setIncharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setInnerHtml(TagNode, String) - Method in class org.htmlcleaner.HtmlCleaner
-
For the specified tag node, defines it's html content.
- setItemsToMove(List<BaseToken>) - Method in class org.htmlcleaner.TagNode
- setMustCloseTags(Set) - Method in class org.htmlcleaner.TagInfo
- setName(String) - Method in class org.htmlcleaner.TagInfo
- setName(String) - Method in class org.htmlcleaner.TagNode
-
Changes name of the tag
- setNamespacesaware(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setNamespacesAware(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setNodebyxpath(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitcomments(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitComments(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitDeprecatedTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitdeprtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitdoctypedecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitDoctypeDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmithtmlenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitHtmlEnvelope(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitunknowntags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitUnknownTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitxmldecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitXmlDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOutcharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOutputtype(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setPermittedTags(Set) - Method in class org.htmlcleaner.TagInfo
- setPrunetags(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setPruneTags(String) - Method in class org.htmlcleaner.CleanerProperties
- setPruneTags(String, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- setRecognizeUnicodeChars(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setRequiredParent(String) - Method in class org.htmlcleaner.TagInfo
- setSpecialentities(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setSrc(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTaginfofile(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTransform(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTransformations(CleanerTransformations) - Method in class org.htmlcleaner.HtmlCleaner
-
Sets tranformations for this cleaner instance.
- setTranslateSpecialEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTransrescharstoncr(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTransResCharsToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTransspecialentitiestoncr(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTransSpecialEntitiesToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTreatDeprecatedTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTreatdeprtagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTreatunknowntagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTreatUnknownTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setUnicodechars(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUnique(boolean) - Method in class org.htmlcleaner.TagInfo
- setUsecdata(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUseCdataForScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setUseemptyelementtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUseEmptyElementTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- SimpleHtmlSerializer - Class in org.htmlcleaner
-
Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
- SimpleHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
- SimpleXmlSerializer - Class in org.htmlcleaner
-
Simple XML serializer - creates resulting XML without indenting lines.
- SimpleXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleXmlSerializer
- singleton(Object) - Method in class org.htmlcleaner.XPather
-
Creates one-element collection for the specified object.
- skipWhitespaces() - Method in class org.htmlcleaner.HtmlTokenizer
-
Skips whitespaces at current position and moves foreward until non-whitespace character is found or the end of content is reached.
- someAlreadyOpen(Set) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
-
Checks if any of tags specified in the set are already open.
- sourceTag - Variable in class org.htmlcleaner.TagTransformation
- specialentities - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- SpecialEntity - Class in org.htmlcleaner
-
This class contains map with special entities used in HTML and their unicodes.
- SpecialEntity(String, int) - Constructor for class org.htmlcleaner.SpecialEntity
- src - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- start() - Method in class org.htmlcleaner.HtmlTokenizer
-
Starts parsing HTML.
- startElement(String, String, String, Attributes) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- startsWith(String) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if content starts with specified value at the current position.
- startsWithSimple(String) - Method in class org.htmlcleaner.HtmlTokenizer
T
- TagAllCondition() - Constructor for class org.htmlcleaner.TagNode.TagAllCondition
- tagAttributes() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses list tag attributes from the current position.
- tagEncountered(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- tagEnd() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses end of the tag.
- tagExists(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
- tagInfo - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- TagInfo - Class in org.htmlcleaner
-
Class contains information about single HTML tag.
It also contains rules for tag balancing. - TagInfo(String, int, int, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.TagInfo
- taginfofile - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- tagInfoMap - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- tagInfoProvider - Variable in class org.htmlcleaner.CleanerProperties
- tagInfoProvider - Variable in class org.htmlcleaner.HtmlCleaner
- tagInfoProvider - Variable in class org.htmlcleaner.HtmlTokenizer
- TagNode - Class in org.htmlcleaner
-
XML node tag - basic node of the cleaned HTML tree.
- TagNode(String) - Constructor for class org.htmlcleaner.TagNode
- TagNode.ITagNodeCondition - Interface in org.htmlcleaner
-
Used as base for different node checkers.
- TagNode.TagAllCondition - Class in org.htmlcleaner
-
All nodes.
- TagNode.TagNodeAttExistsCondition - Class in org.htmlcleaner
-
Checks if node contains specified attribute.
- TagNode.TagNodeAttValueCondition - Class in org.htmlcleaner
-
Checks if node has specified attribute with specified value.
- TagNode.TagNodeNameCondition - Class in org.htmlcleaner
-
Checks if node has specified name.
- TagNodeAttExistsCondition(String) - Constructor for class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
- TagNodeAttValueCondition(String, String, boolean) - Constructor for class org.htmlcleaner.TagNode.TagNodeAttValueCondition
- TagNodeNameCondition(String) - Constructor for class org.htmlcleaner.TagNode.TagNodeNameCondition
- TagNodeVisitor - Interface in org.htmlcleaner
-
Defines action to be performed on TagNodes
- TagPos(int, String) - Constructor for class org.htmlcleaner.HtmlCleaner.TagPos
- tagStart() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses start of the tag.
- TagToken - Class in org.htmlcleaner
-
HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
- TagToken() - Constructor for class org.htmlcleaner.TagToken
- TagToken(String) - Constructor for class org.htmlcleaner.TagToken
- TagTransformation - Class in org.htmlcleaner
-
Describes how specified tag is transformed to another one, or is ignored during parsing
- TagTransformation(String) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation in which specified tag will be skipped (ignored) during parsing process.
- TagTransformation(String, String) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation from source tag to target tag preserving all source tag attributes.
- TagTransformation(String, String, boolean) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation from source tag to target tag specifying whether source tag attributes are preserved.
- text - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- throwStandardException() - Method in class org.htmlcleaner.XPather
- toBoolean(String) - Static method in class org.htmlcleaner.CommandLine
- tokenArray - Variable in class org.htmlcleaner.XPather
- tokenize(String, String) - Static method in class org.htmlcleaner.Utils
- toString() - Method in class org.htmlcleaner.CommentNode
- toString() - Method in class org.htmlcleaner.ContentNode
- toString() - Method in class org.htmlcleaner.DoctypeToken
- toString() - Method in class org.htmlcleaner.TagToken
- toText(Object) - Method in class org.htmlcleaner.XPather
- transform - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- transformations - Variable in class org.htmlcleaner.HtmlCleaner
- transformations - Variable in class org.htmlcleaner.HtmlTokenizer
- transformAttributes(TagTransformation) - Method in class org.htmlcleaner.TagNode
- translateSpecialEntities - Variable in class org.htmlcleaner.CleanerProperties
- transrescharstoncr - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- transResCharsToNCR - Variable in class org.htmlcleaner.CleanerProperties
- transspecialentitiestoncr - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- transSpecialEntitiesToNCR - Variable in class org.htmlcleaner.CleanerProperties
- traverse(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
-
Traverses the tree and performs visitor's action on each node.
- traverseInternally(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
- treatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
- treatdeprtagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- treatunknowntagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- treatUnknownTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
U
- unicodechars - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- unique - Variable in class org.htmlcleaner.TagInfo
- updateTagTransformations(CleanerTransformations, String, String) - Static method in class org.htmlcleaner.Utils
- usecdata - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- useCdataForScriptAndStyle - Variable in class org.htmlcleaner.CleanerProperties
- useemptyelementtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- useEmptyElementTags - Variable in class org.htmlcleaner.CleanerProperties
- Utils - Class in org.htmlcleaner
-
Common utilities.
- Utils() - Constructor for class org.htmlcleaner.Utils
V
- VAR_END - Static variable in class org.htmlcleaner.Utils
- VAR_START - Static variable in class org.htmlcleaner.Utils
- visit(TagNode, HtmlNode) - Method in interface org.htmlcleaner.TagNodeVisitor
-
Action to be performed on single node in the tree
W
- WORKING_BUFFER_SIZE - Static variable in class org.htmlcleaner.HtmlTokenizer
- write(TagNode, Writer, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified node using specified writer.
- write(TagNode, Writer, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified node using specified writer.
- writeToFile(TagNode, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using system default charset.
- writeToFile(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeToFile(TagNode, String, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset.
- writeToFile(TagNode, String, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using system default charset.
- writeToStream(TagNode, OutputStream, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using system default charset and optionally omits node envelope (skips open and close tags of the node).
- writeToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using specified charset.
- writeToStream(TagNode, OutputStream, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeXml(TagNode, Writer, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use write() instead.
- writeXmlToFile(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToFile() instead.
- writeXmlToFile(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToFile() instead.
- writeXmlToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToStream() instead.
- writeXmlToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToStream() instead.
X
- XmlSerializer - Class in org.htmlcleaner
-
Abstract XML serializer - contains common logic for descendants.
- XmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.XmlSerializer
- XPather - Class in org.htmlcleaner
-
Utility for searching cleaned document tree with XPath expressions.
- XPather(String) - Constructor for class org.htmlcleaner.XPather
-
Constructor - creates XPather instance with specified XPath expression.
- XPatherException - Exception in org.htmlcleaner
-
Exception that could occure during XPather evaluation.
- XPatherException() - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(String) - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(String, Throwable) - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(Throwable) - Constructor for exception org.htmlcleaner.XPatherException
_
- _asExpected - Variable in class org.htmlcleaner.HtmlTokenizer
- _bodyOpened - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- _currentTagToken - Variable in class org.htmlcleaner.HtmlTokenizer
- _docType - Variable in class org.htmlcleaner.HtmlTokenizer
- _headOpened - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- _headTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- _instance - Static variable in class org.htmlcleaner.DefaultTagProvider
- _isScriptContext - Variable in class org.htmlcleaner.HtmlTokenizer
- _len - Variable in class org.htmlcleaner.HtmlTokenizer
- _openTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
- _pos - Variable in class org.htmlcleaner.HtmlTokenizer
- _reader - Variable in class org.htmlcleaner.HtmlTokenizer
- _saved - Variable in class org.htmlcleaner.HtmlTokenizer
- _savedLen - Variable in class org.htmlcleaner.HtmlTokenizer
- _tokenList - Variable in class org.htmlcleaner.HtmlTokenizer
- _working - Variable in class org.htmlcleaner.HtmlTokenizer
All Classes All Packages