public class HTMLDocument extends AbstractIndexableDocument implements ParsableDocument
AbstractIndexableDocument.StoreHandlerSdxObject.ConfigurationNode| Modifier and Type | Field and Description |
|---|---|
protected java.io.File |
tidyConf |
_boost, _currentFieldBoost, _msgHandler, _xmlFieldList, _xmlFieldOutput, _xmlFieldTransformer, attachedDocuments, characterBuffer, contentHandler, currentFieldName, DOC_ATTACHEDOC_ELEMENT_NAME, DOC_FIELD_ELEMENT_NAME, DOC_MSG_ELEMENT_NAME, DOC_NAMESPACE, DOC_ROOT_ELEMENT_NAME, lexicalHandler, nsTable, openSdxDocElems, properties, storeHandler, subDoc, subDocBytes, subDocuments, transformedDoc, withinSdxElement, withinXmlField, xmlConsumeridGenerator, idPrefix, idSuffix, mimeType, storeRepo_configuration, _context, _description, _encoding, _id, _locale, _logger, _manager, _xmlizable_objects, _xmlLang, isToSaxInitializedCLASS_NAME_SUFFIX, DOCTYPE_BINARY, DOCTYPE_GROUP, DOCTYPE_HTML, DOCTYPE_USER, DOCTYPE_XMLDEFAULT_ENCODING| Constructor and Description |
|---|
HTMLDocument()
Creates an HTML document.
|
HTMLDocument(java.lang.String id)
Creates an HTML document given an id.
|
| Modifier and Type | Method and Description |
|---|---|
void |
addAdditionalSystemFields(org.apache.lucene.document.Document doc)
Some additional system fields adding to the Lucene document
|
java.lang.String |
getDocType()
Gets the docType for the document
|
java.lang.String |
getMimeType()
Returns the mimeType field (A String) for this document
|
void |
parse(org.apache.excalibur.xml.sax.SAXParser parser)
Parses a document using the previously supplied consumer.
|
void |
parse(org.apache.excalibur.xml.sax.SAXParser parser,
org.apache.cocoon.xml.XMLConsumer consumer)
Parses a document using a specific consumer.
|
void |
setTidyConfiguration(java.io.File tidyConf) |
void |
setTransformedDocument(byte[] content)
Set's the transformed document for the parent document.
|
void |
setTransformedDocument(java.io.File file)
Set's the transformed document for the parent document.
|
void |
startIndexing(org.apache.excalibur.xml.sax.SAXParser parser,
org.apache.cocoon.xml.XMLConsumer consumer)
Starts the indexing process.
|
addAttachedDocument, characters, comment, endCDATA, endDocument, endDTD, endElement, endEntity, endPrefixMapping, generateId, getAttachedDocuments, getAttachedDocumentsSize, getBoost, getFieldValues, getStoreHandler, getSubDocuments, getSubDocumentsSize, getTransformedDocument, handleDocumentId, ignorableWhitespace, processingInstruction, resetAttachedDocuments, resetFields, setAttachedDocuments, setBoost, setConsumer, setContentHandler, setDocumentLocator, setLexicalHandler, setMessageHandler, setUpdateAttachedDocuments, setUpTransformedDocument, setXMLFieldList, setXMLTransformerHandler, skippedEntity, startCDATA, startDocument, startDTD, startElement, startEntity, startPrefixMapping, updateAttachedDocumentsgetClassNameSuffix, getInputSource, getLength, getPreferredFilename, getRepositoryForStorage, getURL, initToSax, initVolatileObjectsToSax, openStream, save, setContent, setContent, setContent, setContent, setIdGenerator, setIdGenerator, setMimeType, setPreferredFilename, setPreferredFilename, setRepositoryForStorage, setURLconfigure, configureDescription, contextualize, enableLogging, getBaseAttributes, getConfiguration, getContext, getDescription, getEncoding, getId, getLocale, getLog, getServiceManager, getXmlLang, service, setDescription, setEncoding, setId, setLocale, setUpSdxObject, setUpSdxObject, setXmlLang, toSAX, verifyConfigurationResourcesclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetLength, getPreferredFilename, getRepositoryForStorage, getURL, openStream, save, setContent, setContent, setContent, setContent, setId, setIdGenerator, setIdGenerator, setMimeType, setPreferredFilename, setRepositoryForStorage, setURLgetIdgetDescription, setDescriptiongetEncoding, setEncodinggetLocale, getXmlLang, setLocale, setXmlLangpublic HTMLDocument(java.lang.String id)
throws SDXException
id - The document's id.
If logging is desired the super.getLog() should be set after creation.SDXExceptionAbstractSdxObject.enableLogging(org.apache.avalon.framework.logger.Logger)public HTMLDocument()
public void startIndexing(org.apache.excalibur.xml.sax.SAXParser parser,
org.apache.cocoon.xml.XMLConsumer consumer)
throws SDXException
startIndexing in interface IndexableDocumentparser - The parser to useconsumer - The consumer for the events generated by the indexing processSDXExceptionpublic void parse(org.apache.excalibur.xml.sax.SAXParser parser)
throws SDXException
parse in interface ParsableDocumentparser - The parser to use.SDXExceptionpublic void parse(org.apache.excalibur.xml.sax.SAXParser parser,
org.apache.cocoon.xml.XMLConsumer consumer)
throws SDXException
parse in interface ParsableDocumentparser - The parser to useconsumer - The consumer of the events generated by the parseSDXExceptionpublic java.lang.String getDocType()
getDocType in interface Documentpublic void setTransformedDocument(byte[] content)
throws SDXException
setTransformedDocument in interface IndexableDocumentcontent - The byte array of dataSDXExceptionpublic void setTransformedDocument(java.io.File file)
throws SDXException
setTransformedDocument in interface IndexableDocumentfile - The transformed document fileSDXExceptionpublic java.lang.String getMimeType()
getMimeType in interface DocumentgetMimeType in class AbstractDocumentpublic void setTidyConfiguration(java.io.File tidyConf)
public void addAdditionalSystemFields(org.apache.lucene.document.Document doc)
addAdditionalSystemFields in interface IndexableDocumentCopyright © 2000-2010 Ministere de la culture et de la communication / AJLSM. All Rights Reserved.