com.mkdoc.jspider
Class XhtmlTripleWriterPlugin

java.lang.Object
  |
  +--com.mkdoc.jspider.AbstractMKDocPlugin
        |
        +--com.mkdoc.jspider.AbstractRdfPlugin
              |
              +--com.mkdoc.jspider.AbstractFileStoreManagerPlugin
                    |
                    +--com.mkdoc.jspider.XhtmlTripleWriterPlugin
All Implemented Interfaces:
net.javacoding.jspider.api.event.EventSink, net.javacoding.jspider.spi.Plugin

public class XhtmlTripleWriterPlugin
extends AbstractFileStoreManagerPlugin
implements net.javacoding.jspider.spi.Plugin

A generic HTML metadata triple writer plugin.

Creates a set of N-Triple documents that mirrors the HTML URL structure of a Web site. Current version does not escape high range characters per the N-Triple specification.

manager
The fully qualified class name of a com.mkdoc.store.FileStoreManager implementation, required. Should normally be com.mkdoc.store.TripleFileStoreManager.
output.absolute
Whether the output directory configuration specifies an absolute directory path. If not set, a relative path is assumed.
output.folder
The base file output directory, which may be relative to the JSpider installation directory. If not set, the default output directory beneath the JSpider installation directory is assumed.
profile
The fully qualified class name of a com.mkdoc.schema.ApplicationProfile implementation to override the default Dublin Core profile.

Version:
0.7 2005-07-12
Author:
Philip Shaw
To do:
Default to TripleFileStoreManager if the manager is not configured in the application context.

Field Summary
 
Fields inherited from class com.mkdoc.jspider.AbstractFileStoreManagerPlugin
manager
 
Fields inherited from class com.mkdoc.jspider.AbstractRdfPlugin
profile
 
Fields inherited from class com.mkdoc.jspider.AbstractMKDocPlugin
log, VENDOR
 
Constructor Summary
XhtmlTripleWriterPlugin(net.javacoding.jspider.core.util.config.PropertySet propertySet)
          Obtain the output configuration.
 
Method Summary
protected  RdfContentHandler getContentHandler(FileStoreManager manager, java.net.URL url)
          Get a pre-configured content handler.
 java.lang.String getDescription()
          Get the plugin description.
 java.lang.String getName()
          Get the plugin name.
 java.lang.String getVersion()
          Get the plugin version.
 void parse(java.io.InputStream inStream, java.net.URL url)
          Parse an input stream.
 
Methods inherited from class com.mkdoc.jspider.AbstractFileStoreManagerPlugin
shutdown
 
Methods inherited from class com.mkdoc.jspider.AbstractRdfPlugin
configureHandler, getInputSource, getXmlReader, loadApplicationProfile, notify, setXhtmlTidy, setXmlReader
 
Methods inherited from class com.mkdoc.jspider.AbstractMKDocPlugin
getApplicationContext, getVendor, initialize, setLog
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface net.javacoding.jspider.spi.Plugin
getVendor
 
Methods inherited from interface net.javacoding.jspider.api.event.EventSink
initialize, notify, shutdown
 

Constructor Detail

XhtmlTripleWriterPlugin

public XhtmlTripleWriterPlugin(net.javacoding.jspider.core.util.config.PropertySet propertySet)
Obtain the output configuration.

Parameters:
propertySet - The configuration property set for this plugin.
Since:
Version 0.1 2005-03-01
Method Detail

parse

public final void parse(java.io.InputStream inStream,
                        java.net.URL url)
Parse an input stream.

Specified by:
parse in class AbstractRdfPlugin
Parameters:
inStream - The resource input stream.
url - The resource URL.
Since:
Version 0.3 2005-04-27

getContentHandler

protected final RdfContentHandler getContentHandler(FileStoreManager manager,
                                                    java.net.URL url)
Get a pre-configured content handler.

Specified by:
getContentHandler in class AbstractFileStoreManagerPlugin
Parameters:
manager - A manager interface to write the triples to.
url - An initial URI subject.
Returns:
An RdfContentHandler instance with custom Schema if configured.
Since:
Version 0.1 2005-03-01

getDescription

public final java.lang.String getDescription()
Get the plugin description.

Specified by:
getDescription in interface net.javacoding.jspider.spi.Plugin
Returns:
A short description of this plugin.
Since:
Version 0.1 2005-02-28

getName

public final java.lang.String getName()
Get the plugin name.

Specified by:
getName in interface net.javacoding.jspider.spi.Plugin
Returns:
The name of this plugin.
Since:
Version 0.1 2005-02-28

getVersion

public final java.lang.String getVersion()
Get the plugin version.

Specified by:
getVersion in interface net.javacoding.jspider.spi.Plugin
Returns:
The version information for this plugin.
Since:
Version 0.1 2005-02-28