org.galagosearch.core.parse
Class AnchorTextDocumentCreator

java.lang.Object
  extended by org.galagosearch.tupleflow.StandardStep<org.galagosearch.core.types.IdentifiedLink,Document>
      extended by org.galagosearch.core.parse.AnchorTextDocumentCreator
All Implemented Interfaces:
org.galagosearch.tupleflow.Processor<org.galagosearch.core.types.IdentifiedLink>, org.galagosearch.tupleflow.Source<Document>, org.galagosearch.tupleflow.Step

@InputClass(className="org.galagosearch.core.types.IdentifiedLink")
@OutputClass(className="org.galagosearch.core.parse.Document")
public class AnchorTextDocumentCreator
extends org.galagosearch.tupleflow.StandardStep<org.galagosearch.core.types.IdentifiedLink,Document>

From an IdentifiedLink object, this class constructs a document containing only anchor text.

Author:
trevor

Field Summary
 
Fields inherited from class org.galagosearch.tupleflow.StandardStep
processor
 
Constructor Summary
AnchorTextDocumentCreator()
           
 
Method Summary
 void close()
           
 void flush()
           
 java.lang.Class<org.galagosearch.core.types.IdentifiedLink> getInputClass()
           
 java.lang.Class<Document> getOutputClass()
           
 void process(org.galagosearch.core.types.IdentifiedLink link)
          This method takes the text from a link object, tokenizes it, then adds it to a document object.
 
Methods inherited from class org.galagosearch.tupleflow.StandardStep
setProcessor
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

AnchorTextDocumentCreator

public AnchorTextDocumentCreator()
Method Detail

process

public void process(org.galagosearch.core.types.IdentifiedLink link)
             throws java.io.IOException
This method takes the text from a link object, tokenizes it, then adds it to a document object.

Specified by:
process in interface org.galagosearch.tupleflow.Processor<org.galagosearch.core.types.IdentifiedLink>
Specified by:
process in class org.galagosearch.tupleflow.StandardStep<org.galagosearch.core.types.IdentifiedLink,Document>
Throws:
java.io.IOException

flush

public void flush()
           throws java.io.IOException
Throws:
java.io.IOException

close

public void close()
           throws java.io.IOException
Specified by:
close in interface org.galagosearch.tupleflow.Processor<org.galagosearch.core.types.IdentifiedLink>
Overrides:
close in class org.galagosearch.tupleflow.StandardStep<org.galagosearch.core.types.IdentifiedLink,Document>
Throws:
java.io.IOException

getInputClass

public java.lang.Class<org.galagosearch.core.types.IdentifiedLink> getInputClass()

getOutputClass

public java.lang.Class<Document> getOutputClass()


Copyright © 2009. All Rights Reserved.