public class MSWordDocReader extends java.lang.Object implements FormatReader<DocData>
Constructor and Description |
---|
MSWordDocReader()
Create a new, uninitialized MSWordDocReader.
|
MSWordDocReader(java.io.File file)
Create a new MSWordDocReader that will read the specified MSWord file or
directory containing MSWord files.
|
MSWordDocReader(java.lang.String filePath)
Create a new MSWordDocReader that will read the specified MSWord file or
directory containing MSWord files.
|
Modifier and Type | Method and Description |
---|---|
boolean |
canRead()
Return whether this reader is initialized and can read data
|
void |
close()
Close the reader.
|
java.lang.String |
getFilters()
Get the filters for this reader.
|
java.lang.String |
getSource()
Get the source for this reader
|
boolean |
hasMoreData()
Determine whether the reader has more data
|
java.util.List<DocData> |
readData()
Read the MSWord file, or directory of MSWord files, to extract the
content.
|
void |
setFilters(java.lang.String filters)
Set filters for this reader.
|
void |
setSource(java.lang.String source)
Set the source for this reader.
|
public MSWordDocReader()
setSource()
method.public MSWordDocReader(java.io.File file)
file
- the MSWord file to read, or the directory containing MSWord
files.public MSWordDocReader(java.lang.String filePath)
filePath
- the path to the MSWord file to read, or the directory
containing MSWord files.public boolean canRead()
canRead
in interface FormatReader<DocData>
public void close()
close
in interface FormatReader<DocData>
public java.lang.String getFilters()
getFilters
in interface FormatReader<DocData>
public java.lang.String getSource()
getSource
in interface FormatReader<DocData>
public boolean hasMoreData()
hasMoreData
in interface FormatReader<DocData>
public java.util.List<DocData> readData()
readData
in interface FormatReader<DocData>
public void setFilters(java.lang.String filters)
setFilters
in interface FormatReader<DocData>
filters
- the filters for this readerpublic void setSource(java.lang.String source)
setSource
in interface FormatReader<DocData>
source
- the MSWord source file or directory