public class MSWordDocReader extends java.lang.Object implements FormatReader<DocData>
| Constructor and Description |
|---|
MSWordDocReader()
Create a new, uninitialized MSWordDocReader.
|
MSWordDocReader(java.io.File file)
Create a new MSWordDocReader that will read the specified MSWord file or
directory containing MSWord files.
|
MSWordDocReader(java.lang.String filePath)
Create a new MSWordDocReader that will read the specified MSWord file or
directory containing MSWord files.
|
| Modifier and Type | Method and Description |
|---|---|
boolean |
canRead()
Return whether this reader is initialized and can read data
|
void |
close()
Close the reader.
|
java.lang.String |
getFilters()
Get the filters for this reader.
|
java.lang.String |
getSource()
Get the source for this reader
|
boolean |
hasMoreData()
Determine whether the reader has more data
|
java.util.List<DocData> |
readData()
Read the MSWord file, or directory of MSWord files, to extract the
content.
|
void |
setFilters(java.lang.String filters)
Set filters for this reader.
|
void |
setSource(java.lang.String source)
Set the source for this reader.
|
public MSWordDocReader()
setSource() method.public MSWordDocReader(java.io.File file)
file - the MSWord file to read, or the directory containing MSWord
files.public MSWordDocReader(java.lang.String filePath)
filePath - the path to the MSWord file to read, or the directory
containing MSWord files.public boolean canRead()
canRead in interface FormatReader<DocData>public void close()
close in interface FormatReader<DocData>public java.lang.String getFilters()
getFilters in interface FormatReader<DocData>public java.lang.String getSource()
getSource in interface FormatReader<DocData>public boolean hasMoreData()
hasMoreData in interface FormatReader<DocData>public java.util.List<DocData> readData()
readData in interface FormatReader<DocData>public void setFilters(java.lang.String filters)
setFilters in interface FormatReader<DocData>filters - the filters for this readerpublic void setSource(java.lang.String source)
setSource in interface FormatReader<DocData>source - the MSWord source file or directory