public class CharsetMatch extends Object implements Comparable
Instances of this class are created only by CharsetDetectors.
Note: this class has a natural ordering that is inconsistent with equals. The natural ordering is based on the match confidence value.
Modifier and Type | Field and Description |
---|---|
static int |
BOM
Bit flag indicating the match is based on the presence of a BOM.
|
static int |
DECLARED_ENCODING
Bit flag indicating he match is based on the declared encoding.
|
static int |
ENCODING_SCHEME
Bit flag indicating the match is based on the the encoding scheme.
|
static int |
LANG_STATISTICS
Bit flag indicating the match is based on language statistics.
|
Modifier and Type | Method and Description |
---|---|
int |
compareTo(Object o)
Compare to other CharsetMatch objects.
|
int |
getConfidence()
Get an indication of the confidence in the charset detected.
|
String |
getLanguage()
Get the ISO code for the language of the detected charset.
|
int |
getMatchType()
Return flags indicating what it was about the input data
that caused this charset to be considered as a possible match.
|
String |
getName()
Get the name of the detected charset.
|
Reader |
getReader()
Create a java.io.Reader for reading the Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
|
String |
getString()
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
|
String |
getString(int maxLength)
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
|
public static final int ENCODING_SCHEME
getMatchType()
,
Constant Field Valuespublic static final int BOM
getMatchType()
,
Constant Field Valuespublic static final int DECLARED_ENCODING
getMatchType()
,
Constant Field Valuespublic static final int LANG_STATISTICS
getMatchType()
,
Constant Field Valuespublic Reader getReader()
CAUTION: if the source of the byte data was an InputStream, a Reader can be created for only one matching char set using this method. If more than one charset needs to be tried, the caller will need to reset the InputStream and create InputStreamReaders itself, based on the charset name.
public String getString() throws IOException
IOException
public String getString(int maxLength) throws IOException
maxLength
- The maximium length of the String to be created when the
source of the data is an input stream, or -1 for
unlimited length.IOException
public int getConfidence()
public int getMatchType()
Note: currently, this method always returns zero.
public String getName()
Charset
,
InputStreamReader
public String getLanguage()
null
if the language cannot be determined.public int compareTo(Object o)
compareTo
in interface Comparable
o
- the CharsetMatch object to compare against.ClassCastException
- if the argument is not a CharsetMatch.Copyright © 2008-2014 Logical Objects. All Rights Reserved.