xstream:Invalid byte 2 of 2-byte UTF-8 sequence
JavaXML
blog遷移至 :http://www.micmiu.com
在用XStream將xml
還原序列化 為Javabean時報錯,資訊如下:
com.thoughtworks.xstream.io.StreamException: : Invalid byte 2 of 2-byte UTF-8 sequence. at com.thoughtworks.xstream.io.xml.DomDriver.createReader(DomDriver.java:88)at com.thoughtworks.xstream.io.xml.DomDriver.createReader(DomDriver.java:70)at com.thoughtworks.xstream.XStream.fromXML(XStream.java:891)at michael.xstream.XtreamTestMain.main(XtreamTestMain.java:71)Caused by: com.sun.org.apache.xerces.internal.impl.io.MalformedByteSequenceException: Invalid byte 2 of 2-byte UTF-8 sequence.at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.invalidByte(UTF8Reader.java:684)at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.read(UTF8Reader.java:369)at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.load(XMLEntityScanner.java:1742)at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.peekChar(XMLEntityScanner.java:487)at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(XMLDocumentFragmentScannerImpl.java:2687)at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(XMLDocumentScannerImpl.java:648)at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:511)at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:808)at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:737)at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:119)at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:235)at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)at com.thoughtworks.xstream.io.xml.DomDriver.createReader(DomDriver.java:79)... 3 more
產生的原因:
簡單的說就是XML檔案的編碼和解析XML時用的編碼不一致產生的問題。
由於檔案會以系統的預設編碼對檔案進行儲存,在中文版的window下Java的預設的編碼為GBK,所以預設產生的xml檔案是以GBK格式來儲存的,所以我們使用GBK、GB2312編碼來產生xml檔案能正確的被解析,而以UTF-8格式產生的檔案不能被xml解析器所解析的原因,其實和之前文章碰到的問題類似:http://sjsky.iteye.com/blog/1053931
解決辦法 :
就是為DOM解析器指定好編碼utf-8,代碼如下
1.XStream xStream = new XStream(new DomDriver("utf-8"));
有關XStream序列化JAVA對象為XML以及還原序列化的使用說明可參見 : http://sjsky.iteye.com/blog/784434