在真實(shí)的工程開發(fā)中,一個(gè)富文本編輯器优幸,不是僅僅可以編輯顯示,還需要處理富文本的轉(zhuǎn)換與解析褪猛,方便傳輸與存儲(chǔ)网杆。一般來說,HTML文本是比較理想的網(wǎng)絡(luò)傳輸格式。如何將安卓的span式富文本和HTML文本之間進(jìn)行互轉(zhuǎn)碳却,是本篇介紹的重點(diǎn)队秩。
首先,介紹富文本轉(zhuǎn)成HTML昼浦。我們需要做的馍资,是遍歷文本中的span對(duì)象,并用合適的html標(biāo)簽來修飾span對(duì)象對(duì)應(yīng)的文本关噪。示例代碼如下:
public static String convertSpannedToRichText(Spanned spanned) {
List<CharacterStyle> spanList =
Arrays.asList(spanned.getSpans(0, spanned.length(), CharacterStyle.class));
SpannableStringBuilder stringBuilder = new SpannableStringBuilder(spanned);
for (CharacterStyle characterStyle : spanList) {
int start = stringBuilder.getSpanStart(characterStyle);
int end = stringBuilder.getSpanEnd(characterStyle);
if (start >= 0) {
String htmlStyle = handleCharacterStyle(characterStyle,
stringBuilder.subSequence(start, end).toString());
if (htmlStyle != null) {
stringBuilder.replace(start, end, htmlStyle);
}
}
}
return stringBuilder.toString();
}
private static String handleCharacterStyle(CharacterStyle characterStyle, String text) {
if (characterStyle instanceof BoldSpan) {
return String.format("<b>%s</b>", text);
} else if (characterStyle instanceof UrlSpan) {
UrlSpan span = (UrlSpan) characterStyle;
return String.format("<a href=\"%s\">%s</a>", span.getValue(), text);
} else if (characterStyle instanceof EmojiSpan) {
EmojiSpan span = (EmojiSpan) characterStyle;
return String.format("<img src=\"%s\" alt=\"[%s]\" class=\"yiqiFace\"/>",
span.getUrl(), span.getName());
} else if (characterStyle instanceof FakeImageSpan) {
FakeImageSpan span = (FakeImageSpan) characterStyle;
return String.format("<img src=\"%s\" />", span.getValue());
} if (characterStyle instanceof ImageSpan) {
ImageSpan span = (ImageSpan) characterStyle;
return String.format("<img src=\"%s\" />", TextUtils.isEmpty(span.getUrl()) ?
span.getFilePath() : span.getUrl());
}
return null;
}
注:上述代碼在處理一些復(fù)雜的span嵌套情況時(shí)鸟蟹,可能會(huì)有問題。如果要考慮到span嵌套的情況使兔,可能需要全新的思路和寫法建钥。后續(xù)如果有改進(jìn)會(huì)再更新。
下面介紹如何將HTML轉(zhuǎn)化為安卓富文本虐沥。這里使用了一個(gè)開源庫:TagSoup锦针,來處理Html內(nèi)容的解析。TagSoup是一個(gè)解析HTML的java開源庫置蜀,一般用作HTML的正則化奈搜。TagSoup的介紹可以參考網(wǎng)上其它文章,這里不再贅述盯荤。而我們要做的馋吗,是實(shí)現(xiàn)自己的SAX內(nèi)容處理器,實(shí)現(xiàn)org.xml.sax.ContentHandler接口秋秤,并set到TagSoupParser中宏粤。具體代碼都在:RichTextConvertor這個(gè)類中。下面節(jié)選幾個(gè)比較重要的方法:
@Override
public void startElement(String uri, String localName, String qName, Attributes atts) throws SAXException {
handleStartTag(localName, atts);
}
@Override
public void endElement(String uri, String localName, String qName) throws SAXException {
handleEndTag(localName);
}
@Override
public void characters(char[] ch, int start, int length) throws SAXException {
StringBuilder sb = new StringBuilder();
for (int i = 0; i < length; i++) {
char c = ch[i + start];
sb.append(c);
}
mResult.append(sb);
}
// ****************************************** Handle Tags *******************************************
private void handleStartTag(String tag, Attributes attributes) {
if (tag.equalsIgnoreCase("a")) {
startAHref(attributes);
} else if (tag.equalsIgnoreCase("img")) {
startImg(attributes);
} else if (tag.equalsIgnoreCase("b") || tag.equalsIgnoreCase("strong")) {
start(new Bold());
}
}
private void handleEndTag(String tag) {
if (tag.equalsIgnoreCase("a")) {
endAHref();
} else if (tag.equalsIgnoreCase("b")|| tag.equalsIgnoreCase("strong")) {
end(Bold.class, new BoldSpan());
}
}
private void startAHref(Attributes attributes) {
String href = attributes.getValue("", "href");
int len = mResult.length();
mResult.setSpan(new Href(href), len, len, Spanned.SPAN_MARK_MARK);
}
private void start(Object mark) {
int len = mResult.length();
mResult.setSpan(mark, len, len, Spanned.SPAN_MARK_MARK);
}
private void endAHref() {
int len = mResult.length();
Object obj = getLast(Href.class);
int where = mResult.getSpanStart(obj);
mResult.removeSpan(obj);
if (where != len) {
Href h = (Href) obj;
if (h.mHref != null) {
mResult.setSpan(new UrlSpan(h.mHref),
where, len, Spanned.SPAN_EXCLUSIVE_EXCLUSIVE);
}
}
}
private void end(Class<? extends Object> kind, Object repl) {
int len = mResult.length();
Object obj = getLast(kind);
int where = mResult.getSpanStart(obj);
mResult.removeSpan(obj);
if (where != len) {
// Note: use SPAN_EXCLUSIVE_EXCLUSIVE, the TemporarySpan will be replaced by a SPAN_EXCLUSIVE_INCLUSIVE span
mResult.setSpan(new TemporarySpan(repl), where, len, Spanned.SPAN_EXCLUSIVE_EXCLUSIVE);
}
}
private Object getLast(Class<? extends Object> kind) {
/*
* This knows that the last returned object from getSpans()
* will be the most recently added.
*/
Object[] objs = mResult.getSpans(0, mResult.length(), kind);
return objs.length == 0 ? null : objs[objs.length - 1];
}
private void startImg(Attributes attributes) {
int len = mResult.length();
String alt = attributes.getValue("", "alt");
String src = attributes.getValue("", "src");
String classString = attributes.getValue("", "class");
// Unicode Character 'OBJECT REPLACEMENT CHARACTER' (U+FFFC)
// see http://www.fileformat.info/info/unicode/char/fffc/index.htm
mResult.append("\uFFFC");
FakeImageSpan imageSpan = new FakeImageSpan(src);
mResult.setSpan(imageSpan, len, len + 1, Spanned.SPAN_EXCLUSIVE_EXCLUSIVE);
}
private static class Bold {
}
這里需要分三種情況進(jìn)行討論:
- 對(duì)于img這種標(biāo)簽灼卢,我們?nèi)〕銎渲械膶傩陨馨ィ涂梢赃M(jìn)行替換(有的讀者可能發(fā)現(xiàn),在img標(biāo)簽的處理上鞋真,我使用了一個(gè)叫做FakeImageSpan的類崇堰。這時(shí)因?yàn)樵趥鬏敗⒋鎯?chǔ)過程中涩咖,圖片都是使用了url遠(yuǎn)程路徑或者本地路徑海诲,需要先進(jìn)行下載或加載,才可以使用真正的ImageSpan進(jìn)行替換)檩互;
- 對(duì)于href這種標(biāo)簽特幔,需要先讀取內(nèi)容,直到出現(xiàn)截止標(biāo)簽闸昨,再用真正的URL內(nèi)容替換掉占位span蚯斯;
- 對(duì)于b和strong這樣的標(biāo)簽薄风,先用占位span占位后,在替換成TemporarySpan拍嵌,并采用SPAN_EXCLUSIVE_EXCLUSIVE格式村刨。在后面處理完所有輸入后,統(tǒng)一替換成相應(yīng)得span撰茎,并采取SPAN_EXCLUSIVE_INCLUSIVE格式,這樣后續(xù)的輸入都會(huì)采用相同的格式打洼。具體的處理請(qǐng)參考RichTextConvertor龄糊。