我想使用FlyingSaucer将包含阿拉伯字符的
HTML页面转换为PDF文件,但生成的PDF不包含组合字符并向后打印输出.
HTML:
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <Meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> </head> <body style="font-size:15px;font-family: Arial Unicode MS;"> <center style="font-size: 18px; font-family: Arial Unicode MS;"> <b> <i style="font-family: Arial Unicode MS;"> جميع الحقوق<br /> </i> </b> </center> </body> </html>
Java摘录:
String inputFile = "c:\\html.html"; String url = new File(inputFile).toURI().toURL().toString(); String outputFile = "c:\\html.pdf"; OutputStream os = new FileOutputStream(outputFile); ITextRenderer renderer = new ITextRenderer(); renderer.getFontResolver().addFont("c://ARIALUNI.TTF",BaseFont.IDENTITY_H,BaseFont.EMBEDDED); renderer.setDocument(url); renderer.layout(); renderer.createPDF(os); os.close();
实际PDF结果:
预期PDF结果:
我该怎么做才能获得正确的结果?