What is this (cid:51) in the output of pdf2txt?

≯℡__Kan透↙ 提交于 2019-12-01 16:09:45

to understand how to interpret the cid you need to know a pair of things:

  1. The Registry-Ordering-Supplement (ROS) information for the font in question. It's usually something like 'Adobe-Japan1-5' and is an informational property stored in the font. The ROS determines how the CIDs are to be interpreted.

  2. Armed with the ROS info, select a compatible CMap and decode through that.You can find CMap files for the Adobe-defined ROSes at http://sourceforge.net/projects/cmap.adobe/files/

More information on CID and CMaps direct from the inventors is available at http://www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5014.CIDFont_Spec.pdf

check decode CID font codes to equivalent ASCII characters for more information

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!