epub

Python — Parsing files (docx, pdf and odt) and converting the content into my data model

假如想象 提交于 2019-12-08 03:51:27
问题 I'm writing an import/export tool for importing docx, pdf, and odt files; in which a book has been written. We already have a tool for the .epub format, and we'd like to extend the functionality beyond that, so users of the site can have more flexibility. So far I've looked at PDFMiner and also found out that docx is just based on the openxml format, so the word/document.xml is essentially the file containing the whole thing, and I can parse it with lxml. The question I have is: I'm hoping to

Taking long time to display epub files in device

二次信任 提交于 2019-12-07 22:45:31
We are displaying an epub file on the screen through our application. The file is saved in SDCard and the following logic we are using for getting the file data from SDCard and displaying in Screen. But its taking long time to load the content in screen. Any issues with my code? please help me friends. File rootDir = Environment.getExternalStorageDirectory(); EpubReader epubReader = new EpubReader(); try { book = epubReader.readEpub(new FileInputStream("/sdcard/forbook.epub")); Toast.makeText(getApplicationContext(), "Book : " + book, Toast.LENGTH_LONG).show(); } catch (FileNotFoundException e

How to highlight text permanently in WebView in Android?

怎甘沉沦 提交于 2019-12-07 15:18:29
问题 I am developing an ePUB reader for Android phones. I can get text copied to the ClipBoard using WebView & ClipBoardManager. But what I really want to do is highlight the selection permanently. I saw some articles. They suggested to get WebView in selection mode & to use JavaScript. How can I get WebView in selection mode? If I go with JavaScript and get the text modified, can I save those changes applied by the JavaScript permanently in the corresponding ePUB file? If yes then how? 来源: https:

Getting XHTML from DRM EPUB

旧街凉风 提交于 2019-12-07 11:37:27
I'm currently using the Adobe RMSDK to read DRM EPUBs. It renders an EPUB page as an image, and sends it back to be displayed on the device. I would really like to have access to the EPUBs XHTML, to display it in a webview. I have been able to unzip the EPUBs and browse the different chapters, but the content is still encrypted. Is there anyone that have any idea how I can accomplish this? I'm using Xcode 4.2 , Objective C/C++ Regards, EZFrag EDIT 1: I already combed through the whole RMSDK, but was unable to find anything that could help me. I also couldn't find any online resource explaining

Multiline questionaire editable

你说的曾经没有我的故事 提交于 2019-12-07 08:45:31
I want to create a multiline field for a questionaire-like form similar to the ones you see in interactive PDF documents. The input field must span the whole width and wrap at the end of the line or at least after a fixed number of characters. The caption does not necessarily have to be in the same line: Please introduce yourself: _____________________________ ________________________________________________________ There are several things I have tried to achieve this: <p contentEditable="true"> or <textarea> did not work for me, because border-bottom: only works for the last line and text

eBook reading error

大城市里の小女人 提交于 2019-12-07 04:08:26
I have tried to read a epub file for which im using http://www.siegmann.nl/epublib/android . Im trying the same sample which is given in the link, but im getting java.lang.NoClassDefFoundError . I tried all the way i can. Even by changing by adt,eclipse but i cant able to solve this issue. Please look into it and provide me some suggestions. 04-20 15:08:47.735: E/AndroidRuntime(4329): FATAL EXCEPTION: main 04-20 15:08:47.735: E/AndroidRuntime(4329): java.lang.ExceptionInInitializerError 04-20 15:08:47.735: E/AndroidRuntime(4329): at com.sample.pubreader.EPubReaderActivity.onCreate

Can't Unzip EPub File

喜夏-厌秋 提交于 2019-12-06 11:37:38
IMO, I thought that epub is a kind of zip . Thus, I've tried to unzip in a way . public class Main { public static void main(String argv[ ]) { final int BUFFER = 2048; try { BufferedOutputStream dest = null; FileInputStream fis = new FileInputStream("/Users/yelinaung/Documents/unzip/epub/doyle.epub"); ZipInputStream zis = new ZipInputStream(new BufferedInputStream(fis)); ZipEntry entry; while ((entry = zis.getNextEntry()) != null) { System.out.println("Extracting: " + entry); int count; byte data[] = new byte[BUFFER]; // write the files to the disk FileOutputStream fos = new FileOutputStream("

Regex: Find groups of lowercase letters between HTML tag

£可爱£侵袭症+ 提交于 2019-12-06 07:04:56
问题 I'm attempting to develop a regular expression that can be run in Sigil, the ePub 2 editor. Small-caps are a well-known problem within the current ePub reader ecosystem. Many readers, such as Adobe Digital Editions, do not support "font-variant: small-caps". After trying several different workarounds, I've settled on creating fake small caps by transforming the text to uppercase and setting the previously lowercase letters to "font-size: 0.75em". This process is extremely tedious, especially

epub 3, how to prevent pages from running in background ? (iBooks / Readium)

China☆狼群 提交于 2019-12-06 06:16:08
I have created and validated an ePub 3 which I am currently testing. I am using a fixed layout.. <meta property="rendition:layout">pre-paginated</meta> <meta property="rendition:orientation">portrait</meta> <meta property="rendition:spread">none< /meta> It works nicely, however multiple pages seems to load and run at once when testing in different readers. I'm using jQuery's document ready to kick things off and play audio, run animations, etc. When testing in iBooks I open the book to page 1 which loads and begins to run, but then I start hearing audio from page 2.. so the document ready

Python — Parsing files (docx, pdf and odt) and converting the content into my data model

旧巷老猫 提交于 2019-12-06 04:31:56
I'm writing an import/export tool for importing docx, pdf, and odt files; in which a book has been written. We already have a tool for the .epub format, and we'd like to extend the functionality beyond that, so users of the site can have more flexibility. So far I've looked at PDFMiner and also found out that docx is just based on the openxml format, so the word/document.xml is essentially the file containing the whole thing, and I can parse it with lxml. The question I have is: I'm hoping to parse the contents of these files, and from that content, extract things like chapter names, images