Last Build Date: Thu, 23 Feb 2017 09:58:31 GMT
Mon, 20 Feb 2017 15:20:41 GMTWhy is there no software that goes through a directory, converts the PDF, EPUB, oet other format to text. then agressively searches the text for ISBN number, title etc. Corrects the metadata of the ebook. Also extracts the IMG for tesseract OCR to check if the title can be deduced. Library of Congress entries are also good sources. parsing PDF's can also be done with python modules for eve nmore effective automatic library cleaning.
Sat, 11 Feb 2017 11:38:09 GMTUser annotations - bookmarks, highlights, notes, etc... to the best of my knowledge do not stay with the EPUB book but are proprietary to each reader. So if I change readers and open a book that I marked up in another reader my annotations will be lost. This is not so much of a concern for the immediate future as I don't intend to change readers often, but say in 5 or 10 years I go to re-read a book and I'm using a totally new system/platform/reader - to loose all my annotations from the last time I read it is unacceptable. * Does the EPUB format support storing user annotations in the book.epub file in a non-destructive manner? * If it does, are there any readers out there that store all of the user annotations in the book.epub file?