Archive for August 6th, 2012

Calibre is not only an E-book Library Management Application

calibre is a free and open source e-book library management application developed by users of e-books for users of e-books. It has a cornucopia of features divided into the following main categories:

  • Library Management
  • calibre manages your e-book collection for you. It is designed around the concept of the logical book, i.e., a single entry in your library that may correspond to actual e-book files in several formats.
    calibre can sort the books in your library by: Title, Author, Date added, Date published, Size, Rating, Series, etc.
    In addition, it supports extra searchable metadata:

    • Tags: A flexible system for categorizing your collection however you like
    • Comments: A long form entry that you can use for book description, notes, reviews, etc.

    You can easily search your book collection for a particular book. calibre supports searching any and all of the fields mentioned above. You can construct advanced search queries by clicking the helpful “Advanced search” button to the left of the search bar.
    You can export arbitrary subsets of your collection to your hard disk arranged in a fully customizable folder structure.
    Finally, calibre will even go out onto the internet to find book metadata based on existing title/author or ISBN information. It can download various types of metadata and covers for your books, automatically. The metadata system is written using plugins so that different types of metadata sources can be supported in the future.

  • E-book conversion
  • calibre can convert from a huge number of formats to a huge number of formats. It supports all the major e-book formats. The full list of formats can be found here .
    The conversion engine has lots of powerful features. It can rescale all font sizes, ensuring the output e-book is readable no matter what font sizes the input document uses. It can automatically detect/create book structure, like chapters and Table of Contents. It can insert the book metadata into a “Book Jacket” at the start of the book.

  • Syncing to e-book reader devices
  • calibre has a modular device driver design that makes adding support for different e-reader devices easy. At the moment, it has support for a large number of devices, the complete list of which is here . Syncing supports updating metadata on the device from metadata in the library and creation of collections on the device based on the tags defined in the library. If a book has more than one format available, calibre automatically chooses the best format when uploading to the device. If none of the formats is suitable, calibre will automatically convert the e-book to a format suitable for the device before sending it.

  • Downloading news from the web and converting it into e-book form
  • calibre can automatically fetch news from websites or RSS feeds, format the news into a ebook and upload to a connected device. The ebooks include the full versions of the articles, not just the summaries. Examples of supported news sites include:

    • The New York Times
    • The Wall Street Journal
    • The Economist
    • Time
    • Newsweek
    • The Guardian
    • ESPN
    • and many, many more…

    calibre has over three hundred news sources and the news system is plugin based, allowing users to easily create and contribute new sources to calibre. As a result the collection of news sources keeps on growing!
    If you are interested in adding support for a news site, read the User Manual . Once you have successfully created a new recipe, you can share it with other users by posting it in the calibre forum or sending it to the calibre developers for inclusion in calibre.

  • Comprehensive e-book viewer
  • calibre has a built-in ebook viewer that can display all the major ebook formats. It has full support for Table of Contents, bookmarks, CSS, a reference mode, printing, searching, copying, customizing the rendering via a user style sheet, embedded fonts, etc.

  • Content server for online access to your book collection
  • calibre has a built-in web server that allows you to access your ebook collection using a simple browser from any computer anywhere in the world. It can also email your books and downloaded news to you automatically. It has support for mobile devices, so you can browse your collection and download books from your smartphone, Kindle, etc.

Be Sociable, Share!

PdfMasher is Another Free Software to Convert PDF to MOBI or EPUB

Calibre is not only a free and open source e-book library management application, but also a power E-book converter, calibre can convert from a huge number of formats to a huge number of formats. It supports all the major e-book formats. The full list of formats are as follow,
Input Formats: CBZ, CBR, CBC, CHM, DJVU, EPUB, FB2, HTML, HTMLZ, LIT, LRF, MOBI, ODT, PDF, PRC, PDB, PML, RB, RTF, SNB, TCR, TXT, TXTZ
Output Formats: AZW3, EPUB, FB2, OEB, LIT, LRF, MOBI, HTMLZ, PDB, PML, RB, PDF, RTF, SNB, TCR, TXT, TXTZ

The conversion engine has lots of powerful features. It can rescale all font sizes, ensuring the output e-book is readable no matter what font sizes the input document uses. It can automatically detect/create book structure, like chapters and Table of Contents. It can insert the book metadata into a “Book Jacket” at the start of the book.

but today I will introduce another free and open source software that can easily convert PDF to MOBI or EPUB, it is PdfMasher.

PdfMasher is a tool to convert PDF articles (newspaper, academic) to MOBI or EPUB documents. Most ebook readers support PDF files natively, but it’s often a real pain to read those documents because we don’t have font size control over the document like we have with native ebooks. In many cases, we have to use the zooming feature and it’s just a pain. Another drawback of PDFs on ebook readers is that annotations are not supported.

There are already tools to convert PDFs to ebooks like Calibre, but what they do is that they try to guess the role of each piece of text in the PDF (and that’s if you’re lucky). I think that in all but the simplest cases, it’s a mistake to think that anything short of an AI can do that kind of guessing.

Enter PdfMasher. PdfMasher asks the user about the role of each piece of text, and does it in an efficient manner. Your PDF has a header on each page and you don’t want them to litter your text? Sort text elements by Y-position (thus grouping them all together), shift select the elements and flag them as ignored. They will not appear on your final HTML. Your PDF has footnotes on many pages? Sort your elements by text content (thus grouping all elements with the text starting with a number together) and flag them as footnotes. They will be moved to the end of the document, and PdfMasher will try to create hyperlinks to footnote references.

Be Sociable, Share!