E-book conversion with Calibre: Difference between revisions

The educational technology and digital learning wiki
Jump to navigation Jump to search
Line 7: Line 7:
== Features ==
== Features ==


List of general features: Library Management - Format conversion (all major ebook formats can be converted from) - Syncing to ebook reader devices - Fetching news from the web and converting it into ebook  form - Viewing many different ebook formats - Giving you access to your book collection over the internet using just a browser.
List of general features according to the calibre [http://calibre-ebook.com/about about page], retrieved 19:08, 29 November 2010 (CET)
: Converter: can create chapters out of single text using XPath expressions, add title image, title, author, data, etc.
 
: Source formats: LIT, MOBI, EPUB, HTML, PRC, RTF, TXT, PDF. Some convert better than others, e.g. PDF pictures don't translate.
'''(1) Library Management'''
: Output formats: EPUB, LRF, MOBI
 
: Easy to install under Ubuntu (tested with 8.04 Hardy Heron and 9.04 Jaunty). You can do it with one command line.
Calibre manages your e-book collection for you. It is designed around the concept of the logical book, i.e., a single entry in your library that may correspond to actual e-book files in several formats. Calibre can sort the books in your library by: Title, Author, Date added, Date published, Size, Rating, Series, etc.
 
You can search your book collection for a particular book either with simple or advanced search.
 
You can export arbitrary subsets of your collection to your hard disk arranged in a fully customizable folder structure.
 
Calibre will retrive book metadata from the Internet based on existing title/author or ISBN information. It can download various types of metadata and covers for your books, automatically. The metadata system is written using plugins so that different types of metadata sources can be supported in the future.
 
'''(2) E-book conversion'''
 
Calibre can convert from a huge number of formats to a huge number of formats. It supports all the major e-book formats.
 
The conversion engine has lots of powerful features. It can rescale all font sizes, ensuring the output e-book is readable no matter what font sizes the input document uses. It can automatically detect/create book structure, like chapters and Table of Contents. It can insert the book metadata into a "Book Jacket" at the start of the book. Itcan create chapters out of single text using XPath expressions, add title image, title, author, data, etc.
 
* Source formats (as of version 0.7.27 (nov 2010): CBZ, CBR, CBC, CHM, EPUB, FB2, HTML, LIT, LRF, MOBI, ODT, PDF, PRC. Some convert better than others, e.g. PDF pictures and tables don't translate well and some can be lost. In order of decreasing preference: LIT, MOBI, EPUB, HTML, PRC, RTF, PDB, TXT, PDF
* Output formats: EPUB, FB2, OEB, LIT, LRF, MOBI, PDB, PML, RB, PDF, SNB, TCR, TXT
 
'''(3) Syncing to e-book reader devices'''
 
calibre has a modular device driver design that makes adding support for different e-reader devices easy. It has support for a [http://calibre-ebook.com/user_manual/faq.html#device-integration large number of devices]. Syncing supports updating metadata on the device from metadata in the library and creation of collections on the device based on the tags defined in the library. If a book has more than one format available, calibre automatically chooses the best format when uploading to the device. If none of the formats is suitable, calibre will automatically convert the e-book to a format suitable for the device before sending it.
 
'''(4) Downloading news from the web and converting it into e-book form'''
 
Calibre can automatically fetch news from websites or RSS feeds, format the news into a ebook and upload to a connected device. The ebooks include the full versions of the articles, not just the summaries. Examples of supported news sites include: ''The New York Times, The Wall Street Journal, The Economist, Time and many more''.
 
'''(5) Comprehensive e-book viewer'''
 
calibre has a built-in ebook viewer that can display all the major ebook formats. It has full support for Table of Contents, bookmarks, CSS, a reference mode, printing, searching, copying, customizing the rendering via a user style sheet, embedded fonts, etc.
 
'''(6) Content server for online access to your book collection'''
 
Calibre has a built-in web server that allows you to access your ebook collection using a simple browser from any computer anywhere in the world. It can also email your books and downloaded news to you automatically. It has support for mobile devices, so you can browse your collection and download books from your smartphone, Kindle, etc.
 
'''Supported system and installation'''
 
* Windows
* MacOSX
* Linux: Calibre is easy install under Ubuntu (tested with 8.04, 9.04 and 10.04). You can do it [http://calibre-ebook.com/download_linux with one command line]. An older version is available through Synaptic.


== Preparing input ==
== Preparing input ==


MS word documents can be made into an e-book, but exporting these as HTML using the '''Web page, filtered''' option in Word 2007. In addition, we strongly suggest to install the  [http://www.mobileread.com/forums/showthread.php?t=28313 BookCreator tool], i.e. a .dot Word template with VBScript code that will assist creating a layout that will make conversion much easier.
MS word documents can be made into an e-book, but exporting these as HTML using the '''Web page, filtered''' option in Word 2007. In addition, we strongly suggest to install the  [http://www.mobileread.com/forums/showthread.php?t=28313 BookCreator tool], i.e. a .dot Word template with VBScript code that will assist creating a layout that will make conversion much easier.
Line 41: Line 80:


A tested example file is [http://tecfa.unige.ch/guides/vrml/vrmlman/epub.html here], i.e. an index of an old VRML manual I wrote over 10 years ago with Latex.
A tested example file is [http://tecfa.unige.ch/guides/vrml/vrmlman/epub.html here], i.e. an index of an old VRML manual I wrote over 10 years ago with Latex.
== Post processing ==
Books made with Calibre may need some post-processing. A good program is the free [http://code.google.com/p/sigil/ Sigil] which is available for Win 32/64bit, Mac(Intel) and Linux 32/64bit.


== Links ==
== Links ==

Revision as of 20:08, 29 November 2010

Draft

Introduction

Calibre is a one stop solution to all your e-book needs. It is free, open source and cross-platform in design and works well on Linux, OS X and Windows.

Features

List of general features according to the calibre about page, retrieved 19:08, 29 November 2010 (CET)

(1) Library Management

Calibre manages your e-book collection for you. It is designed around the concept of the logical book, i.e., a single entry in your library that may correspond to actual e-book files in several formats. Calibre can sort the books in your library by: Title, Author, Date added, Date published, Size, Rating, Series, etc.

You can search your book collection for a particular book either with simple or advanced search.

You can export arbitrary subsets of your collection to your hard disk arranged in a fully customizable folder structure.

Calibre will retrive book metadata from the Internet based on existing title/author or ISBN information. It can download various types of metadata and covers for your books, automatically. The metadata system is written using plugins so that different types of metadata sources can be supported in the future.

(2) E-book conversion

Calibre can convert from a huge number of formats to a huge number of formats. It supports all the major e-book formats.

The conversion engine has lots of powerful features. It can rescale all font sizes, ensuring the output e-book is readable no matter what font sizes the input document uses. It can automatically detect/create book structure, like chapters and Table of Contents. It can insert the book metadata into a "Book Jacket" at the start of the book. Itcan create chapters out of single text using XPath expressions, add title image, title, author, data, etc.

  • Source formats (as of version 0.7.27 (nov 2010): CBZ, CBR, CBC, CHM, EPUB, FB2, HTML, LIT, LRF, MOBI, ODT, PDF, PRC. Some convert better than others, e.g. PDF pictures and tables don't translate well and some can be lost. In order of decreasing preference: LIT, MOBI, EPUB, HTML, PRC, RTF, PDB, TXT, PDF
  • Output formats: EPUB, FB2, OEB, LIT, LRF, MOBI, PDB, PML, RB, PDF, SNB, TCR, TXT

(3) Syncing to e-book reader devices

calibre has a modular device driver design that makes adding support for different e-reader devices easy. It has support for a large number of devices. Syncing supports updating metadata on the device from metadata in the library and creation of collections on the device based on the tags defined in the library. If a book has more than one format available, calibre automatically chooses the best format when uploading to the device. If none of the formats is suitable, calibre will automatically convert the e-book to a format suitable for the device before sending it.

(4) Downloading news from the web and converting it into e-book form

Calibre can automatically fetch news from websites or RSS feeds, format the news into a ebook and upload to a connected device. The ebooks include the full versions of the articles, not just the summaries. Examples of supported news sites include: The New York Times, The Wall Street Journal, The Economist, Time and many more.

(5) Comprehensive e-book viewer

calibre has a built-in ebook viewer that can display all the major ebook formats. It has full support for Table of Contents, bookmarks, CSS, a reference mode, printing, searching, copying, customizing the rendering via a user style sheet, embedded fonts, etc.

(6) Content server for online access to your book collection

Calibre has a built-in web server that allows you to access your ebook collection using a simple browser from any computer anywhere in the world. It can also email your books and downloaded news to you automatically. It has support for mobile devices, so you can browse your collection and download books from your smartphone, Kindle, etc.

Supported system and installation

  • Windows
  • MacOSX
  • Linux: Calibre is easy install under Ubuntu (tested with 8.04, 9.04 and 10.04). You can do it with one command line. An older version is available through Synaptic.

Preparing input

MS word documents can be made into an e-book, but exporting these as HTML using the Web page, filtered option in Word 2007. In addition, we strongly suggest to install the BookCreator tool, i.e. a .dot Word template with VBScript code that will assist creating a layout that will make conversion much easier.

You also can use HTML cleanup programs:

Creating books from multiple files

In order to convert a collection of HTML files in a specific oder, you have to create a table of contents file. That is, another HTML file that contains links to all the other files in the desired order. Such a file looks like:

<html>
   <body>
     <h1>Table of Contents</h1>
     <p style="text-indent:0pt">
        <a href="file1.html">First File</a><br/>
        <a href="file2.html">Second File</a><br/>
        .
        .
        .
     </p>
   </body>
</html>

Then just add this HTML file to the GUI and use the convert button to create your ebook.

A tested example file is here, i.e. an index of an old VRML manual I wrote over 10 years ago with Latex.

Post processing

Books made with Calibre may need some post-processing. A good program is the free Sigil which is available for Win 32/64bit, Mac(Intel) and Linux 32/64bit.

Links

List of Calibre tutorials

  • Absolute e-book beginners also should read Beginner’s Guide to Calibre by John Schember, jan 2010. This tutorial explains how to read books with calibre and how to download stuff and upload to your device.

General ePub tutorials

  • See the ePub article for more

Manual