An eBook (also: e-book, ebook) is an electronic (or digital) version of a book. The term is used ambiguously both to refer to either an individual work in a digital format, or a hardware device used to read books in digital format. Some users deprecate the second meaning in favor of the more precise "eBook device." However the term interplay works out coloquially in the long run, eBooks are an emerging and rapidly changing technology, and since at least 2004 have included the newer experimental eMagazines, pioneered in part by Baen's Books in their release of the first Grantville Gazette eMagazine.
The term e-text is often used synonymously with the term eBook, and is also used for the more limited case of data in ASCII text format, while the more general [e-book] can be in a specialized (and, at times, proprietary) file format. An exception to this rule is the academic e-text, which commonly includes components such as facsimile images, apparatus criticus, and scholarly commentary on the work from one or more editors specially qualified to edit the author or work in question.
An eBook is commonly bundled by a publisher for distribution (as an eBook, an ezine, or an internet newspaper), whereas e-text is distributed in ASCII (or plain text), or in the case of academic works, in the form of discrete media such as compact discs. Metadata relating to the text are sometimes included with etext (though it appears more frequently with eBooks). Metadata commonly include details about author, title, publisher, and copyright date; less common are details regarding language, genre, relevant copyright conventions, etc.
Formats
The eBook community has many options when it comes to choosing a format for production. While the average end-user might arguably simply want to read books, every format has its exponents and champions, and debates over "which format is best" can become intense. The myriad of eBook formats is sometimes referred to as the "Tower of eBabel". For the average end user to read a book, every format has its advantages and disadvantages. Formats available include, but are by no means limited to:
Image files
An eBook can be distributed as a sequence of images, one for each page. In this way, any image format can be used as an eBook format. This method of distribution produces files much larger than all others, and also has the disadvantage that the user cannot select text, nor can the eBook be read by a screen reader. Distribution as images is most suitable for comic books, books about art, or other very visual works.
Rich Text Format
Published as an .rtf
A standard formalized by Microsoft Corporation for specifying formatting of documents. RTF files are actually ASCII files with special commands to indicate formatting information, such as fonts and margins.
Hyper Text Markup Language
Commonly known as HTML
HTML is the markup language used for most web pages. eBooks using HTML can be read using a standard browser (e.g., Mozilla, Firefox, or Microsoft Internet Explorer), with no need for special equipment.
Open Electronic Book Package Format
Commonly known as OPF FlipBook
OPF is an xml-based eBook format created by E-Book Systems. eBooks created in this format are also known as FlipBooks as the viewing software presents the book in a 3D flipping format. There is an on-going project to make the OPF readable using a standard browser (e.g., Mozilla, Firefox, or Microsoft Internet Explorer), with no need for special equipment. Currently it requires the viewing client for the full Flipping experience.
FictionBook
FictionBook is a popular XML-based eBook format, supported by free readers such as Haali Reader and FBReader.
TEI Lite is the most popular of the TEI-based (and thus XML-based or SGML-based) electronic text formats.
Plucker
Plucker is a free eBook reader application with its own associated file format and software to automatically generate plucker files from HTML files, web sites or RSS feeds. The format is a compressed HTML archive, somewhat like Microsoft's CHM.
CHM Format
Also known as Microsoft Compressed HTML Help
CHM format is a proprietary format based on HTML. Multiple pages and embedded graphics are distributed along with proprietary metadata as a single compressed file. In contrast, in HTML, a site consists of multiple html files and associated image files in standardized formats.
http://haali.cs.msu.ru/pocketpc/FictionBook_description.html
TEX
The TeX format is a popular academic format. TeX is a typesetting system written by Donald Knuth, especially for technical writing applications in the scientific communities of mathematics and computer science. TeX can typeset complex mathematical formulas, but is now also being used for many other typesetting tasks especially in template packages. Few people write in plain TeX any more. LaTeX is a TeX document preparation system. LaTeX offers programmable features and facilities for automating aspects of typesetting and desktop publishing, especially numbering and cross-referencing, tables and figures, page layout, bibliographies. LaTeX was originally written in 1984 by Leslie Lamport and has become a leading method for using TeX (current version: 2ε)
Portable Document Format
Published as a .pdf
A file format created by Adobe Systems, initially to provide a standard form for storing and editing printed publishable documents. Because documents in .pdf format can easily be seen and printed by users on a variety of computer and platform types, they are very common on the World Wide Web. But since they are designed to reproduce page images, and the text cannot be re-flowed to fit the screen width, PDF files designed for printing on standard paper sizes are hard to view on screens with limited size or resolution.
PDF files are created mainly using Adobe Acrobat, but Acrobat Capture and other Adobe products also support their creation, as do third-party products such as PDFCreator, OpenOffice.org, and FOP. Acrobat Reader (now simply called Adobe Reader) is Adobe's product used to view PDF files. PDF files typically contain product manuals, brochures, magazine articles, or flyers as they can embed fonts, images, and other documents. A PDF file contains one or more page images, each of which you can zoom in on or out from. The PDF format can include interactive elements such as buttons for forms entry and for triggering sound and Quicktime or AVI movies. Acrobat PDF files are optimized for the Web by rendering text before graphic images and hypertext links. Adobe's PDF-like eBook format is incorporated into their reader.
PostScript
Published as an .ps
PostScript is a page description language used primarily in the electronic and desktop publishing areas for describing the contents of a printed page in a higher level than the actual output bitmap.
DjVu
Published as .djvu
DjVu is a file format that has been long in obscurity, but that is starting to surface now that free tools to manipulate the files are available.
DjVu is a format that particularly excels in storing scanned images. There are even advanced compressors especially specializing in low-color images, such as text documents. Individual files may contain single pages, or they can be collections of multiple pages.
The images are divided in separate layers (such as multi-color, low-resolution, lossily-compressed background layer, and few-colors, high-resolution, tightly-compressed foreground layer), each compressed in best applicable method. The files are also designed to decompress very fast, even faster than vector-based formats.
The advantage of DjVu is that it is possible to take a high-resolution scan (300-400 DPI), good enough for both on-screen and printing, and store it very efficiently. Several dozens of 300 DPI black-and-white scans can be stored in less than a megabyte.
Microsoft
Published as an .lit
The MS reader uses patented ClearType® display technology. Navigation works with a keyboard, mouse, stylus, or through electronic bookmarks. The Catalogue Library records reader books in a personalized "home page". A user can add annotations and notes to any page, create large-print eBooks with a single command, or create free-form drawings in the reader pages. A built-in dictionary allows the user to look up words.
eGuide
eGuides were first introduced on eBay by a seller with the User name "ColidgeGraduit". eGuides are usually in plain Microsoft Word or Notepad files, and are usually short guides in a step-by-step format. The eGuide simple format is easily prone to plagiarism, since almost anybody can just save the file to their computer then resend it to someone else claiming it as their own. Security measures are being introduced in order to secure simple Microsoft Word documents from being edited by the recipients.
eReader (formerly Palm Digital Media)
Published as a .pdb
eReader is a program for viewing Palm Digital Media electronic books. Versions are available for PalmOS, PocketPC, Symbian OS, Windows, and Macintosh. The reader shows text one page at a time as paper books do. eReader supports embedded hyperlinks and images. Most eReader formatted books are encrypted, with the key being the purchaser's full name and credit card number.
Mobipocket
Published as a .prc
The Mobipocket Reader has a home page library. Readers can add blank pages in any part of a book and add free-hand drawings. Annotations — highlights, bookmarks, notes, and drawings — can be applied, organized, and recalled from a single location. Mobipocket Reader has electronic bookmarks, appearing in the page margins. Dictionaries allow users to look up definitions through a built-in lookup function.
The reader has a full screen mode for reading and has Microsoft ClearType® support. On Palm OS, readers can use sub-pixel rendering with the MobiType® font. Mobipocket Reader runs on many PDA types (including Palm OS, Pocket PC and Windows CE, Tablet PC, Casio BE-300, Psion, Symbian OS Smartphones, Franklin eBookMan) and on Windows 2000/XP. Mobipocket products do not allow reading on Linux, Macintosh or other operating systems. These systems may be used only as a conduit to a PDA.
The Mobipocket eBook format based on the Open eBook standard using XHTML can include JavaScript and frames. It also supports native SQL queries to be used with embedded databases.
The Mobipocket encryption system is not a password based system. Its DRM relies on the PDA hardware serial number.
A free eBook of the German Wikipedia has been published in Mobipocket format, see [1].
ExeBook
Published as an .exe
ExeBook is a compiler that produces an eBook file that, when executed, produces a simulated book onscreen, complete with page texture. The etext is encrypted as graphic images so that automatic text copying is very difficult. The fear of exe files picking up viruses, however, is hampering its acceptance.
DesktopAuthor
DesktopAuthor is an electronic publishing suite that allows creation of digital web books with virtual turning pages. Digital web books of any epublication type can be written in this format, including ebrochures, eBooks, digital photo albums, ecards, digital diaries, online resumes, quizzes, exams, tests, forms and surveys. DesktopAuthor packages the eBook into a ".dnl" or ".exe" book. Each can be a single, plain stand-alone executable file which does not require any other programs to view it. DNL files can be viewed inside a web browser or stand-alone via the DNL Reader.
Newton eBook
Published as an ."pkg" and more commonly known as a Newton book; a single Newton package file can contain multiple books
All systems running the Newton operating system (the most common ones include the Newton MessagePads, eMates, Siemens Secretary Stations, Motorola Marcos, Digital Ocean Seahorses and Tarpons) have built-in support for viewing Newton books. The Newton package format was released to the public by Newton, Inc. prior to that company's absorption into Apple Computer. The format is thus arguably open and various people have written readers for it (writing a Newton book converter has even been assigned as a university-level class project).
Newton books have no support for DRM or encryption. They do support internal links, potentially multiple tables of contents and indexes, embedded grayscale images, and even some scripting capability (for example, it's possible to make a book in which the reader can influence the outcome).
http://tools.unna.org/wikiwikinewt/index.php/MakeNewtonEbooksIndex
Comparison with printed books
Ebook
Advantages
- Text can be searched, except when represented in the form of images.
- Take up little space.
- Hundreds (or thousands) may be carried together on one device.
- Approximately 500 average eBooks can be stored on one CD (equivalent to several shelves' worth of print books)
- Because they take up little space, eBooks can be offered indefinitely, with no 'out of print' date, allowing authors to continue to earn royalties indefinitely (copyright law permiting), and allowing readers to find older works by favorite authors.
- Ebooks may be read in low light or even total darkness, with a back-lit device.
- Type size and type face may be adjusted. However, enlarging e.g. a PDF document magnifies the text but preserves the original layout and spacing; a practical limit on zooming follows from the requirement to keep a text column within the width of the screen (otherwise horizontal scrolling would be needed during and after reading each line, which would be very cumbersome). However, tagged PDFs can be reflowed in Acrobat 6 and 7, eliminating the horizontal-scrolling problem in zoomed PDFs. For more on zooming in, see Electronic maps.
- Can be used with text-to-speech software.
- Readily reformatted for independent platforms.
- Instantly copied
- When a backup is kept in a remote place, cannot be lost by fire, etc.
- Once distributed, elimination is hard to impossible.
- Distributed at low cost.
- Distributed instantly, allowing readers to begin reading at once, without the need to visit a bookstore
- Simultaneously share book (if networked).
- Errors may be easily corrected with downloadable lists of errata or simply with corrected text. (This can also be an advantage for printed books, in different circumstances.)
- At the moment, eBooks are commonly published by independent publishing houses, which can mean greater editorial and authorial freedom and more room for experimentation.
- An inexpensive format for works that require color.
- An excellent choice of format for works that benefit from search and cross-reference capabilities, such as dictionaries, reference works, certain kinds of textbooks.
Disadvantages
From the user's point of view:
- Can be incompatible with new or replacement hardware or software
- Require care in handling and storage of the files, to avoid damage or loss
- Reading can be hard on (or even harmful to) the eyes
- Lacks the quality of a print book as an item
- Limited battery life on portable devices
- Can Phone Home to track readers and reading habits
- Can restrict how many times a document is read
- Can restrict printing
From the publisher/author's point of view:
- Can in some cases be hacked, or disseminated without approval from the author or publisher (some formats are more susceptible to this than others)
- Not normally a good format choice for works that have extensive and/or large illustrations, such as works in art history, photography, large maps, etc.
Print book
Advantages
- Less eye strain over extended reading time
- If small, very portable.
- Usable in adverse environmental conditions.
- Robust and durable.
- Readable when severely damaged.
- Requires no power source, and no alternative reading device like a PC or a palmtop.
- Errors are "forever"; this unchangeability sometimes adds to its value.
- Has more value as "collector's items," e.g., first editions
- At the moment, print books are primarily published by established houses including numerous international conglomerates, which can result in greater funds available for promotion of a title.
Disadvantages
- From the user's point of view: Can be priced in a way that inhibits availability
- From the user's and author's point of view: Can be put out of print and made unavailable to readers
- From the author's point of view: Can be difficult to get a publisher to amend errata
- Can be an awkward format for reference works or works that have many internal cross-references.
- An expensive format for works that require color, since color printing commonly requires several passes of paper through the press (typically three to five passes).
Marching to Another Drum
This article's tone or style may not reflect the encyclopedic tone used on Wikipedia. |
Perhaps the most illustrative of how rapidly evolving and hotly contested the eBook format and digital data war is the evolving position held by Speculative fiction publisher Baens Books. Baen's began by surveying the field of eBook eFormats available in 1999, and decided to ePublish using cryptographically keyed downloads in several of the more common and useful formats, including the ubiquitous Rich Text File output compatible with Microsoft WORD and Microsoft Windows utility program Write.exe (WordPad).
Subsequently, in early 2004 they experimented with an eMagazine under the urging of writer-editor Eric Flint, which achieved great success— to the point the first two unserialized magazines have been released in regular print with a third in production (See: Grantville Gazettes and become best sellers in print form). Within the year, Flint talked publisher Jim Baen into trying another experiment, which involves offering free copies of already published works in print forming the basis of what Baen's calls the Baen Free Library.
Since the experiment was again financially successful resulting in more printed sales, Baens has since radically extended the concept and scope of the experment such that most of it's early titles in a given Book series are offered free, and somewhat aged newer titles are periodically offered free as well for a limited time. All Baen titles are in the process of being back-converted to the eBook formats, or for new works, being pre-released as subscription-only serialized eBooks similar to the Baen eMagazine experment— in three installments, roughly one third of each work in production.
Baen Links
- Free Library — SF/Alternate History Some eBooks by Baen's offered in HTML (browser) format; these can be printed, though not resized.
Unserialized purchased formats occur during the final month preceding the hardcover release of a work, and formats include Palm Pilot, Rocketbook, RTF and MS Reader versions.
- Baen eBooks for Purchase — This is the rapidly expanding eBooks marketplace. Most titles may be viewed in part for free.
- Baen Books Homepage — Science-Fiction, Fantasy, Alternate History and all subgenres of Speculative Fiction. Clicking on the underlined catalog name will open a preview eBook for all works on Baen.com; contrast with the above restricted content methods.
- Schedule of Baen Books Releases
More on Wikipedia
For additional information, possibly more up-to-date, see wikipedia article: Baen Books#Electronic publishing strategy in the main article Baen Books.
eBooks As An Industry
But, while Baen uses their Free Library to boost their print titles, it cannot be denied that eBooks as its own industry is a growing concern, growth in the double digits yearly, according to the quarterly reports put out by IDPF.
In fact, it may be noted that many of the new eBook publishers are former print publishers, and most established e-publishers offer print versions of many of their titles. The line between the two is fast blurring.
eBooks have their own bestseller lists, including those compiled by IDPF and Fictionwise. They even have two yearly awards for excellence in eBooks. The longest-standing and most inclusive of these is the EPPIE award, given by EPIC since 2000. The other is the Dream Realm Award, first awarded to Speculative Fiction eBooks in 2002.
eBook Industry Links
See also
References
- Doctorow, Cory (February 12, 2004). Ebooks: Neither E, Nor Books, O'Reilly Emerging Technologies Conference
- James, Bradley (November 20, 2002). The Electronic Book: Looking Beyond the Physical Codex, SciNet
- Lynch, Clifford (May 28, 2001). The Battle to Define the Future of the Book in the Digital World, First Monday - Peer reviewed journal on the Internet
- Menta, Robert (December 26, 2000). Read an e-book to your child, go to jail?, MP3 Newswire
- Flint's description of the inspiration for the Library in an USENET post
- Building the Baen Free Library by Eric Flint - a 2002 essay, republished from Library's news at the link above
- Annotated Baen Free eBook Listing - Baen Free Library and Baen CD-ROM Library
External links
This page or section may contain link spam masquerading as content. |
General
- Project Gutenberg
- The Open eBook Forum(IDPF/formerly OeBF)
- The Electronically Published Internet Connection
- The On-Line Books Page, at Penn University
- The Internet Public Library Online Texts Collection
- Facsimile Books & other digitally enhanced Works from: The University of Georgia Libraries (DjVu format)
- Scholarly Electronic Publishing Bibliography
- TeleRead E-Book News
- The Institute for the Future of the Book
Software
- The Openberg project (open-source implementation of the OeBF's recommendations)
- eTextReader An MS-Windows based program, convenient for reading books from project gutenberg; also reads rtf and some other formats
- eBook Edit Pro An MS-Windows based program for compiling HTML (Web) Pages into a self contained .exe file/book
- Newton Book Reader Extension A Firefox extension for viewing Newton books.
- Free eBook Search A Firefox extension for finding free eBooks.
Devices
- Cybook (Bookeen)
- Librie (Sony) (based on eInk) (in Japanese)
- Sony Reader (based on eInk, comes out in 2006)
- eBookwise-1150 (based on former Gemstar technology)
Formats
- Mobipocket's Reader
- E-Book Systems's FlipViewer
- Microsoft's Reader
- Palm Digital Media's Palm Reader
- ExeBook
- DesktopAuthor
- OpenReader consortium
- Plucker eBook converter and Reader
- Newton book information