Document standard: Difference between revisions
m (using an external editor) |
|||
Line 5: | Line 5: | ||
Document standards can be defined in terms of: | Document standards can be defined in terms of: | ||
* Document file formats, i.e. text or binary formats for storing documents an storage media. | * Document file formats, i.e. text or binary formats for storing documents an storage media. | ||
* Content structure, usually an XML application | * Content structure ([[structured authoring]], usually an [[XML]] application | ||
== Document file formats == | == Document file formats == | ||
Line 15: | Line 15: | ||
=== Markup languages that separate content from style === | === Markup languages that separate content from style === | ||
These are somewhat human readable | These so-called [[structured authoring]] formats are somewhat human readable. | ||
* [[DocBook]] (XML or SGML) was originally | * [[DocBook]] (XML or SGML) was originally intended for authoring technical documents but can be used for almost any kind of document. | ||
* [[DITA]] | * [[DITA]] | ||
* [[XHTML]] strict | * [[XHTML]] strict | ||
Line 24: | Line 24: | ||
=== Specialized markup languages for education === | === Specialized markup languages for education === | ||
We don't consider [[educational modeling language]]s like [[IMS Learning Design]], [[IMS Simple Sequencing]] etc. and that are | We don't consider [[educational modeling language]]s like [[IMS Learning Design]], [[IMS Simple Sequencing]] etc. and that are based on the [[IMS Content Packaging]] "logic" to be document standards, because contents themselves are not in any way marked up. They are just assembled in menu structures. | ||
One also could extend/specialize DITA and DocBook with educational markup. We also think that XHTML 2.0 may have potential for that kind of things. | One also could extend/specialize DITA and DocBook with educational markup. We also think that XHTML 2.0 may have potential for that kind of things. | ||
Line 37: | Line 37: | ||
* OASIS Open Document Format for Office Applications (ODF): [http://xml.openoffice.org/ Open Office XML Project]. This standard is used by [http://www.openoffice.org/ Open Office] and [http://symphony.lotus.com/software/lotus/symphony Lotus Symphony] | * OASIS Open Document Format for Office Applications (ODF): [http://xml.openoffice.org/ Open Office XML Project]. This standard is used by [http://www.openoffice.org/ Open Office] and [http://symphony.lotus.com/software/lotus/symphony Lotus Symphony] | ||
* Microsoft XML | * Microsoft XML Reference Schemas (including WordML, SpreadsheetML, etc.): [http://www.microsoft.com/office/xml/default.mspx Office 2003 XML Reference Schemas] | ||
* [http://labs.adobe.com/wiki/index.php/Mars Adobe MARS]. code name for technology being developed by Adobe that provides an Extensible Markup Language (XML)-based representation of Portable Document Format (PDF) documents. (There is win/mac PDF plugin for this right now, nov 2007). | * [http://labs.adobe.com/wiki/index.php/Mars Adobe MARS]. code name for technology being developed by Adobe that provides an Extensible Markup Language (XML)-based representation of Portable Document Format (PDF) documents. (There is win/mac PDF plugin for this right now, nov 2007). | ||
* Microsoft RTF, an older standard that is being phased out | * Microsoft RTF, an older standard that is being phased out | ||
* [[Adobe | * [[Adobe FrameMaker]] *.MIF | ||
=== Binary file formats === | === Binary file formats === | ||
Line 47: | Line 47: | ||
* Microsoft *.doc | * Microsoft *.doc | ||
* | * FrameMaker *.fm | ||
=== Specialized markup formats for the Internet === | === Specialized markup formats for the Internet === | ||
Line 68: | Line 68: | ||
* All sorts of computers with a monitor | * All sorts of computers with a monitor | ||
* Mobile devices | * Mobile devices | ||
* | * Refreshable electronic paper | ||
Revision as of 10:01, 25 September 2008
Definition
Document standards can be defined in terms of:
- Document file formats, i.e. text or binary formats for storing documents an storage media.
- Content structure (structured authoring, usually an XML application
Document file formats
There are a lot of these, see Wikipedia's document file format article or the full List of file formats
See also: e-book formats
Markup languages that separate content from style
These so-called structured authoring formats are somewhat human readable.
- DocBook (XML or SGML) was originally intended for authoring technical documents but can be used for almost any kind of document.
- DITA
- XHTML strict
- TEX and related languages like Latex
Specialized markup languages for education
We don't consider educational modeling languages like IMS Learning Design, IMS Simple Sequencing etc. and that are based on the IMS Content Packaging "logic" to be document standards, because contents themselves are not in any way marked up. They are just assembled in menu structures.
One also could extend/specialize DITA and DocBook with educational markup. We also think that XHTML 2.0 may have potential for that kind of things.
The following is some sorts of document standard:
- eLML - eLesson Markup Language. In Daniel K. Schneider's opinion, about the only stable educational content markup language there is. See also Stephen Downes comment. eLML is focussed on lessons, which is of course a limitation for general purpose content markup.
Messy markup languages
These markup content, style and other things together and are not really human readable.
- OASIS Open Document Format for Office Applications (ODF): Open Office XML Project. This standard is used by Open Office and Lotus Symphony
- Microsoft XML Reference Schemas (including WordML, SpreadsheetML, etc.): Office 2003 XML Reference Schemas
- Adobe MARS. code name for technology being developed by Adobe that provides an Extensible Markup Language (XML)-based representation of Portable Document Format (PDF) documents. (There is win/mac PDF plugin for this right now, nov 2007).
- Microsoft RTF, an older standard that is being phased out
- Adobe FrameMaker *.MIF
Binary file formats
Not human readable
- Microsoft *.doc
- FrameMaker *.fm
Specialized markup formats for the Internet
These are usually combined within other formats, e.g. XHTML. Most important formats are:
- Display of mathematical formula: MathML
- Vector Graphics: SVG (or the defunct Microsoft WML format)
- Multimedia sequencing: SMIL
- Linking: XLink (this is not supported in XHTML 2)
- Forms: XForms
- Voice markup: VoiceML
For now, integration of various vocabularies into main-stream web pages is not obvious. Only standards-aware browsers like Firefox can handle for instance XHTML + SVG + MathML. In addition, editing is not always easy since there are no official combined DTD's available, although a general standard for this (Compound Document by Reference Framework 1.0) is almost ready.
Hardware
- Paper
- All sorts of computers with a monitor
- Mobile devices
- Refreshable electronic paper
Links
- Wikipedia's Comparison of document markup languages
- http://www.answers.com/topic/document-file-format
- http://en.wikipedia.org/wiki/List_of_file_formats
- http://en.wikipedia.org/wiki/List_of_document_markup_languages
- http://en.wikipedia.org/wiki/OpenDocument
- http://en.wikipedia.org/wiki/Standard_office_document_formats_debate
- http://en.wikipedia.org/wiki/WordprocessingML