<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"
	>
<channel>
	<title>Comments on: Documents for the Long Term</title>
	<atom:link href="http://www.robweir.com/blog/2007/06/documents-for-long-term.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html</link>
	<description>Thinking the unthinkable, pondering the imponderable, effing the ineffable and scruting the inscrutable</description>
	<lastBuildDate>Wed, 10 Mar 2010 01:51:22 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: BobFolkerts</title>
		<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html#comment-782</link>
		<dc:creator>BobFolkerts</dc:creator>
		<pubDate>Mon, 11 Jun 2007 16:48:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/06/documents-for-the-long-term.html#comment-782</guid>
		<description>@wraith&lt;br/&gt;&lt;br/&gt;My first reaction was much like yours, that this was a somewhat confused apples vs. orange comparison.  But upon reflection, there is an important kernel of truth in Rob&#039;s argument.  There is no denying that the design of the OOXML is based upon the older binary model.  Microsoft&#039;s fundamental claim for using OOXML is that is offers better integration with existing documents.  This means that they have to design XML to imitate/integrate with/replicate the data structures in the older binary specifications.  So Microsoft has to defend design decision make for a different media (binary) in their XML.  &lt;br/&gt;&lt;br/&gt;Even though they use XML, there are some pretty serious design mismatches.  The most glaring one to me is using bitmasks in XML.  The O&#039;Reilly XSLT cookbook shows how you can handle bitmasks, but is is a kludge and you have to make assumtions about mapping bit arrays to numbers.  Endian-ness (big vs. little) means that the XSLT will not be portable between computer architectures.  If you can show me a better XSLT transform, I would be happy to retract this criticism.  I do not want to hear about some non-XSLT tool that runs on .Net 3.0.&lt;br/&gt;&lt;br/&gt;I also find it curious that you claim that including dedicated markup languages in the OOXML standard is somehow equivalent with using existing standards.  Especially since your first example (Dublin Core) is not defined by the OOXML format spec, although it does appear that the standard does define what subset of the Dublin Core is used.  Perhaps &#039;dublin core&#039; refers to the subset of &#039;Dublin Core&#039; that OOXML supports.  &lt;br/&gt;&lt;br/&gt;The reason that I want existing standards to be used is that I am lazy.  Somebody else has already implemented tools to manipulate standard XML, so I can simply used debugged tools that I can often find with Google in a few minutes.  So to me, the fact that OOXML defines some new dedicated markup languages is a pain the the backside, not a feature - I may have to write more code, especially if I&#039;m not coding in .Net.</description>
		<content:encoded><![CDATA[<p>@wraith</p>
<p>My first reaction was much like yours, that this was a somewhat confused apples vs. orange comparison.  But upon reflection, there is an important kernel of truth in Rob&#8217;s argument.  There is no denying that the design of the OOXML is based upon the older binary model.  Microsoft&#8217;s fundamental claim for using OOXML is that is offers better integration with existing documents.  This means that they have to design XML to imitate/integrate with/replicate the data structures in the older binary specifications.  So Microsoft has to defend design decision make for a different media (binary) in their XML.  </p>
<p>Even though they use XML, there are some pretty serious design mismatches.  The most glaring one to me is using bitmasks in XML.  The O&#8217;Reilly XSLT cookbook shows how you can handle bitmasks, but is is a kludge and you have to make assumtions about mapping bit arrays to numbers.  Endian-ness (big vs. little) means that the XSLT will not be portable between computer architectures.  If you can show me a better XSLT transform, I would be happy to retract this criticism.  I do not want to hear about some non-XSLT tool that runs on .Net 3.0.</p>
<p>I also find it curious that you claim that including dedicated markup languages in the OOXML standard is somehow equivalent with using existing standards.  Especially since your first example (Dublin Core) is not defined by the OOXML format spec, although it does appear that the standard does define what subset of the Dublin Core is used.  Perhaps &#8216;dublin core&#8217; refers to the subset of &#8216;Dublin Core&#8217; that OOXML supports.  </p>
<p>The reason that I want existing standards to be used is that I am lazy.  Somebody else has already implemented tools to manipulate standard XML, so I can simply used debugged tools that I can often find with Google in a few minutes.  So to me, the fact that OOXML defines some new dedicated markup languages is a pain the the backside, not a feature &#8211; I may have to write more code, especially if I&#8217;m not coding in .Net.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html#comment-780</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Thu, 07 Jun 2007 19:17:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/06/documents-for-the-long-term.html#comment-780</guid>
		<description>@Richard,&lt;br/&gt;&lt;br/&gt;If we were all creating plain text documents, then that would be all we would need to worry about for preservation.  But if we create documents with graphics, even movies, then these may need to be preserved as well.&lt;br/&gt;&lt;br/&gt;One approach would be to have something like ODF/A, an archival profile of ODF that restricted what type of embedded media could be used.  For example, it may only allow open formats like PNG images, Ogg Vorbis for audio, etc.&lt;br/&gt;&lt;br/&gt;Also, having a structured document, with clearly defined headers, footnotes, etc., defined by specific markup rather than merely placed conventionally in expected locations, is a necessity for the tools that allow screen readers and similar tools to read documents for the blind.&lt;br/&gt;&lt;br/&gt;So one way or another, documents via markup rather than plain text is here to stay.&lt;br/&gt;&lt;br/&gt;&lt;br/&gt;@Wraith,&lt;br/&gt;&lt;br/&gt;These diagrams are illustrating the critical dependencies, not the traditional layered architectural diagram.  Maybe directed arrows would work better than pyramids?  But I think the approach I used better illustrates the idea that the technology at the base is the most stable, etc.&lt;br/&gt;&lt;br/&gt;Although OOXML uses XML, Unicode, etc., this is certainly not a dependency for them.  In particular, their use of bitmasks that map directly into Windows platform API&#039;s, undocumented binary inclusions, even provisions for Unicode characters not permitted in XML, these all show that the XML is just a trivial wrapper layer, not a dependency or a constraint. &lt;br/&gt;&lt;br/&gt;When you ask &quot;Why did OOXML do something in a particular way?&quot;, the answer is rarely, &quot;Because it is good XML style to do so&quot;.  The answer more typically is, &quot;Because that is how Word does it&quot;.  That is  why I say that Word and Windows are the primary dependencies for OOXML.</description>
		<content:encoded><![CDATA[<p>@Richard,</p>
<p>If we were all creating plain text documents, then that would be all we would need to worry about for preservation.  But if we create documents with graphics, even movies, then these may need to be preserved as well.</p>
<p>One approach would be to have something like ODF/A, an archival profile of ODF that restricted what type of embedded media could be used.  For example, it may only allow open formats like PNG images, Ogg Vorbis for audio, etc.</p>
<p>Also, having a structured document, with clearly defined headers, footnotes, etc., defined by specific markup rather than merely placed conventionally in expected locations, is a necessity for the tools that allow screen readers and similar tools to read documents for the blind.</p>
<p>So one way or another, documents via markup rather than plain text is here to stay.</p>
<p>@Wraith,</p>
<p>These diagrams are illustrating the critical dependencies, not the traditional layered architectural diagram.  Maybe directed arrows would work better than pyramids?  But I think the approach I used better illustrates the idea that the technology at the base is the most stable, etc.</p>
<p>Although OOXML uses XML, Unicode, etc., this is certainly not a dependency for them.  In particular, their use of bitmasks that map directly into Windows platform API&#8217;s, undocumented binary inclusions, even provisions for Unicode characters not permitted in XML, these all show that the XML is just a trivial wrapper layer, not a dependency or a constraint. </p>
<p>When you ask &#8220;Why did OOXML do something in a particular way?&#8221;, the answer is rarely, &#8220;Because it is good XML style to do so&#8221;.  The answer more typically is, &#8220;Because that is how Word does it&#8221;.  That is  why I say that Word and Windows are the primary dependencies for OOXML.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html#comment-779</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Thu, 07 Jun 2007 00:50:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/06/documents-for-the-long-term.html#comment-779</guid>
		<description>If plain text was all we needed to worry about, then I don&#039;t think we would be having this discussion.  Documents are becoming more and more dynamic.  If the understanding of my document depended on an animation, and a vendors forced &quot;upgrade&quot; destroyed that animation, then the original message of my document has been lost forever.  &lt;br/&gt;&lt;br/&gt;Richard Chapman</description>
		<content:encoded><![CDATA[<p>If plain text was all we needed to worry about, then I don&#8217;t think we would be having this discussion.  Documents are becoming more and more dynamic.  If the understanding of my document depended on an animation, and a vendors forced &#8220;upgrade&#8221; destroyed that animation, then the original message of my document has been lost forever.  </p>
<p>Richard Chapman</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The Wraith</title>
		<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html#comment-778</link>
		<dc:creator>The Wraith</dc:creator>
		<pubDate>Wed, 06 Jun 2007 22:36:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/06/documents-for-the-long-term.html#comment-778</guid>
		<description>You compare very strangely different pyramids. &lt;br/&gt;Actually you can construct the OOXML pyramid the same as the ODF examply you place there. &lt;br/&gt;&lt;br/&gt;OOXML is also using unicode, XML and RelaxNG and using several dedicated markup languages like dublin core and others like DrawingML that are also open due to their inclusion in the OOXML format spec.</description>
		<content:encoded><![CDATA[<p>You compare very strangely different pyramids. <br />Actually you can construct the OOXML pyramid the same as the ODF examply you place there. </p>
<p>OOXML is also using unicode, XML and RelaxNG and using several dedicated markup languages like dublin core and others like DrawingML that are also open due to their inclusion in the OOXML format spec.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2007/06/documents-for-long-term.html#comment-777</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Tue, 05 Jun 2007 20:36:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/06/documents-for-the-long-term.html#comment-777</guid>
		<description>The standard term for the concept of having different layers, with different velocity is, &lt;a HREF=&quot;http://en.wikipedia.org/wiki/Shearing_layers&quot; REL=&quot;nofollow&quot; rel=&quot;nofollow&quot;&gt;&lt;br/&gt;shearing layers&lt;/a&gt;...How Buildings Learn is the classic book on the subject.&lt;br/&gt;&lt;br/&gt;Document persistence is actually a serious issue in architecture, as CAD models aren&#039;t shown to have the longevity yet -and the Airbus 380 disaster shows it applies to engineering systems too. It is claimed that Catia versions were the cause of incompatiblity between  french and german parts of the airplane design.&lt;br/&gt;&lt;br/&gt;Compared to that, doc files arent as serious. IF you want long-lived printable content, PDF is actually very good. XML formats make more sense if you want to have the option of repurposing the content over many years. So does plain text, funnily enough. &lt;br/&gt;-steve</description>
		<content:encoded><![CDATA[<p>The standard term for the concept of having different layers, with different velocity is, <a HREF="http://en.wikipedia.org/wiki/Shearing_layers" REL="nofollow" rel="nofollow"><br />shearing layers</a>&#8230;How Buildings Learn is the classic book on the subject.</p>
<p>Document persistence is actually a serious issue in architecture, as CAD models aren&#8217;t shown to have the longevity yet -and the Airbus 380 disaster shows it applies to engineering systems too. It is claimed that Catia versions were the cause of incompatiblity between  french and german parts of the airplane design.</p>
<p>Compared to that, doc files arent as serious. IF you want long-lived printable content, PDF is actually very good. XML formats make more sense if you want to have the option of repurposing the content over many years. So does plain text, funnily enough. <br />-steve</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.439 seconds -->
