<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"
	>
<channel>
	<title>Comments on: The Celerity of Verbosity</title>
	<atom:link href="http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=celerity-of-verbosity</link>
	<description>Thinking the unthinkable, pondering the imponderable, effing the ineffable and scruting the inscrutable</description>
	<lastBuildDate>Tue, 07 Feb 2012 11:20:47 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-2399</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Sun, 16 Aug 2009 02:01:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-2399</guid>
		<description>Also state of the art XML parsing technology may also help&lt;br /&gt;&lt;br /&gt;http://vtd-xml.sf.net</description>
		<content:encoded><![CDATA[<p>Also state of the art XML parsing technology may also help</p>
<p><a href="http://vtd-xml.sf.net" rel="nofollow">http://vtd-xml.sf.net</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hAl</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-143</link>
		<dc:creator>hAl</dc:creator>
		<pubDate>Sun, 22 Oct 2006 09:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-143</guid>
		<description>As the files are all zipped it is also relavant to check the unzip speed as even the most basic of changes probaly needs unzipping. &lt;br/&gt;&lt;br/&gt;A big problem is determining speed will be whether the application unzip all of the files. If you leave fileparts unzipped untill later it might be easier to get a fast early reading performance.&lt;br/&gt;&lt;br/&gt;Comparing speed will be a very difficult thing no matter what. Certain optimisations like the relationship files in OOXML might very very good for performance in certain tasks but might be slower on regular file editting within the office application. Same for many more differences. I think it is very likely that both parties will be able to show a measuring methode that show their format to their advantage.</description>
		<content:encoded><![CDATA[<p>As the files are all zipped it is also relavant to check the unzip speed as even the most basic of changes probaly needs unzipping. </p>
<p>A big problem is determining speed will be whether the application unzip all of the files. If you leave fileparts unzipped untill later it might be easier to get a fast early reading performance.</p>
<p>Comparing speed will be a very difficult thing no matter what. Certain optimisations like the relationship files in OOXML might very very good for performance in certain tasks but might be slower on regular file editting within the office application. Same for many more differences. I think it is very likely that both parties will be able to show a measuring methode that show their format to their advantage.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ben Langhinrichs</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-142</link>
		<dc:creator>Ben Langhinrichs</dc:creator>
		<pubDate>Sat, 21 Oct 2006 16:48:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-142</guid>
		<description>I have to confirm that I have heard this claim about performance from a couple of different Microsoft sources verbally or in private e-mails.  Whether it is a &quot;whisper campaign&quot; or one of these memes that spread without much oversight, I don&#039;t know.  I know neither one specified &quot;very large spreadsheets&quot;, which makes sense because my area (which both people know) is in word processing docs.  In any case, I think was an excellent analysis.</description>
		<content:encoded><![CDATA[<p>I have to confirm that I have heard this claim about performance from a couple of different Microsoft sources verbally or in private e-mails.  Whether it is a &#8220;whisper campaign&#8221; or one of these memes that spread without much oversight, I don&#8217;t know.  I know neither one specified &#8220;very large spreadsheets&#8221;, which makes sense because my area (which both people know) is in word processing docs.  In any case, I think was an excellent analysis.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-140</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Sat, 21 Oct 2006 15:14:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-140</guid>
		<description>Hi anonymous,&lt;br/&gt;&lt;br/&gt;I&#039;m not a lawyer, so I err on the side of caution.  But even aside from the legal issues, I think it is important to look at the pure parse time for a document.  A document might be created and saved once in a large desktop editor like Office.  But then it will be read many times by other other, smaller tools, search engines, RSS feed generators, etc.  The performance of these smaller tools will be more dominated by parse performance.</description>
		<content:encoded><![CDATA[<p>Hi anonymous,</p>
<p>I&#8217;m not a lawyer, so I err on the side of caution.  But even aside from the legal issues, I think it is important to look at the pure parse time for a document.  A document might be created and saved once in a large desktop editor like Office.  But then it will be read many times by other other, smaller tools, search engines, RSS feed generators, etc.  The performance of these smaller tools will be more dominated by parse performance.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-139</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Fri, 20 Oct 2006 12:33:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-139</guid>
		<description>That license might not be a real hinderence for independant journalists and consumerorganisations.&lt;br/&gt;&lt;br/&gt;However it does seem better to wait for a final release of MS Office 2007.</description>
		<content:encoded><![CDATA[<p>That license might not be a real hinderence for independant journalists and consumerorganisations.</p>
<p>However it does seem better to wait for a final release of MS Office 2007.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-137</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Fri, 20 Oct 2006 10:02:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-137</guid>
		<description>The relative lack of head-to-head performance tests of OpenOffice compared to the Office 2007 beta is simple to understand.  Microsoft won&#039;t let anyone talk about performance of their beta products.  Take a look at this line from their&lt;br/&gt;End User Licence Agreement (EULA): &quot;7. SCOPE OF LICENSE. ...You may not disclose the results of any benchmark tests of the software to any third party without Microsoft’s prior written approval&quot;.&lt;br/&gt;&lt;br/&gt;So, evidently, they can go around commenting on OpenOffice performance, but no one else can talk about specific performance scenarios and numbers around Office performance.  This is one reason why  you&#039;ll see me posting only numbers from application-neutral performance measurements.</description>
		<content:encoded><![CDATA[<p>The relative lack of head-to-head performance tests of OpenOffice compared to the Office 2007 beta is simple to understand.  Microsoft won&#8217;t let anyone talk about performance of their beta products.  Take a look at this line from their<br />End User Licence Agreement (EULA): &#8220;7. SCOPE OF LICENSE. &#8230;You may not disclose the results of any benchmark tests of the software to any third party without Microsoft’s prior written approval&#8221;.</p>
<p>So, evidently, they can go around commenting on OpenOffice performance, but no one else can talk about specific performance scenarios and numbers around Office performance.  This is one reason why  you&#8217;ll see me posting only numbers from application-neutral performance measurements.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-134</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Thu, 19 Oct 2006 22:52:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-134</guid>
		<description>I think your best bet is to take the analysis for what it is, a demonstration that the parsing of word processor XML files of medium size are robust under tag size inflations of 32x.  The numbers aren&#039;t going to change based on what you think my motivation is.&lt;br/&gt;&lt;br/&gt;Also, keep in mind that not every bit of FUD from Microsoft comes on a web page with embedded metadata that says &quot;FUD&quot;.  Where it is on the web and I know about it, I link to it, as any regular reader of this blog knows.  However, there are other methods for spreading FUD, via personal visits, at conferences, via surrogates or in phone calls.  &lt;br/&gt;&lt;br/&gt;For well over a year now the whisper message from Microsoft has been generically that &quot;ODF is slow&quot;.  I&#039;ve heard it relayed by customers and I&#039;ve heard it told from conference attendees.  What I&#039;m showing is that this categorical statement is simplistic and, in the case of word processor files, clearly incorrect.&lt;br/&gt;&lt;br/&gt;I&#039;ll deal with spreadsheet files, including the monster ones another time.  Python code doesn&#039;t exactly write itself.  I&#039;ll even use your document as an example if you think it is typical of the hard-core Excel user.  You can get my address under the &quot;Who Am I&quot; link on the main page.</description>
		<content:encoded><![CDATA[<p>I think your best bet is to take the analysis for what it is, a demonstration that the parsing of word processor XML files of medium size are robust under tag size inflations of 32x.  The numbers aren&#8217;t going to change based on what you think my motivation is.</p>
<p>Also, keep in mind that not every bit of FUD from Microsoft comes on a web page with embedded metadata that says &#8220;FUD&#8221;.  Where it is on the web and I know about it, I link to it, as any regular reader of this blog knows.  However, there are other methods for spreading FUD, via personal visits, at conferences, via surrogates or in phone calls.  </p>
<p>For well over a year now the whisper message from Microsoft has been generically that &#8220;ODF is slow&#8221;.  I&#8217;ve heard it relayed by customers and I&#8217;ve heard it told from conference attendees.  What I&#8217;m showing is that this categorical statement is simplistic and, in the case of word processor files, clearly incorrect.</p>
<p>I&#8217;ll deal with spreadsheet files, including the monster ones another time.  Python code doesn&#8217;t exactly write itself.  I&#8217;ll even use your document as an example if you think it is typical of the hard-core Excel user.  You can get my address under the &#8220;Who Am I&#8221; link on the main page.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: davidacoder</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-133</link>
		<dc:creator>davidacoder</dc:creator>
		<pubDate>Thu, 19 Oct 2006 22:16:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-133</guid>
		<description>&quot;I&#039;ve been hearing some rumblings from the north-west that Ecma Office Open XML (OOXML) format has better performance characteristics than OpenDocument Format (ODF), specifically because OOXML uses shorter tag names.&quot;&lt;br/&gt;&lt;br/&gt;But you were refering to claims from MS about perf, right? I am sorry, I was under the impression that you meant Brian&#039;s post, but apparently you weren&#039;t.&lt;br/&gt;&lt;br/&gt;Can you let us know then what you were refering to? Ideally with a link? Because I just can&#039;t find a claim from MS side anywhere that their file format loads faster for a 60 page word document than ODF. But if you wanted to bust a myth, I assume you do believe they claimed that. Or am I missing something completly here?</description>
		<content:encoded><![CDATA[<p>&#8220;I&#8217;ve been hearing some rumblings from the north-west that Ecma Office Open XML (OOXML) format has better performance characteristics than OpenDocument Format (ODF), specifically because OOXML uses shorter tag names.&#8221;</p>
<p>But you were refering to claims from MS about perf, right? I am sorry, I was under the impression that you meant Brian&#8217;s post, but apparently you weren&#8217;t.</p>
<p>Can you let us know then what you were refering to? Ideally with a link? Because I just can&#8217;t find a claim from MS side anywhere that their file format loads faster for a 60 page word document than ODF. But if you wanted to bust a myth, I assume you do believe they claimed that. Or am I missing something completly here?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-132</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Thu, 19 Oct 2006 22:06:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-132</guid>
		<description>Chill a little, Davidacoder.  My original post was not in response to Brian&#039;s post at all.  In fact, I was not aware of it until you mentioned it. &lt;br/&gt;&lt;br/&gt;In any case, I&#039;m discussing word processing documents today, not spreadsheets, and no amount of bellyaching from you really changes that fact, does it?  You can make all sorts of assumptions about what you think I should be talking about and who I should be responding to, but I assure you that such speculations are a waste of your time, my readers&#039; time and will not be entertained further.</description>
		<content:encoded><![CDATA[<p>Chill a little, Davidacoder.  My original post was not in response to Brian&#8217;s post at all.  In fact, I was not aware of it until you mentioned it. </p>
<p>In any case, I&#8217;m discussing word processing documents today, not spreadsheets, and no amount of bellyaching from you really changes that fact, does it?  You can make all sorts of assumptions about what you think I should be talking about and who I should be responding to, but I assure you that such speculations are a waste of your time, my readers&#8217; time and will not be entertained further.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: davidacoder</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-131</link>
		<dc:creator>davidacoder</dc:creator>
		<pubDate>Thu, 19 Oct 2006 21:29:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-131</guid>
		<description>&quot;Are you talking about that 130MB monster spreadsheeet?&quot;&lt;br/&gt;&lt;br/&gt;Yes. Have you actually read Brian&#039;s post?!? He very clearly states that.&lt;br/&gt;&lt;br/&gt;&quot;The fact that Brian claims to have seen even larger documents is not relevant. It would be futile to compete with Brian on who has seen the largest documents&quot;&lt;br/&gt;&lt;br/&gt;That is not what I said. Brian claims on his blog that this size of Excel files is not a &quot;fringe case&quot;. From my own experience (economic research) I can only confirm that. We regularily work with Excel files in that size range. This is not about &quot;who has seen the biggest&quot;, but about &quot;is this a very rare case, or something quite common&quot;. At least in the area I am working in it is common and often used. Brian seems to conclude the same from his data.&lt;br/&gt;&lt;br/&gt;&quot;So picking a 60 page document is certainly a fair example of a medium/medium-large sized document&quot;&lt;br/&gt;&lt;br/&gt;But why on earth have you picked a word document at all?? I assume you are responding to claims from MS that their short tag names are faster. Well, the most detailed claim of that is in Brian&#039;s post to which I linked. And he is ONLY talking about spreadsheets in that one. Now, spreadsheets are not split up into smaller files and files of the size Brian mentions are commonly used. From what you write it seems to be clear that you believe this is a rare case and that therefore a perf hit for those scenarios would be ok for ODF. Fine, but I am sure glad that MS creatd its own file format in that case, because in my daily work I care a great deal about performance of the tools I use. And I would NOT accept a new version of Excel that had significantly worse load times than older version.&lt;br/&gt;&lt;br/&gt;My basic point is: MS claimed that long tag names are not good for perf with large spreadsheet files. And you try to bust that &quot;myth&quot; by showing average parsing time for a word document with 60 pages. That just does not make any sense at all.</description>
		<content:encoded><![CDATA[<p>&#8220;Are you talking about that 130MB monster spreadsheeet?&#8221;</p>
<p>Yes. Have you actually read Brian&#8217;s post?!? He very clearly states that.</p>
<p>&#8220;The fact that Brian claims to have seen even larger documents is not relevant. It would be futile to compete with Brian on who has seen the largest documents&#8221;</p>
<p>That is not what I said. Brian claims on his blog that this size of Excel files is not a &#8220;fringe case&#8221;. From my own experience (economic research) I can only confirm that. We regularily work with Excel files in that size range. This is not about &#8220;who has seen the biggest&#8221;, but about &#8220;is this a very rare case, or something quite common&#8221;. At least in the area I am working in it is common and often used. Brian seems to conclude the same from his data.</p>
<p>&#8220;So picking a 60 page document is certainly a fair example of a medium/medium-large sized document&#8221;</p>
<p>But why on earth have you picked a word document at all?? I assume you are responding to claims from MS that their short tag names are faster. Well, the most detailed claim of that is in Brian&#8217;s post to which I linked. And he is ONLY talking about spreadsheets in that one. Now, spreadsheets are not split up into smaller files and files of the size Brian mentions are commonly used. From what you write it seems to be clear that you believe this is a rare case and that therefore a perf hit for those scenarios would be ok for ODF. Fine, but I am sure glad that MS creatd its own file format in that case, because in my daily work I care a great deal about performance of the tools I use. And I would NOT accept a new version of Excel that had significantly worse load times than older version.</p>
<p>My basic point is: MS claimed that long tag names are not good for perf with large spreadsheet files. And you try to bust that &#8220;myth&#8221; by showing average parsing time for a word document with 60 pages. That just does not make any sense at all.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hAl</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-130</link>
		<dc:creator>hAl</dc:creator>
		<pubDate>Thu, 19 Oct 2006 19:52:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-130</guid>
		<description>[quote]But I&#039;d note that large documents tend to be broken into smaller documents, for technological as well as organizatioal reasons[/quote]&lt;br/&gt;&lt;br/&gt;For us one of the only reason to go over to MS Office 2007 is the ability to do large scale Excel spreadsheet. The size of current  sheets is really poor which I do not like in Office 2003 and lower.&lt;br/&gt;&lt;br/&gt;Large scale spreadsheet require fast performing applications and MS Office is pretty good at it. It might not all be to do with the file optimisation but I think other things might also be important. Like the weird ISO date format ODF uses which is not very good if they actually us it in spreadsheets. &lt;br/&gt;&lt;br/&gt;Clearly at the moment Excel is definitly the better option for large spreadsheets. It is fast and has better funtionality. OOo is not on the same level. &lt;br/&gt;However for quite a few companies large scale spreadsheets are less important and so I do not think it should be an important issue. However I do not think it is wise to discuss performance untill OOo manages to squeezes out a lot more performance.&lt;br/&gt;&lt;br/&gt;However as I already indicated the long tagging in a standard format is of no real use. Descriptive naming is not needed so it is not really usefull.</description>
		<content:encoded><![CDATA[<p>[quote]But I&#8217;d note that large documents tend to be broken into smaller documents, for technological as well as organizatioal reasons[/quote]</p>
<p>For us one of the only reason to go over to MS Office 2007 is the ability to do large scale Excel spreadsheet. The size of current  sheets is really poor which I do not like in Office 2003 and lower.</p>
<p>Large scale spreadsheet require fast performing applications and MS Office is pretty good at it. It might not all be to do with the file optimisation but I think other things might also be important. Like the weird ISO date format ODF uses which is not very good if they actually us it in spreadsheets. </p>
<p>Clearly at the moment Excel is definitly the better option for large spreadsheets. It is fast and has better funtionality. OOo is not on the same level. <br />However for quite a few companies large scale spreadsheets are less important and so I do not think it should be an important issue. However I do not think it is wise to discuss performance untill OOo manages to squeezes out a lot more performance.</p>
<p>However as I already indicated the long tagging in a standard format is of no real use. Descriptive naming is not needed so it is not really usefull.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-129</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Thu, 19 Oct 2006 19:35:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-129</guid>
		<description>davidacoder,&lt;br/&gt;&lt;br/&gt;Are you talking about that 130MB monster spreadsheeet?  I&#039;m not sure that proves much of anything.  The fact that Brian claims to have seen even larger documents is not relevant.  It would be futile to compete with Brian on who has seen the largest documents.  His experience with OOXML and its 6,000+ specification would win hands down.  But I&#039;d note that large documents tend to be broken into smaller documents, for technological as well as organizatioal reasons.  Even the OOXML spec ended up being split into 5 parts.&lt;br/&gt;&lt;br/&gt;However, I think the important question is what is the size range of typical documents.  We need to optimize for those.  You want the monster documents not to suffer too much of course, but you want to optimize for the common case.&lt;br/&gt;&lt;br/&gt;For Word documents, in one group I looked at (Ecma T45) the most common sized document was 2 pages.  The mean length was 34 pages, the median length 8 pages.  So picking a 60 page document is certainly a fair example of a medium/medium-large sized document.</description>
		<content:encoded><![CDATA[<p>davidacoder,</p>
<p>Are you talking about that 130MB monster spreadsheeet?  I&#8217;m not sure that proves much of anything.  The fact that Brian claims to have seen even larger documents is not relevant.  It would be futile to compete with Brian on who has seen the largest documents.  His experience with OOXML and its 6,000+ specification would win hands down.  But I&#8217;d note that large documents tend to be broken into smaller documents, for technological as well as organizatioal reasons.  Even the OOXML spec ended up being split into 5 parts.</p>
<p>However, I think the important question is what is the size range of typical documents.  We need to optimize for those.  You want the monster documents not to suffer too much of course, but you want to optimize for the common case.</p>
<p>For Word documents, in one group I looked at (Ecma T45) the most common sized document was 2 pages.  The mean length was 34 pages, the median length 8 pages.  So picking a 60 page document is certainly a fair example of a medium/medium-large sized document.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-128</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Thu, 19 Oct 2006 19:10:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-128</guid>
		<description>orchmid,&lt;br/&gt;&lt;br/&gt;I didn&#039;t intentionally republish or change to short descriptions.  Must be Thursday, as you say.&lt;br/&gt;&lt;br/&gt;The trick in my mind is to design a test that removes the application performance from the equation.  Why?   Personally, I&#039;m more interested in the XML aspects.  Also, Microsoft&#039;s EULA forbids anyone from releasing benchmark numbers on the Office 12 beta without Microsoft&#039;s permission.&lt;br/&gt;&lt;br/&gt;As you say, the packaging of the document in the Zip is an important part of the performance, something not measured in this test.  However, I have done tests of that before, and did discuss them at the OpenOffice.org conference in Lyon a few weeks ago.  The way OOXML packages the XML, into more and smaller XML documents than ODF, actually causes the same document to   require more time to parse in OOXML format compared to ODF format.   I&#039;ll write up those results in blog form soon.  In any case, by looking at just the XML document without the  zip archive, I&#039;m better able to target just the impact of the tag name length, which was my object.&lt;br/&gt;&lt;br/&gt;Btw, you might want to give OpenOffice.org 2.04 a try.  They are claiming significant performance improvements.</description>
		<content:encoded><![CDATA[<p>orchmid,</p>
<p>I didn&#8217;t intentionally republish or change to short descriptions.  Must be Thursday, as you say.</p>
<p>The trick in my mind is to design a test that removes the application performance from the equation.  Why?   Personally, I&#8217;m more interested in the XML aspects.  Also, Microsoft&#8217;s EULA forbids anyone from releasing benchmark numbers on the Office 12 beta without Microsoft&#8217;s permission.</p>
<p>As you say, the packaging of the document in the Zip is an important part of the performance, something not measured in this test.  However, I have done tests of that before, and did discuss them at the OpenOffice.org conference in Lyon a few weeks ago.  The way OOXML packages the XML, into more and smaller XML documents than ODF, actually causes the same document to   require more time to parse in OOXML format compared to ODF format.   I&#8217;ll write up those results in blog form soon.  In any case, by looking at just the XML document without the  zip archive, I&#8217;m better able to target just the impact of the tag name length, which was my object.</p>
<p>Btw, you might want to give OpenOffice.org 2.04 a try.  They are claiming significant performance improvements.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: orcmid</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-127</link>
		<dc:creator>orcmid</dc:creator>
		<pubDate>Thu, 19 Oct 2006 17:52:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-127</guid>
		<description>Rob, I think you have to look at the impact on working with the compressed package, whether .docx or .odt.&lt;br/&gt;&lt;br/&gt;I compared the sizes of the PDFs and the .docx (downloaded as .zip) for the TC45 Final Draft documents.  In only one case is the .docx larger than the PDF, and the shrinkage cases are quite startling.&lt;br/&gt;&lt;br/&gt;My anecdotal experience (Using OO.o 2.0.2 (the just-appearing release may be better) and Excel 2003 (! with the Microsoft beta plug-in for OOX formats) is odd.  Here&#039;s a spreadsheet that I use every day and an odd experience with it:&lt;br/&gt;&lt;br/&gt;Excel 2003 binary format, .xls: 144,896 bytes (all numbers as the files were on 2006-10-16 pm)&lt;br/&gt;&lt;br/&gt;Excel 2007 beta-OOX format, .xlsx: 64,208 bytes (this is the one I work from just for the experience of it)&lt;br/&gt;&lt;br/&gt;OO.o 2.02 Calc ODF format, .ods: 54,041 bytes&lt;br/&gt;&lt;br/&gt;The weird thing is that the .xls goes into OO.o pretty quickly, and OO.o saves it as .ods nicely too.  But opening the .ods file is painful. More painful than opening the .xlsx in Excel 2003, where it has to go through a not-so-optimized conversion.  And way more painful than opening the .xls file in OO.o 2.0.2.&lt;br/&gt;&lt;br/&gt;I suspect there are variances all over the map here.  Also, there&#039;s no requirement that long QName prefixes be used in ODF (office: could be changed to o: for example), and no requirement that short ones be used in OOX. Namings of the namespace elements are fixed, but we can play with the other bits if it matters.  &lt;br/&gt;&lt;br/&gt;We will have to watch how implementations are tuned over time to compete with each other for snappy user experience.</description>
		<content:encoded><![CDATA[<p>Rob, I think you have to look at the impact on working with the compressed package, whether .docx or .odt.</p>
<p>I compared the sizes of the PDFs and the .docx (downloaded as .zip) for the TC45 Final Draft documents.  In only one case is the .docx larger than the PDF, and the shrinkage cases are quite startling.</p>
<p>My anecdotal experience (Using OO.o 2.0.2 (the just-appearing release may be better) and Excel 2003 (! with the Microsoft beta plug-in for OOX formats) is odd.  Here&#8217;s a spreadsheet that I use every day and an odd experience with it:</p>
<p>Excel 2003 binary format, .xls: 144,896 bytes (all numbers as the files were on 2006-10-16 pm)</p>
<p>Excel 2007 beta-OOX format, .xlsx: 64,208 bytes (this is the one I work from just for the experience of it)</p>
<p>OO.o 2.02 Calc ODF format, .ods: 54,041 bytes</p>
<p>The weird thing is that the .xls goes into OO.o pretty quickly, and OO.o saves it as .ods nicely too.  But opening the .ods file is painful. More painful than opening the .xlsx in Excel 2003, where it has to go through a not-so-optimized conversion.  And way more painful than opening the .xls file in OO.o 2.0.2.</p>
<p>I suspect there are variances all over the map here.  Also, there&#8217;s no requirement that long QName prefixes be used in ODF (office: could be changed to o: for example), and no requirement that short ones be used in OOX. Namings of the namespace elements are fixed, but we can play with the other bits if it matters.  </p>
<p>We will have to watch how implementations are tuned over time to compete with each other for snappy user experience.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: davidacoder</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-126</link>
		<dc:creator>davidacoder</dc:creator>
		<pubDate>Thu, 19 Oct 2006 16:27:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-126</guid>
		<description>Oh common, nothing busted.&lt;br/&gt;&lt;br/&gt;First, it would be fair to link Brian&#039;s post where he discussed the issue of tag length in depth. &lt;a HREF=&quot;http://blogs.msdn.com/brian_jones/archive/2006/05/16/599096.aspx&quot; REL=&quot;nofollow&quot; rel=&quot;nofollow&quot;&gt;Here it is&lt;/a&gt;.&lt;br/&gt;&lt;br/&gt;Now, if you read Brian&#039;s post, he in particular talks about large Excel files, his example has more than 7 million tags. He also says that they see usage of such (and larger) files a lot with their user base. So that is the scenario for which he claims short tag names are important.&lt;br/&gt;&lt;br/&gt;And you compare it to a 60 page (large?!?) text document with 51000 tags. Give me a break. That just shows nothing.</description>
		<content:encoded><![CDATA[<p>Oh common, nothing busted.</p>
<p>First, it would be fair to link Brian&#8217;s post where he discussed the issue of tag length in depth. <a HREF="http://blogs.msdn.com/brian_jones/archive/2006/05/16/599096.aspx" REL="nofollow" rel="nofollow">Here it is</a>.</p>
<p>Now, if you read Brian&#8217;s post, he in particular talks about large Excel files, his example has more than 7 million tags. He also says that they see usage of such (and larger) files a lot with their user base. So that is the scenario for which he claims short tag names are important.</p>
<p>And you compare it to a 60 page (large?!?) text document with 51000 tags. Give me a break. That just shows nothing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2006/10/celerity-of-verbosity.html#comment-125</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Thu, 19 Oct 2006 14:42:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2006/10/the-celerity-of-verbosity.html#comment-125</guid>
		<description>Interesting. If we turn the table... What could be the reasoning behind using short and obscure tag names in OOXML?</description>
		<content:encoded><![CDATA[<p>Interesting. If we turn the table&#8230; What could be the reasoning behind using short and obscure tag names in OOXML?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.565 seconds -->

