<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"
	>
<channel>
	<title>Comments on: Document Migrations</title>
	<atom:link href="http://www.robweir.com/blog/2007/03/document-migrations.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.robweir.com/blog/2007/03/document-migrations.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=document-migrations</link>
	<description>Thinking the unthinkable, pondering the imponderable, effing the ineffable and scruting the inscrutable</description>
	<lastBuildDate>Tue, 07 Feb 2012 11:20:47 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-577</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Mon, 12 Mar 2007 12:45:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-577</guid>
		<description>The image formats we have today were developed over a 20-year period.  They developed to meet different needs, first scanners (TIFF), then for use in web graphics (GIF) then photographs (JPEG) and then to avoid a GIF patent (PNG).  &lt;br/&gt;&lt;br/&gt;I should also note that the JTC1 prohibition against contradictory standards did not exist 10-15 years ago when these conflicting image standards came about.  The contradiction clause in the JTC1 Directives is a relatively recent addition.  So one can assume that it was added to curb the abuses of the past.  So presenting examples of contradictory standards before then does not really make a strong argument. &lt;br/&gt;&lt;br/&gt;So back to OOXML.  It was submitted to ISO only 3-months after ISO&#039;s publication of ODF.  This is hardly the timescale where one can argue that ODF was superseded by newer technology. Also it is clear that OOXML is attempting to serve the same market niche as ODF -- Personal Productivity Application.   No one can argue that Microsoft Office and Open Office are in different markets.</description>
		<content:encoded><![CDATA[<p>The image formats we have today were developed over a 20-year period.  They developed to meet different needs, first scanners (TIFF), then for use in web graphics (GIF) then photographs (JPEG) and then to avoid a GIF patent (PNG).  </p>
<p>I should also note that the JTC1 prohibition against contradictory standards did not exist 10-15 years ago when these conflicting image standards came about.  The contradiction clause in the JTC1 Directives is a relatively recent addition.  So one can assume that it was added to curb the abuses of the past.  So presenting examples of contradictory standards before then does not really make a strong argument. </p>
<p>So back to OOXML.  It was submitted to ISO only 3-months after ISO&#8217;s publication of ODF.  This is hardly the timescale where one can argue that ODF was superseded by newer technology. Also it is clear that OOXML is attempting to serve the same market niche as ODF &#8212; Personal Productivity Application.   No one can argue that Microsoft Office and Open Office are in different markets.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nksingh</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-576</link>
		<dc:creator>nksingh</dc:creator>
		<pubDate>Mon, 12 Mar 2007 05:02:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-576</guid>
		<description>Rob, doesn&#039;t your above statement about the similarities of the different image formats invalidate much of your reasoning about why OOXML cannot exist as a separate standard format?&lt;br/&gt;&lt;br/&gt;If two different image formats which represent almost exactly identical subject matter can exist as non-contradictory standards, why cannot two different office document formats that encompass significantly different object models do the same?</description>
		<content:encoded><![CDATA[<p>Rob, doesn&#8217;t your above statement about the similarities of the different image formats invalidate much of your reasoning about why OOXML cannot exist as a separate standard format?</p>
<p>If two different image formats which represent almost exactly identical subject matter can exist as non-contradictory standards, why cannot two different office document formats that encompass significantly different object models do the same?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-574</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Fri, 09 Mar 2007 21:03:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-574</guid>
		<description>Some good comments -- thanks.&lt;br/&gt;&lt;br/&gt;Any conversion requires some human verification to see that it worked correctly. I&#039;ll suggest a criterion for all:  when translation technology is so perfect that you would be willing to run your resume  through it, and send out the resulting document, as-is, without looking at it first, then translation software is good enough to run without a manual check, and even then only for documents as simple as a resume.&lt;br/&gt;&lt;br/&gt;Compare this to image conversions, where it is expected to be 100% flawless.  This is because image formats (TIFF/GIF/JPG/PNG) represent nearly the same thing.  They are really just different encodings of a rectangular grid of pixels, with different colors, along with some metadata specifying transparency, color space, compression, etc.  But the core of it is just a grid of pixels, a bitmap.  The abstract model expressed by a document format is several thousand times more complex than this.</description>
		<content:encoded><![CDATA[<p>Some good comments &#8212; thanks.</p>
<p>Any conversion requires some human verification to see that it worked correctly. I&#8217;ll suggest a criterion for all:  when translation technology is so perfect that you would be willing to run your resume  through it, and send out the resulting document, as-is, without looking at it first, then translation software is good enough to run without a manual check, and even then only for documents as simple as a resume.</p>
<p>Compare this to image conversions, where it is expected to be 100% flawless.  This is because image formats (TIFF/GIF/JPG/PNG) represent nearly the same thing.  They are really just different encodings of a rectangular grid of pixels, with different colors, along with some metadata specifying transparency, color space, compression, etc.  But the core of it is just a grid of pixels, a bitmap.  The abstract model expressed by a document format is several thousand times more complex than this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PolR</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-573</link>
		<dc:creator>PolR</dc:creator>
		<pubDate>Wed, 07 Mar 2007 21:11:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-573</guid>
		<description>I forgot to mention another good conversion strategy: to implement an internal service bureau. This would be a team of expert that can help users with difficult conversions. The service bureau may also operate utilities like:&lt;br/&gt;&lt;br/&gt;- A server that convert documents received by mail into PDF and mail back the result to the user.&lt;br/&gt;&lt;br/&gt;- A Terminal server running Microsoft Office for users that have a one-time or short term need to edit OOXML documents sent by  third parties.&lt;br/&gt;&lt;br/&gt;During initial implementation, the service bureau will act as a task force to convert large repositories where there is a business reason to convert.&lt;br/&gt;&lt;br/&gt;Afterwards, conversion will be an on-going activity because outside correspondents might not migrate to ODF right away and insist on OOXML. This situation will persist until the market perception that document standards must come from Microsoft wears out.&lt;br/&gt;&lt;br/&gt;The service bureau staff requirement will be reduced past the initial implementation, but some service will need to be maintained. Once ODF becomes dominant, the service bureau can be totally discontinued.&lt;br/&gt;&lt;br/&gt;&lt;br/&gt;A note for those that believe too much in automated tools. They are good but only to a point. Software has bugs. Converted documents need to be checked for fidelity. Inventories need to be validated for accuracy. This is a project cost in the cycle of inventory, convert and validate. This cost is a function of volume. Any strategy that reduces the volume reduces costs. Also documents have retention limits past which they can be destroyed. In many cases if the retention limit is shorter than the viability of keeping the ability to read and convert on-demand legacy stuff, there is no immediate business needs to convert.</description>
		<content:encoded><![CDATA[<p>I forgot to mention another good conversion strategy: to implement an internal service bureau. This would be a team of expert that can help users with difficult conversions. The service bureau may also operate utilities like:</p>
<p>- A server that convert documents received by mail into PDF and mail back the result to the user.</p>
<p>- A Terminal server running Microsoft Office for users that have a one-time or short term need to edit OOXML documents sent by  third parties.</p>
<p>During initial implementation, the service bureau will act as a task force to convert large repositories where there is a business reason to convert.</p>
<p>Afterwards, conversion will be an on-going activity because outside correspondents might not migrate to ODF right away and insist on OOXML. This situation will persist until the market perception that document standards must come from Microsoft wears out.</p>
<p>The service bureau staff requirement will be reduced past the initial implementation, but some service will need to be maintained. Once ODF becomes dominant, the service bureau can be totally discontinued.</p>
<p>A note for those that believe too much in automated tools. They are good but only to a point. Software has bugs. Converted documents need to be checked for fidelity. Inventories need to be validated for accuracy. This is a project cost in the cycle of inventory, convert and validate. This cost is a function of volume. Any strategy that reduces the volume reduces costs. Also documents have retention limits past which they can be destroyed. In many cases if the retention limit is shorter than the viability of keeping the ability to read and convert on-demand legacy stuff, there is no immediate business needs to convert.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-572</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Wed, 07 Mar 2007 19:54:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-572</guid>
		<description>I tested with 1 GB of office docs and xls files and the MS mass ooxml converter. &lt;br/&gt;It seems to work pretty good.&lt;br/&gt;&lt;br/&gt;I did notice one or two slight look changes in the spreadsheets.&lt;br/&gt;&lt;br/&gt;We might consider it as a valid method for conversion in a year or more. Allthough we do not expect the old MS Office formats to cause any problems for 5-10 years at least. We did actually found it reduced storage capacity with almost 27%. That would save us 100k-200k in expenses a year (storage,+ backup + servers) not including possible netwerk/email server savings. We should test more as the savings could be due to the nature of the converted documents in the tested set.&lt;br/&gt;&lt;br/&gt;Allthough we consider ODF a fine alternative we have not considered it as an option (yet) as we consider the prime choice is between Office products and not between Office formats.&lt;br/&gt;&lt;br/&gt;I don&#039;t think many people look at conversion as a space saver but it might wel be worth checking out if you have millions of documents.</description>
		<content:encoded><![CDATA[<p>I tested with 1 GB of office docs and xls files and the MS mass ooxml converter. <br />It seems to work pretty good.</p>
<p>I did notice one or two slight look changes in the spreadsheets.</p>
<p>We might consider it as a valid method for conversion in a year or more. Allthough we do not expect the old MS Office formats to cause any problems for 5-10 years at least. We did actually found it reduced storage capacity with almost 27%. That would save us 100k-200k in expenses a year (storage,+ backup + servers) not including possible netwerk/email server savings. We should test more as the savings could be due to the nature of the converted documents in the tested set.</p>
<p>Allthough we consider ODF a fine alternative we have not considered it as an option (yet) as we consider the prime choice is between Office products and not between Office formats.</p>
<p>I don&#8217;t think many people look at conversion as a space saver but it might wel be worth checking out if you have millions of documents.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Queen Elizabeth</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-571</link>
		<dc:creator>Queen Elizabeth</dc:creator>
		<pubDate>Wed, 07 Mar 2007 17:13:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-571</guid>
		<description>But why not mass-migrate the files?&lt;br/&gt;&lt;br/&gt;The difficulty of doing so is not really a good argument: not only does MS made import/export converters for older versions of Office available, but it provides tools* that make inventorying and bulk conversion relatively trouble-free. Seriously, what could they have done to make it any easier?&lt;br/&gt;&lt;br/&gt;By not upgrading now, one is simply passing on costs to the future. It&#039;s akin to deferring maintenance--generally not a wise idea.&lt;br/&gt;&lt;br/&gt;Admittedly, there may be some intricate documents that break upon conversion. But that&#039;s why, before converting, you archive all your documents. Then, if something goes amiss, you always have the option of restoring the original file.&lt;br/&gt;&lt;br/&gt;* See: technet2.microsoft.com/Office/en-us/ library/d0373697-31f5-4fc5-8dd1-1b9d7f35842f1033.mspx</description>
		<content:encoded><![CDATA[<p>But why not mass-migrate the files?</p>
<p>The difficulty of doing so is not really a good argument: not only does MS made import/export converters for older versions of Office available, but it provides tools* that make inventorying and bulk conversion relatively trouble-free. Seriously, what could they have done to make it any easier?</p>
<p>By not upgrading now, one is simply passing on costs to the future. It&#8217;s akin to deferring maintenance&#8211;generally not a wise idea.</p>
<p>Admittedly, there may be some intricate documents that break upon conversion. But that&#8217;s why, before converting, you archive all your documents. Then, if something goes amiss, you always have the option of restoring the original file.</p>
<p>* See: technet2.microsoft.com/Office/en-us/ library/d0373697-31f5-4fc5-8dd1-1b9d7f35842f1033.mspx</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PolR</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-570</link>
		<dc:creator>PolR</dc:creator>
		<pubDate>Wed, 07 Mar 2007 14:10:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-570</guid>
		<description>I forgot to mention.&lt;br/&gt;&lt;br/&gt;Conversion costs apply regardless you go ODF or OOXML. This is important because the Microsoft compatibility talk implies that ODF requires costly conversions while OOXML does not. This perception is wrong.&lt;br/&gt;&lt;br/&gt;We don&#039;t need to convert most legacy documents. Among those that need conversion, most are kept for archival purpose and can be converted to PDF because they will never be edited. What is left?&lt;br/&gt;&lt;br/&gt;The conversion to ODF is required for stuff like templates and application front-ends. Should one choose OOXML, these same documents and applications also need to be converted. You don&#039;t want to keep generating legacy stuff forever.&lt;br/&gt;&lt;br/&gt;Whatever format one choose, there is a need to inventory, convert and test. The difference in costs will occur only if OOXML somehow makes faster/better automatic conversion than ODF. How does that compare to licensing or hardware upgrades for Office 2007? Compatibility issues are overblown.</description>
		<content:encoded><![CDATA[<p>I forgot to mention.</p>
<p>Conversion costs apply regardless you go ODF or OOXML. This is important because the Microsoft compatibility talk implies that ODF requires costly conversions while OOXML does not. This perception is wrong.</p>
<p>We don&#8217;t need to convert most legacy documents. Among those that need conversion, most are kept for archival purpose and can be converted to PDF because they will never be edited. What is left?</p>
<p>The conversion to ODF is required for stuff like templates and application front-ends. Should one choose OOXML, these same documents and applications also need to be converted. You don&#8217;t want to keep generating legacy stuff forever.</p>
<p>Whatever format one choose, there is a need to inventory, convert and test. The difference in costs will occur only if OOXML somehow makes faster/better automatic conversion than ODF. How does that compare to licensing or hardware upgrades for Office 2007? Compatibility issues are overblown.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: zridling</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-569</link>
		<dc:creator>zridling</dc:creator>
		<pubDate>Wed, 07 Mar 2007 04:53:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-569</guid>
		<description>An excellent guide to study and share with others and organizations. Thanks for taking the time to outline it in such a clear manner, Rob.&lt;br/&gt;&lt;br/&gt;For me, I will convert my personal writings (short stories, theses, dissertation) to ODF, but since StarOffice/OpenOffice reads my legacy .doc files with such accuracy, your guidelines will save me a year of conversion time.</description>
		<content:encoded><![CDATA[<p>An excellent guide to study and share with others and organizations. Thanks for taking the time to outline it in such a clear manner, Rob.</p>
<p>For me, I will convert my personal writings (short stories, theses, dissertation) to ODF, but since StarOffice/OpenOffice reads my legacy .doc files with such accuracy, your guidelines will save me a year of conversion time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PolR</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-568</link>
		<dc:creator>PolR</dc:creator>
		<pubDate>Wed, 07 Mar 2007 03:53:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-568</guid>
		<description>I don&#039;t quite agree with this method.&lt;br/&gt;&lt;br/&gt;No big conversion orgy is required. Over 95% of the documents need only to be read. This does not call for a massive conversion. What needs conversion is two things:&lt;br/&gt;&lt;br/&gt;- The templates, especially those with plenty of macros. These are the ones that need to be used on a daily basis to produce new documents and feed corporate workflows. You want the new documents to be in the new format.&lt;br/&gt;&lt;br/&gt;- The applications, those that use Excel as a reporting or data entry tool, the systems indexing official versions of documents. You want to adapt them to use the new format.&lt;br/&gt;&lt;br/&gt;Almost everything else can be left out of scope for the migration project. Most documents are kept only for archival value. Users can be trained to convert the document they need to have handy in the new format. Selecting these document is a business decision that is best left to their owner. The conversion itself can be performed on an as needed basis. No need to gather a big inventory and anguish about getting the classification right. Users will convert only what they need. After a while most documents will become obsolete and there will be no point converting them anymore.&lt;br/&gt;&lt;br/&gt;If there are difficult cases that can&#039;t be read with the new office suite, a server can be configured to automatically convert anything sent to it into PDF. This server can be left available for years until there is very few legacy documents still relevant.&lt;br/&gt;&lt;br/&gt;Stuff that needs to be retained on a very long duration for legal reasons can be batch processed into PDF upfront if keeping them in legacy format is not good enough. Users can point out where repositories of such documents are stored. If there is a legal reason to keep them, they should be indexed and kept in a well defined place anyway. If they are not, then you have grounds for a project having nothing to do with the choice of the next file format or office suite.&lt;br/&gt;&lt;br/&gt;For read only compatibility, PDF is more than good enough. I find the later point ironic since Microsoft advertise read and convert compatibility as the main justification of OOXML without noting that PDF already does about everything that needs to be done. We don&#039;t want to retroactively edit our archives.&lt;br/&gt;&lt;br/&gt;If there is anything left that still can&#039;t be handled, then there is an administrative procedure called derogation. You give some individual that have unusual requirements some unusual rights to deviate from the corporate standard and use a second Office suite on top of the main one. He knows he is an exception and will jump through the required hoops to interface with the rest of the corporation. After a while the annoyance will make him find ways to bring himself back into the corporate standard.&lt;br/&gt;&lt;br/&gt;In my opinion, massive legacy document conversion is a red herring. &lt;br/&gt;&lt;br/&gt;A more serious issue is what you do with correspondents and business partners that insist on using a different file format. The perception that Microsoft defines the standard due to market share works like a self-fulfilling prophecy. There are people that will be reluctant to adopt ODF by fear of being unable to exchange with the masses that will use OOXML. But if everybody thinks the same, there is no real reason to stick with Microsoft, just some irrational group behavior.</description>
		<content:encoded><![CDATA[<p>I don&#8217;t quite agree with this method.</p>
<p>No big conversion orgy is required. Over 95% of the documents need only to be read. This does not call for a massive conversion. What needs conversion is two things:</p>
<p>- The templates, especially those with plenty of macros. These are the ones that need to be used on a daily basis to produce new documents and feed corporate workflows. You want the new documents to be in the new format.</p>
<p>- The applications, those that use Excel as a reporting or data entry tool, the systems indexing official versions of documents. You want to adapt them to use the new format.</p>
<p>Almost everything else can be left out of scope for the migration project. Most documents are kept only for archival value. Users can be trained to convert the document they need to have handy in the new format. Selecting these document is a business decision that is best left to their owner. The conversion itself can be performed on an as needed basis. No need to gather a big inventory and anguish about getting the classification right. Users will convert only what they need. After a while most documents will become obsolete and there will be no point converting them anymore.</p>
<p>If there are difficult cases that can&#8217;t be read with the new office suite, a server can be configured to automatically convert anything sent to it into PDF. This server can be left available for years until there is very few legacy documents still relevant.</p>
<p>Stuff that needs to be retained on a very long duration for legal reasons can be batch processed into PDF upfront if keeping them in legacy format is not good enough. Users can point out where repositories of such documents are stored. If there is a legal reason to keep them, they should be indexed and kept in a well defined place anyway. If they are not, then you have grounds for a project having nothing to do with the choice of the next file format or office suite.</p>
<p>For read only compatibility, PDF is more than good enough. I find the later point ironic since Microsoft advertise read and convert compatibility as the main justification of OOXML without noting that PDF already does about everything that needs to be done. We don&#8217;t want to retroactively edit our archives.</p>
<p>If there is anything left that still can&#8217;t be handled, then there is an administrative procedure called derogation. You give some individual that have unusual requirements some unusual rights to deviate from the corporate standard and use a second Office suite on top of the main one. He knows he is an exception and will jump through the required hoops to interface with the rest of the corporation. After a while the annoyance will make him find ways to bring himself back into the corporate standard.</p>
<p>In my opinion, massive legacy document conversion is a red herring. </p>
<p>A more serious issue is what you do with correspondents and business partners that insist on using a different file format. The perception that Microsoft defines the standard due to market share works like a self-fulfilling prophecy. There are people that will be reluctant to adopt ODF by fear of being unable to exchange with the masses that will use OOXML. But if everybody thinks the same, there is no real reason to stick with Microsoft, just some irrational group behavior.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steven G. Johnson</title>
		<link>http://www.robweir.com/blog/2007/03/document-migrations.html#comment-567</link>
		<dc:creator>Steven G. Johnson</dc:creator>
		<pubDate>Wed, 07 Mar 2007 03:47:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2007/03/document-migrations.html#comment-567</guid>
		<description>Not converting documents does have one downside: the longer you wait to convert, the harder it might become, because the program that created the original document will at some point cease to work on readily available hardware, and new programs may be unable to read the old format.&lt;br/&gt;&lt;br/&gt;On the other hand, I guess it&#039;s a balance of costs: certain high cost to do all conversions now, versus a low probability of a high cost to do a small number of conversions much later.</description>
		<content:encoded><![CDATA[<p>Not converting documents does have one downside: the longer you wait to convert, the harder it might become, because the program that created the original document will at some point cease to work on readily available hardware, and new programs may be unable to read the old format.</p>
<p>On the other hand, I guess it&#8217;s a balance of costs: certain high cost to do all conversions now, versus a low probability of a high cost to do a small number of conversions much later.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 1.138 seconds -->

