≡ Menu

An ODF/OOXML File Format Timeline

I suppose the downside of a blog post containing only a picture is that there is nothing for anyone to quote. So here are a few themes that struck me while putting this chart together:

  1. Microsoft once made file format information on the binary formats readily available, in fact encouraged programmers to use the binary formats. But then around 1999 they reversed course, and eliminated such documentation. At the time, working at Lotus, I had no idea what motivated this change. It was only years later, when Microsoft internal memos were released in cases like Comes v. Microsoft, that the full picture emerged. The file format was viewed by Microsoft as a strategic tool, used to support the overall Microsoft platform, not the user. The format was designed to preserve their vendor lock-in. The availability of the file format documentation to competitors was limited, as a matter of corporate policy.So this reminds us that just because something is documented and available today does not prevent Microsoft from changing their mind at a later point and removing the documentation, failing to update it with new releases, or making it available only under a more restrictive license. Since Ecma owns the OOXML specification, as well as the future maintenance of it, any belief in the long-term openness of this format depends on your trust of Microsoft’s future behavior in this area.
  2. Like any durable goods monopoly (and few things are as durable as software) Microsoft’s largest competitor is their own install base. Microsoft has made many attempts at moving beyond the binary formats in the past, with Office 2000, Office XP and Office 2003. But in each case it failed. These were all false starts and abandoned attempts. So we should look for signs that OOXML is actually Microsoft’s real direction and not another false start or dead end. My guess is that OOXML is merely a transitional format, much like Windows ME was in the OS space, a temporary hybrid used to ease the transition from 16-bit to the 32-bit platform that would eventually come (Windows 2000). Microsoft doesn’t want to support all of the quirks of their legacy formats forever. That just leads to bloated, fragile code, more expensive development and support costs. They would rather have clean, structured markup, like ODF. But the question is, how do you get there? The answer is straightforward: First, eliminate the competition. Second, move users in small steps, promising the comfort of continuity and safety. Third, once you have eliminated competition and have the users on the OOXML format that no one but Microsoft fully understands, then you may have your will of them. For example, introduce a new format that drops support for legacy formats and force everyone to upgrade. They are pretty much doing this already on the Mac by dropping support for VBA in the next version of the Mac Office.Even a cursory look at OOXML shows that it was not designed for long-term use, even by Microsoft. So the question I have is, what is the real format that they are going toward?
  3. Microsoft, after pretty much ignoring document standards for over a decade, suddenly got religion in late 2005 and rushed whatever they had on hand into Ecma. Remember, just months earlier they had recommended the Office 2003 Reference Schemas to Massachusetts for official use. I’m certainly glad Massachusetts did not fall for that by putting their resources on another dead format in the Microsoft format graveyard. OOXML was not designed to be a standard. It is just a proprietary specification that Microsoft has dumped, at the last minute, into ISO’s lap, in an attempt to translate their market domination into a standards imprimatur in order to further cement their market domination. It is a win-win situation for them. Either they have a effective monopoly in office applications and an ISO standard, or they have an effective monopoly in office applications. Nice situation for them either way.

Creative Commons License
This work, unless otherwise expressly stated, is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License.

{ 28 comments… add one }

  • Yoon Kit 2007/06/25, 00:04

    > Customers and developers who move to these formats
    > will find their investment quickly negated when
    > Microsoft abandons the format in the next release.

    So are you saying that Microsoft has had a “sterling” track record in abandoning any openly defined file format?

    3 times out of 3 they have changed. Any strong reason to think that they will stick to MSOOXML this time? Fourth time lucky? Because its bigger?

    yk.

  • steve_l 2007/06/25, 06:24

    I seem to recall that OLE2.0 came with the OLEFS, binary format of MSword and the like fully undocumented, in about 1992/93; the first place it got described was some joint ms/hp proposal for an alternative image format to JPEG. I dont remember the date of the latter.

  • marc 2007/06/25, 07:08

    Pictures are worth more than 1000 words. Very representative ( and embarrasing ) timeline. Thank you.

  • Anonymous 2007/06/26, 02:22

    “Any strong reason to think that they will stick to MSOOXML this time? Fourth time lucky? Because its bigger? “

    MS have already abandoned EOOXML.

    Read Open Malaysia

    Is VML in or out now, or was that a typo?

    And the first comment by Stephane Rodriguez is very interesting.

    Winter

  • putt1ck 2007/06/26, 07:49

    @yoon kit

    No, this time they will stick. It’s just that MSOOXML is not the same as EOOXML and will probably go through several versions as a result of service packs and product releases to ease customers into “keeping up to date”.

  • Charles Robinson 2007/06/27, 12:25

    Microsoft has already sown the seeds of the VBA replacement, called Visual Studio for Applications (VSA). It was released in 2005 with the .Net framework 2.0. Look for a deprecation of VBA with the next release of Office (14).

  • Chris Ward 2007/06/27, 16:57

    So, we have a dominant vendor of standalone office productivity software (Microsoft Office).

    We have a secondary vendor (Lotus with SmartSuite). I think Lotus (a division of IBM, now) would be perfectly happy never to sell another copy of SmartSuite; in fact I think if someone like Lenovo were to approach IBM, there’s a good chance IBM would sell off that business and we’d have Lenovo SmartSuite, growing in China.

    Lotus are doing Notes now. Teamrooms, Replication, Professional Collaboration.

    We have another vendor, Sun Microsystems, with StarOffice. I don’t think Sun make much of their revenue from StarOffice; it’s really so that they can get office productivity software going on SparcStations under Solaris, to give them (and their clients) some choices. Sun mainly sell ‘engineering services’ nowadays; warranties for Java.

    And OpenOffice. Anyone can download that, any time they want, from http://www.openoffice.org/ ; source code and all; and do whatever they like with it. No charge.

    It feels like when you’re in a plane, taxiing for take-off. On the runway, you need the wheels down. They are like Microsoft Office and Lotus SmartSuite. Faster, faster, faster, ‘Rotate’, tip the flaps, you’re up in the air and flying. Climb to cruising altitude, point to where you want to go. That’s what planes are for. The bit on the ground was just how you get started.

    Then you don’t need the wheels. They only get in the way.

    OpenOffice.org and ISO26300 from now on, please.

  • Anonymous 2007/06/27, 19:13

    ODF applications (open office) are bundled with big blue’s new Eclipse-based Lotus Notes 8 client, due this summer. The corporate desktop is getting interesting again!

  • Anonymous 2007/06/28, 06:33

    Charles said “Microsoft has already sown the seeds of the VBA replacement, called Visual Studio for Applications (VSA). It was released in 2005 with the .Net framework 2.0. Look for a deprecation of VBA with the next release of Office (14).”

    No, Microsoft cannot infuriates their install base.

    There is way too much money in existing VBA stuff embedded in Word/Excel/Powerpoint documents right now that if there is one thing you can bet, it’s that it’s here for a looooonng time. And that no half-assed technology such as VSTO is going to replace it anytime soon.
    Sure, two or three people at Microsoft want to do that, but for instance VSTO assemblies live outside documents, in other words this stuff is not allowed by the suits in the IT department. Remember Charles, power users in the enterprise are a minority.

    -Stephane Rodriguez

  • Bruce 2007/06/28, 12:09

    Re: “No, Microsoft cannot infuriates their install base”: MS has already announced they’re eliminating VBA from Office 2007.

  • Anonymous 2007/06/28, 23:25

    “MS has already announced they’re eliminating VBA from Office 2007″

    Sorry Bruce, I think you got it wrong.

    MS has made no such announcement. And by the way VBA is of course part of Office 2007.

    What they have announced though is that the MAC version of Office 2007, marketed as Mac Office 2008, will not provide support for VBA.

    There is a huge difference. The Windows install base is not impacted. And as I said above, I don’t see how it can possibly be taken out given the stakes.

    -Stephane Rodriguez

  • Anonymous 2007/06/29, 01:24

    Speaking of MS so-called interoperability, have a good laugh reading this :

    http://blogs.msdn.com/scaravajal/archive/2007/06/28/excel-2007-add-in-for-syncing-with-sharepoint.aspx

    Apparently, even interoperability with their own proprietary software is impossible.

    -Stephane Rodriguez

  • Wesley Parish 2007/06/29, 09:18

    I think Lotus (a division of IBM, now) would be perfectly happy never to sell another copy of SmartSuite;

    Well, if IBM decides that the SmartSuite source tree is much too good to simply dump, and they were willing to put it on Sourceforge under the Common Public License or some other OSI-approved license, I’m sure there would be quite a few people – including myself – who would be willing to take IBM up on the offer.

    Among other things, it would open up the SmartSuite file formats so they could be supported more fully by other office suites – a complaint that I’ve read every now and then.

    And SmartSuite would be adapted to use ODF – yet another contender that would make a hash of the Microsoft contention that ODF is OpenOffice.org under another name.

    It would even support a contention I’ve made repeatedly to Microsoft and others, that once such-and-such a company has gotten rid of such-and-such a software product, it should turn it over to its fan-base; instead of competing with its installed base the way Microsoft currently does, it would use it to debug its previous product/s and then to move in new directions.

    So how about it, IBM?

  • Ed 2007/06/29, 19:01

    Wesley,

    It seems like I might have commented on your thought somewhere else before, but I can’t find it.

    At this point in their lifecycle, posting the SmartSuite products as open source is not really possible. IBM has considered it in the past. The base problem is that there is too much licensed technology/software that is part of SmartSuite… and in some cases, the companies that wrote them are gone or so diluted that getting permission / agreement to open source that stuff is legally or otherwise difficult.

    I understand the desire and wish that it could be different. This is Rob’s blog and he worked on the suite, so he might have other data.

    –Ed Brill

  • Bruce 2007/06/30, 08:04

    Oh, yes, I was wrong; I meant Office 2008 (Mac).

  • Rob 2007/06/30, 08:48

    Echoing what Ed said, making a commercial product, especially an old one, open source is a huge undertaking from the IP perspective. With a new code base, it is easier. For example, I worked on Lotus XSL a few years ago, and donating that to Apache to make Xalan was simple because we could easily demonstrate that it was 100% original code. But with something like SmartSuite, it has a lot of 3rd party code for which we would need to secure permission, some from companies that are no longer around.

    Similarly, when a TV show is released in syndication or on DVD, they need to renegotiate the music rights with the composers and artists for any music used in the show. In some cases, like WKRP in Cincinnati, the musical changes required in the DVD versions, were extensive.

  • dario 2007/06/30, 13:38

    >No, this time they will stick.
    >It’s just that MSOOXML is not
    >the same as EOOXML and will probably
    > go through several versions as a
    >result of service packs and product
    >releases to ease customers into
    >”keeping up to date”.

    It seems that MS is prepared to do that.

    Do a search of the term “extLst” in OOXML Part 4 Markup Reference:

    651 occurrences.

    Quoting some of them:


    “3.2.10 extLst (Future Feature Data Storage Area) This element defines flexible storage extensions for implementing applications”
    [end of subclause, no more information given!!]“


    “3.2.7 ext (Extension) Each ext element contains extensions to the standard SpreadsheetML feature set.
    Parent Elements: extLst (§3.2.10)
    Child Elements: Any element from any namespace
    Subclause: n/a
    Attributes: uri (URI): A token to identify version and application information for this particular extension. The possible values for this attribute are defined by the XML Schema token datatype.
    [end of subclause, no more information given!!!!]“


    “5.1.2.1.14 ext (Extension)
    This element [of type CT_OfficeArtExtension] specifies an extension that is used for future extensions to the current version of DrawingML. This allows for the specifying of currently unknown elements in the future that will be used for later versions of generating applications.

    Attributes: uri (Uniform
    Resource Identifier): Specifies the URI, or uniform resource identifier that represents the data stored under
    this tag. The URI is used to identify the correct ‘server’ that can process the contents of this tag. The possible values for this attribute are defined by the XML Schema token datatype. [end of subclause, what 'server'????!!!]“

    Scaring …

  • Anonymous 2007/06/30, 13:44


    “Unfortunately, you cannot save the workbook in the new Office Open XML Formats. Instead, to retain the functionality, you need to save the workbook in the Excel 97-2003 (Biff8) file format.”

    mmmm, where is the 100% compatibility

    ( makes me remember this post )

  • Wesley Parish 2007/07/01, 07:21

    Point taken, Ed and Rob.

    I’ve heard also, from another IBMer whose name I don’t recall, make a similar point about IBM OS/2 2.x and later, that to release it as Open Source, would leave huge chunks missing.

    Which is what happened with Netscape and Mozilla, and which was resolved quite quickly in the case of encryption, if I remember correctly.

    As far as missing rights holders go, I’ve had a bit of experience with that myself. I know a bit of how frustrating it can be, when the person concerned seems to have vanished from the face of the earth. Which is partly why I think such cases should be declared abandoned and in the public domain – rather like an abandoned and derelict vessel in busy shipping lanes can earn up to near its full value in salvage fees and maritime lien.

  • Karl O. Pinc 2007/07/01, 13:04

    “then you may have your will of them”

    Awkward phrase. The usual phrase is “then you may have your way with them”, connoting sexual rapaciousness. Other choices might be “then you may bend them to your will”, or “then you may exercise your will over them”.

  • Rob 2007/07/01, 17:41

    It is certainly an archaic construct, though it rings true in my ear. A notable use, in the first person, and one that lingers in my ear is Paul Scofield’s Thomas More in the movie “A Man for All Seasons”, after the play by Robert Bolt, where More, on trial for treason, finds that his case is lost, by perjured testimony against him. He says to the court, “I am a dead man. You have your will of me.”

  • r3m0t 2007/07/05, 16:17

    Where’s StarOffice in this? People were using StarOffice before OO 1.0.

    I bet it had a different file format and you’re embarrased to admit it :)

  • Rob 2007/07/05, 17:41

    StarOffice before 1999? Sorry, I just don’t know any details about their work before OpenOffice 1.0.

    Since XML became a W3C standard in 1998, any StarOffice version before then could not have had an XML format, right?

  • Anonymous 2007/07/05, 21:31

    So, from what Wesley says (correct me if I’m wrong), IBM has not released the specifications of the binary file formats for SmartSuite or Notes.

    From your experience in working at IBM on SmartSuite, how did the people perceive the binary file formats during the Lotus/IBM years?

  • Rob 2007/07/06, 16:21

    It is hard to make generalities about the Lotus binary file formats. Remember, Freelance Graphics and WordPro were both acquisitions and each came with their own binary formats, with different design principles. The 1-2-3 format was a record-based format, a repeated set of opcode types, lengths and records. Freelance Graphics was a serialization of C-language structures. These were not the internal runtime data structures, but special data structures which were used only for writing the file format. The runtime structures, which did evolve from release to release, would be carefully copied into these persistence structures for saving. And WordPro used Bento structured storage, which was a compound container format designed by Apple.

    The 1-2-3 file format was published in book form and the Freelance format was available on request. I don’t believe that the WordPro format was ever documented.

  • Anonymous 2007/09/22, 18:53

    sorry, but Microsoft has a long history of doing this. Internet Explorer for Mac was discontinued. Why because of Apples safari program.
    Windows media player was discontinued. why, because of apples Quicktime.
    Virtual PC a program by Connectix, was bought out by Microsoft. is being discontinued, why, because of mac’s Bootcamp VM software.
    As Microsoft realizes that they can’t control the market, they eliminate the support, the competition, and the software.
    As they continue to grow in the Open Linux world, they will eventually develope software that will be licensed to them, so they can eliminate that too. The two words ( user friendly & Free ) are hated by Microsoft.

  • Glen Turner 2012/06/16, 01:56

    Previous events to mention would be:
    - TeX and LaTeX anf troff: Knuth, Lamport and Kernighan were very aware their systems were document interchange formats
    - SGML in 1986
    - HTML in 1991
    - The variously incompatible .DOC versions. This had a major effect within Microsoft as they realised that .DOC wasn’t used as an intermediary between keyboard and paper, but was an interchange format.
    - Boeing’s dissatisfaction with .DOC and continued use of FrameMaker, which set off the end times within Microsoft of .DOC as a future format.

  • oversky 2012/08/20, 22:54

    You may be interested in this article.
    http://bit.ly/TQO2Vh
    New file format options in the new Office

Leave a Comment

Next post:

Previous post: