• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

An Antic Disposition

  • Home
  • About
  • Archives
  • Writings
  • Links
You are here: Home / Archives for ODF

ODF

More Matter with Less Art

2007/01/31 By Rob 29 Comments

I wish to discuss a recent blog post, a vigorous defense of Microsoft’s Office Open XML and XAML from Novell’s Miguel de Icaza. His post is so wrong, on so many levels, that I am somewhat at a loss for words. Miguel is not stupid, and I find it hard to believe that he is a Microsoft shill, so I must assume that he was imperfectly informed on this issue. “Everyone is entitled to their own opinions but they are not entitled to their own facts,” as Pat Moynihan was fond of saying. I’ll try hard not to make this personal, but there are so many errors in his post that he may very well feel the sting of correction in my words, and for that I apologize in advance.

I suggest you read through Miguel’s post in its entirely, and then return here for my response.

After an attack against lawyers, we come to some technical comments:

Unlike the XML Schema vs Relax NG discussion where the advantages of one system over the other are very clear, the quality differences between the OOXML and ODF markup are hard to articulate.

The high-level comparisons so far have focused on tiny details (encoding, model used for the XML). There is nothing fundamentally better or worse in those standards like there is between XML Schema and Relax NG.

ODF grew out of OpenOffice.org and is influenced by its internal design. OOXML grew out of Microsoft Office and it is influenced by its internal design. No real surprises there.

Maybe I can be of some assistance here, helping to articulate the difference in quality between ODF and OOXML. ODF, starting from its roots in OpenOffice.org specification, spent a further 2 1/2 years being improved and reviewed in OASIS, then further work preparing for submission to ISO, then a further year in ISO, receiving more comments and corrections, before it was published as an ISO standard. So this is a combined 4 years in technical committees being refined by standards bodies. During this time ODF has been implemented in dozens of applications, including full suites like OpenOffice.org, KOffice and Lotus Workplace, as well as individual applications like AbiWord, Gnumeric and Google Docs and Spreadsheets.

In comparison, OOXML went from a proprietary Microsoft specification to an Ecma standard in record time. If you make something 8 times lengthier than ODF, and do it 4 times faster than ODF, then you are going to have a quality problem. The list of problems on GrokLaw is one list of known problems in OOXML. Note that particular list was generated in only 3 or 4 days by volunteers. I recently did a sampled survey of OOXML specification quality and predicted that it contains thousands of errors.

And where are the OOXML implementations? OOXML was approved by Ecma and submitted to ISO without a single available implementation. Certainly, Office 2007 later shipped with support, but is that it? A single implementation? Until you have at least two independent implementations of a standard you will have a very imperfect understanding of the standard’s quality.

So the question to ask is this: Why should JTC1 NB volunteers deal with the mess that Microsoft dropped on their lap by their overhasty review of OOXML in Ecma? Why should they spend the next 6 months reviewing this specification when even a cursory review shows it is defective in so many ways? And considering the observed low level of quality, why should it be reviewed and approved via a Fast Track process, and all in one big chunk of 6,000 pages? Isn’t this the last thing you want to do, following up a rushed review in Ecma by a rushed review in ISO? Instead this should go back to Ecma to let them do a proper review, one they can be proud of.

Miguel correctly points out that OOXML derives from Microsoft Office’s formats, and ODF derives from OpenOffice.org’s formats. But then he leaps to an assertion that they both reflect their parent application’s internals. This is not true. Only a poorly-designed file format reflects the internals of the application. Maybe that is how we did it back in the 1980’s, but best-practices for portable file formats have been known for years now. That is why we have data formats like XML, so the format can be independent of the application internals. ODF was designed, even in the OpenOffice days, from the ground up to be an application- and platform-neutral document format. While it was further developed in OASIS, it continued to take on such good qualities as reuse of existing relevant W3C standards such as XForms and MathML and SVG. So certainly, the platform-independence and open nature of OpenOffice.org rubbed off on ODF, but isn’t that an extremely good thing?

OOXML, on the other hand, matches to an inane degree the internals of a single vendor’s legacy application, with no concessions to platform-neutrality. For example, OOXML encodes data in non-XML formats such as binary blobs, bitmasks and other encodings that defy XML schema validation or processing by XML tools. As I’ve said before, this is not a specification, this is a DNA sequence.

Does that help articulate the difference?

Miguel then takes on the size question:

A common objection to OOXML is that the specification is “too big”, that 6,000 pages is a bit too much for a specification and that this would prevent third parties from implementing support for the standard.

Considering that for years we, the open source community, have been trying to extract as much information about protocols and file formats from Microsoft, this is actually a good thing.

This is good thing, I agree, that Microsoft has produced this specification. I’d like even more for them to make the specification for the Office binary formats public, since that is the format that the billions of legacy documents are actually in. I hope you’ll join with me in calling for Microsoft to release the specification for these formats under their Open Specification Promise, so that users will truly be able to choose which format they want to remain in or move to.

However, merely because it is useful from a disclosure perspective, does not necessarily mean it will make a good standard. Simply because it is better than nothing does not mean it is sufficient for an ISO standard. There is an important difference between a descriptive specification and a prescriptive standard. Writing down file formats is a small virtue, and one that other companies have done for years. Do they all deserve to be ISO standards?

For example, many years ago, when I was working on Gnumeric, one of the issues that we ran into was that the actual descriptions for functions and formulas in Excel was not entirely accurate from the public books you could buy.

OOXML devotes 324 pages of the standard to document the formulas and functions.

….

Depending on how you count, ODF has 4 to 10 pages devoted to it. There is no way you could build a spreadsheet software based on this specification.

This is a rather bold misstatement, considering that implementations such as OpenOffice.org, KSpread, Gnumeric, Google Spreadsheets, Lotus Workplace, etc., already in fact exist. Go back even earlier, we had 1-2-3, Quattro Pro and OpenOffice all supporting Excel’s formulas even though there was no formal specification for it. Sure having a good specification helps, but the extreme rhetoric that says that this is unimplementable is patently absurd. Just look around.

Some folks have been using a Wiki to keep track of the issues with OOXML. The motivation for tracking these issues seems to be politically inclined, but it manages to pack some important technical issues.

Hmm… The open source community helps test a purported open standard, reports the defects it finds, and this is called “politically inclined”? Isn’t this what open source is all about, “given sufficient eyeballs, all bugs are shallow”? Shouldn’t open standards be subject to scrutiny? As I said in my blog, I am so impressed by the quality and productivity of this type of wiki-enabled public review that I am going to investigate how we can do this to solicit public comments on ODF 1.2. This isn’t for political reasons. This is because it works.

Some of the objections over OOXML are based around the fact that it does not use existing ISO standards for some of the bits in it. They list 7 ISO standards that OOXML does not use: 8601 dates and times; 639 names and languages; 8632 computer graphics and metafiles; 10118-3 cryptography as well as a handful of W3C standards.

By comparison, ODF only references three ISO standards: Relax NG (OOXML also references this one), 639 (language codes) and 3166 (country codes).

Not only it is demanded that OOXML abide by more standards than ISO’s own ODF does, but also that the format used for metafiles from 1999 be used. It seems like it would prevent some nice features developed in the last 8 years for no other reason than “there was a standard for it”.

Miguel has inexplicably ommitted all of the W3C standards that ODF uses, such as XForms, MathML, SVG, XLink, SMIL, XSLT, CSS2 as well as IETF standards such as RFC 2045, RFC 2048, RFC 2616, RFC 2898, RFC 3066, RFC 3987. To imply that OOXML follows more standards that ODF is a foolish statement, unsupported by facts.

On the WMF, Miguel has it all wrong. What is a Windows Metafile? It is simply a recording of the graphical function calls made by Windows as it renders a drawing. It maps 1-to-1 into Windows API calls. It maps so closely to Windows that when the WMF format was found to be vulnerable to a security flaw, even the Wine Windows compatibility layer for Linux was susceptible to the same security hole. WMF (and VML, another legacy format in OOXML with a history of security problems) are flawed formats. One security vendor said: “Turns out this is not really a bug, it’s just bad design. Design from another era.” and “The WMF vulnerability probably affects more computers than any other security vulnerability, ever.”

Although Miguel is pleased to note that the proposed cross-platform ISO standard, Computer Graphics Metafile (CGM) dates to 1999, he fails to mention that WMF is even older, dating back to Windows 3.0 (1990).

So which one should be prefered in an ISO standard? The Windows Metafile format which is not documented in an open standard, is tied to the graphical layer of a single vendor, and has design flaws with serious security implications? Is this what we really want? Or do we want an open standard, one designed to be platform neutral, that has been in use for eight years, that has had a community continuing development and promotion of it such as CGM Open and WebCGM? Where is the WMF community? A Google search for WMF comes up with security problems; a search of CGM comes up with communities, initiatives and test suites.

There is an important-sounding “Ecma 376 relies on undisclosed information” section, but it is a weak case: The case is that Windows Metafiles are not specified.

It is weak because the complaint is that Windows Metafiles are not specified. It is certainly not in the standard, but the information is publicly available and is hardly “undisclosed information”. I would vote to add the information to the standard.

Did you really read the Groklaw issues list? WMF is not the only, or even the most troublesome of the undisclosed information in OOXML. Start here, then go back and read the Groklaw list of issues, and let me know if it makes more sense then. I am not that good at explaining these things, so please ask questions and I will try harder.

I have obviously not read the entire specification, and am biased towards what I have seen in the spreadsheet angle. But considering that it is impossible to implement a spreadsheet program based on ODF, am convinced that the analysis done by those opposing OOXML is incredibly shallow, the burden is on them to prove that ODF is “enough” to implement from scratch alternative applications.

There is that claim, that it is impossible to implement an ODF spreadsheet. Miguel, surely you aware of OpenOffice, KSpread, Lotus Workplace, Gnumeric, Google Docs? How can you persist in such obvious error? How could you actually write the above when you know, I know, and everyone reading it knows that it is patently false? Please tell me it was a just a typographical error.

Here’s a challenge: Give me a list of four spreadsheet applications from four different vendors that today are as interoperable with OOXML as the four leading ODF spreadsheets are with ODF.

There is a good case to be made for OOXML to be further fine-tuned before it becomes an ISO standard. But considering that Office 2007 has shipped, I doubt that any significant changes to the file format would be implemented in the short or medium term.

The best possible outcome in delaying the stamp of approval for OOXML would be to get further clarifications on the standard. Delaying it on the grounds of technical limitations is not going to help much.

This is quite a revealing statement. Why should the shipment of Office 2007 factor in the appropriateness and the quality of a proposed International Standard? Should standards of quality be relaxed for Microsoft’s convenience? Do technical limitations not matter because Microsoft has sales targets to meet? Is this what ISO is for? If so, I suggest their hard-working volunteers be given Microsoft salaries and stock options, since clearly they would be working only for Microsoft’s benefit at this point.

Miguel has a good point at the end:

To make ODF successful, we need to make OpenOffice.org a better product, and we need to keep improving it. It is very easy to nitpick a standard, specially one that is as big as OOXML. But it is a lot harder to actually improve OpenOffice.org.

If everyone complaining about OOXML was actually hacking on improving OpenOffice.org to make it a technically superior product in every sense we would not have to resort, as a community, to play a political case on weak grounds.

OpenOffice.org is one, but not the only application of ODF. It is the most prominent one in the traditional heavy-weight office suite model, but I’m not certain that this is the only way forward. We need good implementations, several of them, since one size does not fit all.

In any case I’d say in return that if Microsoft and Microsoft boosters spent some of their time investigating exactly how easy it would be to encode Office’s legacy features on top of the extensible ODF specification, and worked together with the ODF community to address their common concerns, then we could easily have a single interoperable format that we all could use. The resulting standard of OOXML on top of ODF would be smaller, simpler, higher quality and more interoperable than the mess that we’ll end up with by having OOXML as a standard, in addition to ODF.


Change Log:

2/1/2007 — fixed spelling errors reported by a reader via email
2/2/2007 — another spelling error

Filed Under: ODF, OOXML

A Review of the Wikipedia Article on ODF

2007/01/27 By Rob 5 Comments

As I had done last week with the Wikipedia article on Office Open XML (OOXML), I have taken a read through the article on OpenDocument Format (ODF). My aim was to do some fact checking and make some suggestions on some additional references that might be included. In some case I’ve made additional usage or phrasing suggestions, but I have not endeavored to do a full edit of the article.

In accordance with Wikipedia’s Conflict of Interest guidelines, I will put a link to this blog entry on the ODF article’s Talk page. These points are for the consideration of the volunteers editing the article, to consider and do what they want with them. I’ll probably repeat this review on a quarterly basis.

Since the article is changing at a rather rapid rate, you should note that I looked at the revision of 27 January at 16:19 which you can retrieve here.

  1. Opening paragraph. “…is a document file format used for exchanging electronic documents”. I’d say instead, “…for describing electronic documents”. Documents are exchanged via protocols like SMTP, WebDAV or HTTP, etc. ODF is only describing the documents.
  2. Strictly speaking, ODF was developed by a technical committee (TC) working within the OASIS consortium. The point is OASIS as a whole approved ODF, but it was developed within a TC.
  3. Last sentence of first paragraph is awkward. I’d keep the details and dates in the Standardization section and just state the current status here: “OpenDocument is an OASIS Standard as well as an International Standard published as ISO/IEC 26300:2006”
  4. The next sentence is weak. I’d rephrase as something like “ODF meets the common definitions of an [Open Standard], meaning the specification is freely available and may be implemented freely”. Since Wikipedia already has nice article on open standards, why not just link to that?
  5. The claim that ODF was “intended” to avoid vendor lock-in should be substantiated. That indeed may be one of its effects. But the charter of the TC did not mention that as an explicit goal. I think this is just loose language. Whenever you see a passive sentence, ask yourself, “Who or what did this”? Who intended ODF to be such and such? If you can provide a reference for that question, then you have something.
  6. Next sentence is awkward. How about, “OpenDocument is the first widely adopted International Standard for editable office documents.” ?
  7. Under Specifications, in addition to the listed compression advantage of using the approach with the ZIP archive, it also has the benefit of separating the content, styles , metadata and application settings into four separate XML files. This is a good example of the architectural principle of [Separation of Concerns].
  8. I suggest we add here: “An important goal during the development of ODF was to reuse existing relevant standards where possible. Such standards used in ODF include [MathML], [Synchronized Multimedia Integration Language|SMIL], [SVG], and [XForms].” If needed a link to the ODF TC’s charter would server as an authoritative reference for the goal to reuse existing standards.
  9. The Standardization section seems to be split off into a linked article which is a bit outdated. Is this necessary? This might make more sense to have this information brought back into the main article. Just my opinion.
  10. First sentence is not quite correct. ODF was developed by a technical committee (TC) working within the OASIS consortium.
  11. “OASIS Standard” should be capitalized as a proper noun.
  12. This section gets a bit weighted down with jargon. Does the average reader, even a technical reader, understand was a “DIS” is, or a “default ballot”? We should either explain the significance of these terms, or summarize. I don’t think this needs to contain a day-by-day retelling of how a specification made its way through ISO.
  13. OpenDocument Format 1.1 was approved as an Committee Specification in October. The ballot for approval as an OASIS Standard is occurring right now. (Would the average reader understand this distinction? Specifications are approved first by the ODF TC as Committee Specifications, then major versions are put forward for a vote by the entire OASIS membership as an OASIS Standard, and even more significant editions are then put forward for approval by ISO as an International Standard.)
  14. On the ODF 1.2 work, the parenthentical remark on spreadsheet formulas seem out of place and redundent since there is a separate Criticism header that covers this. The obvious presumption is that anything added to ODF 1.2 is added because it is not already there. Do we believe that any reader would think otherwise?
  15. Overall the 1.2 statement looks like it needs a rewrite. I’d suggest a simple statement like, “OpenDocument Format is currently being drafted by the ODF TC. It is planned to contain additional accessibility features, metadata enhancements, spreadsheet formula definition (based on [OpenFormula] and any errata submitted by the public.” (Discussion of various schedule predictions seems outdated since December has already come and gone. )
  16. Section on Application support — “Since there are a number of independent implementations of the ODF standard..”. This might be better in an “Interoperability” sub-section. If you make such a sub-section, the Fellowships test suite, mentioned earlier in the article, could be moved there as well.
  17. “Although Microsoft Office does not support OpenDocument…” should be, “Although Microsoft Office does not support OpenDocument natively…”
  18. Again, never trust engineers to come up with a good prediction of schedules. December has come and gone and no Add-in is complete.
  19. There should also be mention of Corel’s stated plans to add ODF support to WordPerfect Office. The press release you can reference is here.
  20. There is mention here of a “MS Open XML translator”. This was Microsoft’s name for their intiative. But the web page linked to here consistently refers to itself as the “ODF Add-in for Microsoft Word”. This is confusing. Maybe start with a mention of the Microsoft announcement from July 2006 (this press release) then say that one such project supported by Microsoft is the ODF Add-in for Word, etc.
  21. The ODMA mention is unrelated to ODF. It probably should be removed entirely.
  22. Under the Accessibility sub-section, might want to mention that a group at the University of Illinois has written an OpenDocument Format Accessibility Evaluator to scan uploaded ODF documents for how well they follow best practices for accessibility. A link to the tool is project is here.
  23. Under Promotion section, we should link to the ODF Adoption TC’s web page here and mention that they also manage the web site http://OpenDocument.xml.org
  24. The promotion activities of OpenOffice.org should be included in the bullet list that follows, right? Not clear why it is not.
  25. “…as well as other companies who may or may not be working inside…” is weird. Was someone attempting to say something here. The fact that the ODF Alliance is stated has having “more than 280 members” should make it obvious that not all are members of the OASIS ODF TC. Is anything added by having this statement?
  26. ODF Alliance has 362 organizational members according to their latest newsletter here .
  27. In Adoption section, there is repetition of information that was already covered in the Application support section, such as the Microsoft-funded translator work.
  28. The Adoption section is incomplete, missing adoptions in Brazil, Argentina, Extremadura Spain, and India. The ODF Alliance newsletters have the details on these and others. This whitepaper is a good summary.
  29. In Criticism section, the statements, “Some mathematicians do not think that the choice of the MathML W3C standard for use in OpenDocument is a good choice” and “monstrosity written purely by web designers” lack an authoritative citation. All that is given is a link to an unnamed commenter on a GrokLaw article, whose credentials as a mathematician or a spokesman for mathematicians are not obvious. Consider that one of the authors of the MathML 2.0 standard, and co-chair of the W3C’s Math Working Group, is Patrick Ion, editor of the American Mathematical Society’s Mathematical Reviews. So the credibility of MathML should not so easily be set aside by a single anonymous, unsubstantiated comment. I’d also note that the Wikipedia artcle for MathML does not note such criticism.
  30. “The OpenDocument ISO specification does not contain a defined formula language” is more precise as “The OpenDocument ISO specification does not define a standard spreadsheet formula language.”
  31. “This means that ISO conforming files do not have to be compatible.” This is a weak argument. Even if the spreadsheet language were defined, ISO conforming documents are not required to be compatible. For example, two implementations may implement different subsets of features. And even without a formula standard, implementations can still be compatible. For example, 1-2-3 , Quattro Pro and OpenOffice have been able to read Excel formulas for years, even though Microsoft had not specified this. Maybe what is meant here is “This means that spreadsheet implementations currently rely on application-level interoperability testing rather than referencing a normative specification of formula syntax and semantics.”
  32. The criticism of the ability to embed Java applets is new to me. No reference is given for this criticism. The section number establishes the existence of the feature, but does not establish grounds for criticizing it. Is this original research? If so, it does not belong on Wikipedia.

Change Log
1/28/07 — corrected link to ODF’s Talk page

Filed Under: ODF, Wikipedia

Document Format Punditry

2007/01/23 By Rob 15 Comments

Rick Jelliffe, Mr. Schematron, who blogs for O’Reilly, recently announced that he had been contacted by Microsoft to see if he would be interested in a contract to edit the Office Open XML (OOXML) and Open Document Format (ODF) Wikipedia pages. As Rick says,

So I was a little surprised to receive email a couple of days ago from Microsoft saying they wanted to contract someone independent but friendly (me) for a couple of days to provide more balance on Wikipedia concerning ODF/OOXML. I am hardly the poster boy of Microsoft partisanship! Apparently they are frustrated at the amount of spin from some ODF stakeholders on Wikipedia and blogs.

I think I’ll accept it: FUD enrages me and MS certainly are not hiring me to add any pro-MS FUD, just to correct any errors I see. If anyone sees any examples of incorrect statements on Wikipedia or other similar forums in the next few weeks, please let me know: whether anti-OOXML or anti-ODF. In fact, I already had added some material to Wikipedia several months ago, so it is not something new, so I’ll spend a couple of days mythbusting and adding more information.

This immediately brought on an avalanche of commentary, on his blog, and elsewhere. As someone who also blogs on ODF/OOXML topics, I’d like to say a few words on the subject of document format punditry.

Few of my readers know me personally. They only know me via my words. Their acceptance or non-acceptance of this blog and what I say is largely determined by their perception of these two dimensions:

  • Authority — Am I an expert? Am I writing about things that I have direct knowledge of, or through education, training or direct experience would be expected to have worthwhile insights on?
  • Orientation — Do I have a bias on the subject being discussed. I’m not using the word “bias” in a pejorative sense, but to describe how far one’s views vary from a neutral, journalistic point of view, to a view that is overtly partisan on a particular issue. Bias is expected in opinion pieces, but not in Wikipedia articles.

My blog clearly comes with an expert, pro-ODF orientation. Additionally, I try to keep it light and humorous so even if a reader disagrees with me on one issue, at least they will be amused.

Looking at the range of people writing on these issues, I see the landscape something like this:

  • We have a number of highly informed experts in ODF and OOXML who aren’t really talking to each other.
  • We have the press, trying to be neutral, but having difficulty figuring out the significance of the technical issues since they are rather esoteric.
  • The General Public, who won’t even hear about the issues until the press figures it out.
  • And then we have various degrees of extremists of all varieties, not easily classifiable. Their writings are backed by ideological more than technical arguments. There are important ideological issues at stake in this debate, so these are voices are important.

What we seem to be lacking is the expert, neutral technical commentary. This is not too surprising. Many of the experts took sides a long time ago, or decided to sit this one out. That is understandable. But without this center of expert commentary, the press will continue to report the biases of whatever side they happen talk to first.

Where does Rick fit it into this chart? His expertise is undeniable. But if he takes Microsoft’s money he risks losing his reputation for neutrality. That is his choice and I am in no position to fault someone for that. He joins a crowded field of opinionated people already writing on this issue from one angle or another. He’ll likely be one of the better pro-OOXML writers out there. Nothing wrong with that. As Charles McCabe famously said, “Any clod can have the facts, having opinions is an art.”

But I do suggest that Microsoft’s money would have been better spent, and Rick’s skills better used, if they had engaged Rick earlier to help review and improve the OOXML specification. Trying to fix perceptions of the standard after the fact will be a lot harder, and more expensive, than creating a good standard in the first place.

And I will lament the fact that we continue to lack neutral experts who can digest the massive amounts of technical information out there and present it in a way that the press can reference and the public can understand. I think Rick would have served this role admirably. Instead we risk having one less voice in the middle.

Looking at this potential deal with Rick, and Microsoft’s earlier deal with Novell, I wonder if someone at Microsoft thinks that neutrality is dangerous and that their purposes are better served by eliminating it?

Filed Under: Blogging/Social, ODF, OOXML

Opportunity Knocks

2007/01/21 By Rob Leave a Comment

There are a number of us commenting on issues related to open standards, in particular ODF. My blog roll has a list of several who regularly cover these topics. I’ve recently added a new link to that list, and I’d like to highlight it for you today.

Walt Hucks and Opportunity Knocks blog has been putting out some nicely researched commentary on the file format debate. His most recent post, “Whose Finances Are On the Line?”, looks at what Microsoft is risking if OOXML fails to gain acceptance.

Walt looks at the business angle in “What’s Wrong With Choice?”, delving into Microsoft’s financials and explaining how that is determining Microsoft’s behavior around OOXML:

Let’s be honest here. According to your latest Form 10-Q, Office is 90% of the revenue of Microsoft Business Division, which is in turn one of the three profitable segments in the company. Both of the other two segments related directly to the Windows operating systems (“Client” & “Server”). MBD is able to charge a pretty high price for its products. If there was a fully-level playing field—a standardized file format for the industry that almost anyone could implement—that would directly threaten Office & MBD. Losing dominance with Office would in turn threaten the Client segment, because users would be free to utilize whatever operating system(s) met their needs without being risking being unable to share office documents with others.

So, I’d like to officially welcome Walt to the Fraternity of Geeks who Blog about File Formats on the Weekend (FGBFFW), and recommend him to everyone else who will read his blog on Monday.

Filed Under: ODF, OOXML

Amusing but Confusing

2007/01/20 By Rob 12 Comments

I’ve always been annoyed by Microsoft’s choice for a name in their “Office Open XML”. It isn’t the wishful use of the work “open” that bothered me. It was that the name just doesn’t roll off the tounge easily. It always seems to get stuck someplace and comes out wrong. You need to think harder to say “Office Open XML” and have it come out right.

“Open” is an adjective, and in English adjectives are usually placed before nouns, not in the middle of a noun phrase. We say, a “black guard dog”, not a “guard black dog”. When you fight language, language usually ends up winning. So it is not surprising that what comes out is “Open Office XML” by mistake.

I’m obviously not the only one with this problem. A quick Google for “Microsoft Open Office XML”, or “Ecma Open Office XML”, phrases that should get zero hits, reveals instead an embarrassment of riches. Everyone gets this wrong.

ZDNet’s David Berlind:

Yesterday, when Novell announced that one of the first fruits to be born out of its newly minted legal relationship with Microsoft would be a plug-in to OpenOffice.org that would allow the open source based office suite to open or save documents in Microsoft’s Open Office XML (OO-XML) file format, I had a tough time parsing through the text of the company’s press release.

Redmonks’s Stephen O’Grady with an article titled “Microsoft Open Office XML Formats / Open Document Format Follow Up”.

CRN: Reseller Channel News with a headline, “Ecma says Yeah to Microsoft Open Office XML“.

Computer Business Review:

Corel Corp, developer of the WordPerfect suite, announced last week that it will support both ODF and Microsoft’s Open Office XML format.

XMLMind, a tool designed to work with OOXML gets it wrong:

Thanks to new XMLmind FO Converter v4, it is now possible to convert XML documents to Open Office XML (.docx) the native format of MS-Word 2007.

BusinessWeek proof-readers missed this error:

…Microsoft is working hard to defeat it and promote its own XML-based file format–called Microsoft Open Office XML. This will be the default file format in Office 2007, due out late this year.

Even Microsoft Press Releases make this error:

‘Through the XXX Alliance, we are working closely with Microsoft to increase data access across our instrument systems and data analysis software tools using Ecma Open Office XML,’ said XXX, president of XXX.

Even Microsoft’s blog profile for a member of their own Corporate Standards Team, an OOXML expert, gets it wrong:

Dave is a member of Microsoft’s Corporate Standards policy team. He is involved with all of Microsoft’s global standards around server & tools which includes everything from XML to WS-*, from W3C to Oasis and ISO, all Office standards including Open Office XML, and all vertical industry standards from the enterprise markets to Microsoft Dynamics products

This guy works on Office Open XML and he doesn’t even get it right!?

Microsoft’s own OOXML overview page on the file formats can’t get it right:

By installing a simple update, users of Microsoft Office 2000, Microsoft Office XP, and Office 2003 Editions can open, edit, and save documents in one of the Ecma Open Office XML File Formats.

Ditto for Microsoft’s FAQ page on the file formats:

The Ecma Open Office XML Formats will offer some key improvements over the binary file formats in use today within Word, Excel, and PowerPoint. Because these new file formats are compressed, the resulting document sizes will be much smaller, somewhere between 50 and 75 percent smaller in some cases.


A recent article by Microsoft’s Platform Strategy Manager in Australia got it wrong in the title: Streamlining your documents with Open Office XML.

And to top it all off, Bill Gates himself gets it wrong, then corrects himself, as seen in Molly Holzschlag’s transcript from a recent blogger outreach event she attended at Microsoft headquarters in Redmond:

But every year for 13, 14 years now we’ve not just followed and implemented standards, we’ve contributed. This WS stuff, . . . we contributed more Web standards than anyone! We have our smartest people who go and work on that stuff . . . we just did the OpenOffice . . . our office XML formats we contributed to them . . . we’ve got XML at the core of all our products.

(Thanks to Yoon Kit from Open Malaysia, who has also been taking a closer look at the names used inside OOXML, for pointing out that quote.)

I’m not meaning to embarrass anyone with the above quotes. Those who have heard me speak on Office Open XML know that I struggle to get that name out every time, and do not always succeed. Like I said before, if you fight language, you will lose.

So the Ecma standard clearly has a name which causes confusion with the name of an existing application, “Open Office”, which happens to also be the most prominent implementation of OpenDocument Format, the ISO standard for office documents. OpenOffice.org is a registered trademark (check the Tess database for the actual registration) and has been used in the trade since 2001 for describing a application used for database management, spreadsheet, word processor and presentation graphics.

I am not a lawyer, but from reading a BitLaw writeup on trademark infringement, it appears that the thing to prove is “likelihood of confusion”, and the factors the courts would look at include evidence of actual confusion by consumers and similarity of the marketing channels for the two products.

In any case, to have an ISO standard that, by its aberrant use of the English language, almost compels users to transform it into “Open Office XML” will only confuse users. This is not just my prediction. It is my observation, backed up by many specific examples of how this confusion is happening even now. I invite you to comment on other examples you may know of.

Early last year, another Microsoft/Ecma was submitted to JTC1 for approval under Fast Track rules. It was Microsoft’s C++/CLI specification. During the 30-day contradiction review period national bodies raised objections based on the confusing name Microsoft picked for their standard, and the practical problems this caused. GrokLaw had good coverage of this.

A summary of the UK’s contradiction argument is:

In response to document ISO/IEC JTC1 N8037, the UK objects to Fast Track Ballot ECMA-372 1st Edition C++/CLI Language Specification, on the grounds that there is a contradiction with an existing JTC1 standard. ISO/IEC 14882:2003 is the standard for the C++ programming language. Adopting a second standard under the proposed name of C++/CLI will cause unnecessary and harmful confusion in the marketplace.

We consider that C++/CLI is a new language with idioms and usage distinct from C++. Confusion between C++ and C++/CLI is already occurring and is damaging to both vendors and consumers.

A new language needs a new name. We therefore request that Ecma withdraw this document from fast-track voting and if they must re-submit it, do so under a name which will not conflict with Standard C++.

Germany had similar objections:

We propose that the document is input into SC22 as a regular New Work Item Proposal and assigned to WG21 for further processing.

On a technical level, there are some rather different approaches between C++ and C++/CLI which can easily cause considerable confusion when both languages are considered to be “C++” or add unnecessary overhead when trying to write C++ code usable with C++ and C++/CLI.

I suggest a similar objection should be raised with regards to Ecma Office Open XML. It’s name causes confusion with an existing registered trademark. Ecma should rename their standard to something less likely to cause confusion.

Any suggestions for a new name?


Updated on 25 June 2007 to add some additional recent examples of this continuing confusion.

Filed Under: ODF, OOXML

  • « Go to Previous Page
  • Page 1
  • Interim pages omitted …
  • Page 18
  • Page 19
  • Page 20
  • Page 21
  • Page 22
  • Interim pages omitted …
  • Page 25
  • Go to Next Page »

Primary Sidebar

Copyright © 2006-2026 Rob Weir · Site Policies