• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

An Antic Disposition

  • Home
  • About
  • Archives
  • Writings
  • Links
You are here: Home / Archives for 2007

Archives for 2007

A Review of the Wikipedia Article on ODF

2007/01/27 By Rob 5 Comments

As I had done last week with the Wikipedia article on Office Open XML (OOXML), I have taken a read through the article on OpenDocument Format (ODF). My aim was to do some fact checking and make some suggestions on some additional references that might be included. In some case I’ve made additional usage or phrasing suggestions, but I have not endeavored to do a full edit of the article.

In accordance with Wikipedia’s Conflict of Interest guidelines, I will put a link to this blog entry on the ODF article’s Talk page. These points are for the consideration of the volunteers editing the article, to consider and do what they want with them. I’ll probably repeat this review on a quarterly basis.

Since the article is changing at a rather rapid rate, you should note that I looked at the revision of 27 January at 16:19 which you can retrieve here.

  1. Opening paragraph. “…is a document file format used for exchanging electronic documents”. I’d say instead, “…for describing electronic documents”. Documents are exchanged via protocols like SMTP, WebDAV or HTTP, etc. ODF is only describing the documents.
  2. Strictly speaking, ODF was developed by a technical committee (TC) working within the OASIS consortium. The point is OASIS as a whole approved ODF, but it was developed within a TC.
  3. Last sentence of first paragraph is awkward. I’d keep the details and dates in the Standardization section and just state the current status here: “OpenDocument is an OASIS Standard as well as an International Standard published as ISO/IEC 26300:2006”
  4. The next sentence is weak. I’d rephrase as something like “ODF meets the common definitions of an [Open Standard], meaning the specification is freely available and may be implemented freely”. Since Wikipedia already has nice article on open standards, why not just link to that?
  5. The claim that ODF was “intended” to avoid vendor lock-in should be substantiated. That indeed may be one of its effects. But the charter of the TC did not mention that as an explicit goal. I think this is just loose language. Whenever you see a passive sentence, ask yourself, “Who or what did this”? Who intended ODF to be such and such? If you can provide a reference for that question, then you have something.
  6. Next sentence is awkward. How about, “OpenDocument is the first widely adopted International Standard for editable office documents.” ?
  7. Under Specifications, in addition to the listed compression advantage of using the approach with the ZIP archive, it also has the benefit of separating the content, styles , metadata and application settings into four separate XML files. This is a good example of the architectural principle of [Separation of Concerns].
  8. I suggest we add here: “An important goal during the development of ODF was to reuse existing relevant standards where possible. Such standards used in ODF include [MathML], [Synchronized Multimedia Integration Language|SMIL], [SVG], and [XForms].” If needed a link to the ODF TC’s charter would server as an authoritative reference for the goal to reuse existing standards.
  9. The Standardization section seems to be split off into a linked article which is a bit outdated. Is this necessary? This might make more sense to have this information brought back into the main article. Just my opinion.
  10. First sentence is not quite correct. ODF was developed by a technical committee (TC) working within the OASIS consortium.
  11. “OASIS Standard” should be capitalized as a proper noun.
  12. This section gets a bit weighted down with jargon. Does the average reader, even a technical reader, understand was a “DIS” is, or a “default ballot”? We should either explain the significance of these terms, or summarize. I don’t think this needs to contain a day-by-day retelling of how a specification made its way through ISO.
  13. OpenDocument Format 1.1 was approved as an Committee Specification in October. The ballot for approval as an OASIS Standard is occurring right now. (Would the average reader understand this distinction? Specifications are approved first by the ODF TC as Committee Specifications, then major versions are put forward for a vote by the entire OASIS membership as an OASIS Standard, and even more significant editions are then put forward for approval by ISO as an International Standard.)
  14. On the ODF 1.2 work, the parenthentical remark on spreadsheet formulas seem out of place and redundent since there is a separate Criticism header that covers this. The obvious presumption is that anything added to ODF 1.2 is added because it is not already there. Do we believe that any reader would think otherwise?
  15. Overall the 1.2 statement looks like it needs a rewrite. I’d suggest a simple statement like, “OpenDocument Format is currently being drafted by the ODF TC. It is planned to contain additional accessibility features, metadata enhancements, spreadsheet formula definition (based on [OpenFormula] and any errata submitted by the public.” (Discussion of various schedule predictions seems outdated since December has already come and gone. )
  16. Section on Application support — “Since there are a number of independent implementations of the ODF standard..”. This might be better in an “Interoperability” sub-section. If you make such a sub-section, the Fellowships test suite, mentioned earlier in the article, could be moved there as well.
  17. “Although Microsoft Office does not support OpenDocument…” should be, “Although Microsoft Office does not support OpenDocument natively…”
  18. Again, never trust engineers to come up with a good prediction of schedules. December has come and gone and no Add-in is complete.
  19. There should also be mention of Corel’s stated plans to add ODF support to WordPerfect Office. The press release you can reference is here.
  20. There is mention here of a “MS Open XML translator”. This was Microsoft’s name for their intiative. But the web page linked to here consistently refers to itself as the “ODF Add-in for Microsoft Word”. This is confusing. Maybe start with a mention of the Microsoft announcement from July 2006 (this press release) then say that one such project supported by Microsoft is the ODF Add-in for Word, etc.
  21. The ODMA mention is unrelated to ODF. It probably should be removed entirely.
  22. Under the Accessibility sub-section, might want to mention that a group at the University of Illinois has written an OpenDocument Format Accessibility Evaluator to scan uploaded ODF documents for how well they follow best practices for accessibility. A link to the tool is project is here.
  23. Under Promotion section, we should link to the ODF Adoption TC’s web page here and mention that they also manage the web site http://OpenDocument.xml.org
  24. The promotion activities of OpenOffice.org should be included in the bullet list that follows, right? Not clear why it is not.
  25. “…as well as other companies who may or may not be working inside…” is weird. Was someone attempting to say something here. The fact that the ODF Alliance is stated has having “more than 280 members” should make it obvious that not all are members of the OASIS ODF TC. Is anything added by having this statement?
  26. ODF Alliance has 362 organizational members according to their latest newsletter here .
  27. In Adoption section, there is repetition of information that was already covered in the Application support section, such as the Microsoft-funded translator work.
  28. The Adoption section is incomplete, missing adoptions in Brazil, Argentina, Extremadura Spain, and India. The ODF Alliance newsletters have the details on these and others. This whitepaper is a good summary.
  29. In Criticism section, the statements, “Some mathematicians do not think that the choice of the MathML W3C standard for use in OpenDocument is a good choice” and “monstrosity written purely by web designers” lack an authoritative citation. All that is given is a link to an unnamed commenter on a GrokLaw article, whose credentials as a mathematician or a spokesman for mathematicians are not obvious. Consider that one of the authors of the MathML 2.0 standard, and co-chair of the W3C’s Math Working Group, is Patrick Ion, editor of the American Mathematical Society’s Mathematical Reviews. So the credibility of MathML should not so easily be set aside by a single anonymous, unsubstantiated comment. I’d also note that the Wikipedia artcle for MathML does not note such criticism.
  30. “The OpenDocument ISO specification does not contain a defined formula language” is more precise as “The OpenDocument ISO specification does not define a standard spreadsheet formula language.”
  31. “This means that ISO conforming files do not have to be compatible.” This is a weak argument. Even if the spreadsheet language were defined, ISO conforming documents are not required to be compatible. For example, two implementations may implement different subsets of features. And even without a formula standard, implementations can still be compatible. For example, 1-2-3 , Quattro Pro and OpenOffice have been able to read Excel formulas for years, even though Microsoft had not specified this. Maybe what is meant here is “This means that spreadsheet implementations currently rely on application-level interoperability testing rather than referencing a normative specification of formula syntax and semantics.”
  32. The criticism of the ability to embed Java applets is new to me. No reference is given for this criticism. The section number establishes the existence of the feature, but does not establish grounds for criticizing it. Is this original research? If so, it does not belong on Wikipedia.

Change Log
1/28/07 — corrected link to ODF’s Talk page

Filed Under: ODF, Wikipedia

Crocodile Tears

2007/01/25 By Rob 21 Comments

By now everyone on the planet with an internet connection knows about Rick Jelliffe, the blogger Microsoft offered to pay to make the Office Open XML Wikipedia page “more objective”. (It is unclear what criteria Microsoft uses to determine which bloggers are given free laptops and which ones get offered cash.) In any case, I suspect that objectivity, like love, is something that is better free than purchased.

Microsoft’s Doug Mahugh disclosed a portion of the proposal he sent to Rick, in a comment on Slashdot:

Wikipedia has an entry on Open XML that has a lot of slanted language, and we’d like for them to make it more objective but we feel that it would be best if a non-Microsoft person were the source of any corrections… Would you have any interest or availability to do some of this kind of work? Your reputation as a leading voice in the XML community would carry a lot of credibility, so your name came up in a discussion of the Wikipedia situation today.

The national coverage of what was eventually called “Wiki-gate” brought the inevitable reaction from Microsoft — IBM made us do it:

[Microsoft] Spokeswoman Catherine Brooker said she believed the articles were heavily written by people at IBM, which is a big supporter of the open-source standard — in USA Today.

So the question in my mind is this: How bad was the OOXML Wikipedia page before all the fuss started? All this Wiki-gate news hit on the 23rd, with Rick’s blog post. So let’s go back to the Wikipedia page previous to that, which would be the version of 18 January. Take a read. You can also take a look at the Talk page where the prior version was last edited on 21 September. You can read it here.

Is this something that one would say has “a lot of slanted language” and was “heavily written by people at IBM Corp”? Is this something that warranted extraordinary means to address? I’d be interested in what parts they believed were “heavily written”. What does “heavily written” even mean? This is quite an allegation.

How’s this for heavy: I’ll make you an offer you can’t refuse. I’ll do a review and fact checking of the 18 January version of the OOXML Wikipedia entry, and I’ll link to the review from the Talk page, so others can consider it and make the changes if they agree. I won’t charge you a cent. If you find this at all useful, you can donate a few dollars to the Free Software Foundation. How’s that, Doug?

  1. The first paragraph should say simply “Office Open XML”. It is a waste of time to argue about whether it is Microsoft Office Open XML, Ecma Office Open XML or ISO Office Open XML. At some point you may have one version in ISO while a revision is being worked on in Ecma. Just call it “Office Open XML” and it will cover all cases.
  2. Next sentence should say, “The specification was developed by Microsoft and others…”. You shouldn’t need to list them all here, but do list them under Standardization.
  3. Should say, “is the default format in Office 2007”, not just “is used”.
  4. “Microsoft maintains that its primary goal…” needs a reference to cite. Perhaps page 1 of the whitepaper.
  5. “The Microsoft Office Open XML format is Microsoft’s direct answer to the OpenDocument format” also needs a reference or should be removed.
  6. Standardization section would be better if written chronologically, start from the beginning and end at the end.
  7. Should say, “A liaison from the ISO/IEC JTC1/SC34 was appointed to help during…”
  8. Licensing — “There has been a lot of argument about…”. If there has been a lot, maybe someone should cite an example?
  9. Brian Jones is an expert in some things, but I am not aware of his legal credentials. So citing his legal analysis does not seem to be authoritative and I doubt he intended it to be taken that way. This citation should be removed.
  10. Overall, the Licensing section seems like it is missing what I’d consider the two most important links: Microsoft’s Open Specification Promise, and to the Baker & McKenzie analysis.
  11. I would move the packaging and relations text into its own article called “Open Packaging Conventions” or OPC. The basic structure here will be used in other Microsoft formats like XPS so it makes sense to centralize it in one place and reference it from here.
  12. Under Document Markup Languages, I’d drop the discussion of the 2003 formats. Move that to a different article if needed. Ditto for DataDiagrammingML.
  13. Under Criticism, there needs to be some references cited. There is no shortage of criticism and no shortage of references for that. If you want primary source material, I’d suggest GrokLaw list as the most comprehensive. It cannot simply be denied or ignored that there is a large amount of criticism out there. It will look silly for Wikipedia if OOXML is defeated in ISO and the day prior there was not even a mention of criticism on its Wikipedia page.
  14. Market Adoption — This section seems to be talking more about application support than adoption. I suggest it be renamed “Application Support” and “Adoption” be reserved for notable adoptions of the standard at the state or national level if/when they occur. OpenOffice’s support of WordProcessing 2003 doesn’t belong here, but Novell’s announcement that they will add OOXML should be here.
  15. A note throughout — this article could use some copy editing. As expected with any text written by several people over time, not all native English speakers, there are differences in levels of formality and a good number of language errors.

That’s about it. I didn’t see anything all that unusual or extreme here. If anything I found it odd that there are no links in this article to anything critical of OOXML, even though Microsoft’s stated reason for contracting with Jelliffe was to correct bias in the article. What am I missing? Where is this horrible slant? If anything, this article is living in some dream world where OOXML is not being heavily criticized for being too large, too rushed and too poorly written. If an article with no links to criticism is considered “heavily written by IBM”, I’d hate to see what Microsoft thinks an objective article reads like.

Filed Under: OOXML, Wikipedia

Linus’s Law Applied to Standards Review

2007/01/23 By Rob 1 Comment

Eric Raymond’s famous formulation in the Cathedral and the Bazaar was “given enough eyeballs, all bugs are shallow”. Since Code is text and Spec is text, so it is reasonable to ask if this same law might apply to reviewing a specification as well.

This proposition was put to the test this last weekend at GrokLaw, where a team of volunteers attempted to review the 6,000 page Ecma Office Open XML specification. Since the specification is already two-weeks into a 30-day review in ISO/IEC JTC1, a parallel approach was the indicated solution. The alternative, for each individual to review the specification in its entirety, would have required them to read at the rate of 200-pages/day for a month.

The team of around 20 contributors logged nearly 1,000 edits on the wiki they set up for their collaboration. The wiki received a further 4,000 page reads. This was done over a few days, but the bulk of the work was done just this weekend.

What they found is amazing. As you know, I have been reading the OOXML specification, on and off, for a few months now, noting in this blog the problems I’ve seen. I thought I had a good grasp of the problems. But I was wrong. I was just scratching the surface. The Microsoft guys think I have been complaining too much. But it now looks like I wasn’t complaining enough.

Take a look at the report. I’ll need a few days to read through the details and research some of the items. You can be sure I’ll follow up with some new posts to explain, in plain English, the significance of the new issues.

Also, GrokLaw has put out a call for concerned individuals to write to their nation’s JTC1 representatives, to give informed thoughts on whether OOXML should continue the process toward an ISO standard, or whether it should be taken off its current “Fast Track” because it contradicts existing standards. If you are a regular reader of this blog, you know what is at stake and you know what to do.

One final note. I’m so impressed with the results of this collaborative approach to standards review, that I’m going to investigate whether we can do the same thing at OASIS. We’ve been using a wiki internally for drafting new parts of the ODF 1.2 specification, and that has worked well. But I’d love it if the next time we had a public review period for ODF we could have the public also participate in editing content in the wiki and organize the process that way. It is a much better method than the non-interactive, linear pattern of a mailing list.

Filed Under: OOXML, Standards

Document Format Punditry

2007/01/23 By Rob 15 Comments

Rick Jelliffe, Mr. Schematron, who blogs for O’Reilly, recently announced that he had been contacted by Microsoft to see if he would be interested in a contract to edit the Office Open XML (OOXML) and Open Document Format (ODF) Wikipedia pages. As Rick says,

So I was a little surprised to receive email a couple of days ago from Microsoft saying they wanted to contract someone independent but friendly (me) for a couple of days to provide more balance on Wikipedia concerning ODF/OOXML. I am hardly the poster boy of Microsoft partisanship! Apparently they are frustrated at the amount of spin from some ODF stakeholders on Wikipedia and blogs.

I think I’ll accept it: FUD enrages me and MS certainly are not hiring me to add any pro-MS FUD, just to correct any errors I see. If anyone sees any examples of incorrect statements on Wikipedia or other similar forums in the next few weeks, please let me know: whether anti-OOXML or anti-ODF. In fact, I already had added some material to Wikipedia several months ago, so it is not something new, so I’ll spend a couple of days mythbusting and adding more information.

This immediately brought on an avalanche of commentary, on his blog, and elsewhere. As someone who also blogs on ODF/OOXML topics, I’d like to say a few words on the subject of document format punditry.

Few of my readers know me personally. They only know me via my words. Their acceptance or non-acceptance of this blog and what I say is largely determined by their perception of these two dimensions:

  • Authority — Am I an expert? Am I writing about things that I have direct knowledge of, or through education, training or direct experience would be expected to have worthwhile insights on?
  • Orientation — Do I have a bias on the subject being discussed. I’m not using the word “bias” in a pejorative sense, but to describe how far one’s views vary from a neutral, journalistic point of view, to a view that is overtly partisan on a particular issue. Bias is expected in opinion pieces, but not in Wikipedia articles.

My blog clearly comes with an expert, pro-ODF orientation. Additionally, I try to keep it light and humorous so even if a reader disagrees with me on one issue, at least they will be amused.

Looking at the range of people writing on these issues, I see the landscape something like this:

  • We have a number of highly informed experts in ODF and OOXML who aren’t really talking to each other.
  • We have the press, trying to be neutral, but having difficulty figuring out the significance of the technical issues since they are rather esoteric.
  • The General Public, who won’t even hear about the issues until the press figures it out.
  • And then we have various degrees of extremists of all varieties, not easily classifiable. Their writings are backed by ideological more than technical arguments. There are important ideological issues at stake in this debate, so these are voices are important.

What we seem to be lacking is the expert, neutral technical commentary. This is not too surprising. Many of the experts took sides a long time ago, or decided to sit this one out. That is understandable. But without this center of expert commentary, the press will continue to report the biases of whatever side they happen talk to first.

Where does Rick fit it into this chart? His expertise is undeniable. But if he takes Microsoft’s money he risks losing his reputation for neutrality. That is his choice and I am in no position to fault someone for that. He joins a crowded field of opinionated people already writing on this issue from one angle or another. He’ll likely be one of the better pro-OOXML writers out there. Nothing wrong with that. As Charles McCabe famously said, “Any clod can have the facts, having opinions is an art.”

But I do suggest that Microsoft’s money would have been better spent, and Rick’s skills better used, if they had engaged Rick earlier to help review and improve the OOXML specification. Trying to fix perceptions of the standard after the fact will be a lot harder, and more expensive, than creating a good standard in the first place.

And I will lament the fact that we continue to lack neutral experts who can digest the massive amounts of technical information out there and present it in a way that the press can reference and the public can understand. I think Rick would have served this role admirably. Instead we risk having one less voice in the middle.

Looking at this potential deal with Rick, and Microsoft’s earlier deal with Novell, I wonder if someone at Microsoft thinks that neutrality is dangerous and that their purposes are better served by eliminating it?

Filed Under: Blogging/Social, ODF, OOXML

The Parable of the Solipsistic Standard

2007/01/22 By Rob 3 Comments

Winter is finally here. It is dark and dreary, the ground hard, unyielding. I’m getting over a cold. My feet are never as warm as I’d like them to be. But still, I look forward to spring. The seed catalogs have arrived. I’m starting to review possibilities for the garden next year. It is Winter, but this is only a temporary affliction. Current misery coupled with the knowledge of eventual satisfaction — there doesn’t seem to be a single English word which captures this thought. So, I’ll coin a word, “Sperandomiseria”.

Og mil ten fit ghust lech fer ti nostu, pertents? Sperandomiseria, cuic cuic danto do quant fer nos protoblian, sed nuic, volte torma. Zherantilli, fer muc opsice inito brandu s’deko prot affti? Nek worchi fer ubir! Sperandomiseria, gher-kloj ven ter moido, ven ter zer-moidi, eggen ven ter moidisti miki-moiki.

Do you agree? I think this is a good argument and I see no practical downside. Something must be done soon, lest we experience a repeat next time.

Sorry, What is that? You have no idea what I am talking about? Oh. So you don’t speak Weirish? We’ll need to do something about that then. That’s what I’m speaking now, Ecma Weirish. See, I used to use English, but I found that the English language was missing words for some things I wanted to express, so I made up some new words for these ideas, to ensure that everyone would perfectly understand what I was saying, with no ambiguities.

Ini hag danto do abergi nec palmu, ven fec tolibissi, pert rami fer cuic cuic affti.

Pardon, you are still having problems? You want to know about the words in the English language that were already well-known, useful and descriptive, and why I didn’t just use those, and supplement them with new words as needed? Good question. Once I started making up new words, I found that none of existing words in English perfectly matched my usage of them. In fact I really couldn’t translate my thoughts perfectly into any existing language. My thoughts are so unique that no other language works well for them . A totally new language is a much more accurate way to notate my thoughts. I wonder why everyone doesn’t do it? If you use this language, you will understand me perfectly.

Og mil ven ter moidisti… What? You again? Why can’t you just speak Weirish? When you use English you just slow down my mental processing. Ah, so you want to know how to speak Weirish. Great. I’ll give you a starter word list:

  • Pertentare (v) — to walk like Rob walks.
  • Protoblia (n) — a nice person [Note: This cannot be fully defined within this word list. It is best defined by how Weir thought a nice person was back 15 years ago.]
  • Zherantillo (n) — where Rob keeps his keys, sometimes upstairs near the bedroom, sometimes by the front door, sometimes in a hidden place.

Rhodantillu, muc muc dilinorpthu, ac…

I’m a patient man. What else do you want to know? Why should Weirish be an International Standard? Because it matches my thoughts so perfectly. Everyone wants to know what I think, so it is good that they learn Weirish for that task. If you look closely, you see that there are hundreds of languages already out there. I should have one too.

How do you say, “Firefox” in Weirish? Umm… uhhh… well, you don’t. I only use Internet Explorer, so there is no word for “Firefox”. Just say “Internet Explorer 4.0” instead. That’s close enough, right? Ditto for “Linux”, “OpenOffice”, “KOffice”, “WordPerfect” or “MySQL”. Here’s a 6,000 page document on Weirish I dictated in my sleep last week. Don’t leave! Hey! I’ve given you everything you’ve asked for. A perfect language, a dictionary for understanding it, a very very long manuscript on it, everything. Please, don’t go! Amitambo n’itorno!


Change log

1/28/07 — Fixed broken link, put Weirish text in italics, fixed grammatical error in one of the Weirish passages.

Filed Under: OOXML, Standards

  • « Go to Previous Page
  • Go to page 1
  • Interim pages omitted …
  • Go to page 13
  • Go to page 14
  • Go to page 15
  • Go to page 16
  • Go to page 17
  • Go to Next Page »

Primary Sidebar

Copyright © 2006-2023 Rob Weir · Site Policies