Wednesday, March 28, 2007

The ODF Validation Service

No, this has nothing to do with getting discounted parking if you use ODF, though that is an intriguing idea...

Daniel Carrera (OpenDocument Fellowship and the OASIS ODF TC) has a new blog and with it comes news of a new ODF tool, an ODF Validator Service, written as part of the Fellowship's ODF Tools project by Alex Hudson.

It is in the spirit of the W3C's Markup Validation Service: upload a document and get an instant report of whether or not it is valid ODF, and if not, what problems were found. I tried a few documents and it seems to work well.

It would be interesting to see if something like this could be made into a flexible framework for scanning ODF documents, at various levels. Think of a SAX-like call-back parser but at multiple levels of detail. So the framework knows how to fully parse an ODF document and identify features at the Zip and XML level. Plugins to the framework can subscribe to various parse events. So, maybe a ZipListener interface that simply has methods onFile() and onDirectory(). Then a ManifestListener interface that allows you to subscribe to notifications of the data in the manifest. Then within a document, like a spreadsheet, you could have listeners at the structural and content level, so onWorksheet(), onCell(), or in a Wordprocessor document, onTable(), onImage(), etc.

A framework like this could allow you to make a range of applications that need to scan an ODF document and take some action on it.


The benefit of the framework is the reduction in code required to get directly to the info in the ODF document you want, without having to master the ODF specification or writing a lot of parsing code. Think of it as a framework for easy multi-level information extraction from ODF documents.

Change Log

4/11/2007 — Removed parenthetical comment about the need for a privacy policy, since one has now been added to the Validator page.

Labels:

Comments:
Interesting thoughts.

The validator looks like a nice project and the ODF Tools will be welcome. It looks like these can only get richer, as you surmise.
 
Not a bad idea. I know I wrote a C# library for parsing in ODF files, mainly so I could find all the paragraphs of a certain style ("Command"), process them and substituted the results of the internal program with that paragraph. It ended up making a nice little toolchain for my documents, more so when I could build arbitrary tables and styles based on a database.

That and a little tool for converting spreadsheet sheets into tab-delm files.

One reason I love ODF is that it is such a nice clean interface, fairly easy to understand, and doesn't really have excess cruft of history.
 
Why would anyone except for an ODF implementor validate ODF files on line ?

An implementor seem more likely likely to directly use the ODF XML schema files for validating his created documents.
What does this tool add to ordinary schema validation ?
 
You could ask the same question about the W3C's HTML/XHTML/CSS validators. But they seem to get a lot of use. You don't need to be an HTML editor vendor to have a need for validating HTML.

If you look around the site I linked to you can see that this is more than just schema validation. For example, ODF specifies other constraints, such as referential integrity constraints between the manifest and content XML's, that are not expressible in RELAX NG.
 
Hi Rob. I just saw your post. I talked to Cyclone3 and we just put together a privacy policy. Please drop me an email if we missed anything.

http://opendocumentfellowship.org/validator
 
Post a Comment

Links to this post:

Create a Link



<< Home

This page is powered by Blogger. Isn't yours?