xml-schema and RDFS
Roger wrote off-list:
We might have a complexType for a 'personType' and then create an element within 'bookType' called 'author' that is of type 'personType'. Is the notion of "an author" a property of a bookType or the extension of the notion of a person?
I think neither. If in any given OO-model you define a class or struct "Coordinates" with lat and long, and you use it for two properties (say TranssectStart, TransectEnd) of another class, you are not extending "Coordinates". Both xml-elements and xml-attributes are OO-properties of the current OO-class (called "Type" in xml-schema). Attributes are always simple types (OO: value- types), whereas elements can be simple or complex (or mixed, but let us forget that - all TDWG standards avoid mixed type). Confusing is that XML schema often allows you to be implicit about this, by defining "anonymous types". So you can make complex nested structures without giving the types names. Or even define global elements. But in principle, you are creating a composition of class- definitions - just that type-naming can be omitted.
In an instance document the thing encoded within the book>author element gets the meaning of person from the type hierarchy but the meaning of author from its position within a bookType and the name of the element. Should we really create a complexType called authorType that extends personType and the author element should be of type authorType. (i.e. are the semantics in the complexType hierarchy) or should we just have a author element of type person (i.e. some of the the semantics are in the document structure and some in the type hierarchy)?
I think not. Otherwise in current OO programming you would have to extend the class "string" for each property where you want to use the type string.
Simply put: Are the semantics encoded in the XML Schema or in the structure of the XML instance documents that validate against that schema? Is it possible to 'understand' an instance document without reference to the schema?
You are correct that the semantics of a property are not defined in an ontological way, but they are implied by the property that is using a type. (They are implied in the *schema*, not the "instance"). If you add several string properties to a class in any Java, C++, Python, Visual Basic, etc. this is what you do. That is why I at the moment feel that xml-schema is intuitive to users of OO languages like the ones named. RDFS seems strange - at least to me. I realize that more OO-languages exist, I just perceive the ones "mainstream" and know little about the others.
How do we map 'concepts' in two schemas to each other if one encodes meaning in the document structure and the other in the type hierarchy? Even doing it manually how do we express that authorType in one schema is equivelent to book>author(of type person) in another. This gets very complex as the Tapir team have found. Is there are concept for every possible element in every possible instance document for a given schema? What about iterative nested structures?
I have no answer other than simple xml transformation based on a one time (per schema combination) analysis. Although I do not believe we will truly achieve backward compatibility (it could be nominal, but not useful!) very soon, regardless of technology, these questions are certainly important for schema evolution. Do we have to go to the length of atomizing our models into subject-predicate- object tupels, or do we have other options? I find xml-schema a much simpler technology, despite all its shortcoming, some of which we have encountered ourself. Can we answer Roger's question about mapping concepts from one schema to another based on xml-schema standards like TDWG has produced so far? Gregor BTW: I just wonder how you do inheritance of a class with properties in RDFS. It seems to me that since in RDF/RDFS you do not know the number of properties a class may have, you can always only extend an abstract ("property-less") concept, but never communicate an agreement that two pieces of software desire to use an agreed set of properties. Is that correct?---------------------------- ------------------------------ Gregor Hagedorn (G.Hagedorn@bba.de) Institute for Plant Virology, Microbiology, and Biosafety Federal Research Center for Agriculture and Forestry (BBA) Königin-Luise-Str. 19 Tel: +49-30-8304-2220 14195 Berlin, Germany Fax: +49-30-8304-2203
participants (1)
-
Gregor Hagedorn