Content

? Raw content
Help on Raw Content :
Although the previous section was dealing with structure-only indicators, this section is entirely devoted to the content in the Xml stream, that is values enclosed by elements, and attribute values.
As content is anywhere in the Xml stream, it is suitable that indicators follow a relevant rule : this rule is that we mostly compare content values for a given element, resp. for a given attribute. That's what we call a LOV (List of values) or simply a column. Below is a sample :
  ...
  <name firstname="John" lastname="Fitzgerald"/>
  <name firstname="Matt" lastname="Kassv"/>
  <name firstname="Steven" lastname="Witcold"/>
  <name firstname="Thomas" lastname="FukSiebl"/>
  <name firstname="John" lastname="Smith"/>
  ...
In this sample, there are two elementary List Of Values : the one with all firstname attribute values, the one with all lastname attribute values. There are obvious relations inside each List Of Values. For instance, in firstname, John appears twice so we may come up with a value duplication rate. Such figures stress the need to factorize content to avoid size overhead. Of course, indicators do not go in a such high level of details such like a List Of Values. Indicators are averages or ratios over the whole Xml stream. Close this one
Help on Content minimum size :
This is the minimum size in bytes of a value in the Xml stream.
It uses the UTF8 encoding charset, so it is roughly half the size of the Unicode equivalent. If there is at least one occurence of an empty tag (either element or attribute), then this is 0. If this figure is high, say above 20, then one may question whether Xml is a suited format. Close this one
Help on Content maximum size :
This is the maximum size in bytes of a value in the Xml stream.
It uses the UTF8 encoding charset, so it is roughly half the size of the Unicode equivalent. If this figure is high, say above 20, then one may question whether Xml is a suited format. Close this one
Help on Content mean size :
This is the average size in bytes of values in the Xml stream.
This figure reveals the distribution of content size in the stream. If this figure is high, say above 20, then one may question whether Xml is a suited format. Close this one
Help on Empty element tags :
This is the ratio of empty element tags in the Xml stream.
An empty element tag is an element with no content inside. There are several syntaxes including :

First syntax for an empty element tag
  <book></book>

Second syntax for an empty element tag
  <book>
   <title>The Round Door</title>
  </book>

Third syntax for an empty element tag
  <book/>
In both cases, the associated ListOfValues of the book tag is empty.
An empty element tag is not an element missing at least one value somewhere. An empty element tag is an element with no content in all occurences.
If there is a significant amount of empty element tags, then one may question why elements are used, instead of simple attributes, thus avoiding overhead in size. However, empty element tags may add some form of value on the relationship with other element tags, especially hierarchy.Close this one
Help on Empty attribute tags :
This is the counterpart of Empty element tags. This is the ratio of empty attribute tags in the Xml stream.
An empty attribute tag commonly refers to an attribute where none of its occurences have a value. Below is a sample:

Sample of what's an empty attribute tag, here birthdate
 <person name="Smith" birthyear=""/>
 <person name="Zergov" birthyear=""/>
 <person name="Nelson" birthyear=""/>

Sample of what's NOT an empty attribute tag
 <person name="Smith" birthyear=""/>
 <person name="Zergov" birthyear="1980"/>
 <person name="Nelson" birthyear=""/>
If there are empty attribute tags, then one may question why they are used. Close this one
Help on Multiple part values :
This ratio stresses the significance of values in element tags that are splitted in several parts.
A multiple part value, is either a multiline value, or a value disseminated between more than one element. A multiple part value is always related to an element value, not an attribute value (see W3C Xml norm). Below are two samples of multiple part values :
  <book>
   The name of this book is so inadequate for a general audience
   that it has been decided not to print it.
  </book>
  ...
  <book>The Round Door
   <year>1999</year>
   <price>20$</price>Part II
  </book>
The actual value for the second sample is "The Round Door Part II". Close this one