? Content correlation
Help on Content correlation :
The content correlation is an in-depth examination of List Of Values that reveals valuables things. The first indicator is related to duplication, or how often the same values appear again and again. The second indicator is a ranking, it shows the most seen value in all List Of Values. Close this one
Help on Max Content Duplication :
This ratio extracts from the set of List Of Values what is the highest level of duplicated values.
If the ratio is high, that's because at least one List Of Values has its values highly correlated, thus candidate for factorization.
The maximum ratio works with other indicators such like Mean and Standard Deviation to show the distribution of correlation in the whole set of List Of Values. Close this one
Help on Mean Content Duplication :
This is the average duplication in the Xml stream. It works with the Standard Deviation to reveal the distribution of correlation in List Of Values.
It is based on the examination of the set of List Of Values. It is quite a good indicator of whether the content is unique and distinct, if the mean is low. On the contrary, if the mean is high, it means that several List Of Values are candidates for factorization.
Close this one
Help on Standard Deviation Content Duplication :
This is the standard deviation for correlation in the set of List Of Values, and it reveals the distribution of correlation. If this is low, say below 1, that's because only one List Of Values has correlated values. If this is high, that's because the Xml stream is not exactly optimized. Close this one
Help on Topmost Duplicated Content :
This is a value extracted from the Xml stream. It is based on the ranking of correlation in the List Of Values. The value is the first in this ranking. Close this one