Charles Explorer logo
🇬🇧

Optimization and Refinement of XML Schema Inference Approaches

Publication at Faculty of Mathematics and Physics |
2012

Abstract

XML is a widely used technology. Although in most real life applications XML data is required to conform to particular schemas, the majority of real-world XML documents does not contain any explicit declaration.

To fill the gap, the research area of automatic schema inference from XML documents has emerged. This paper refines and extends recent approaches to the automatic schema inference by exploiting an obsolete schema in the inference process, designing new MDL measures and heuristic excluding of eccentric data inputs.

It delivers a ready-to-use implementation integrated into jInfer - a framework for XML schema inference. Experimental results are a part of the paper.