sourceforge / freespace / guidod / pTA / TextArray
The XEE Project is short for "XML Query Execution Engine" being a prototype of a Query and Managment sytem for XML documents. At the heart of it there is a special special data structure called the Access Support Tree / TextArray (AST/TA). Within this AST/TA Model the TextArray manages the text content of XML documents.
I have been implementing this TextArray for block-oriented secondary storage media, called the persistant TextArray (or short "pTA"). It uses the well-known technique of a positional B-Tree for that. This allows for an implementation that handles both Queries and Updates quickly and efficiently.
In the course a study was written by me that looks at the environment of the AST/TA and the usage of XML documents being both queried and updated. The operations have strong needs beyond just scanning the text, modifications to the text content are usual and in many cases in an amount of words to phrases of natural languages.
The choice for using a positional-B-Tree turns out to be well suited to this task and the study contains some tables and diagrams that show the result of the implementation. Furthermore, a section of future work is given where the special implementation can be extended later to support specific characteristics of the AST/TA model.
Many of these pages are in German.
(C) 2003 Guido Draheim | 31-Jan-2003 |