ReadSmartTM
nLanguage Technologies, Inc.
nPlug-in generates XML from PDF files
(or Word, Quark, etc.)
nFormatter adjusts word placement
nPlug-in moves words
The data I have alluded to has to do with something called ReadSmartTM, which is made by Language Technologies, Inc., which I do consulting work for.
We have a plug in for Adobe Acrobat that converts a PDF file to an XML file that contains enough information for our formatting purposes.
A formatter reads in the abstract document described in the XML file and formats it.
The plug-in then does an XML to PDF conversion, effecting any of the formatting changes.

I’ll do a quick demonstration of this.
Here is what the data looks like in a text editor.
Any questions?