jeremyf(_at_)isscorp(_dot_)com wrote:
I have been earnestly trying to find a tool that
converts a .PDF file to .FO or some close
approximation of that.
this problem is Hard in the general case. I have had some success with
pdftohtml <http://pdftohtml.sourceforge.net/> in XML mode. Last time I did
this, pdftohtml didn't created FO, and actually needed some help to generate
well-formed XML. The output was useful, though.
Beware of multicolumn output, weird character encoding and/or weird character
subsetting. Here be dragons. Big ones.
I'm told that a company here in Toronto founded by one of the SoftQuad people
is working on a commercial solution to this with good AI.
Stewart
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list