xsl-list
[Top] [All Lists]

Re: Problem with encoding UTF-8

2004-12-15 10:24:56
David at roamware writes:
If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 ->
UTF-8 conversion option save the file and then pop it through my program,
all works fine. This ambiguous conversion is explained thus "This function
will convert the complete file from Unicode or UTF-8 (ASCII representation)
to UTF-8 (with the file internally as Unicode)"

So I am at a bit of a loss to explain what the file format has to do with
this, the PDF exports the file with the "encoding=UTF-8" in the xml element.
Any experience of this behaviour and how to get around it? I cannot change
what the PDF exports so it will have to be a "not strict" switch or
something on the parser I suppose (couldn't find reference to such a thing
mind you.).

Can you examine the differences between the two files?  (I would use GNU
Emacs and its M-x ediff-files command.)

What does GNU recode tell you about the original file when you "convert"
it with utf-8..dump (with and without the --strict option)?

-- 
Kevin Rodgers


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--



<Prev in Thread] Current Thread [Next in Thread>