Would a character count of output (stripped white space) suffice?
regards
On Fri, 22 Jan 2021 at 12:11, Paul Tyson phtyson(_at_)sbcglobal(_dot_)net
<xsl-list-service(_at_)lists(_dot_)mulberrytech(_dot_)com> wrote:
I had a similar challenge some years back. I met it with schematron.
Write a 2nd stylesheet that generates schematron rules describing the
expected structure and content of the output files. Run these against the
output.
Of course there is a risk that your schematron rules will be incomplete in
the same or different manner than your transformation stylesheet. But they
will be able to check for known errors, such as the one you have already
discovered.
Regards,
--Paul
On 1/22/21 5:29 AM, ian(_dot_)proudfoot(_at_)itp-x(_dot_)co(_dot_)uk wrote:
Hi everyone,
�
I am working on a project to convert several thousand SGML files (S1000D 1.7)
into a more recent XML version (S1000D 4.1). My finished XSLT style sheet
does the job that is expected. �However during the development I did run
into a problem where an error in the stylesheet allowed the output to pass
schema validation but by omitting some content! For me that�s very bad news
and I was lucky to notice it. �Ultimately the final output will be verified
by the subject matter experts, but I really don�t want to give them any
reason to doubt the reliability of the conversion.
�
This got me thinking about ways to verify the output text content against the
input despite significantly different structure. Is there an established way
to do that? If so what is it called and how well does it work?
Perhaps it�s something that I should build into the XSLT as it is written?
Or perhaps it could be run as a post process batch comparison operation?
�
My initial thought is to output normalized text from input and output and
compare the resulting text files� �
�
I�ve searched the archives, but I probably don�t know the correct
terminology to get any useful results�
�
Thanks in advance for all responses.
Ian
�
Ian Proudfoot
Bembridge
Isle of Wight
United Kingdom
�
XSL-List info and archive
EasyUnsubscribe (by email)
XSL-List info and archive
EasyUnsubscribe (by email)
--
Dave Pawson
XSLT XSL-FO FAQ.
Docbook FAQ.
--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--