Re: [xsl] citation processing

At 11:32 AM 10/20/2006, Andrew wrote:

If you think its not really feasible to parse a plain text citation
into a marked up version then that's good feedback - it could well be
that a percentage need to be done by hand.

Scale is a real issue here. Real-world citation formats includevariations like "use 'pp.' on page ranges for articles in books, butnot for articles in journals." At scale, even if your process doesthe correct thing with 85 of 100 citations (a very optimistic rate),that can leave scores of incorrect ones. And if your upconversioncan't recognize where it's failing, you have to find the errorsbefore you can fix them.

David is right: it's ultimately an NLP problem (though a veryinteresting subset of NLP). As he also says, success depends both onhandling the rules properly, and on the input actually followingthose rules. (There are dozens of citation formats around, too.)"Never say never" is good to keep in mind, but when I'm asked to lookat citations I immediately start asking questions about the scope ofthe input, its validation, and acceptable strategies for exceptionhandling. When told there won't be any exceptions it's usually prettyeasy to find a bunch.


Cheers,
Wendell


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--

<Prev in Thread]	Current Thread	[Next in Thread>
[xsl] citation processing, Andrew Welch Re: [xsl] citation processing, David Carlisle Re: [xsl] citation processing, Andrew Welch RE: [xsl] citation processing, Michael Kay Re: [xsl] citation processing, David Carlisle Re: [xsl] citation processing, Andrew Welch Message not available Re: [xsl] citation processing, Wendell Piez <= Re: [xsl] citation processing, Wendell Piez Re: [xsl] citation processing, David Carlisle Re: [xsl] citation processing, Wendell Piez RE: [xsl] citation processing, Waters, Michael, Springer US Message not available RE: [xsl] citation processing, Wendell Piez RE: [xsl] citation processing, Waters, Michael, Springer US