xsl-list
[Top] [All Lists]

Re: [xsl] where to look for xsl folk..

2016-06-20 22:38:52
hey

Thanks for this information, and thanks for the other post about OxG (I
am very familiar with it). I have done a reasonable amount of research
in the open source world and I dont see anything fitting the bill.
Essentially I want a good stylesheet to run against a docx file and
provide 'good enough' HTML. It will be manually cleaned up and will stay
for the duration of its life as HTML.

I'd prefer to make a direct docx-> html conversion and not go through an
intermediary format. Also, Im looking for open source solutions.

Adam

On 06/20/2016 12:30 PM, G. Ken Holman 
g(_dot_)ken(_dot_)holman(_at_)gmail(_dot_)com wrote:
Indeed hard does not mean impossible.  The Inera folks have a strong
product named eXtyles for going from Word to various JATS derivatives
including ISOSTS that I am personally interested in:

  http://www.inera.com/resources/extyles-related-technologies

I haven't heard much of any other Word-based products ... but I post
this to point out that it has been done successfully commercially.

. . . . . . . Ken

At 2016-06-20 18:58 +0000, Wendell Piez wapiez(_at_)wendellpiez(_dot_)com 
wrote:

Hi,

On Mon, Jun 20, 2016 at 10:36 AM, Christopher R. Maden 
crism(_at_)maden(_dot_)org
<xsl-list-service(_at_)lists(_dot_)mulberrytech(_dot_)com> wrote:
On 06/19/2016 04:17 PM, adam adam@coko.foundation wrote:

We are working with docx files that need to be translated into
HTML. The
docx files are chapters of scholarly content that constitute a
book. We
need to translate the docx into a tidy HTML version with direct
translation of semantic elements but with the elimination of styles.

There are a few tools to do this kind of thing.  The Public Knowledge
Project is working on integrating them into a pipeline; it's not
ready for
prime time *quite* yet, but it's getting there, and the individual
components may be useful to you on their own.  Check out <URL:
https://github.com/pkp/xmlps > for source and more info.

Indeed there are a number of different such initiatives some of them
including XSLT and so on topic. :-)

(In fact didn't Eliot recently mention his thing for a Word -> DITA
pathway?)

Whether using XSLT (and on topic) or not -- converting from Word (what
I like to call a 'paintbrush' application) into strong markup is going
to be a hard problem, largely because its boundaries are not in an
obvious place, plus they move. It will always be contested what is in
scope vs what is not, and there will be a tradeoff between generic and
specialized capabilities.

Hard doesn't mean impossible, however, and what would be nice would be
a toolkit that could be adapted for local use....

Cheers, Wendell

-- 
Wendell Piez | http://www.wendellpiez.com
XML | XSLT | electronic publishing
Eat Your Vegetables
_____oo_________o_o___ooooo____ooooooo_^



-- 
Check our site for free XML, XSLT, XSL-FO and UBL developer resources |
Streaming hands-on XSLT/XPath 2 training @US$45: http://goo.gl/Dd9qBK |
Crane Softwrights Ltd. _ _ _ _ _ _ http://www.CraneSoftwrights.com/s/ |
G Ken Holman _ _ _ _ _ _ _ _ _ _ 
mailto:gkholman(_at_)CraneSoftwrights(_dot_)com |
Google+ blog _ _ _ _ _ http://plus.google.com/+GKenHolman-Crane/posts |
Legal business disclaimers: _ _ http://www.CraneSoftwrights.com/legal |


---
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus


-- 

---
Adam Hyde
http://www.adamhyde.net/projects
--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--

<Prev in Thread] Current Thread [Next in Thread>