xsl-list
[Top] [All Lists]

Re: [xsl] plea for help...

2006-03-09 09:59:18

On Wed, March 8, 2006 5:30 pm, Alexander Johannesen wrote:
Hi,


On 3/9/06, Walter Torres <walter(_at_)torres(_dot_)ws> wrote:

Well, now that I've sold the idea, I'm stumped on how to do it.

It depends quite dramatically what those HTML files contain. If
they're reasonably semantic instead of presentation based, it should
be easy. The other way is damn hard. Here's what I would do ;

1. Try various ways to convert the HTML files to either Wiki notation
or OpenDocument formats; get rid of presentation markup and focus on the
semantics. 2. Convert the files from the Wiki notation to clean XHTML, and
use CSS for presentation.

I found "Tidy HTML"

It looks like it's handling the files I've been throwing at it pretty well.


Also, how do you deal with display characteristics in XML? Like BOLD
and ITALIC, or even BULLET LISTS (<ul><li>)


This again depends on what you're trying to achieve with this
conversion. XHTML *is* XML, so you're already there. If you're doing
something semantically outside the scope of HTML with the files, what
are they?

I'm looking to set these files down to a very "basic" format so that I can
display them in almost any media:
 * web pages, with various looks
 * RSS feeds for other sites to pull the data
 * text for print media
 * WML for phones and hand-helds
 * etc


If you just want to display them back again,

Yes I do, as I outlined above...


just leave them as XHTML. I'm sure the interesting part of the articles
start and stop at some points, so extract out these only.

If they are left as xHTML won't that effect how this file is handed by
various "devices"?

Or am I over-thinking this?

walter


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--

<Prev in Thread] Current Thread [Next in Thread>