Re: a few short notes


At 08:16 03/02/2004, Iljitsch van Beijnum wrote:

I wrote a program in C that takes a Netscape bookmark file and stores thecontent in a database. This is just under 300 lines and it's prettystupid, it certainly can't handle all variations of HTML.
It could be lack of programming prowess on my part, but I find parsingHTML / XML syntax incredibly inconvenient. The most troubleshome part isthat you can't just work left to right, you have to look for close tagsand so on.
Also, it just doesn't make any sense.
Why is it <input type=blah> but <title>blah</title> ? Something likeinput="blah" title="blah" would be much better.

I couldn't agree more. I wrote a basic XML parser which could do just whatwe needed and no more and it was 259 lines long (that's not counting theSTL libraries it used). This built up a tree structure of the XML. Therewas even more code to grab the particular XML element I wanted from thattree structure. I can't see how you could do it in 12 lines apart from justto find a specific tag value.

OTOH, my RFC822 header email-address-aware parser is only 220 lines long.(If I didn't need to parse email addresses it would be MUCH shorter). Aparser for a better designed plain text metadata format could easily be inthe region of 50 lines or less.

I don't like RFC822 headers, but I think there are simpler alternatives toXML which I'd prefer. I wouldn't die if it did turn out to be XML, but I'dlike a good reason rather than 'it's the new way of doing things'.



Paul                            VPOP3 - Internet Email Server/Gateway
support(_at_)pscs(_dot_)co(_dot_)uk                     http://www.pscs.co.uk/

<Prev in Thread]	Current Thread	[Next in Thread>
Re: a few short notes, (continued) Re: a few short notes, Chuq Von Rospach Re: a few short notes, Paul Robinson Re: a few short notes, Martin Duerst Re: a few short notes, Paul Robinson Re: a few short notes, Paul Crowley Re: a few short notes, Paul Robinson Re: a few short notes, Paul Crowley Re: a few short notes, Hector Santos Re: a few short notes, Iljitsch van Beijnum XML consideration. [Was Re: a few short notes], Hector Santos Re: a few short notes, Paul Smith <= Re: a few short notes, Chuq Von Rospach Re: a few short notes, Martin Duerst Re: a few short notes, James Seng Re: a few short notes, Paul Smith Re: a few short notes, Chuq Von Rospach Message not available Re: a few short notes, Chuq Von Rospach Anonymity, Jacob Palme Re: a few short notes, Martin Duerst Re: a few short notes, James Craig Burley Re: a few short notes, Jari Arkko