Hi Syd,
Thank you for the links -- this is a treasure!
Cheers,
Dimitre
On Fri, Oct 31, 2014 at 11:27 AM, Syd Bauman s(_dot_)bauman(_at_)neu(_dot_)edu
<xsl-list-service(_at_)lists(_dot_)mulberrytech(_dot_)com> wrote:
Sorry to be late to the party. But Dimitre, you are of course welcome
to the entire WWP corpus, if you like.
Other thoughts:
* Just last week the TAPAS project announced they are open for
(rudimentary) business. Navigate to any document at
http://www.tapasproject.org/ and there's a "Download TEI" button.
* Any of the DocSouth collections is available in XML (TEI P4). See
http://docsouth.unc.edu/docsouthdata/
* Sticking with the "south" theme, lots of TEI P5 manuscript
descriptions are available from
http://aso.obdurodon.org/browse.php.
* The voting record of the United States Senate in XML is quite a
pain to find; e.g.:
http://www.senate.gov/legislative/LIS/roll_call_votes/vote1122/vote_112_2_00172.xml
* There's this Extremely good conference which keeps its proceedings
online, and a lot of the papers include the source XML. See
http://conferences.idealliance.org/extreme/biblio.html
and
http://www.balisage.net/Proceedings/index.html
E.g.
http://www.balisage.net/Proceedings/vol10/xml/Novatchev01/BalisageVol10-Novatchev01.xml
(Alright, that particular one is probably not helpful to you :-)
* If you'd like some pretty bad OCR-software-generated XML, each
document in the collection at http://ufdc.ufl.edu/grimm has a link
to the "TEI".
* And, of course, if you want lots of examples of really awful XML,
many of the .plist files on a Mac OS X system will be in the XML
syntax.
--
Syd Bauman, EMT-Paramedic
Senior XML Programmer/Analyst
Northeastern University Women Writers Project
s(_dot_)bauman(_at_)neu(_dot_)edu or
Syd_Bauman(_at_)alumni(_dot_)Brown(_dot_)edu
--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--