[
Lists Home |
Date Index |
Thread Index
]
"Joshua Allen" <joshuaa@microsoft.com> writes:
> I think there is another big gap that HST people run into, which is
> that most of the existing HST tools for Unix are based on text files
> that are delimited by LF. Things like grep, cut, and wc are
> workhorses of HST, but totally inapplicable to XML. A set of
> similar command-line tools that operate on XPath instead of LF would
> be a huge help in encouraging adoption of XML, I think.
You might find the LT XML toolkit [1], which was designed to provide
exactly this sort of UNIX-style command line flexibility and
composibility, worth a look.
"The LT XML tool-kit includes stand-alone tools for a wide range of
processing of well-formed XML documents, including searching and
extracting, down-translation (e.g. report generation, formatting),
tokenising[, counting] and sorting."
ht
[1] http://www.ltg.ed.ac.uk/software/xml/
--
Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
W3C Fellow 1999--2001, part-time member of W3C Team
2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
URL: http://www.ltg.ed.ac.uk/~ht/
|