[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]
Re: [xml-dev] Represent Html Syntax Tree as RDF
- From: "Liam R. E. Quin" <liam@fromoldbooks.org>
- To: bryan rasmussen <rasmussen.bryan@gmail.com>, xml-dev<xml-dev@lists.xml.org>
- Date: Thu, 25 Jun 2020 00:38:05 -0400
On Wed, 2020-06-24 at 10:52 +0200, bryan rasmussen wrote:
> I'm looking for either a tool that when given a complete html
> document
> or a fragment thereof will generate an rdf description of the tree.
> If not a tool documents/studies showing how to do it.
I seem to remember Henry Thompson (or Richart Tobin) at Edinburgh
wrote a tool to convert from a post-schema-validation "infoset" into
rdf. The results are spectacularly impractical - for example, as i
recall, each c h a r a c t e r becomes a separate information item and
hence is the subject of a triple describing its location, considerably
complicating string search. A quick search found an XML infoset to RDF
script in Python.
There isn't a single tree.
Maybe you want an RDF representation of the HTML 5 DOM that would be
built?
For example, should namespaces be normalised as per HTML 5, and should
missing "tbody: elements be inserted?
What is your goal?
Liam
--
Liam Quin, https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations: http://www.fromoldbooks.org
[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]