OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] Represent Html Syntax Tree as RDF

On Wed, 2020-06-24 at 10:52 +0200, bryan rasmussen wrote:
> I'm looking for either a tool that when given a complete html
> document
> or a fragment thereof will generate an rdf description of the tree.
> If not a tool documents/studies showing how to do it.

I seem  to remember Henry Thompson (or Richart Tobin) at Edinburgh
wrote a tool to convert from a post-schema-validation "infoset" into
rdf. The results are spectacularly impractical - for example, as i
recall, each c h a r a c t e r becomes a separate information item and
hence is the subject of a triple describing its location, considerably
complicating string search. A quick search found an XML infoset to RDF
script in Python.

There isn't a single tree.

Maybe you want an RDF representation of the HTML 5 DOM that would be

For example, should namespaces be normalised as per HTML 5, and should
missing "tbody: elements be inserted?

What is your goal?


Liam Quin, https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations:  http://www.fromoldbooks.org

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS