[
Lists Home |
Date Index |
Thread Index
]
> Acrobat 5 has a plugin called XMLExtract or somethign (free from Adobe I
> think) which attempts to extract XML.
What use are plug-ins that translate (DOC, PDF, etc) to XML??? I don't
think it would be easy, as the main advantage of using XML is that you get
to define the element names to suit the context.
Of course, I understand translating (DOC, PDF, etc) to an XML-application
like DocBook, XHTML, etc...
---
Jimmy Cerra
"Challenging the status quo at the expense of my status, ego..."
> -----Original Message-----
> From: Rick Jelliffe [mailto:ricko@allette.com.au]
> Sent: Saturday, May 04, 2002 2:03 AM
> To: xml-dev@lists.xml.org
> Subject: Re: [xml-dev] PDF 2 XML
>
> From: <Kevin.Gutch@mapinfo.com>
>
> > Are there any PDF to XML utilities in existence? I assume it is
difficult
> > to re-purpose PDF to XML.
>
> Acrobat 5 has a plugin called XMLExtract or somethign (free from Adobe I
> think)
> which attempts to extract XML. If the document was created by a
> structured editor, such as FrameMaker+SGML, then there is a fighting
chance
> that the XML won't be complete crap. Does anyone have experience with it?
>
> But people who want to use PDF or XML as archiving formats should beware.
> Old PDF locks up the data, and XML can have URL dependencies. So for
> archiving, treat the XML as sub-SGML, not as super-HTML: make sure you
> archive all the resources for the document and change system IDs and links
> to refer to the local version, as snapshots. (I guess you could also use
a
> catalog system to override the system IDs, but you don't know whether
> the software in the future will cope readily with catalogs, so I don't
know
> that the extra level of indirection is worthwhile.)
>
> Cheers
> Rick
>
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
> initiative of OASIS <http://www.oasis-open.org>
>
> The list archives are at http://lists.xml.org/archives/xml-dev/
>
> To subscribe or unsubscribe from this list use the subscription
> manager: <http://lists.xml.org/ob/adm.pl>
|