[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]
Re: [xml-dev] The year is 2027, and we need to examine archived XMLdocuments from 2007 ...
- From: "Stephen D. Williams" <sdw@lig.net>
- To: Paul Kiel <paul@xmlhelpline.com>
- Date: Mon, 10 Sep 2007 10:08:23 -0400
Email seems like a special case. Mbox is, and might stay, the
definitive archival format for email. I have all of my email going back
at least since 1989 and it continues to be stored in mbox. Most email
storage on Unix/Linux is mbox and Mozilla/Thunderbird also uses mbox on
the client. The mbox format has been in use almost since RFC-822 was
published.
There's nothing wrong with an XML version. It should be easy to convert
between mbox<->XML. It should also be easy to have an XML view of mbox
or vice versa. This will be useful because there will probably be
software that can still use mbox in 2027.
Why Microsoft Outlook and Exchange use such reported-to-be fragile
formats for something so important is not clear. I have over 30GB in my
online email archive, reachable and searchable in seconds from any IMAP4
client. All mbox on ReiserFS.
It's not the text nature alone that makes it resilient. I'm all for
efficient, binary formats when done well (W3C EXI / my esXML 2.0) and
within the proper architecture.
sdw
Paul Kiel wrote:
> Very interesting thread. Before getting into the data integration space, I
> was trained as an Archivist. Indeed, the profession is has long been
> looking very closely at Xml as a long term storage format.
>
> In fact, I did some coding for the state of north Carolina to prototype the
> conversion of email messages from the governor's office into Xml for
> archival storage. So in 2027 (or 2927), all that is necessary to manage
> these files is to understand Unicode/ascii. None of today's software is
> needed.
>
> And today, archivists use Xml for metadata about primary resource materials.
> Their standard, called EAD (Encoded Archival Description) as been around
> since 1997 - and which was the spec that got me interested in Xml (or at
> least sgml until xml 1.0 came out the following year). It isn't the
> archival storage format, but it is a metadata format and shows their
> interest in Xml. It is only natural that they would see the value for long
> term storage.
>
> 0.02 from a former archivist,
> Paul Kiel
>
>
>
>
> =====================================================
> W. Paul Kiel
> xmlHelpline.com Consulting
> paul@xmlhelpline.com
> work: 919-846-0224
> cell: 919-449-8801
> website: http://www.xmlhelpline.com
> Your helpline for data integration solutions.
> =====================================================
>
>
>
> -----Original Message-----
> From: Rick Marshall [mailto:rjm@zenucom.com]
> Sent: Sunday, September 09, 2007 9:52 PM
> To: Michael Champion
> Cc: xml-dev@lists.xml.org
> Subject: Re: [xml-dev] The year is 2027, and we need to examine archived XML
> documents from 2007 ...
>
> no because the archive must be complete without external reference and
> it must not require execution of embedded code (eg javascript) - at
> least to make an old hacker like me happy that the archive is what
> someone saw/would have seen at the time it was created.
>
> Rick
>
> Michael Champion wrote:
>
>> Wouldn't (X)HTML meet these requirements?
>>
>> */Rick Marshall <rjm@zenucom.com>/* wrote:
>>
>> Hi
>>
>> So, instead of fighting over ODF/OOXML which can survive with or
>> without
>> each other, perhaps ISO/W3C/OASIS/IETF etc should focus on a
>> (possibly)
>> XML based archive format for documents and data which preserves basic
>> layout, links, binary objects (like jpg, avi, etc) as part of the
>> format. Open Office and Microsoft can conform to Government
>> requirements
>> by having a "Save Archive" option.
>>
>> All sorts of applications, not just office functions could target the
>> archive format for long term storage and searching.
>>
>> Rick
>>
>> PS This is not a request to return to SGML, although it could once
>> again
>> be the starting point.
>>
>> Richard Salz wrote:
>> >> Should entities, Xinclude statements, and links be avoided in
>> 2007 if
>> >> we wish for XML documents to be usable in 2027?
>> >>
>> >
>> > The issue behind this question is not a fundamentally new one.
>> Very few
>> > documents exist apart from any context, and the legal systems
>> know how to
>> > deal with taht.
>> >
>> > /r$
>> >
>> > --
>> > STSM, DataPower Chief Programmer
>> > Websphere DataPower SOA Appliances
>> > http://www.ibm.com/software/integration/datapower/
>> >
>> >
>> >
>>
>>
> _______________________________________________________________________
>
>> >
>> > XML-DEV is a publicly archived, unmoderated list hosted by OASIS
>> > to support XML implementation and development. To minimize
>> > spam in the archives, you must subscribe before posting.
>> >
>> > [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
>> > Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
>> > subscribe: xml-dev-subscribe@lists.xml.org
>> > List archive: http://lists.xml.org/archives/xml-dev/
>> > List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>> >
>> >
>>
>>
>>
> _______________________________________________________________________
>
>> XML-DEV is a publicly archived, unmoderated list hosted by OASIS
>> to support XML implementation and development. To minimize
>> spam in the archives, you must subscribe before posting.
>>
>> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
>> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
>> subscribe: xml-dev-subscribe@lists.xml.org
>> List archive: http://lists.xml.org/archives/xml-dev/
>> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>>
>>
>>
>
> _______________________________________________________________________
>
> XML-DEV is a publicly archived, unmoderated list hosted by OASIS
> to support XML implementation and development. To minimize
> spam in the archives, you must subscribe before posting.
>
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org
> List archive: http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>
>
>
>
>
> _______________________________________________________________________
>
> XML-DEV is a publicly archived, unmoderated list hosted by OASIS
> to support XML implementation and development. To minimize
> spam in the archives, you must subscribe before posting.
>
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org
> List archive: http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>
--
swilliams@hpti.com http://www.hpti.com Per: sdw@lig.net http://sdw.st
Stephen D. Williams 703-371-9362C 703-995-0407Fax 20147 AIM: sdw
[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]