xml-dev - RE: [xml-dev] Beyond Ontologies

RE: [xml-dev] Beyond Ontologies
[ Lists Home | Date Index | Thread Index ]
To: 'Didier PH Martin' <martind@netfolder.com>
Subject: RE: [xml-dev] Beyond Ontologies
From: "Bullard, Claude L (Len)" <clbullar@ingr.com>
Date: Fri, 10 Oct 2003 16:36:29 -0500
Cc: xml-dev@lists.xml.org

Going long and hazy.  Apologies, it is getting late in the day.

From: Didier PH Martin [mailto:martind@netfolder.com]

>In fact, this research started when I discovered that the actual semantic
>web technologies as recommended by W3C don't help me aggregate different
>statement about a particular concept. 

Aggregate or choose?  In other words, given a term, there may be multiple 
word senses (semioticians such as Sowa use the term, lattice of theories, 
and backs that up with mathematical description, but not here).  Systems 
such as WordNet provide these senses.

>Actually, the main source of web document classification is Google. 

An assertion, but fine.

>Let's consider that Google is offering a
>point of view or a view of the world. Some others like Teoma are proposing
a
>different point of view. If we have only one source of classification then
>no problems, we go back to the middle ages and get a single view of world.
>However, if, more than one proposition is made about a concept and if these
>sources a quite different, then, it would be useful to be able to merge
them
>in an intelligent way.

Why?  They may indeed be different word senses and not resolvable to a 
single meaning.  When offered as choices, one picks one and uses it as 
given, or alters it and publishes the new theory.  Do you need to resolve 
the difference or denote the difference?  It seems if you are trying to 
resolve all of them, you are back to the unified but medieval view, or 
you have uncovered a cosmetic difference that makes no difference.  I 
can see the value in that but nothing that takes us to a new level of 
computer science.

>Just, for a moment, go to http://www.infospace.com/home/dog/index.htm  or
>http://www.mama.com both are aggregating different point of views. We
simply
>do not know how they come to the conclusion that a particular resource is
>more closely related to a concept than another. Without considering the
>social interaction or the why people link or interact in certain ways, how
>do I reconciliate different propositions. If people start to publish marked
>up information in the form of frames (ala RDF) and if people publish
>information in the form a text, how can I conciliate different propositions
>about a certain concept?

You can't.  You have to test it in use and see if the test results match 
the predictions.  What kind of test that would be is an interesting
question.

>If I am using the topic map method, I will collect a collection of
resources
>about a particular topic. If I am using RDF I will aggregate propositions
>about a topic. Whatever the method I am using (topic map or RDF) I have to
>evaluate how related a resource (a proposition or a document) is to a
>concept/topic/class. The result needs both a notation (for presentation)
and
>an algorithm (for computation). This is operational, non philosophical and
>it does not depends on the linear or non linear nature of the web.

The operation isn't unless the algorithm behaves non-linearly.  Again, I
suspect you are 
denoting but not resolving.

>On the other hand, we have propositions
>based on formal and documented ontologies. For the latter, the problem is
to
>relate a particular statement to its ontology. 

Which is the one-world view.  So far so good.

>For example if I include the
>following statement in a web page and thus provide both text and data for
>two agents: a) a human being or automatic classification engine, b) a data
>collector.

<rdf: description about="aparticularresource">
  <property>value</property>
     ......
</rdf:description>

>The first problem is to get access to the ontology if there is one attached
>to this statement. 

Yes.  As in, does a URI act as a URL to the ontology for the namespace?
That 
sort of problem.  In that case, the resource identifies its own view of what

it means.  So far, so good.

>I may have mine but as I said earlier my definition of a
>chemist may not be the same one as the author of this description.

Certainly.  Without a meta-ontology, one that relates the definition 
of chemist to the context of its use (Intention), you are stuck in 
the classic Shannonesque choice:  equally probable so fully entropic. 
It is the hanged cat, or the Tic Tac Toe game with no winner.

>Problem 1: 
>----------
>How to get access to the ontology behind an RDF description? Where is it
>located where in an RDF fragment? How do I get the link to fetch such
>definition?

Access is easy is the URI resolves to the location of the ontology.  If 
not, you search.  If you search, you are back to the Shannon dilemma of 
having sufficient or insufficient means to choose.

>Problem 2:
>----------
>If the same document being categorized by one or several search engines
>provides an RDF description about this document and if the document also
>provides one, which algorithm can I use to resolve the conflicting
>descriptions about the same resource? 

The problem is, if the descriptions conflict, you have to choose and 
it the algorithm of choosing you have to define.  A selector.  So far 
so good.  Which definition meets your operational need most closely? 
IOW, you have your own ontology and that is similar to the metaontology 
I described above.  Turtles all the way around, but one can't get past 
the requirement of ontological commitment.  One can use 'authority' 
as a selector criteria; one can use 'favorite authors', and so on, but one
still 
requires an ontology for those selections.   There is a reason that 
classical storytelling starts in media rex and storytellers learn 
to write backstories.

>Is there one, none? Len if you think
>role playing can help; frankly I don't have a clue how. Can you publish the
>link pointing to this research so that at least I can analyze if their
>solution find its home for this type of problem. Because for the moment, I
>think I have a different ontology categorizing "role playing" than yours
:-)

Role playing is just a means of engagement to discover the source of 
a conflict or to find a selector.  I like it because given some set 
of real world events, it is a remarkably accurate means to predict 
an outcome.   What if a way to create a selector is to 
role play the conflict, observe it, and create the ontology that 
controls the selection for the choices?  What if by using simulations 
of humans engaged in conflict perhaps with real humans (agent avatars 
plus human directed avatars) one can do this online in real time? 

I don't suggest that is the only way, but it is a way.

Why do topics in mail lists tend over time to not 
reflect the actual contents of the particular emails?  Think 
about the debate some months back on the meaning of 'resource' 
in web architecture.  Even something that dominating as a 
keyword has a very nebulous meaning.  That is something 
of what interests me because it demonstrates that no matter how 
complete the ontology or how high the frequency of the term, 
the humans will drift away from it and topical based tracking will get
noisy.  

On the other hand, will they as our exchanges have, orbit given topics?   
The problemis that we can know what the attractors are, but can we predict 
where any given email will land relative to one of them without a history 
and when we have one, is it linear?

len
Follow-Ups:
- RE: [xml-dev] Beyond Ontologies
  - From: "Didier PH Martin" <martind@netfolder.com>
Prev by Date: Re: [xml-dev] The Battle for Web Services
Next by Date: RE: [xml-dev] Beyond Ontologies
Previous by thread: RE: [xml-dev] Beyond Ontologies
Next by thread: RE: [xml-dev] Beyond Ontologies
Index(es):
- Date
- Thread