[
Lists Home |
Date Index |
Thread Index
]
On Thu, 5 May 2005 1:23 am, Bullard, Claude L (Len) wrote:
> Yes, VSM is a form of document indexing and classification.
> It uses term frequency to create similarity metrics, typically
> a cosine for the angle between terms which normalizes the
> distance. There are a LOT of papers you can read freely
> available simply by entering "vector space model" into
> that ever-loving simple box that does such a good job for
> cases where SQL falls on its bum. No structure == no SQL.
> ....
oh ok. I'm clear on that now. I actually worked on
such technologies in the '90s for a while before all the
buzzwords got changed. It's interesting stuff.
> Again, the problem of XML is apriori classification.
Of course, xml needs a good classification system to
be truly effective in a business context.
but I guess as Ken says, it's all wrapped into the
operating system these days. (Pity though it is an
undocumented feature)
David
--
Computergrid : The ones with the most connections win.
|