[Date Prev] [Date Next] [Thread Prev] [Thread Next] Indexes: Main | Date | Thread | Author

Re: [ba-ohs-talk] Organic Growth of Knowledge


Malcolm Dean wrote:    (01)

> I applaud your efforts and direction, and I wish you tremendous success. I
> still see no great use for it. Google is the best thing since sliced bread
> for finding information, but it doesn't know about my personal ontology, or
> that of science fiction writers.    (02)

There is an "almost" lurking in there that bears some examination, because
it is in the nature of the "almost" that ontologies make sense.    (03)

Google is "almost" the best thing since sliced bread, in that
   a) It *is* the best thing since sliced bread if you the query you
       are asking reflects the moat popularly-asked question.
       (The odds are good, because it is in the nature of a popular
         question that many people ask it.)    (04)

   b) If the question you are asking is *different* from the popular
       one, Google isn't much help at all.    (05)

Let's see if I can construct a concrete example. (I've encountered quite
a few, but haven't recorded them.)    (06)

Let's consider a search on something like "stopping a printing process".
Since most Google users are computer geeks, if you want to know
how to cancel a print job, you'll come up with lots of great hits right
away.    (07)

But if you're trying to shut down the newspaper's printing press, you're
going to have a huge collection of false positives to get through before
you find something relevant.    (08)

That's where ontologies enter in -- to constrain the search to relevant
information.    (09)

Then there is the matter of "kind" of information. Somewhere, you
know there is a tutorial that showed a bit of code for determing
whether the text in a file was XML or HTML. What you want is    (010)

"code" that "determines" whether a file is "XML or HTML" that's
in a "tutorial".    (011)

But "code", "determines/decides", änd "tutorial" are ontological
concepts that describe the material. The only thing could search
on would be "XML HTML" -- and that's going to deliver one
whale of a bunch of false positives.    (012)

Of course, the task of ontological tagging is immense. But hopefully
that analysis is just a little bit persuasive as to how it could hold
sizable benefits. As we begin to define reasonable mechanisms to
allow it, the next step is to figure out how it will get done.    (013)