Re: SUO: linguistics and SUMO
> Dear SUMO Group:
>
> Let me first introduce myself to the group. My name is
> Scott Farrar and I'm a doctoral student at the University of
> Arizona. Our research group at the UofA is interested in
> providing input to the SUMO effort regarding issues that are
> related to &%Language. (The UofA group is developing an expert
> system to perform linguistic analysis on language data.)
Welcome to the group, I got my Master's in Computer Science, with
a specific focus on Computational Linguistics. I make my living as
a Medical Informaticist, but they are surprisingly related.
>
> In my discussion with Ian, several issues surfaced:
> First there is the status of the concept &%Language. We suggest
> expanding its status somewhat:
>
> In addition to including &%Language as a subclass of &%LinguisticExpression,
> we think &%Language should include a set-theoretic notion. Many linguists
> argue that a language is a potentially infinite set of what SUMO calls
> &%LinguisticExpressions. Therefore Ian suggested that SUMO add,
>
> (disjoint Language FiniteSet)
I certainly would understand why you would want to do this, however, if
you have a finite vocabulary, and a grammar that cannot realize an infinite
number of linguistic expressions, it would be false. I would like to
caution you to be careful not to overload the SUMO term with more than one
English word-sense. I believe Jon Awbrey is correct that there are many
ways words can be used, and SUMO needs to have a way to represent them,
but I am of the opinion that modern reasoning systems don't deal with
ambiguity well at all, and we would do well to keep our definitions
and terms usable by machines and systems currently available.
>
> Also, if we construe language as a set of &%LinguisticExpressions, then
> is it correct to say (subclass Language LinguisticExpression)? I think that
> if we adopt the set-theoretic notion of &%Language, then we need some way
>
> of disallowing Language as a member of the set that is itself &%Language.
>
> Perhaps this touches on Russell's Paradox?
What are the elements of the class &%Language ?
Is &%Language the mode or method of expressing a concept in a Linguistic
fashion? (meaning 1)
Or contrari-wise, is &%Language the set of utterances, each of which is the
Linguistic formulation of a particular concept? (meaning 2)
I would hope that you aren't advocating that the term &%Language should be
used for both of these, as they are distinct, seperable, and from the SUO
perspective, do not support the same inferences and conclusions.
> Second, &%Language should be linked ontologically to the concepts
> of &%SocialInteraction and/or &%Communication. Let me express my ideas
> in English:
> "Language is a means for communication."
> "And/or "Language is used for social interaction."
>
This is close to the meaning (1) above. Again, let me caution you that
simply because English uses the same word, you should not assume it is the
same concept. As a doctoral student, I'm sure this has been drilled into
your consciousness multiple times, so I suppose my caution is actually being
expressed to remind the other folks on the mailing list who haven't thought
as deeply about this very focused area of existence as you have.
> Third, I suggest that we give some thought to specifying the various forms
> of &%Language and &%LinguisticExpressions: perhaps a 'form' slot with the
>
> possible values, 'written,' 'spoken,' or 'signed.'
>
Perhaps Adam can clarify, but the idea of 'slots' isn't a KIF concept,
and the word has been used in so many ways in knowledge representation,
that I am afraid I must anticipate Jon Awbrey and say this Third point
is so fraught with ambiguity, that I have no idea how to crystalize it.
perhaps you should look into extra predicates that make the distinctions
you are interested in? or perhaps differing subclasses ?
> Fourth, I suggest the following subclasses of &%Language:
>
> (subclass HumanLanguage Language)
> (documentation Human Language "This is the subclass of &%Language used and
> interpreted by a &%Human. Instances include all extinct and extant human
> languages")
>
> (instance Spanish HumanLanguage)
>
Spanish as the name of a particular mode of expression is an instance of
HumanLanguage. (sub-meaning 1)
Spanish as the set of utterances understandable by a Spanish language
speaker is a subclass of HumanLanguage. (sub-meaning 2)
Since sub-meaning 1 and sub-meaning 2 are not the same, which do you mean?
> (subclass ArtificialLanguage Language)
> (documentation ArtificialLanguage "This is the subclass of &%Language
> designed by humans but intended for and interpreted by a &%Machine.
> This class is disjoint from the subclass of languages that are
> constructed to be used by humans, eg. Esperanto.")
>
I would have used the term ConstructedLanguage or ArtificialLanguage for
Esperanto. Since you are bringing interpretation and intent into your
documentation, don't be surprised if Jon picks up this ball and runs with
it. I'll let him try to take it in for a touchdown.
I probably would include some idea of a FormalLanguage as a superclass
too, since there are many things in common there.
> ;;I'm less sure about the following instance examples,
> ;;any suggestions?
> (instance Java ArtificialLanguage)
> (instance SQL ArtificialLanguage)
>
>
> (subclass AnimalLanguage Language)
> (documentation AnimalLanguage "This is the subclass of &%Language used and
> interpreted
> by an &%Animal that is not &%Human.")
>
> (instance DolphinLanguage AnimalLanguage)
> ;;Obviously, the status of AnimalLanguage as "Language" is
> ;;controversial for many linguists
>
The concept that Animals have concepts in the way that humans do is
controversial in society at large, much less among linguists.
It should be able to have an ontology that takes a consistent approach to
controversial topics though. How will a system which reasons using a
controversial ontology communicate that fact to other systems which may not
support such reasoning methods, and separately, may not agree that certain
facts should be considered to be true?
> Please share your thoughts,
> Scott
>
>
> farrar@u.arizona.edu
> 1049 N. Jacobus Ave
> Tucson AZ 85705
>
>