 |
|
Precise Understanding of
Language by Computers
|
The Kinds of Texts Considered in the PULC Project
As explained here, precise understanding of language is impossible if we
allow arbitrary texts on arbitrary topics. The PULC project therefore
concentrates on texts that can be understood precisely, where
all (or almost all) people agree about their meaning and about
the correct answers to queries about them.
The answers to the queries usually do not explicitly appear
in the text and require inference based on the information in the text.
Examples of such texts appear below.
The comprehension exams are various tasks designed to test the
computer's level of understanding of a NL sentence or text (just
as they test humans' understanding). The second list gives a few
examples of real applications that would be very valuable to
people, and which crucially rely on a high level of understanding
of text meaning.
Comprehension Exams
-
Identifying logical entailment and contradiction relations between the
meanings of two sentences.
-
A good example of such entailments is the list compiled during the
FRACAS project
-- see Deliverable 16,
pp.63-120.
-
The PASCAL RTE Challenge is NOT
a good example of such entailments because the pairs of sentences there are related by
all kinds of ways, including analogies and similarities, but mostly the problem is
that identifying the entailments requires an unrestricted amount of general world knowledge.
- Logic puzzles, such as those on
LSAT and GRE exams.
- Math puzzles, such as those on SAT
exams. (Solving such puzzles given their English description
was one of the tasks that was suggested for
the next DARPA Grand Challenge.)
- Solving Advanced Placement (AP) tests. (See e.g.
Project Halo which works on the knowledge representation of such a task,
but not yet on the language comprehension part).
Real World Applications
- NL interface to databases: expressing a query by a NL question
rather than an SQL query. (Look here for a survey.)
- NL interface to a computational law
system:
Transforming texts that describe precise regulations to
representations that the computer could reason with in order to
determine whether a given case complies with the regulations
(e.g.: which
courses a college student must take in order to fulfil the
requirements of a study program.
Example text: look here).
- Understanding manuals and other documents written in a
restricted subset of NL, like those used by the aircraft
industry (e.g. the Boeing company: see this page
and the first three papers on
this page).
- Answering questions based on information given in a text
drawn from a restricted class of texts.
Send mail to iddolev [at] cs [dot] stanford [dot] edu with
questions or comments about this web site.
Copyright © 2005-2007 by Iddo Lev
and The
PULC Project