Can computers think (now or in the future)? - The Tao of Computing

Information Technology Reference

In-Depth Information

Although many such tests might be contemplated, two general

approaches involve black-box testing and white-box testing . In

blackbox testing, we give the test subject various problems or in

puts, and we examine the results. We do not worry about how the

subject determined the answers—we just review the conclusions. A

simple example of blackbox testing might be a multiplechoice test;

questions are given, and the person being tested responds by identi

fying which answer is correct. In grading the test, we do not con

sider how the subject came up with the answer; we just mark each

answer as right or wrong.

In whitebox testing, we examine the results, but we also review

the process used for reaching the conclusion. For example, in a

longanswer test involving mathematics, a grader may check how a

solution was first set up, what approaches were used to solve vari

ous equations, what intermediate results were obtained, and how

the various pieces led to the conclusion. In whitebox testing, the fi

nal answer is only one factor among many in determining whether

the subject passes the test.

Perhaps the best known proposal for the blackbox testing of in

telligence is called the Turing Test , proposed by Alan Turing in

1950 in the early days of electrical computers. Turing proposed that

a (human) investigator be placed in one room, a computer in a sec

ond room, and a human respondent in a third room. The investiga

tor would have a teletype (similar to an old typewriter) attached to

the computer and another for communication with the human re

spondent. (See Figure 16.1.) Although the investigator would know

that one teletype communicated with a computer and the other cor

responded with a human, the investigator would not know which

was which. The investigator was charged with asking questions to

each party and analyzing the responses in whatever way seemed ap

propriate. The goal was for the investigator to identify which tele

type went to the computer and which to the human. Although the

human was to answer questions honestly, the computer was to re

spond as it wished. If the investigator was able to pick the human

only half the time (the same fraction as would be obtained by ran

dom guess), then the computer was deemed to have passed this in

telligence test.

The rationale for the Turing Test depended on the premise that

the human respondent would be intelligent in her or his behavior. If

the computer seemed to consistently behave in a similar way, then

Search WWH ::

Custom Search

Home