WO2013162652A2 - Reverse multiple-choice method for knowledge engineering and expert system implementation - Google Patents

Reverse multiple-choice method for knowledge engineering and expert system implementation Download PDF

Info

Publication number
WO2013162652A2
WO2013162652A2 PCT/US2013/000113 US2013000113W WO2013162652A2 WO 2013162652 A2 WO2013162652 A2 WO 2013162652A2 US 2013000113 W US2013000113 W US 2013000113W WO 2013162652 A2 WO2013162652 A2 WO 2013162652A2
Authority
WO
WIPO (PCT)
Prior art keywords
answer
query
choice
question
machine
Prior art date
Application number
PCT/US2013/000113
Other languages
French (fr)
Other versions
WO2013162652A3 (en
Inventor
Indu Mati ANAND
Pranav ANAND
Original Assignee
Anand Indu Mati
Anand Pranav
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/457,440 external-priority patent/US20120288844A1/en
Application filed by Anand Indu Mati, Anand Pranav filed Critical Anand Indu Mati
Priority to CN201380033263.6A priority Critical patent/CN104620301A/en
Publication of WO2013162652A2 publication Critical patent/WO2013162652A2/en
Publication of WO2013162652A3 publication Critical patent/WO2013162652A3/en

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student

Definitions

  • the present invention relates generally to machine learning and the development and use of query-based expert systems. More specifically it extends a multiple-choice based method of generating educational and testing materials to knowledge acquisition and engineering.
  • a multiple-choice question comprises three identifiable sections: a section containing a set of facts to be presumed (for instance, a narrative, a short story, a poem, an expression, an equation, or a geometric figure), an interrogative sentence (sometimes known as the "call of the question"), and a set of answer choices.
  • a multiple-choice question can also be thought of consisting of two parts—a first part, comprising a set of facts to be presumed and an interrogative sentence, and a second part, comprising a set of answer choices.
  • the first part may be referenced as a "query" herein.
  • the second part typically contains between three and five answer choices, one of which may be marked as the correct
  • RMCM Reverse Multiple-Choice Method
  • a fact object is a semantic entity which is often, though not universally, expressed in the syntactic construction of the query and it is meaningful in the context of the query and one or more of the answer choices provided.
  • a fact object may be a fact object in the context of one answer choice for a question but not for another answer choice, where that phrase is not relevant to the context depicted in the latter answer choice.
  • the answer choices in a multiple-choice question provide the context for interpretation of the query, which is exploited by RMCM.
  • RMCM Unlike traditional multiple-choice questions, the wrong or incorrect answer choices in RMCM questions are just as valuable teaching tools as the correct answers.
  • RMCM By teaching a student to deconstruct and reassemble a question, as it were, RMCM could train him or her to critically examine a given set of facts, and to recognize their relationship to the key words, phrases, concepts or facts, in order to achieve in-depth understanding of particular subject matter.
  • Machines that can "learn” must demonstrate capability to understand the meaning of natural (human) language to an extent. This could require, for example, that given a form of text (such as, a document, message, narrative, or script) the machine be able to parse the text and generate an instantiated follow-up script that would be regarded as conveying the meaning of the original text at an acceptable level.
  • a form of text such as, a document, message, narrative, or script
  • Other ways to demonstrate the understanding might be similar to the following: recall, not as a regurgitated barrage of the inputs, but as an organized presentation of the text content; high performance on a test;
  • Such machine-based expert systems or “intelligent” systems have the following identifiable components: a knowledge base which represents the compilation of known facts gleaned from several sources (generally external to the system to include “real world” knowledge), possibly including a human expert; an inference engine which includes the rules for operating on the facts either in the knowledge base or new to the system; a database on which the knowledge base and inference engine may operate; and input and output units that allow the system to communicate with a user, in particular to communicate appropriate conclusions in new situations. There may also be interpreter components that explain the reasoning behind the inferences and/or carry out actions based on the conclusions.
  • Knowledge based methods currently include: clustering of text based items according to "similarity” by compiling frequency vectors of index terms; Latent Semantic Indexing which takes advantage of "closeness” of words in a comparison of texts; adaptive techniques based on relevance feedback; preprocessing of texts by category; syntactic categorization; semantic analysis using "fuzzy” logic; or some combination of similar techniques. The more successful of these techniques involve a variation of two-pass "prediction and correction" routine.
  • RMCM Reverse Multiple-Choice techniques are compatible with these, and may additionally be used in conjunction with one or more of these and other similar approaches for developing intelligent machines.
  • RMCM puts the onus of the learning on the expert to generate well- crafted queries, whether the expert in the learning situation is the human or the machine.
  • the underlying techniques may include statistical or probabilistic measurement and analysis, either based on fuzzy sets and logic or on "crisp" sets and logic.
  • RMCM Reverse Multiple-Choice Method
  • One possible application of the extended RMCM methods is in assisted generation of suitable RMCM queries for teaching or testing.
  • Such assistance of the computer may go beyond the simple retrieve and store functions involving databases; it is envisioned as the capacity for analytical dialogue that a "trained" machine with components to "reason” can carry on with a human, combine it with its rich database, and provide new queries for question writing. Indeed, the roles of a human trainer and machine trainee can eventually reverse as the machine acquires sophistication.
  • a knowledge base which represents the compilation of known facts gleaned from several sources (generally external to the system to include "real world" knowledge), possibly including a human expert; an inference engine which includes the rules for operating on the facts either in the knowledge base or new to the system; a database on which the knowledge base and inference engine may operate, in particular to draw and communicate appropriate conclusions in new situations; and, input and output units that allow the system to communicate with a user.
  • interpreter components that explain the reasoning behind the inferences and/or carry out actions based on the conclusions.
  • RMCM can contribute to most of the functions of machine based expert systems.
  • RMCM Reverse Multiple-Choice Method
  • U.S. patent application Ser. No. 09/951,132 is a method of developing educational and testing products or materials, by utilizing multiple-choice questions.
  • RMCM represents a reversal of perspective from traditional multiple-choice approach. Starting with a given set of facts to be presumed, the method may require one to examine each answer choice, and inquire how the set of facts to be presumed could be modified to make that answer choice the "correct" or "best” answer choice. If a given answer choice is already correct, no modification is needed. If a given answer choice is not correct, various modifications may be employed, including changing some of the words or phrases, or other symbols or objects within the set of facts to be presumed.
  • RMCM provides a method for using the multiple-choice format in focusing the students on the art of closely reading the fact pattern, critically evaluating the answer choices, and learning to recognize the critical pieces of information in the fact pattern on which the answer choices turn.
  • tests based on Reverse Multiple-Choice Method may be able to measure the extent to which the examinees have learnt these critical skills.
  • the measure of the test-takers' ability to recognize critical pieces of information in the given fact pattern vis-a-vis the possible answer choices is generally the central goal of an educational testing regime as reliable predictor of their knowledge and understanding.
  • RMCM for educational testing which comprises the steps of: (a) providing one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; (b) prompting the examinee to select an answer choice as the correct answer; (c) maintaining a record of the examinee's selection in step (b); (d) assigning credit for the examinee's selection of an answer in step (b) according to a preset formula; (e) prompting the examinee to select at least one of said plurality of answer choices not selected in step (b); (f) prompting the examinee to provide a follow-up query to which the answer choice selected in step (e) is a correct answer; (g) maintaining a record of the examinee's answer in step (e); (h) assigning credit for the examinee's selection of an answer in step (e) according to another preset formula; (i) providing one or more multiple-choice questions, each question comprising a query and
  • This very same method may be adapted to train a machine by suitably framing the questions and judiciously compiling the answers, both from the human trainer and from the machine.
  • step (b) may be omitted, and any of the steps (d), (h), (i) and (j) related to assigning credit may be suitably modified.
  • One possible modification may be to associate weights in accordance with statistical or probabilistic models corresponding to the given answer choices or to the facts on which the answers turn (called fact objects herein).
  • steps of the same core method may be adapted in order for the machine to assist a human, for instance, by providing decision support.
  • the "trained" machine may engage a human operator or actor in a dialogue to understand, analyze and evaluate a newly presented situation, and then generate a short list of appropriate conclusions.
  • RMCM With its step of modifying a query as needed, RMCM provides a concrete mechanism for using and correcting errors and misconceptions in communication; that mechanism can be used to similarly eliminate the errors in communication between humans and machines when suitably adapted.
  • Training or machine learning is an essential requirement for a functioning expert system, whether the machine assists a human actor or operates autonomously.
  • bottlenecks generally remain in knowledge base acquisition.
  • RMCM methodology can be used to ensure that machine learning or training is reliable: as shown in Detailed Description, the method can contribute to the development of robust knowledge base and sound reasoning, and improve the art of expert system development.
  • a multiple-choice question comprises three identifiable sections: a section containing a set of facts to be presumed (for instance, a narrative, a short story, a poem, an expression, an equation, or a geometric figure), an interrogative sentence (sometimes known as the "call of the question"), and a set of answer choices.
  • the number of answer choices typically range between two (as in true/false) to five, and there usually is only one correct (or best) answer out of the multiple provided answers.
  • a well-constructed multiple-choice question is an incisive evaluation tool that requires thought and analysis for the selection of the correct answer out of the various "confounding" answer choices. To the extent that an answer choice can be rejected on cursory inspection, it represents a “lost” opportunity for discrimination and evaluation. On the other hand, if due to a "small” difference in the narrative or the call of the question, correct and incorrect answers trade places, then the "evaluative" value of the question is higher.
  • the questioner may delve deeper into a topic than in a set of k unrelated questions, since the k answer choices all relate to the same narrative. This allows for the subject matter of the question to be potentially viewed and reviewed from k perspectives. That is akin to locating a data point with k degrees of freedom.
  • the first part comprises a set of facts to be presumed and an interrogative sentence.
  • the first part may also be termed a "query.”
  • First_Part and “query” shall be synonymous, unless otherwise noted.
  • a second part of a question comprises a set of answer choices.
  • second part and “Second_Part” shall be synonymous, unless otherwise noted.
  • a correct answer is the answer choice that will or would be graded on a test as the "correct” or “best” answer choice to a given query.
  • a first part of a question typically comprises one or more fact objects (or
  • a fact object is defined to include any object, word, element, number, operator symbol, phrase, or a group of words, elements, numbers, operators, symbols, or other objects, or any other type of entity capable of holding information.
  • a fact object typically denotes a fact, datum or piece of information in the first part of a multiple-choice question that may be considered when interpreting the answer choices or choosing an answer choice to the question. For instance, in the example discussed above of the box having sides of 1 foot, 2 feet and 3 feet, the length of each side may be considered a fact object.
  • each piece of information presented (which can be thought of as a word or group of words) may be considered a fact object.
  • the interrogative sentence may ask the answerer to consider answer choices relating to the work or its creator to select the "correct" answer choice.
  • a fact object is a phrase or a group of words, elements, numbers, operators, symbols, or other objects or entities
  • the whole of such phrase or group- but no part less than the whole— of such phrase or group denotes the particular fact, datum or piece of information contained in or conveyed by the fact object.
  • the significance of a fact object may derive partly or primarily from the context of (including its placement within) the query and the set of answer choices of a particular multiple-choice question.
  • every answer choice will utilize this contextual connection between the critical information content of the fact object or fact objects in the given query and the answer choice.
  • an embodiment with minor variations can be used either for purposes of study or review, or for testing. For example, a student may be shown the fact objects corresponding to different answer choices to a question in side by side comparisons, whereas a test-taker might be called upon to match the fact objects (or their values) from a list to different answer choices in comparative displays.
  • Embodiments of RMCM may be suitably modified in case a student or teacher is a machine.
  • H creates or displays a previously created multiple-choice question Q based on case narrative - Answer Choices (ACs), Fact Objects (FOs), Fact Values (FVs)
  • M saves corrected query as new answer choice alternative ii. If no, M saves corrected query as new related case
  • M requests RMCM answer "correction" from H, records FO(s) for
  • M compares query created in step (ii) with "corrected" query c. M saves corrected query as related new case, if at least one FO match d. Else, M saves corrected query as unrelated new case.
  • (A) System may compute and assign probabilities of case scenarios via FV and FO probabilities
  • (B) M may be refined by assigning and incorporating probabilities of case scenarios
  • Such sessions used sequentially will generate the Knowledge Base, made up of clusters or families of questions, such that questions within a cluster may be related.
  • the RMCM based method outlined above includes graph-theoretic components and does not entirely depend on the tree structure. This approach ensures that all related conditionals are available for consideration and that the logic tree is not pruned prematurely losing a branch of analysis that might have proved useful.
  • RMCM may generate domain-aware query clusters that are generally not available by other methods typically used in the machine learning environments. In many machine learning applications one is faced with a high volume of information necessitating the pruning of some branches to analyze the most promising line of reasoning. If it becomes apparent that the line of reasoning being followed is not the most promising one then it becomes necessary to follow the backward chain of reasoning. However, this method of forward and backward can be inefficient if the error in reasoning is uncovered far down the stream. [0047] Other clustering methods have been proposed in the literature, but RMCM has the potential as a flexible, general purpose method of clustering that minimizes the need for backward chaining. The reason for this is that RMCM does not depend on the logical tree structure, but on clusters of queries whose mutual dependency may be related and analyzed via graph-theoretic considerations.
  • the method of this invention can make the difference between right or wrong diagnosis.
  • An "expert" machine may eventually also "write” queries for possible acceptance by a human examiner.
  • the methodology for the machine to write such queries will often rely on the clustering mechanisms built into the RMCM system.

Abstract

A system and method of communication based on the Reverse Multiple-Choice Method of teaching and testing is disclosed where at least one communicant is a machine. The method is applicable for training a machine for knowledge engineering and artificial intelligence oriented applications, as well as for a trained machine to assist a human being engaged in the activity of training, teaching or testing.

Description

REVERSE MULTIPLE-CHOICE METHOD FOR KNOWLEDGE ENGINEERING
AND EXPERT SYSTEM IMPLEMENTATION
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001 ] This application is filed under Patent Cooperation Treaty from U.S. patent application Ser. No. 13/457,440, filed on April 26, 2012 as a continuation-in-part of U.S. patent application Ser. No. 1 1/350,266, filed Feb. 7, 2006, now U.S. Patent No. 8, 195,085, which claims the benefit of U.S. patent application Ser. No. 09/951 , 132, filed Sep. 12, 2001, now U.S. Patent No. 7,033, 182, which claims the benefit of U.S. Provisional Patent Application Ser. No. 60/232, 1 10, filed Sep. 1 1, 2000. The subject matter of this application is also related to PCT Application No. PCT/US2007/003127 filed on February 6, 2007 which gave rise to patent grants in Australia and Japan.
[0002] The entire contents of application Ser. No. 1 1/350,266, application Ser. No.
09/9 1, 132 and Provisional Application Ser. No. 60/232, 1 10 are hereby incorporated by reference.
FIELD OF THE INVENTION
[0003] The present invention relates generally to machine learning and the development and use of query-based expert systems. More specifically it extends a multiple-choice based method of generating educational and testing materials to knowledge acquisition and engineering.
BACKGROUND OF THE INVENTION
[0004] Multiple-choice questions are a common way of testing students in a variety of subject areas, particularly in examinations taken by large numbers of students. In its most commonly used form, a multiple-choice question comprises three identifiable sections: a section containing a set of facts to be presumed (for instance, a narrative, a short story, a poem, an expression, an equation, or a geometric figure), an interrogative sentence (sometimes known as the "call of the question"), and a set of answer choices. A multiple-choice question can also be thought of consisting of two parts—a first part, comprising a set of facts to be presumed and an interrogative sentence, and a second part, comprising a set of answer choices. The first part may be referenced as a "query" herein. The second part, typically contains between three and five answer choices, one of which may be marked as the correct
l answer, although the number of answer choices may vary below three or above five under appropriate circumstances.
[0005] Reverse Multiple-Choice Method ("RMCM") was introduced in U.S. patent application Ser. No. 09/951,132, now U.S. Patent number 7,033,182. A key part of a typical RMCM embodiment requires a student to: (i) methodically examine each answer choice in relation to the question; (ii) identify the key data or information provided in the question that make that answer choice correct or incorrect; and then, (iii) inquire how the given set of facts could be modified to make that particular answer choice the "correct" or "best" answer. The last step, a crucial and distinguishing feature of RMCM, depends on the identification of the important facts within the query on which the correctness or incorrectness of the answer choice rests. In Reverse MultipJe-Choice environment these important facts are termed "Fact Objects." A fact object is a semantic entity which is often, though not universally, expressed in the syntactic construction of the query and it is meaningful in the context of the query and one or more of the answer choices provided. Thus, for instance, it is possible for the same word to be used in a narrative twice impacting the meaning the first time but not on the second occasion; that word in that scenario would be a fact object (or a segment of a fact object) for an answer choice the first time but not the second time. Similarly, a phrase may be a fact object in the context of one answer choice for a question but not for another answer choice, where that phrase is not relevant to the context depicted in the latter answer choice. Typically, the answer choices in a multiple-choice question provide the context for interpretation of the query, which is exploited by RMCM.
[0006] Unlike traditional multiple-choice questions, the wrong or incorrect answer choices in RMCM questions are just as valuable teaching tools as the correct answers. By teaching a student to deconstruct and reassemble a question, as it were, RMCM could train him or her to critically examine a given set of facts, and to recognize their relationship to the key words, phrases, concepts or facts, in order to achieve in-depth understanding of particular subject matter.
[0007] The advantages of RMCM are not limited to interactions between human
teachers/examiners and students/examinees. The attributes of RMCM may be used effectively when the "student" is a machine, learning to imbibe the "knowledge" of a human expert; or conversely, in a situation where a machine is the expert trainer, for example of a human trainee/student; or when a machine aids a human teacher/examiner as a competent assistant to help fashion the most effective educational materials. Most of these situations require computers to go beyond their traditional, procedural roles of providing support for data management and number-crunching and be available to augment or replace human intelligence with "artificial" intelligence - hence these situations call for "machine learning" in some form or another.
[0008] Machines that can "learn" must demonstrate capability to understand the meaning of natural (human) language to an extent. This could require, for example, that given a form of text (such as, a document, message, narrative, or script) the machine be able to parse the text and generate an instantiated follow-up script that would be regarded as conveying the meaning of the original text at an acceptable level. Other ways to demonstrate the understanding might be similar to the following: recall, not as a regurgitated barrage of the inputs, but as an organized presentation of the text content; high performance on a test;
augment its lexical knowledge; consult a resource such as a dictionary to interpret a new script; adaptively respond in a coherent manner to new script.
[0009] Such machine-based expert systems or "intelligent" systems have the following identifiable components: a knowledge base which represents the compilation of known facts gleaned from several sources (generally external to the system to include "real world" knowledge), possibly including a human expert; an inference engine which includes the rules for operating on the facts either in the knowledge base or new to the system; a database on which the knowledge base and inference engine may operate; and input and output units that allow the system to communicate with a user, in particular to communicate appropriate conclusions in new situations. There may also be interpreter components that explain the reasoning behind the inferences and/or carry out actions based on the conclusions.
[0010] Knowledge based methods currently include: clustering of text based items according to "similarity" by compiling frequency vectors of index terms; Latent Semantic Indexing which takes advantage of "closeness" of words in a comparison of texts; adaptive techniques based on relevance feedback; preprocessing of texts by category; syntactic categorization; semantic analysis using "fuzzy" logic; or some combination of similar techniques. The more successful of these techniques involve a variation of two-pass "prediction and correction" routine.
[001 1 ] Reverse Multiple-Choice techniques are compatible with these, and may additionally be used in conjunction with one or more of these and other similar approaches for developing intelligent machines. RMCM puts the onus of the learning on the expert to generate well- crafted queries, whether the expert in the learning situation is the human or the machine.
[0012] As used herein, the terms knowledge systems and expert systems are used
interchangeably, as are the terms machine learning, knowledge engineering, natural language processing, artificial intelligence etc. The underlying techniques may include statistical or probabilistic measurement and analysis, either based on fuzzy sets and logic or on "crisp" sets and logic.
SUMMARY OF THE INVENTION
[0013] With suitable adaptation, the "Reverse Multiple-Choice Method" (RMCM) methodology for generation of educational and testing materials may be extended beyond academic teaching and testing. In particular, the key concepts of RMCM, viz., that of "fact object" and that of "changing a query ("query" part of the multiple-choice question) corresponding to an incorrect answer so as to make it a correct answer to the "changed query" of (changed) multiple-choice question can be advantageously used in artificial intelligence oriented applications.
[0014] One possible application of the extended RMCM methods is in assisted generation of suitable RMCM queries for teaching or testing. Such assistance of the computer may go beyond the simple retrieve and store functions involving databases; it is envisioned as the capacity for analytical dialogue that a "trained" machine with components to "reason" can carry on with a human, combine it with its rich database, and provide new queries for question writing. Indeed, the roles of a human trainer and machine trainee can eventually reverse as the machine acquires sophistication.
[0015] The field of expert systems or knowledge engineering is identified in the literature by several names - its key characteristics being the existence within the system of the following identifiable components: a knowledge base which represents the compilation of known facts gleaned from several sources (generally external to the system to include "real world" knowledge), possibly including a human expert; an inference engine which includes the rules for operating on the facts either in the knowledge base or new to the system; a database on which the knowledge base and inference engine may operate, in particular to draw and communicate appropriate conclusions in new situations; and, input and output units that allow the system to communicate with a user. There may also be interpreter components that explain the reasoning behind the inferences and/or carry out actions based on the conclusions. Most of these functions of the system depend on the machine's "learning" the facts and the rules from a "dialogue" between the machine and a human expert. In one form or another, this relies on presenting or finding answers to queries presented. As described below herein, RMCM can contribute to most of the functions of machine based expert systems.
[0016] "Reverse Multiple-Choice Method" (RMCM), introduced and defined in U.S. patent application Ser. No. 09/951,132, is a method of developing educational and testing products or materials, by utilizing multiple-choice questions. RMCM represents a reversal of perspective from traditional multiple-choice approach. Starting with a given set of facts to be presumed, the method may require one to examine each answer choice, and inquire how the set of facts to be presumed could be modified to make that answer choice the "correct" or "best" answer choice. If a given answer choice is already correct, no modification is needed. If a given answer choice is not correct, various modifications may be employed, including changing some of the words or phrases, or other symbols or objects within the set of facts to be presumed.
[0017] In the context of educational dialogue, whereas the currently available educational products based on the multiple-choice format regard the set of presumed facts to be "fixed," and "variably" examine the answer choices to pick the correct one, RMCM temporarily "fixes" an answer choice as the correct answer and "varies" the facts of the question to accommodate that assumption.
[0018] Through a process of leading the student to deconstruct and reassemble a question, RMCM provides a method for using the multiple-choice format in focusing the students on the art of closely reading the fact pattern, critically evaluating the answer choices, and learning to recognize the critical pieces of information in the fact pattern on which the answer choices turn. The skills learnt through this process, with or without the assistance of machine or human tutor, underlie the strength of RMCM as a self-study tool. Also, tests based on Reverse Multiple-Choice Method may be able to measure the extent to which the examinees have learnt these critical skills.
[0019] The measure of the test-takers' ability to recognize critical pieces of information in the given fact pattern vis-a-vis the possible answer choices is generally the central goal of an educational testing regime as reliable predictor of their knowledge and understanding.
[0020] The same paradigms for learning and applying the knowledge to new situations is the goal as well for machines which are either expected to function without human supervision or to assist the human operators in tasks not amenable to preset, procedural computations.
[0021] To elaborate, consider, for example, one illustrative embodiment of RMCM for educational testing which comprises the steps of: (a) providing one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; (b) prompting the examinee to select an answer choice as the correct answer; (c) maintaining a record of the examinee's selection in step (b); (d) assigning credit for the examinee's selection of an answer in step (b) according to a preset formula; (e) prompting the examinee to select at least one of said plurality of answer choices not selected in step (b); (f) prompting the examinee to provide a follow-up query to which the answer choice selected in step (e) is a correct answer; (g) maintaining a record of the examinee's answer in step (e); (h) assigning credit for the examinee's selection of an answer in step (e) according to another preset formula; (i) combining the credit generated in steps (d) and (h) into a score for the question according to a third preset formula; (j) evaluating the score, e.g., against a preset standard.
[0022] This very same method may be adapted to train a machine by suitably framing the questions and judiciously compiling the answers, both from the human trainer and from the machine. In this adaptation step (b) may be omitted, and any of the steps (d), (h), (i) and (j) related to assigning credit may be suitably modified. One possible modification may be to associate weights in accordance with statistical or probabilistic models corresponding to the given answer choices or to the facts on which the answers turn (called fact objects herein).
[0023] Furthermore, steps of the same core method may be adapted in order for the machine to assist a human, for instance, by providing decision support. For example, the "trained" machine may engage a human operator or actor in a dialogue to understand, analyze and evaluate a newly presented situation, and then generate a short list of appropriate conclusions.
[0024] With its step of modifying a query as needed, RMCM provides a concrete mechanism for using and correcting errors and misconceptions in communication; that mechanism can be used to similarly eliminate the errors in communication between humans and machines when suitably adapted.
[0025] Training or machine learning is an essential requirement for a functioning expert system, whether the machine assists a human actor or operates autonomously. However, bottlenecks generally remain in knowledge base acquisition. RMCM methodology can be used to ensure that machine learning or training is reliable: as shown in Detailed Description, the method can contribute to the development of robust knowledge base and sound reasoning, and improve the art of expert system development.
DETAILED DESCRIPTION
[0026] Multiple-choice format for questioning and answering has been used round the globe for academic testing for many decades. In spite of the drawbacks of the format, well- recognized in the educational testing industry, multiple-choice questions remain a common way of testing students in a variety of subject areas, particularly in examinations taken by large numbers of students.
[0027] In its most commonly used form, a multiple-choice question comprises three identifiable sections: a section containing a set of facts to be presumed (for instance, a narrative, a short story, a poem, an expression, an equation, or a geometric figure), an interrogative sentence (sometimes known as the "call of the question"), and a set of answer choices. The number of answer choices typically range between two (as in true/false) to five, and there usually is only one correct (or best) answer out of the multiple provided answers.
[0028] Although the advantages such as the uniformity of administering and the ease of grading multiple-choice tests are universally recognized, there are other not-frequently-cited, advantages of this user-friendly format. Those inherent and exceptional advantages make this format suitable as well for several non-academic testing applications, such as, applications involving interaction with computer.
[0029] The application environments where "Q & A" are fundamental are not limited to the testing of students. The fields such as machine learning, expert systems, knowledge engineering, artificial intelligence and decision support systems etc. all involve a discourse between the "Transferor" and the "Transferee" of knowledge; this transfer of information necessitates a dialogue which can be carried out with clarity in the form of questioning and answering. Furthermore, the roles of questioner and answerer in these applications may alternate between the transferor and the transferee of information. Multiple-choice format brings a unique set of advantages to these non-academic situations.
[0030] A well-constructed multiple-choice question is an incisive evaluation tool that requires thought and analysis for the selection of the correct answer out of the various "confounding" answer choices. To the extent that an answer choice can be rejected on cursory inspection, it represents a "lost" opportunity for discrimination and evaluation. On the other hand, if due to a "small" difference in the narrative or the call of the question, correct and incorrect answers trade places, then the "evaluative" value of the question is higher.
[0031] In a well-constructed multiple-choice question with k answer choices, the questioner may delve deeper into a topic than in a set of k unrelated questions, since the k answer choices all relate to the same narrative. This allows for the subject matter of the question to be potentially viewed and reviewed from k perspectives. That is akin to locating a data point with k degrees of freedom.
[0032] Traditional multiple-choice method of questioning, asking only for the correct answer, discards most of that dimensional freedom. Reverse Multiple-Choice Method, however, exploits the power of the incorrect answers and all k degrees of freedom. Although certain traditional multiple-choice questions may allow for a comparable level of evaluative depth by suitable follow-up questions, RMCM does more than provide for a native structure to enable appropriate clustering of such questions; RMCM's flexible format admits short, structured write-in answers by students, often in the form of modified queries. This distinction of RMCM from standard multiple-choice questions can prove more important in a dialogue with a machine than in examining a human student.
[0033] By asking the answerer to construct new queries corresponding to the incorrect answers, it is possible to generate Q & A families or clusters that can impart "real world" knowledge, similar in a way to the manner in which a baby acquires real world knowledge by repeatedly asking questions.
[0034] On the other hand, in a reverse multiple-choice format the many degrees of freedom are resolved by requiring the answer choices themselves to limit the lexical universe of the question in a way: the reason an answer choice is either correct or incorrect must depend on an identifiable part of the question, whether it is an explicit segment of the narrative or an implicit requirement inherent in the query string.
[0035] This property of multiple-choice questions makes possible the identification of the "key" concepts therein. Identification of the key concepts in a query string is precisely the objective of machine learning. Reverse Multiple-Choice Method as used in the present invention provides a method of quite naturally arriving at the key concepts by seeking "Fact Objects" (FO's) corresponding to each answer choice.
[0036] For further discussion, it may be convenient to divide a multiple-choice question into two parts. The first part comprises a set of facts to be presumed and an interrogative sentence. The first part may also be termed a "query." As used herein, the terms "first part,"
"First_Part" and "query" shall be synonymous, unless otherwise noted. A second part of a question comprises a set of answer choices. As used herein, "second part" and "Second_Part" shall be synonymous, unless otherwise noted. (In addition to the definition above, it may be convenient to think of the first part of a question as comprising the portion of a question not included in its second part.) A correct answer (or "Correct Answer") is the answer choice that will or would be graded on a test as the "correct" or "best" answer choice to a given query.
[0037] A first part of a question typically comprises one or more fact objects (or
"Fact_Objects"). A fact object is defined to include any object, word, element, number, operator symbol, phrase, or a group of words, elements, numbers, operators, symbols, or other objects, or any other type of entity capable of holding information. A fact object typically denotes a fact, datum or piece of information in the first part of a multiple-choice question that may be considered when interpreting the answer choices or choosing an answer choice to the question. For instance, in the example discussed above of the box having sides of 1 foot, 2 feet and 3 feet, the length of each side may be considered a fact object. In a short story, each piece of information presented (which can be thought of as a word or group of words) may be considered a fact object. In a question on an art history exam, there may be a single fact object—the piece of artwork presented— and the interrogative sentence may ask the answerer to consider answer choices relating to the work or its creator to select the "correct" answer choice.
[0038] In one embodiment, where a fact object is a phrase or a group of words, elements, numbers, operators, symbols, or other objects or entities, the whole of such phrase or group- but no part less than the whole— of such phrase or group, denotes the particular fact, datum or piece of information contained in or conveyed by the fact object. The significance of a fact object may derive partly or primarily from the context of (including its placement within) the query and the set of answer choices of a particular multiple-choice question.
[0039] Ideally, as mentioned above, in a carefully constructed multiple-choice question, every answer choice will utilize this contextual connection between the critical information content of the fact object or fact objects in the given query and the answer choice. Generally, an embodiment with minor variations can be used either for purposes of study or review, or for testing. For example, a student may be shown the fact objects corresponding to different answer choices to a question in side by side comparisons, whereas a test-taker might be called upon to match the fact objects (or their values) from a list to different answer choices in comparative displays. Embodiments of RMCM may be suitably modified in case a student or teacher is a machine. [0040] A method of training a machine by a human expert
The following is an illustrative session of RMCM based training dialogue between Human Expert, H, and Machine expert-under-construction, M, wherein a Fact Value (FV) is an instantiation of a Fact Object (FO), case may define or be equivalent to a question in this context and the command "scan" below may include interpret or otherwise process the question as a machine; a dialogue below is expected to loop iteratively as many times as decided by H:
(i) User U defines a case. (U may be = H)
(ii) H creates or displays a previously created multiple-choice question Q based on case narrative - Answer Choices (ACs), Fact Objects (FOs), Fact Values (FVs)
(iii) Q is stored in database if a new question
(iv) U asks M to "scan" question
(v) U asks M for an answer
(vi) M encounters a FO, recalls Q
(vii) M matches FOs sequentially
(viii) IF a match, M matches FVs sequentially
a. IF an FV match, M produces for U an answer choice
b. ELSE M requests RMCM answer "correction" from H, records FV(s) for correction
c. M compares query created in step (ii) with "corrected" query d. M asks H if corrected query should be saved as new answer choice
alternative
i. If yes, M saves corrected query as new answer choice alternative ii. If no, M saves corrected query as new related case
(ix) IF not an FO match,
a. M requests RMCM answer "correction" from H, records FO(s) for
correction
b. M compares query created in step (ii) with "corrected" query c. M saves corrected query as related new case, if at least one FO match d. Else, M saves corrected query as unrelated new case.
Additionally, one may note that
(A) System may compute and assign probabilities of case scenarios via FV and FO probabilities
(B) M may be refined by assigning and incorporating probabilities of case scenarios
(C) The roles of H and M are interchangeable;
(D) Either H or M may also be the user U
(E) For each question in the database there are: Query +{AC, FOs, FVs } where AC stands for answer choice, FO for fact object and FV for fact value; each FO has a truth value (T or F) corresponding to each AC; each FO that has the truth value T (relevant) for the AC has an associated FV; each FO with truth value F has no FV associated with it; here {AC, FOs, FVs} is the set of an answer choice and its related fact objects and corresponding fact values, and + denotes direct sum of the query and such sets for all answer choices
(F) When M gets an incorrect AC, M records that fact in database as well: Query + AC~, where AC- is the incorrect answer choice
(G) When M produces an incorrect AC as answer, rejected by H, M records that fact as well as Query + AC-
(H) When H produces an incorrect AC as answer, rejected by M, M records that fact as well as Query + AC-
(I) When new Answer is presented, M matches it against all ACs and against all AC-s to present or generate an answer
(J) When new query presented, M matches query against all stored queries to present or generate an answer
[0041] Other variations of this method of training based on the RMCM methodology are contemplated to be within the scope of this invention. For example, the training of the machine may be conducted by a group of experts, including humans and other machines with appropriate expertise.
[0042] Such sessions used sequentially will generate the Knowledge Base, made up of clusters or families of questions, such that questions within a cluster may be related.
[0043] The organization of the knowledge base as related clusters is useful when a new set of conditionals are presented to the machine.
[0044] The algorithms of artificial intelligence/expert systems typically ask the machine to return an answer by "pruning the logic tree" that the machine has constructed from the training.
[0045] The RMCM based method outlined above, includes graph-theoretic components and does not entirely depend on the tree structure. This approach ensures that all related conditionals are available for consideration and that the logic tree is not pruned prematurely losing a branch of analysis that might have proved useful.
[0046] RMCM may generate domain-aware query clusters that are generally not available by other methods typically used in the machine learning environments. In many machine learning applications one is faced with a high volume of information necessitating the pruning of some branches to analyze the most promising line of reasoning. If it becomes apparent that the line of reasoning being followed is not the most promising one then it becomes necessary to follow the backward chain of reasoning. However, this method of forward and backward can be inefficient if the error in reasoning is uncovered far down the stream. [0047] Other clustering methods have been proposed in the literature, but RMCM has the potential as a flexible, general purpose method of clustering that minimizes the need for backward chaining. The reason for this is that RMCM does not depend on the logical tree structure, but on clusters of queries whose mutual dependency may be related and analyzed via graph-theoretic considerations.
[0048] In an expert system implementing a medical diagnoses, for example, the method of this invention can make the difference between right or wrong diagnosis.
[0049] Consider the following example of medical diagnosis by Julie Herzner and Miriam ubiska (1992), reproduced here from http ://www.rpi .edu dept/chem-eng/B iotech- Environ/EXPERT/expmed.html:
Under the "IF-THEN" inference strategy ~
IF forced volume capacity is high (.625)
(high/low/medium)
AND Bronchoscopy results are positive (.250)
(positive/negative/inconclusive)
AND local symptoms are present (.125)
(present/partial/absent)
THEN surgery is probably necessary
IF Metastasis is present (.500)
(present/absent/unknown)
OR Contraindications to surgery exist (.500)
(yes/No)
THEN surgery is probably not appropriate.
[0050] This example has been reproduced here with the potential (RMCM) Fact Objects identified in bold and underlined, and with the corresponding Fact Values indicated in parentheses underneath.
[0051] In a corresponding RMCM approach, the THEN result will not be arrived at immediately. An RMCM compliant engine will expand its horizon and consider the family of related queries with other fact values before pruning the branches of the tree and arriving at a conclusion. In situations where the consequences of premature pruning can be enormous, such as in medical decision making, RMCM approach of establishing and considering expanded options can be extremely useful, even at the expense of some extra time.
[0052] It is noteworthy that the usual approaches to expert/knowledge system development do not have widely applicable algorithms to recapture the logical branches once they are pruned. [0053] Similar considerations will apply to Herzner and Kubiska 's version of the example below with a slightly different inference rules:
IF-THEN-UNLESS
IF forced volume capacity is high
AND Bronchoscopy results are positive
AND local symptoms are present
THEN Surgery is probably appropriate
UNLESS Metastasis is present
OR Contraindications to surgery exist
[0054] A computer-aided method of developing RMCM based materials for education or testing
By way of illustration, a partial collection of headings from related tables are shown below that underscore the thorough, interconnected databases maintained in an implementation of an RMCM based system. The environment of such a system is highly flexible and, due to the notions of fact object and fact value, provides richness of operational freedom down to significant segment and fact levels.
Record of ANSWER CHOICES for the Question
Figure imgf000013_0001
TABLE 1
Record of FACT OBJECTS for the Question and Associated Answer Choices
Figure imgf000013_0002
TABLE 2 Record of Fact Objects for the Question and Associated SCORE
Figure imgf000014_0001
[0055] It is anticipated that a computerized machine that has been trained to recognize the relationship between the questions and answers in a particular subject matter will acquire the capability to suggest new queries to the human examiner by intelligently searching its database of queries.
[0056] An "expert" machine may eventually also "write" queries for possible acceptance by a human examiner. The methodology for the machine to write such queries will often rely on the clustering mechanisms built into the RMCM system.
[0057] Having now described a few illustrative embodiments, it should be apparent to those skilled in the art that the foregoing is merely illustrative and not limiting, having been presented by way of example only. Numerous modifications and other embodiments are within the scope of one of ordinary skill in the art and are contemplated as falling within the scope of the invention.
What is claimed is:

Claims

1. A method of interacting with a computerized machine comprising the steps of: (a) providing one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; and (b) for at least one of said one or more incorrect answers, prompting said machine to produce a follow-up query to which said at least one incorrect answer is a correct answer.
2. A method of interacting with a computerized machine comprising the steps of: (a) providing by said machine one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; and (b) for at least one of said one or more incorrect answers producing a follow-up query to which said at least one incorrect answer is a correct answer.
3. The method of claim 1 or claim 2, wherein said follow-up query is a modification of said query.
4. The method of claim 1 or claim 2, wherein said at least one of said one or more incorrect answers is all of said incorrect answers.
5. A method of interaction between two or more communicants comprising the steps of: (a) providing by one communicant one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; (b) prompting another communicant to select one of said incorrect answers; and (c) prompting said another communicant to produce a follow-up query to which said one of said incorrect answers is a correct answer, wherein one or more of the communicants is a machine.
6. The method of claim 5, wherein step (c) comprises prompting said another communicant to modify fact objects in said query to produce said follow-up query.
7. The method of claim 5, wherein step (c) comprises prompting said another communicant to choose fact objects from a list to modify in producing said follow-up query.
8. The method of claim 7, further comprising the step of: (d) repeating step (c) such that said another communicant is prompted to produce a different follow-up query to which said one of said incorrect answers is a correct answer.
9. A method of interaction between two or more communicants comprising the steps of: (a) providing by one communicant one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; (b) providing a list of fact objects present in said query; and (c) prompting said another communicant to identify which fact objects from said list must be changed in said query for one of said one or more incorrect answers to become a correct answer, wherein one or more of the communicants is a machine.
10. The method of claim 9, further comprising the step of: (d) repeating step (c) for each of said one or more incorrect answers.
1 1. A program stored on a computer-readable medium which, when executed, performs the steps of: (a) providing one or more multiple-choice questions, each question comprising a query and a plurality of answer choices, wherein said plurality of answer choices comprises one correct answer and one or more incorrect answers; (b) providing to a user one of said one or more multiple-choice questions; (c) prompting said user to select one of said incorrect answers for said one of said one or more multiple-choice questions; (d) providing a follow-up query; (e) prompting said user to determine whether said incorrect answer is a correct answer to said follow-up query, wherein said user is a human being or a machine.
12. The program of claim 1 1, wherein steps (c), (d) and (e) are repeated for every one of said one or more incorrect answers.
PCT/US2013/000113 2012-04-26 2013-04-27 Reverse multiple-choice method for knowledge engineering and expert system implementation WO2013162652A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201380033263.6A CN104620301A (en) 2012-04-26 2013-04-27 Reverse multiple-choice method for knowledge engineering and expert system implementation

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/457,440 2012-04-26
US13/457,440 US20120288844A1 (en) 2000-09-11 2012-04-26 Reverse multiple-choice method for knowledge engineering and expert system implementation

Publications (2)

Publication Number Publication Date
WO2013162652A2 true WO2013162652A2 (en) 2013-10-31
WO2013162652A3 WO2013162652A3 (en) 2014-08-21

Family

ID=49484009

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2013/000113 WO2013162652A2 (en) 2012-04-26 2013-04-27 Reverse multiple-choice method for knowledge engineering and expert system implementation

Country Status (2)

Country Link
CN (1) CN104620301A (en)
WO (1) WO2013162652A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017139726A1 (en) * 2016-02-11 2017-08-17 Indu Anand Reverse multiple-choice method for knowledge engineering and expert system implementation
US10430466B2 (en) 2018-01-25 2019-10-01 International Business Machines Corporation Streamlining support dialogues via transitive relationships between different dialogues

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6794992B2 (en) * 2015-10-13 2020-12-02 ソニー株式会社 Information processing equipment, information processing methods, and programs

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6018617A (en) * 1997-07-31 2000-01-25 Advantage Learning Systems, Inc. Test generating and formatting system
JP2008197604A (en) * 2007-02-08 2008-08-28 Shoichi Watanabe English composition automatic erratum discrimination and guidance system or program
WO2011088557A2 (en) * 2010-01-22 2011-07-28 Ravindran Pulyassary Active learning system, method, board game and card games, game board and playing cards for use therewith
KR20110099977A (en) * 2010-03-03 2011-09-09 주식회사 지식과미래 System for providing incorrect answer note based on online lecture
US20120045744A1 (en) * 2010-08-23 2012-02-23 Daniel Nickolai Collaborative University Placement Exam

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6018617A (en) * 1997-07-31 2000-01-25 Advantage Learning Systems, Inc. Test generating and formatting system
JP2008197604A (en) * 2007-02-08 2008-08-28 Shoichi Watanabe English composition automatic erratum discrimination and guidance system or program
WO2011088557A2 (en) * 2010-01-22 2011-07-28 Ravindran Pulyassary Active learning system, method, board game and card games, game board and playing cards for use therewith
KR20110099977A (en) * 2010-03-03 2011-09-09 주식회사 지식과미래 System for providing incorrect answer note based on online lecture
US20120045744A1 (en) * 2010-08-23 2012-02-23 Daniel Nickolai Collaborative University Placement Exam

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017139726A1 (en) * 2016-02-11 2017-08-17 Indu Anand Reverse multiple-choice method for knowledge engineering and expert system implementation
US10430466B2 (en) 2018-01-25 2019-10-01 International Business Machines Corporation Streamlining support dialogues via transitive relationships between different dialogues
US10783187B2 (en) 2018-01-25 2020-09-22 International Business Machines Corporation Streamlining support dialogues via transitive relationships between different dialogues

Also Published As

Publication number Publication date
CN104620301A (en) 2015-05-13
WO2013162652A3 (en) 2014-08-21

Similar Documents

Publication Publication Date Title
US20120288844A1 (en) Reverse multiple-choice method for knowledge engineering and expert system implementation
Li et al. A machine learning approach for automatic student model discovery.
Daniels Cognitive models in information retrieval—an evaluative review
LeBlanc et al. Text integration and mathematical connections: A computer model of arithmetic word problem solving
Sulmont et al. What is hard about teaching machine learning to non-majors? Insights from classifying instructors’ learning goals
Al Emran et al. A survey of intelligent language tutoring systems
CN108021703B (en) Conversation type intelligent teaching system
Wiggins et al. Do you think you can? the influence of student self-efficacy on the effectiveness of tutorial dialogue for computer science
US9542854B2 (en) Reverse-multiple choice method for knowledge engineering and expert system implementation
Huang et al. Constructing educational concept maps with multiple relationships from multi-source data
Bertacchini et al. A project based learning approach for improving students’ computational thinking skills
Stancheva et al. An environment for automatic test generation
Dragon et al. Coaching within a domain independent inquiry environment
Moiseeva et al. Multipurpose intelligent process automation via conversational assistant
Almelhes A review of artificial intelligence adoption in second-language learning
WO2013162652A2 (en) Reverse multiple-choice method for knowledge engineering and expert system implementation
Sokolnicki Towards knowledge-based tutors: a survey and appraisal of Intelligent Tutoring Systems
Tran et al. Generating multiple choice questions for computing courses using large language models
Možina et al. Identifying typical approaches and errors in Prolog programming with argument-based machine learning
Suraweera et al. Widening the knowledge acquisition bottleneck for constraint-based tutors
Linn et al. How do LISP programmers draw on previous experience to solve novel problems?
de los Angeles Chang Capturing qualitative science knowledge with multimodal instructional analogies
Almstrum Limitations in the understanding of mathematical logic by novice computer science students
Sulmont Machine learning for end-users: Exploring learning goals and pedagogical content knowledge
Castro Development of a Data-Grounded Theory of Program Design in HTDP

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13780937

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 13780937

Country of ref document: EP

Kind code of ref document: A2