WO2003017060A2 - Consistency validation for complex classification rules - Google Patents

Consistency validation for complex classification rules Download PDF

Info

Publication number
WO2003017060A2
WO2003017060A2 PCT/US2002/026286 US0226286W WO03017060A2 WO 2003017060 A2 WO2003017060 A2 WO 2003017060A2 US 0226286 W US0226286 W US 0226286W WO 03017060 A2 WO03017060 A2 WO 03017060A2
Authority
WO
WIPO (PCT)
Prior art keywords
rules
attributes
under
lapping
user
Prior art date
Application number
PCT/US2002/026286
Other languages
French (fr)
Other versions
WO2003017060A3 (en
Inventor
Jacob Feldman
Alexander Korolov
Semen Meshcheryakov
Stanislav Shor
Original Assignee
Exigen Group
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/953,701 external-priority patent/US20030018486A1/en
Application filed by Exigen Group filed Critical Exigen Group
Priority to EP02763468A priority Critical patent/EP1425691A2/en
Priority to AU2002327476A priority patent/AU2002327476A1/en
Publication of WO2003017060A2 publication Critical patent/WO2003017060A2/en
Publication of WO2003017060A3 publication Critical patent/WO2003017060A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q99/00Subject matter not provided for in other groups of this subclass

Definitions

  • the present invention relates generally to the field of expert systems.
  • Modern rule-based technology provides the ideal architecture for implementing flexible, easy-to-maintain business rule applications, because such applications keep the rules separate from the application code.
  • Business rules are statements of business policies and procedures that drive day-to-day business activity. Typically, the rules are presented in the form of "if ⁇ condition> then ⁇ action>", thus allowing the users to describe a numerous and various business situations.
  • business rules can rarely be presented in the form of simple "if-then" statements.
  • typical classification rules are combinations of multiple criteria, which frequently are presented as multi-column multi- row tables.
  • the insurance driver classification rule considers all possible combinations of driver age, gender, marital status, driving experience, vehicle usage, etc. Such rules result in multi-page tables, which are difficult to create and maintain. Usually, maintenance of business rules is the responsibility of (non-technical) business specialists, and the proper software tool has to provide consistency checking of rules with a complex infrastructure, and must point users to possible inconsistencies.
  • a method and apparatus for providing a practical solution for the generic problem of consistency validation of complex business rules is disclosed.
  • the method and apparatus is implemented with software tools.
  • Figure 1 is a flow diagram showing a method for validating the consistency of business rules, according to one embodiment.
  • typical classification rules deal with one prime classification object (for example, Driver) and different combinations of its attributes (such as Age, Gender, Marital Status, Driving Experience, Vehicle Usage, Violation Points, etc.).
  • Such classification rules usually state the proper object class for all possible combinations of its attributes.
  • classification rules are represented as a constraint satisfaction problem (CSP), shown in process block 101.
  • CSP constraint satisfaction problem
  • One constrained variable is associated with each attribute, shown in process block 102, and for each user-defined combination of the attributes, a constraint on these variables is defined, shown in process block 103.
  • consistency checking deals with only three possible situations:
  • Over-Lapping The classification includes contradictory (overlapping) combinations of attribute values.
  • an actual table may consist of 20 or more pages.
  • several attributes could be unrelated to the values of other attributes. For example, if the number of violation points is too high, it results in a special driver class, independent of the driver's age or gender.
  • CSP constraint satisfaction problem
  • one constrained integer variable is associated with each attribute, as seen in process block 102, and as shown in this expression:
  • the number N corresponds to the number of columns in the table above.
  • the domain of possible values has a specific size and content for each Attr(x), but without losing the generality of the definition, all values could in one embodiment be considered as integers.
  • Under-Coverage Validation In one embodiment, with the constraint defined "AllRows — 0", an attempt is made to instantiate all Attr(x). If the CSP has a solution, it means that there is a combination of attributes for which all RowExp(y) are false. In the example given in this embodiment, it means a driver exists who cannot be classified by any defined row of attributes, and the classification rule is inconsistent (due to under- coverage). This evaluation is shown in process blocks 104 and 106 of Figure 1. [0017] Over-Lapping Validation: In one embodiment, with the constraint defined "AllRows > 1", an attempt is made to instantiate all Attr(x).
  • the CSP has a solution, it means that more than one row conditions are true. In the example given in this embodiment, it means a driver exists who could be classified by more than one row of attributes, and the classification rule is inconsistent (due to over-lapping). This evaluation is shown in process blocks 104 and 106 of Figure 1.
  • the proposed solution in one embodiment goes beyond the simple “consistent” or “not consistent” diagnosis, and allows to proper program to tell the user why and where the inconsistency occurs.
  • the found solution points exactly to the combination of attributes not covered by the current classification rule, shown as process block 107.
  • the processes and embodiments as described above can be stored on a machine-readable medium as instructions.
  • the machine-readable medium includes any mechanism that provides (i.e., stores and/or transmits) information in a form readable by a machine (e.g., a computer).
  • a machine-readable medium includes read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.).
  • the device or machine-readable medium may include a solid state memory device and or a rotating magnetic or optical disk.
  • the device or machine-readable medium may be distributed when partitions of instructions have been separated into different machines, such as across an interconnection of computers.

Abstract

A method and apparatus for providing a practical solution for the generic problem of consistency validation of complex business rules is disclosed. In one embodiment, the method and apparatus are implemented with software tools. The method includes providing for association of a constrained variable with an attribute of rules (102), and providing for definition of a constraint on constrained variables for a combination of attributes defined by a user (103).

Description

Consistency Validation for Complex Classification Rules
[0001] The present application claims priority to the following provisional filed applications entitled Consistency Validation for Complex Classification Rules, filed on
August 16, 2001, serial no. ; entitled Hybrid Use of Rule and Constraint
Engines, filed on June 25, 2001, serial no. 60/300,951; entitled Minimization of Business
Rule Violations, filed on August 16, serial no. , all of which are incorporated herein by reference.
FIELD OF THE INVENTION
[0002] The present invention relates generally to the field of expert systems.
BACKGROUND OF THE INVENTION
[0003] Modern rule-based technology provides the ideal architecture for implementing flexible, easy-to-maintain business rule applications, because such applications keep the rules separate from the application code. Business rules are statements of business policies and procedures that drive day-to-day business activity. Typically, the rules are presented in the form of "if <condition> then <action>", thus allowing the users to describe a numerous and various business situations. [0004] However, in real-world e-business applications, such as an Insurance Rating system or a Financial Loan Origination system, business rules can rarely be presented in the form of simple "if-then" statements. In particular, typical classification rules are combinations of multiple criteria, which frequently are presented as multi-column multi- row tables.
[0005] For example, the insurance driver classification rule considers all possible combinations of driver age, gender, marital status, driving experience, vehicle usage, etc. Such rules result in multi-page tables, which are difficult to create and maintain. Usually, maintenance of business rules is the responsibility of (non-technical) business specialists, and the proper software tool has to provide consistency checking of rules with a complex infrastructure, and must point users to possible inconsistencies. SUMMARY OF THE INVENTION
[0006] A method and apparatus for providing a practical solution for the generic problem of consistency validation of complex business rules is disclosed. In one embodiment, the method and apparatus is implemented with software tools.
BRIEF DESCRIPTION OF THE DRAWINGS
[0007] Figure 1 is a flow diagram showing a method for validating the consistency of business rules, according to one embodiment.
DETAILED DESCRIPTION
[0008] In one embodiment, typical classification rules deal with one prime classification object (for example, Driver) and different combinations of its attributes (such as Age, Gender, Marital Status, Driving Experience, Vehicle Usage, Violation Points, etc.). Such classification rules usually state the proper object class for all possible combinations of its attributes.
[0009] hi one embodiment, as exemplified in Figure 1, according to one embodiment, classification rules are represented as a constraint satisfaction problem (CSP), shown in process block 101. One constrained variable is associated with each attribute, shown in process block 102, and for each user-defined combination of the attributes, a constraint on these variables is defined, shown in process block 103. Thus, in one embodiment, consistency checking deals with only three possible situations:
1. Over-Lapping: The classification includes contradictory (overlapping) combinations of attribute values.
2. Under-Coverage: The classification rule does not cover all possible situations.
3. Consistency: The rule is consistent.
For example, consider the driver classification rule represented in the following table:
Figure imgf000003_0001
Figure imgf000004_0001
To cover all possible combination of attributes, an actual table may consist of 20 or more pages. In one embodiment, several attributes could be unrelated to the values of other attributes. For example, if the number of violation points is too high, it results in a special driver class, independent of the driver's age or gender. [0010] To define a constraint satisfaction problem (CSP), in one embodiment one constrained integer variable is associated with each attribute, as seen in process block 102, and as shown in this expression:
Attr(x), x<≡ {l,N} [0011] The number N corresponds to the number of columns in the table above. The domain of possible values has a specific size and content for each Attr(x), but without losing the generality of the definition, all values could in one embodiment be considered as integers.
[0012] Now assume that there are M different combinations of all attributes defined by the user. The number M corresponds to the number of rows in the table above. If cell (x,y) defines some values for Attr(x), a Boolean constrained expression CellExp(x,y) may in one embodiment be associated with the cell. For example, for cell (1,1) the condition like "Gender is Male" could be represented in one embodiment as
Celffixp(U): Attr(l) = 0, where 0 corresponds "Male".
[0013] Similarly, for cell (3,2) a condition such as, for example, "Age is 17 through 24", could be represented in one embodiment as
CellExp(3,2): Attr(3) >= 17 && Attr(3) <= 24. [0014] Each user-defined combination of attributes (the table row number "y" in this example) could be presented in one embodiment as a Boolean constrained expression
RowExp(y) = αxCellExp(x,y), where α is a logical "and" of all y Boolean expressions CellExp(x,y). And, finally, an integer-constrained variable may be defined, as seen in process block 103, in one embodiment as
AURows = ∑yRowExp(x,y), [0015] To validate the consistency of the classification rule in one embodiment, the following CSP(s) could be solved:
[0016] Under-Coverage Validation: In one embodiment, with the constraint defined "AllRows — 0", an attempt is made to instantiate all Attr(x). If the CSP has a solution, it means that there is a combination of attributes for which all RowExp(y) are false. In the example given in this embodiment, it means a driver exists who cannot be classified by any defined row of attributes, and the classification rule is inconsistent (due to under- coverage). This evaluation is shown in process blocks 104 and 106 of Figure 1. [0017] Over-Lapping Validation: In one embodiment, with the constraint defined "AllRows > 1", an attempt is made to instantiate all Attr(x). If the CSP has a solution, it means that more than one row conditions are true. In the example given in this embodiment, it means a driver exists who could be classified by more than one row of attributes, and the classification rule is inconsistent (due to over-lapping). This evaluation is shown in process blocks 104 and 106 of Figure 1.
[0018] The classification rule is consistent, as shown in process block 105 according to one embodiment, when the CSP has no solutions, as shown in process block 104, under the constraint "AllRows != 1".
[0019] The proposed solution in one embodiment goes beyond the simple "consistent" or "not consistent" diagnosis, and allows to proper program to tell the user why and where the inconsistency occurs. In cases of under-coverage, the found solution points exactly to the combination of attributes not covered by the current classification rule, shown as process block 107. In cases of over-lapping, all rows with RowExp(y)=true are over-lapping.
[0020] And finally, the same technique could be used in one embodiment for interactive creation (configuration) of the classification rules. Instead of only pointing to the inconsistency, interactive tools could automatically generate "under-covered" rows and not to allow the user to enter over-lapping rows, shown as process block 108, or warn the user that he creates an overlap, and force him to correct it. [0021] The described approach has been implemented in one embodiment for Auto Insurance Rating rules using Exigen Rules™ and Exigen Constrainer™. [0022] It will be clear to the person skilled in the art, that besides insurances, other applications of the described embodiments exist, such as, including but not limited to, financial services in general, governmental agencies, resource planning situations in transport and distribution, etc.
[0023] The processes and embodiments as described above can be stored on a machine-readable medium as instructions. The machine-readable medium includes any mechanism that provides (i.e., stores and/or transmits) information in a form readable by a machine (e.g., a computer). For example, a machine-readable medium includes read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.). The device or machine-readable medium may include a solid state memory device and or a rotating magnetic or optical disk. The device or machine-readable medium may be distributed when partitions of instructions have been separated into different machines, such as across an interconnection of computers.
[0024] While certain exemplary embodiments have been described and shown in the accompanying drawings, it is to be understood that such embodiments are merely illustrative of and not restrictive on the broad invention, and that this invention not be limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those ordinarily skilled in the art.

Claims

CLAIMSWhat is claimed is:
1. A method comprising: providing for association of a constrained variable with an attribute of rules; and providing for definition of a constraint on constrained variables for a combination of attributes defined by a user.
2. The method of claim 1 , wherein the rules are consistent only if a constraint satisfaction problem has no solutions under over-lapping and under-coverage constraints.
3. The method of claim 1 , wherein the rules are inconsistent if a constraint satisfaction problem has a solution by instantiations of all attributes when over-lapping or under-coverage constraints are defined.
4. The method of claim 1, further comprising providing for telling a user which combinations of attributes cause the rules to be under-covered or over-lapping.
5. The method of claim 1, further comprising providing for preventing a user from entering rules that are inconsistent.
6. A machine-readable medium that provides instructions which, when executed by a machine, cause the machine to perform operations comprising: associating a constrained variable with an attribute of rules; and defining a constraint on constrained variables for a combination of attributes defined by a user.
7. The machine-readable medium of claim 6, wherein the rules are consistent only if a constraint satisfaction problem has no solutions under over-lapping and under-coverage constraints.
8. The machine-readable medium of claim 6, wherein the rules are inconsistent if a constraint satisfaction problem has a solution by instantiations of all attributes when over-lapping or under-coverage constraints are defined.
9. The machine-readable medium of claim 6, wherein operations further comprise telling a user which combinations of attributes cause the rules to be under-covered or over-lapping.
10. The machine-readable medium of claim 6, wherein operations further comprise preventing a user from entering rules that are inconsistent.
11. An apparatus comprising at least one tool to provide for validation of consistency of rules by representing the rules as a constraint satisfaction problem.
12. The apparatus of claim 11, wherein the rules are consistent only if the constraint satisfaction problem has no solutions under over-lapping and under-coverage constraints.
13. The apparatus of claim 11, wherein representing the rules as a constraint satisfaction problem includes: associating a constrained variable with an attribute of the rules for each attribute; and defining a constraint on all constrained variables for a combination of attributes, for each combination, defined by a user.
14. The apparatus of claim 13, wherein the rules are inconsistent if the constraint satisfaction problem has a solution by instantiations of all attributes when over-lapping or under-coverage constraints are defined.
15. The apparatus of claim 13, wherein validation includes telling a user which combinations of attributes cause the rules to be under-covered or over-lapping.
16. The apparatus of claim 13, wherein validation includes preventing a user from entering rules that are inconsistent.
PCT/US2002/026286 2001-08-16 2002-08-16 Consistency validation for complex classification rules WO2003017060A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP02763468A EP1425691A2 (en) 2001-08-16 2002-08-16 Consistency validation for complex classification rules
AU2002327476A AU2002327476A1 (en) 2001-08-16 2002-08-16 Consistency validation for complex classification rules

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US31307901P 2001-08-16 2001-08-16
US31304001P 2001-08-16 2001-08-16
US60/313,040 2001-08-16
US60/313,079 2001-08-16
US09/953,701 US20030018486A1 (en) 2001-06-25 2001-09-11 Consistency validation for complex classification rules
US09/953,701 2001-09-11

Publications (2)

Publication Number Publication Date
WO2003017060A2 true WO2003017060A2 (en) 2003-02-27
WO2003017060A3 WO2003017060A3 (en) 2003-12-24

Family

ID=27405639

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/026286 WO2003017060A2 (en) 2001-08-16 2002-08-16 Consistency validation for complex classification rules

Country Status (3)

Country Link
EP (1) EP1425691A2 (en)
AU (1) AU2002327476A1 (en)
WO (1) WO2003017060A2 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4658370A (en) * 1984-06-07 1987-04-14 Teknowledge, Inc. Knowledge engineering tool
US5970464A (en) * 1997-09-10 1999-10-19 International Business Machines Corporation Data mining based underwriting profitability analysis
US6016477A (en) * 1997-12-18 2000-01-18 International Business Machines Corporation Method and apparatus for identifying applicable business rules
US6272482B1 (en) * 1998-08-14 2001-08-07 International Business Machines Corporation Managing business rules using jurisdictions
US6377932B1 (en) * 1998-07-02 2002-04-23 Ita Software, Inc. Rules validation for travel planning system
US6415275B1 (en) * 1999-08-05 2002-07-02 Unisys Corp. Method and system for processing rules using an extensible object-oriented model resident within a repository
US6473748B1 (en) * 1998-08-31 2002-10-29 Worldcom, Inc. System for implementing rules

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4658370A (en) * 1984-06-07 1987-04-14 Teknowledge, Inc. Knowledge engineering tool
US5970464A (en) * 1997-09-10 1999-10-19 International Business Machines Corporation Data mining based underwriting profitability analysis
US6016477A (en) * 1997-12-18 2000-01-18 International Business Machines Corporation Method and apparatus for identifying applicable business rules
US6377932B1 (en) * 1998-07-02 2002-04-23 Ita Software, Inc. Rules validation for travel planning system
US6272482B1 (en) * 1998-08-14 2001-08-07 International Business Machines Corporation Managing business rules using jurisdictions
US6473748B1 (en) * 1998-08-31 2002-10-29 Worldcom, Inc. System for implementing rules
US6415275B1 (en) * 1999-08-05 2002-07-02 Unisys Corp. Method and system for processing rules using an extensible object-oriented model resident within a repository

Also Published As

Publication number Publication date
WO2003017060A3 (en) 2003-12-24
EP1425691A2 (en) 2004-06-09
AU2002327476A1 (en) 2003-03-03

Similar Documents

Publication Publication Date Title
Veale et al. Fairness and accountability design needs for algorithmic support in high-stakes public sector decision-making
US6856980B2 (en) Hybrid use of rule and constraint engines
Vanetti et al. Content-based filtering in on-line social networks
JP5957580B2 (en) Data processing using vector fields
US10354187B2 (en) Confidentiality of files using file vectorization and machine learning
US20230004890A1 (en) Continuous and anonymous risk evaluation
US20210350211A1 (en) Distributed architecture for explainable ai models
CN103365812A (en) Method and system for data privacy engine
Bhargava et al. LimeOut: an ensemble approach to improve process fairness
Spertus et al. Sweeter than SUITE: supermartingale stratified union-intersection tests of elections
CN109783781A (en) Declaration form input method and relevant apparatus based on image recognition
US11531845B1 (en) Bias mitigating machine learning training system
US7003749B2 (en) Constraint data management for electronic design automation
US20030018486A1 (en) Consistency validation for complex classification rules
Pradhan et al. Explainable ai: Foundations, applications, opportunities for data management research
Deshmukh Complexity and chaos in manufacturing systems
Sánchez et al. Federatedtrust: A solution for trustworthy federated learning
US20020198856A1 (en) Minimization of business rules violations
WO2003017060A2 (en) Consistency validation for complex classification rules
Matzner Algorithms: Technology, Culture, Politics
Fisun et al. Knowledge management applications based on user activities feedback
Lundberg Increasing the Trustworthiness ofAI-based In-Vehicle IDS usingeXplainable AI
JP7430298B2 (en) Information processing device, information processing method, and program
US20230359890A1 (en) Bias mitigating machine learning training system with multi-class target
US11790036B2 (en) Bias mitigating machine learning training system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG US UZ VN YU ZA ZM

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2002763468

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002763468

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP