WO2005013151A1 - Methods and systems for editing a network of interconnected concepts - Google Patents

Methods and systems for editing a network of interconnected concepts Download PDF

Info

Publication number
WO2005013151A1
WO2005013151A1 PCT/US2004/023828 US2004023828W WO2005013151A1 WO 2005013151 A1 WO2005013151 A1 WO 2005013151A1 US 2004023828 W US2004023828 W US 2004023828W WO 2005013151 A1 WO2005013151 A1 WO 2005013151A1
Authority
WO
WIPO (PCT)
Prior art keywords
concept
receiving
relationship
program code
computer
Prior art date
Application number
PCT/US2004/023828
Other languages
French (fr)
Inventor
Adam J. Weissman
Gilad Israel Elbaz
Original Assignee
Google Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Inc. filed Critical Google Inc.
Publication of WO2005013151A1 publication Critical patent/WO2005013151A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/956Hierarchical
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/96Object-relational

Definitions

  • the invention generally relates to a network of interconnected concepts. More particularly, the invention relates to methods and systems for editing a network of interconnected concepts.
  • a search engine or search engine program is a widely used mechanism for allowing users to search vast numbers of documents for information.
  • search engines index the documents and the terms appearing in the documents.
  • a term can be a string of characters or groups of strings of characters representing a one or more related words.
  • a search engine provides a user interface that
  • search engine includes a query field.
  • a user can enter one or more search terms describing the desired information in the query field.
  • the search engine In response to a query, the search engine
  • the search engine attempts to locate relevant documents by matching the search terms to the indexed terms appearing in the documents. Based on these matches, the search engine ranks, sorts and then returns for display the search results, usually as a list of relevant
  • a concept can be a normalized semantic representation that can be expressed with any number of terms.
  • a search engine can utilize a
  • networks generally associate concepts in a hierarchical fashion and provide no weight to the associations. Further, such conventional networks of concepts are not easily editable.
  • SUMMARY Embodiments of the present invention comprise methods and systems for editing a network of interconnected concepts.
  • a semantic engine implements a method comprising receiving a primary term representing a first
  • semantic engine implements a method comprising receiving a request to edit a first
  • FIG. 1 illustrates a block diagram of a system in accordance with one
  • FIG. 2 illustrates an example of a portion of a relationship-based ontology in
  • FIG. 3 illustrates an example of bond strength and semantic distance in a
  • FIG. 4 illustrates an exemplary concept view in accordance with one embodiment of the present invention.
  • FIG. 5 illustrates a flow diagram of a method in accordance with one embodiment ofthe present invention.
  • FIG. 6 illustrates a flow diagram of a method in accordance with one
  • a semantic engine allows for the creation and editing of concepts in a network of interconnected concepts or ontology.
  • the concepts can be defined by a synset or a group of related terms, such as synonyms and acronyms, relationships, both hierarchical and lateral, with other
  • the semantic engine allows for the creation or editing of a concept by allowing a user to define the related terms, the relationships with other concepts, and the strength of the
  • FIG. 1 is a diagram illustrating an exemplary system in which exemplary
  • the present invention may operate.
  • the present invention may operate, and be embodied in, other systems as well.
  • the system 100 shown in FIG. 1 includes multiple client devices 102a-n, a
  • the server device 104 and a network 106.
  • the network 106 shown can be the Internet. In other embodiments, other networks, such as an intranet may be used. Moreover,
  • the client devices 102a-n shown each include a computer-readable medium, such as a random access memory (RAM) 108 in the embodiment shown, coupled to a processor
  • RAM random access memory
  • the processor 110 executes a set of computer-executable program instructions
  • processors may include a microprocessor, an ASIC, and
  • Such processors include, or may be in communication with, media, for example computer-readable media, which stores instructions that, when executed
  • Embodiments of computer-readable media include, but are not limited to, an
  • processor such as the processor in communication with a touch-sensitive input device, with computer-readable instructions.
  • a computer processor includes, but are not limited to, a floppy disk, CD-ROM, magnetic disk, memory chip, ROM, RAM, an ASIC, a configured processor, all optical media, all magnetic tape or other magnetic media, or any other medium from which a computer processor can
  • various other forms of computer-readable media may transmit or carry instructions to a computer, including a router, private or public network, or other transmission device or channel, both wired and wireless.
  • the instructions may comprise code from any computer-programming language, including, for example, C, C++, C#, Visual Basic, Java, Python, Perl, and JavaScript.
  • Client devices 102a-n may also include a number of external or internal
  • a mouse such as a mouse, a CD-ROM, a keyboard, a display, or other input or output
  • client devices 102a-n are personal computers, digital assistants, personal digital assistants, cellular phones, mobile phones, smart phones, pagers, digital tablets, laptop computers, a processor-based device and similar types of systems and devices.
  • client device 102a-n may be any type of processor-
  • the client devices 102a-n shown include personal computers
  • a user interface application program and/or a browser application program such as Internet ExplorerTM, version 6.0 from Microsoft Corporation, Netscape
  • server device 104 is also coupled to the network 106.
  • the server device 104 shown includes a server executing a semantic engine program.
  • the system 100 can also include multiple other server devices. Similar to the client devices 102a-n, the server device 104 shown includes a processor 116 coupled to a
  • the server device 104 is depicted as a single computer system, but may be implemented as a network of computer processors.
  • server device 104 examples are servers, mainframe computers, networked computers, a processor-based device and similar types of systems and devices.
  • Client processors 110 and server processor 116 can be any of a number of well known computer processors, such as processors from Intel Corporation of Santa Clara, California and Motorola Corporation of Schaumburg, Illinois.
  • Memory 118 of the server device 104 contains a semantic engine application
  • the semantic engine 120 manages
  • the semantic engine 120 also can index articles and concepts found in articles to enable the location and retrieval of information.
  • Articles include,
  • documents for example, web pages of various formats, such as HTML, XML,
  • the semantic engine 120 shown includes a concept processor 130, which, in the embodiment shown, comprises computer code residing in the memory 118.
  • the concept processor 130 can edit and create concepts in a network of interconnected concepts based on information received from a client device 102a input by a user 112a.
  • the client processor 130 can have a user interface application that allows user
  • Server device 104 also provides access to other storage elements, such as a concept storage element, in the example shown a concept database 122.
  • the concept database 120 can be used to store the ontology.
  • the concept database can be located in memory 118 as shown or can be located external to server device 104. Data
  • storage elements may include any one or combination of methods for storing data, including without limitation, arrays, hash tables, lists, and pairs. Other similar types
  • present invention may comprise systems having
  • the semantic engine 120 may not be on a single server device 104, but may be on multiple server devices.
  • the system 100 shown in FIG. 1 is merely exemplary, and is used to explain the exemplary methods
  • the concept database 122 can contain an ontology that can describe a semantic space that can be employed to facilitate concept-based location
  • the ontology consists of a plurality of concepts, which are connected to one another through many different types of relationships, such as,
  • a lateral bond relationship describes one concept's closeness to another in an average person's common understanding. For example, “skier” and “skiing” are not closely related concepts in conventional ontologies. The former is a kind of "athlete", ultimately a
  • FIG. 2 a simple example is illustrated by FIG. 2.
  • the boxed elements in FIG. 2 represent meanings within the ontology and collectively, along with the relationship connections between meanings can be viewed
  • FIG. 2 concerns skiing. Starting with the branch for "sport",
  • skiing is defined in the ontology 200 as a kind of “sport”.
  • the word “ski” typically, in its noun form, can be thought of as related to “skiing” in that it is a “part of “skiing” as shown in FIG. 2.
  • "Slalom skiing” is a type of skiing and hence a “kind of connection is shown between it and “skiing”.
  • "Bindings” are a structural attachment
  • lateral bond can be, for example, "product of. The company branch has a child
  • Target documents or nodes within a subject directory may be "placed” or “located” by human intervention or automatically into the semantic space as defined by ontology 200.
  • a value can be assigned to a connection from one concept to another that signifies how strongly the second meaning relates to
  • connections can be dependant on the direction of the bond, so that, for example, “skier” might imply a strong connection to “skiing”, whereas “skiing
  • FIG. 3 illustrates an example of bond strengths and semantic distances between the concepts illustrated in FIG. 2.
  • FIG. 3 illustrates how distance and closeness of meaning between concepts can be quantified within the semantic space.
  • FIG. 3 also shows semantic distances between concepts. "Ski” and “skiing”
  • the concept processor 130 can update and expand the ontology to include new
  • Big Ski this concept can be added to the ontology by a user.
  • the user can define the relationships of other concepts to “Big Ski” and the strength of these
  • Figure 4 illustrates an example of a concept window 400.
  • the concept window 400 shows an example of a concept definition for "The United States”.
  • the primary term is shown at 402 and is "The United States”.
  • the primary term 402 also indicates the part of speech and provides a context for the
  • the concept window 400 shown further provides a concept details section 401 and a concept relationships section 412.
  • the concept details section 401 can display various details of the concept. For example, at 404 the group of related terms
  • the part of speech is shown at 406 and
  • the refinition box can be used to provide a context for the concept
  • the concept origin is shown at 414.
  • the various relationships for the concept are shown in boxes 416, 418, and 420.
  • box 416 the parents of the concept are displayed and in box 420, children of the concept are displayed.
  • Parent concepts and children concepts are concepts with a hierarchical relationship with the origin
  • concepts having a lateral relationship with the origin concept are displayed
  • concepts having a lateral relationship with the origin concept are displayed
  • a concept view such as a concept creation window
  • a primary term representing the first concept is received
  • at least one related term associated with the primary term is received
  • the relationship can be hierarchical or lateral and can
  • a request to edit a first concept in a network of interrelated concepts is received, the first concept is displayed, at least one
  • the first concept is updated in the network.
  • information defining a part of speech of the first concept is received, and the first concept is updated in the network.
  • information defining a breadth of the first concept information that the first concept is offensive, user data, and context information, such as refinition, can also be
  • FIG. 5 illustrates an exemplary method 500 that provides a method for
  • the method 500 shown in FIG. 5 can be executed or otherwise performed by any of various systems.
  • the method 500 is described below as carried out by the system 100 shown in FIG. 1 by way of
  • a request signal from a user to create a concept term is received by the semantic engine 120.
  • the request signal can be created by a user interface
  • the concept processor 130 can cause, in 504, a
  • concept view such as a concept creation window
  • the user 112a can be prompted for the entry of a variety of information in order to define the concept by the concept creation window.
  • defines the concept can include: a primary term used to identify the concept; a group
  • the information that defines the concept can also include: the part of speech of the concept; the frequency of the primary term; the
  • a concept definition can also include user data and a refinition.
  • a user can input a primary term associated with the
  • the primary term can be the term most
  • a user can input a group of terms related to the primary term that are also associated with the concept in the concept
  • a user 112a can also input concepts related to the created concept and provide
  • relationship types for each related concept can include: "kind of;
  • the user 112a can define the related concept by typing it in a related concept box and then defining the related concept
  • relationship type by choosing a relationship type from a drop down list. For every relationship type, the user 112a can assign a strength to the
  • the strength can be representative of how strong the relationship is.
  • the strength represents the distance in semantic space the created concept is from the related concept.
  • the user 112a can also input other information to define the concept.
  • the part of speech of the concept can be provided.
  • a frequency of the primary term can be provided to represent the frequency of the primary term in
  • a probability can be provided of the likelihood that the primary term and the related terms imply the concept.
  • the user 112a can specify the breadth or focus of the concept and indicate whether the concept is offensive.
  • the user 112a can also specify whether to match terms associated with the concept literally or can define non-literal acronym and abbreviation rules.
  • the user can also provide user data and context for the concept (or refinition).
  • the user 112a After the user 112a has defined the concept, the user 112a can request through
  • the client device 102a that the concept be added to the ontology.
  • This request can cause the data input by the user 112a in the concept creation window to be sent from the client device 102a via the network 106 to the semantic engine 120.
  • the concept processor 130 can create a new concept in the ontology from the data
  • FIG. 6 illustrates an exemplary method 600 that provides a method for editing concepts in an ontology. This exemplary method is provided by way of example, as
  • the method 600 shown in FIG. 6 can be executed or otherwise performed by any of
  • the method 600 is described below as carried out by the system 100 shown in FIG. 1 by way of example, and various elements of the system 100 are referenced in explaining the example method of FIG. 6.
  • a request signal to edit a concept from a user 112a is received by the semantic engine 120.
  • the request signal can be created by a user interface application on client device 102a through input from the user 112a. This request
  • the signal can include the concept to be edited and is passed to server device 104 via the network 106.
  • the concept processor 130 can verify that the concept requested exists and can cause in 604 a concept editor window to be displayed on client device 102a.
  • the concept editor window displays the concept definition that
  • a user can edit the various information used to define a concept as described above.
  • a user can edit the group of terms related to the primary term that are associated with the concept. For example, the user can add additional terms or delete
  • a user can also input additional concepts related to the edited concept, input
  • relationship types for each related concept, and edit existing concepts and relationship types.
  • a user can define and/or edit the relationship type by choosing a relationship type from a drop down list. For the relationship types in the concept definition, the user can assign and edit a strength of the relationship.
  • user 112a can also input and edit other information in order to edit the concept as
  • the user 112a After the user 112a has edited the concept, the user 112a can request through
  • the concept processor 130 can replace the concept in the ontology with the edited concept received from the user

Abstract

Systems and methods that create and edit a network of interconnected concepts are disclosed. In one embodiment, a semantic engine implements a method comprising receiving a primary term representing a first concept to be added to a network of interrelated concepts, receiving at least one related term related to the primary term, receiving at least one relationship between the first concept and a second concept, receiving a strength value associated with the relationship, and adding the first concept to the network.

Description

METHODS AND SYSTEMS FOR EDITING A NETWORK OF INTERCONNECTED CONCEPTS
FIELD OF THE INVENTION
The invention generally relates to a network of interconnected concepts. More particularly, the invention relates to methods and systems for editing a network of interconnected concepts.
BACKGROUND OF THE INVENTION
Improving mechanisms for searching vast numbers of documents, such as those available via the World Wide Web ("web") or on large enterprise computer
networks, has increasingly been an area of focus and research. This is especially due
to the continued growth in the number of computer users, services and applications
offered for example on the web, and the amount of information being added to the web, enterprise computer networks and other databases.
A search engine or search engine program is a widely used mechanism for allowing users to search vast numbers of documents for information. Conventional
search engines index the documents and the terms appearing in the documents. A term can be a string of characters or groups of strings of characters representing a one or more related words. Generally, a search engine provides a user interface that
includes a query field. A user can enter one or more search terms describing the desired information in the query field. In response to a query, the search engine
attempts to locate relevant documents by matching the search terms to the indexed terms appearing in the documents. Based on these matches, the search engine ranks, sorts and then returns for display the search results, usually as a list of relevant
documents. Typically, the concept or concepts represented by the search terms or the terms
in the documents are not taken into account. A concept can be a normalized semantic representation that can be expressed with any number of terms. Methods of complex
searching exist, but most of these methods of complex searching are term based. For example, an advanced Boolean search or an advanced text search are both based on terms and not concepts. With a term search, documents are converted into a list of
terms with relative rankings and the document's terms are compared with the search terms. Term searches can provide poor results when faced with terms having multiple
meanings, the use of synonyms, and important terms that are not used frequently. Further, it is difficult to search for a concept using term-based searching methods.
For example, if it is desired to search all documents that mention the concept "California cities", each city must be enumerated as a search term. In order to provide a concept-based search, a search engine can utilize a
network of concepts. While such networks of concepts exist, these conventional
networks generally associate concepts in a hierarchical fashion and provide no weight to the associations. Further, such conventional networks of concepts are not easily editable.
SUMMARY Embodiments of the present invention comprise methods and systems for editing a network of interconnected concepts. In one embodiment, a semantic engine implements a method comprising receiving a primary term representing a first
concept to be added to a network of interrelated concepts, receiving at least one
related term related to the primary term, receiving at least one relationship between
the first concept and a second concept, receiving a strength value associated with the
relationship, and adding the first concept to the network. In another embodiment, a
semantic engine implements a method comprising receiving a request to edit a first
concept in a network of interrelated concepts, causing the display of the first concept,
receiving a request to add at least one relationship between the first concept and a
second concept, receiving a strength value associated with the relationship, and
updating the first concept in the network. Additional aspects of the present invention
are directed to computer systems and to computer-readable media having features
relating to the foregoing aspects.
Further details and advantages of embodiments of the present invention are set
forth below.
BRIEF DESCRIPTION OF THE DRAWINGS
These and other features, aspects, and advantages of the present invention are
better understood when the following Detailed Description is read with reference to
the accompanying drawings, wherein: FIG. 1 illustrates a block diagram of a system in accordance with one
embodiment of the present invention.
FIG. 2 illustrates an example of a portion of a relationship-based ontology in
accordance with one embodiment ofthe present invention. FIG. 3 illustrates an example of bond strength and semantic distance in a
portion of a relationship-based ontology in accordance with one embodiment of the present invention.
FIG. 4 illustrates an exemplary concept view in accordance with one embodiment of the present invention. FIG. 5 illustrates a flow diagram of a method in accordance with one embodiment ofthe present invention. FIG. 6 illustrates a flow diagram of a method in accordance with one
embodiment ofthe present invention.
DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS The present invention comprises methods and systems for editing a network of
interconnected concepts. In one embodiment, a semantic engine allows for the creation and editing of concepts in a network of interconnected concepts or ontology.
The concepts can be defined by a synset or a group of related terms, such as synonyms and acronyms, relationships, both hierarchical and lateral, with other
concepts, and the strength of the relationships with other concepts. The semantic engine allows for the creation or editing of a concept by allowing a user to define the related terms, the relationships with other concepts, and the strength of the
relationships.
Reference will now be made in detail to exemplary embodiments of the
invention as illustrated in the text and accompanying drawings. The same reference numbers are used throughout the drawings and the following description to refer to
the same or like parts. Various systems in accordance with the present invention may be constructed. FIG. 1 is a diagram illustrating an exemplary system in which exemplary
embodiments of the present invention may operate. The present invention may operate, and be embodied in, other systems as well. The system 100 shown in FIG. 1 includes multiple client devices 102a-n, a
server device 104 and a network 106. The network 106 shown can be the Internet. In other embodiments, other networks, such as an intranet may be used. Moreover,
methods according to the present invention may operate in a single computer. The client devices 102a-n shown each include a computer-readable medium, such as a random access memory (RAM) 108 in the embodiment shown, coupled to a processor
110. The processor 110 executes a set of computer-executable program instructions
stored in memory 108. Such processors may include a microprocessor, an ASIC, and
state machines. Such processors include, or may be in communication with, media, for example computer-readable media, which stores instructions that, when executed
by the processor, cause the processor to perform the steps described herein. Embodiments of computer-readable media include, but are not limited to, an
electronic, optical, magnetic, or other storage or transmission device capable of
providing a processor, such as the processor in communication with a touch-sensitive input device, with computer-readable instructions. Other examples of suitable media
include, but are not limited to, a floppy disk, CD-ROM, magnetic disk, memory chip, ROM, RAM, an ASIC, a configured processor, all optical media, all magnetic tape or other magnetic media, or any other medium from which a computer processor can
read instructions. Also, various other forms of computer-readable media may transmit or carry instructions to a computer, including a router, private or public network, or other transmission device or channel, both wired and wireless. The instructions may comprise code from any computer-programming language, including, for example, C, C++, C#, Visual Basic, Java, Python, Perl, and JavaScript.
Client devices 102a-n may also include a number of external or internal
devices such as a mouse, a CD-ROM, a keyboard, a display, or other input or output
devices. Examples of client devices 102a-n are personal computers, digital assistants, personal digital assistants, cellular phones, mobile phones, smart phones, pagers, digital tablets, laptop computers, a processor-based device and similar types of systems and devices. In general, a client device 102a-n may be any type of processor-
based platform connected to a network 106 and that interacts with one or more application programs. The client devices 102a-n shown include personal computers
executing a user interface application program and/or a browser application program such as Internet Explorer™, version 6.0 from Microsoft Corporation, Netscape
Navigator™, version 7.1 from Netscape Communications Corporation, and Safari™,
version 1.0 from Apple Computer. Through the client devices 102a-n, users 112a-n can communicate over the network 106 with each other and with other systems and
devices coupled to the network 106. As shown in FIG. 1, server device 104 is also coupled to the network 106.
The server device 104 shown includes a server executing a semantic engine program. The system 100 can also include multiple other server devices. Similar to the client devices 102a-n, the server device 104 shown includes a processor 116 coupled to a
computer readable memory 118. The server device 104 is depicted as a single computer system, but may be implemented as a network of computer processors.
Examples of server device 104 are servers, mainframe computers, networked computers, a processor-based device and similar types of systems and devices. Client processors 110 and server processor 116 can be any of a number of well known computer processors, such as processors from Intel Corporation of Santa Clara, California and Motorola Corporation of Schaumburg, Illinois.
Memory 118 of the server device 104 contains a semantic engine application
program, also known as a semantic engine 120. The semantic engine 120 manages
and provides access to a network of interconnected concepts or ontology. In one embodiment, the semantic engine 120 also can index articles and concepts found in articles to enable the location and retrieval of information. Articles include,
documents, for example, web pages of various formats, such as HTML, XML,
XHTML, Portable Document Format (PDF) files, and word processor, database, and application program document files, audio, video, or any other information of any
type whatsoever made available on a network (such as the Internet or intranet), a personal computer, or other computing or storage means. The embodiments
described herein are described generally in relation to documents, but embodiments
may operate on any type of article. The semantic engine 120 shown includes a concept processor 130, which, in the embodiment shown, comprises computer code residing in the memory 118. The concept processor 130 can edit and create concepts in a network of interconnected concepts based on information received from a client device 102a input by a user 112a. The client processor 130 can have a user interface application that allows user
112a via the client device 102a to interact with the concept processor 130 to edit
existing concepts and create new concepts in the ontology. Server device 104 also provides access to other storage elements, such as a concept storage element, in the example shown a concept database 122. The concept database 120 can be used to store the ontology. The concept database can be located in memory 118 as shown or can be located external to server device 104. Data
storage elements may include any one or combination of methods for storing data, including without limitation, arrays, hash tables, lists, and pairs. Other similar types
of data storage devices can be accessed by the server device 104.
It should be noted that the present invention may comprise systems having
different architecture than that which is shown in FIG. 1. For example, in some
systems according to the present invention, the semantic engine 120 may not be on a single server device 104, but may be on multiple server devices. The system 100 shown in FIG. 1 is merely exemplary, and is used to explain the exemplary methods
shown in FIGs. 5 and 6.
As explained above, the concept database 122 can contain an ontology that can describe a semantic space that can be employed to facilitate concept-based location
and retrieval of information. The ontology consists of a plurality of concepts, which are connected to one another through many different types of relationships, such as,
for example, "kind of, "has kind", "part of, "has part", "member of, "has member", "substance of, "has substance", "product of, and "has product". For example, the concept of "table" is connected to the concept of "furniture" through a
"kind of connection. Thus, "table" is a "kind of "furniture". Similarly, "California" is a part of the "United States", the "United States" has part "California",
the "United States" is a member of the "NATO", and "NATO" has member "United
States". The ontology according to one embodiment of the present invention also
includes bind or lateral bond relationships between concepts. A lateral bond relationship describes one concept's closeness to another in an average person's common understanding. For example, "skier" and "skiing" are not closely related concepts in conventional ontologies. The former is a kind of "athlete", ultimately a
kind of "human being"; and thus would reside within the "entity" or "living thing" tree. The latter is a kind of "sport", ultimately a kind of "activity"; it is in the
"actions" tree. Though the subjects are closely related in everyday usage, they may
be in widely separated locations within a conventional ontology. To remedy this, a bind or lateral bond can be made between the two concepts, to reflect their close
proximity in semantic space (when an average person thinks of one concept, the
person tends to think of the other). The relationships between elements may take on many forms and can become
quite complex, but for ease of illustration, a simple example is illustrated by FIG. 2. The boxed elements in FIG. 2 represent meanings within the ontology and collectively, along with the relationship connections between meanings can be viewed
as defining a semantic space. The relationship types "part of, "kind of and "lateral bond" are represented by differing line types in FIG. 2, a legend for which is drawn
thereon. The example in FIG. 2 concerns skiing. Starting with the branch for "sport",
"skiing" is defined in the ontology 200 as a kind of "sport". The word "ski" typically, in its noun form, can be thought of as related to "skiing" in that it is a "part of "skiing" as shown in FIG. 2. "Slalom skiing" is a type of skiing and hence a "kind of connection is shown between it and "skiing". "Bindings" are a structural attachment
on a ski, and hence it is assigned a "part of connection with "ski". The example of a
specific brand of ski, "K2 ski," is given to show how it is in a "kind of connection
with "ski". "K2 ski" is also assigned a lateral bond showing a conceptual commonness with the manufacturer of the ski "K2" which lies in the "company" branch. The
lateral bond can be, for example, "product of. The company branch has a child
"athletic equipment company" as a "kind of "company." "Athletic equipment
company" has as its child in turn the "K2" company. Considering "ski" once again, "ski" is also a child of the "equipment" branch
which has "athletic equipment" as a "kind of "equipment" and ski as a "kind of
"athletic equipment". "Surfboard" is related to "ski" in that it too is a "kind of
"athletic equipment". Target documents or nodes within a subject directory may be "placed" or "located" by human intervention or automatically into the semantic space as defined by ontology 200. A document containing information concerning the sale
of skis or skiing destinations would fall somewhere within the defined semantic space based upon its focus of content. The ontology according to one embodiment of the present invention also
includes bond strength between concepts. A value can be assigned to a connection from one concept to another that signifies how strongly the second meaning relates to
the first. These connections can be dependant on the direction of the bond, so that, for example, "skier" might imply a strong connection to "skiing", whereas "skiing
need not imply "skier" to the same degree.
FIG. 3 illustrates an example of bond strengths and semantic distances between the concepts illustrated in FIG. 2. FIG. 3 illustrates how distance and closeness of meaning between concepts can be quantified within the semantic space.
Distances are shown between the element "ski" and all other elements within the
semantic space. Using three classes of bond strengths the degree of closeness
between concepts may be discovered. A "strong relationship" exists between "ski" and "skiing" as does between "ski" and "athletic equipment." Between "skiing" and "sport" there is a weaker than strong relationship known as a "medium relationship".
This is because when the average person thinks of the root term "skiing" they do not typically think also of "sport". Going from "ski" to "skiing" however, the average
person would more likely associate or think "skiing" if given the term "ski". The
direction of the arrows in the bond strengths indicates the direction of association. For example, "A→B" in FIG. 3 means that given the concept A, how likely is it or
closely would an average person associate the concept B. Going the other direction
between the same two concepts may produce a different bond strength.
A "weak relationship" is displayed between "ski" and "K2 ski" (when an average person thinks of "ski," "K2 ski" does not closely come to mind). However, if an average person were to go from "K2 ski" to "ski" this might be construed as a
strong relationship since they would naturally associate "ski" if given "K2 ski".
FIG. 3 also shows semantic distances between concepts. "Ski" and "skiing"
have only a distance of two between them while "skiing" and "sport" have a distance
of five (seven minus two). The distance between "ski" and "sport" is seven. When
traveling from parent to child or vice-versa, the distances can be simply
added/subtracted but when changing the direction of travel, a penalty may be imposed
upon the distance calculation. Take, for example, the distance between "ski" and
"athletic equipment company". Judging merely on a linear basis, the distance might
be twelve. But since the path from "ski" to "athletic equipment" switches direction
twice (it starts down to "K2 ski" and then across the lateral bond to "K2" and then up
to "athletic equipment company") a penalty or scaling factor can cause the distance
between "ski" and "athletic equipment" to be much larger than just twelve especially
given their lack of connectedness. As described above, penalties may be added when
the direction of traversal is switched or when a lateral bond is crossed. Distances
between concepts may be calculated and stored for future use. Semantic distances
can be determined in a variety of ways, such as, for example, as described in U. S.
Patent No. 6,453,315, which is hereby incorporated by reference.
The concept processor 130 can update and expand the ontology to include new
concepts or update relationships and other information for concepts already present.
Concepts can be updated or added to reflect the changes in everyday culture or for a
particular enterprise. For example, if a new ski manufacturer comes into existence
called "Big Ski" this concept can be added to the ontology by a user. The user can define the relationships of other concepts to "Big Ski" and the strength of these
relationships as well as other data that defines the concept. Concepts can be presented to a user 112a on a client device 102a in a concept
view, such as a concept window. Figure 4 illustrates an example of a concept window 400. The concept window 400 shows an example of a concept definition for "The United States". The primary term is shown at 402 and is "The United States".
The primary term 402 also indicates the part of speech and provides a context for the
concept (or refinition). The concept window 400 shown further provides a concept details section 401 and a concept relationships section 412. The concept details section 401 can display various details of the concept. For example, at 404 the group of related terms
associated with the concept is shown. The part of speech is shown at 406 and
indicates that the concept shown is a noun. Refinition is showed at 408 and user data is shown at 410. The refinition box can be used to provide a context for the concept
and the user data box can be used to further describe the concept. The relationships for the concept are shown in the relationship section 412.
The concept origin is shown at 414. The various relationships for the concept are shown in boxes 416, 418, and 420. In box 416, the parents of the concept are displayed and in box 420, children of the concept are displayed. Parent concepts and children concepts are concepts with a hierarchical relationship with the origin
concept. In box 418, concepts having a lateral relationship with the origin concept are displayed Various methods in accordance with the present invention may be carried out. For example, in one embodiment a user request to create a first concept in a network of interrelated concepts is received, a concept view, such as a concept creation window, is displayed, a primary term representing the first concept is received, at least one related term associated with the primary term is received, at least one
relationship between the first concept and a second concept is received, a strength value associated with the relationship is received, and the first concept is added to the
network. In one embodiment, the relationship can be hierarchical or lateral and can
be selected from one of the following: kind of, has kind, part of, has part, member of,
has member, substance of, has substance, product of, and has product.
In another embodiment, a request to edit a first concept in a network of interrelated concepts is received, the first concept is displayed, at least one
relationship between the first concept and a second concept is received, a strength value associated with the relationship is received, an edited strength value for an
existing relationship between the first concept and another concept is received, and the first concept is updated in the network. In one embodiment, information defining a part of speech of the first concept,
information defining a frequency of the primary term, information defining a
likelihood that the primary term and the related terms imply the first concept,
information defining a breadth of the first concept, information that the first concept is offensive, user data, and context information, such as refinition, can also be
received in order to create or edit the first concept. FIG. 5 illustrates an exemplary method 500 that provides a method for
creating concepts in a network of interconnected concepts or ontology. This
exemplary method is provided by way of example, as there are a variety of ways to
carry out methods according to the present invention. The method 500 shown in FIG. 5 can be executed or otherwise performed by any of various systems. The method 500 is described below as carried out by the system 100 shown in FIG. 1 by way of
example, and various elements of the system 100 are referenced in explaining the
example method of FIG. 5. In 502, a request signal from a user to create a concept term is received by the semantic engine 120. The request signal can be created by a user interface
application on client device 102a through input from the user 112a and passed to the
server device 104 via the network 106.
In response to the signal, the concept processor 130 can cause, in 504, a
concept view, such as a concept creation window, to be displayed on the client device 102a. The user 112a can be prompted for the entry of a variety of information in order to define the concept by the concept creation window. The information that
defines the concept can include: a primary term used to identify the concept; a group
of terms related to the primary term, such as synonyms and acronyms; related concepts and the relationship of the created concept to the related concepts; and the
strength of these relationships. The information that defines the concept can also include: the part of speech of the concept; the frequency of the primary term; the
likelihood that the primary term and the related terms imply the concept; the breadth or focus of the concept; the offensive nature of the concept; and the literal nature of
the primary term. A concept definition can also include user data and a refinition. In defining a new concept, a user can input a primary term associated with the
concept in the concept creation window. The primary term can be the term most
frequently associated with the concept. Next, a user can input a group of terms related to the primary term that are also associated with the concept in the concept
creation window. For example, the related terms can be synonyms or acronyms of the primary term. A user 112a can also input concepts related to the created concept and provide
relationship types for each related concept. The relationships can include: "kind of;
"has kind"; "part of; "has part"; "member of; "has member"; "substance of; "has substance"; "product of; and "has product". Relationship types can also be
specialized or customized, such as "causes symptom" and "symptom caused by" for a pharmaceutical or medical concept. The relationship type can also be "single
concept" for concepts unrelated to others. In one embodiment, the user 112a can define the related concept by typing it in a related concept box and then defining the
relationship type by choosing a relationship type from a drop down list. For every relationship type, the user 112a can assign a strength to the
relationship. The strength can be representative of how strong the relationship is. In
one embodiment, the strength represents the distance in semantic space the created concept is from the related concept. The user 112a can also input other information to define the concept. For example, the part of speech of the concept can be provided. A frequency of the primary term can be provided to represent the frequency of the primary term in
normal usage or in the source documents. A probability can be provided of the likelihood that the primary term and the related terms imply the concept. The user
can specify the breadth or focus of the concept and indicate whether the concept is offensive. The user 112a can also specify whether to match terms associated with the concept literally or can define non-literal acronym and abbreviation rules. The user can also provide user data and context for the concept (or refinition).
After the user 112a has defined the concept, the user 112a can request through
the client device 102a that the concept be added to the ontology. This request can cause the data input by the user 112a in the concept creation window to be sent from the client device 102a via the network 106 to the semantic engine 120. In 508, the concept processor 130 can create a new concept in the ontology from the data
received from the client device 102a and can add the concept to the ontology in the
concept database 122. FIG. 6 illustrates an exemplary method 600 that provides a method for editing concepts in an ontology. This exemplary method is provided by way of example, as
there are a variety of ways to carry out methods according to the present invention.
The method 600 shown in FIG. 6 can be executed or otherwise performed by any of
various systems. The method 600 is described below as carried out by the system 100 shown in FIG. 1 by way of example, and various elements of the system 100 are referenced in explaining the example method of FIG. 6.
In 602, a request signal to edit a concept from a user 112a is received by the semantic engine 120. The request signal can be created by a user interface application on client device 102a through input from the user 112a. This request
signal can include the concept to be edited and is passed to server device 104 via the network 106. In response to the signal, the concept processor 130 can verify that the concept requested exists and can cause in 604 a concept editor window to be displayed on client device 102a. The concept editor window displays the concept definition that
the user has requested to edit. A user can edit the various information used to define a concept as described above.
A user can edit the group of terms related to the primary term that are associated with the concept. For example, the user can add additional terms or delete
related terms, such as synonyms or acronyms, of the primary term. A user can also input additional concepts related to the edited concept, input
relationship types for each related concept, and edit existing concepts and relationship types. In one embodiment, a user can define and/or edit the relationship type by choosing a relationship type from a drop down list. For the relationship types in the concept definition, the user can assign and edit a strength of the relationship. The
user 112a can also input and edit other information in order to edit the concept as
described above. After the user 112a has edited the concept, the user 112a can request through
the client device 102a that the edited concept be added to the ontology. This request
can cause the data input by the user in the concept edit window to be sent from the client device 102a via the network 106 to the semantic engine 120. In 606, the edited concept is received by the semantic engine 120. In 608, the concept processor 130 can replace the concept in the ontology with the edited concept received from the user
112a or can add the edited data received from the user 112a to the concept in the ontology.
While the above description contains many specifics, these specifics should
not be construed as limitations on the scope of the invention, but merely as exemplifications of the disclosed embodiments. Those skilled in the art will envision many other possible variations that are within the scope ofthe invention.

Claims

1. A method, comprising: receiving a primary term representing a first concept to be added to a network of interrelated concepts; receiving at least one related term associated with the primary term; receiving at least one relationship between the first concept and a second
concept; receiving a strength value associated with the relationship; and adding the first concept to the network.
2. The method of claim 1 , further comprising: receiving a user request to create the first concept; and causing a display of a concept view.
3. The method of claim 1, further comprising receiving information defining a
part of speech of the first concept.
4. The method of claim 1 , wherein the relationship can be hierarchical or lateral.
5. The method of claim 1, wherein the relationship is selected from one of the
following: kind of, has kind, part of, has part, member of, has member, substance of, has substance, product of, and has product.
6. The method of claim 1 , further comprising receiving information defining a
frequency of the primary term.
7. The method of claim 1, further comprising receiving information defining a
likelihood that the primary term and the related terms imply the first concept.
8. The method of claim 1, further comprising receiving information defining a
breadth ofthe first concept.
9. The method of claim 1, further comprising receiving information that the first
concept is offensive.
10. The method of claim 1 , further comprising receiving user data.
11. The method of claim 1 , further comprising receiving context information.
12. A method, comprising: receiving a request to edit a first concept in a network of interrelated concepts; causing a display of the first concept; receiving at least one relationship between the first concept and a second
concept; receiving a strength value associated with the relationship; and updating the first concept in the network.
13. The method of claim 12, further comprising receiving an edited strength value for an existing relationship between the first concept and another concept.
14. The method of claim 12, further comprising receiving an edited relationship for an existing relationship between the first concept and another concept.
15. The method of claim 12, wherein the relationship can be hierarchical or lateral.
16. The method of claim 12, wherein the relationship is selected from one of the
following: kind of, has kind, part of, has part, member of, has member, substance of,
has substance, product of, and has product.
17. The method of claim 12, further comprising receiving information editing a
frequency of the primary term.
18. The method of claim 12, further comprising receiving information editing a
likelihood that a primary term and related terms imply the first concept.
19. The method of claim 12, further comprising receiving information editing a
breadth of the first concept.
20. A computer-readable medium containing program code, comprising: program code for receiving a primary term representing a first concept to be added to a network of interrelated concepts; program code for receiving at least one related term associated with the primary term; program code for receiving at least one relationship between the first concept
and a second concept; program code for receiving a strength value associated with the relationship; and program code for adding the first concept to the network.
21. The computer-readable medium of claim 20, further comprising: program code for receiving a user request to create the first concept; and program code for causing a display of a concept view.
22. The computer-readable medium of claim 20, further comprising program code
for receiving information defining a part of speech ofthe first concept.
23. The computer-readable medium of claim 20, wherein the relationship can be hierarchical or lateral.
24. The computer-readable medium of claim 20, wherein the relationship is
selected from one of the following: kind of, has kind, part of, has part, member of, has member, substance of, has substance, product of, and has product.
25. The computer-readable medium of claim 20, further comprising program code for receiving information defining a frequency ofthe primary term.
26. The computer-readable medium of claim 20, further comprising program code for receiving information defining a likelihood that the primary term and the related terms imply the first concept.
27. The computer-readable medium of claim 20, further comprising program code for receiving information defining a breadth of the first concept.
28. The computer-readable medium of claim 20, further comprising program code
for receiving information that the first concept is offensive.
29. The computer-readable medium of claim 20, further comprising program code for receiving user data.
30. The computer-readable medium of claim 20, further comprising program code for receiving refinition information.
31. A computer-readable medium containing program code, comprising: program code for receiving a request to edit a first concept in a network of interrelated concepts; program code for causing a display of the first concept; program code for receiving a request to add at least one relationship between
the first concept and a second concept; program code for receiving a strength value associated with the relationship;
and program code for updating the first concept in the network.
32. The computer-readable medium of claim 31 , further comprising program code
for receiving an edited strength value for an existing relationship between the first
concept and another concept.
33. The computer-readable medium of claim 31 , further comprising program code
for receiving an edited relationship for an existing relationship between the first
concept and another concept.
34. The computer-readable medium of claim 31, wherein the relationship can be
hierarchical or lateral.
35. The computer-readable medium of claim 31, wherein the relationship is
selected from one of the following: kind of, has kind, part of, has part, member of, has
member, substance of, has substance, product of, and has product.
36. The computer-readable medium of claim 31, further comprising program code for receiving information editing a frequency of the primary term.
37. The computer-readable medium of claim 31 , further comprising program code for receiving information editing a likelihood that a primary term and related terms imply the first concept.
38. The computer-readable medium of claim 31, further comprising program code
for receiving information editing a breadth of the first concept.
PCT/US2004/023828 2003-07-30 2004-07-23 Methods and systems for editing a network of interconnected concepts WO2005013151A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US49142203P 2003-07-30 2003-07-30
US60/491,422 2003-07-30
US10/748,399 US8051104B2 (en) 1999-09-22 2003-12-30 Editing a network of interconnected concepts
US10/748,399 2003-12-30

Publications (1)

Publication Number Publication Date
WO2005013151A1 true WO2005013151A1 (en) 2005-02-10

Family

ID=34118864

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/023828 WO2005013151A1 (en) 2003-07-30 2004-07-23 Methods and systems for editing a network of interconnected concepts

Country Status (2)

Country Link
US (3) US8051104B2 (en)
WO (1) WO2005013151A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009017941A1 (en) * 2007-07-31 2009-02-05 Yahoo! Inc. System and method for determining semantically related terms
US7548929B2 (en) 2005-07-29 2009-06-16 Yahoo! Inc. System and method for determining semantically related terms
US8433671B2 (en) 1999-09-22 2013-04-30 Google Inc. Determining a meaning of a knowledge item using document based information
US8521731B2 (en) 2008-07-09 2013-08-27 Yahoo! Inc. Systems and methods for query expansion in sponsored search

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8051104B2 (en) 1999-09-22 2011-11-01 Google Inc. Editing a network of interconnected concepts
US8195693B2 (en) 2004-12-16 2012-06-05 International Business Machines Corporation Automatic composition of services through semantic attribute matching
US20060253476A1 (en) * 2005-05-09 2006-11-09 Roth Mary A Technique for relationship discovery in schemas using semantic name indexing
US20070248026A1 (en) * 2006-04-19 2007-10-25 Electronic Data Systems Corporation System, Method, and Computer Program Product for Concept Network Based Collaboration
US7831472B2 (en) 2006-08-22 2010-11-09 Yufik Yan M Methods and system for search engine revenue maximization in internet advertising
US9892111B2 (en) 2006-10-10 2018-02-13 Abbyy Production Llc Method and device to estimate similarity between documents having multiple segments
US9069750B2 (en) 2006-10-10 2015-06-30 Abbyy Infopoisk Llc Method and system for semantic searching of natural language texts
US9098489B2 (en) 2006-10-10 2015-08-04 Abbyy Infopoisk Llc Method and system for semantic searching
US9495358B2 (en) 2006-10-10 2016-11-15 Abbyy Infopoisk Llc Cross-language text clustering
US9075864B2 (en) 2006-10-10 2015-07-07 Abbyy Infopoisk Llc Method and system for semantic searching using syntactic and semantic analysis
US9189482B2 (en) 2012-10-10 2015-11-17 Abbyy Infopoisk Llc Similar document search
US20080242419A1 (en) * 2007-03-29 2008-10-02 Nokia Corporation Server method for supporting a game
EP2143011A4 (en) * 2007-03-30 2012-06-27 Knewco Inc Data structure, system and method for knowledge navigation and discovery
JP5295605B2 (en) * 2008-03-27 2013-09-18 株式会社東芝 SEARCH KEYWORD IMPROVING DEVICE, SERVER DEVICE, AND METHOD
US8150859B2 (en) * 2010-02-05 2012-04-03 Microsoft Corporation Semantic table of contents for search results
US8903794B2 (en) * 2010-02-05 2014-12-02 Microsoft Corporation Generating and presenting lateral concepts
US8260664B2 (en) * 2010-02-05 2012-09-04 Microsoft Corporation Semantic advertising selection from lateral concepts and topics
US8983989B2 (en) 2010-02-05 2015-03-17 Microsoft Technology Licensing, Llc Contextual queries
US8775426B2 (en) * 2010-09-14 2014-07-08 Microsoft Corporation Interface to navigate and search a concept hierarchy
US8560494B1 (en) * 2011-09-30 2013-10-15 Palantir Technologies, Inc. Visual data importer
US20130204408A1 (en) * 2012-02-06 2013-08-08 Honeywell International Inc. System for controlling home automation system using body movements
US9471606B1 (en) 2012-06-25 2016-10-18 Google Inc. Obtaining information to provide to users
US9390174B2 (en) 2012-08-08 2016-07-12 Google Inc. Search result ranking and presentation
CN103838728B (en) * 2012-11-21 2018-01-09 腾讯科技(深圳)有限公司 The processing method and browser of info web
US9256682B1 (en) 2012-12-05 2016-02-09 Google Inc. Providing search results based on sorted properties
US9278255B2 (en) 2012-12-09 2016-03-08 Arris Enterprises, Inc. System and method for activity recognition
US10212986B2 (en) 2012-12-09 2019-02-26 Arris Enterprises Llc System, apparel, and method for identifying performance of workout routines
US8935305B2 (en) 2012-12-20 2015-01-13 General Instrument Corporation Sequential semantic representations for media curation
US9576007B1 (en) 2012-12-21 2017-02-21 Google Inc. Index and query serving for low latency search of large graphs
US9224103B1 (en) 2013-03-13 2015-12-29 Google Inc. Automatic annotation for training and evaluation of semantic analysis engines
US10810193B1 (en) 2013-03-13 2020-10-20 Google Llc Querying a data graph using natural language queries
US10713261B2 (en) 2013-03-13 2020-07-14 Google Llc Generating insightful connections between graph entities
US10108700B2 (en) 2013-03-15 2018-10-23 Google Llc Question answering to populate knowledge base
US10055462B2 (en) 2013-03-15 2018-08-21 Google Llc Providing search results using augmented search queries
US9477759B2 (en) 2013-03-15 2016-10-25 Google Inc. Question answering using entity references in unstructured data
US9235653B2 (en) 2013-06-26 2016-01-12 Google Inc. Discovering entity actions for an entity graph
US9256687B2 (en) 2013-06-28 2016-02-09 International Business Machines Corporation Augmenting search results with interactive search matrix
US20150066506A1 (en) 2013-08-30 2015-03-05 Verint Systems Ltd. System and Method of Text Zoning
CN105706078B (en) 2013-10-09 2021-08-03 谷歌有限责任公司 Automatic definition of entity collections
US9659056B1 (en) 2013-12-30 2017-05-23 Google Inc. Providing an explanation of a missing fact estimate
US9336220B2 (en) * 2014-05-05 2016-05-10 Empire Technology Development Llc Ontology-based data access monitoring
US11030406B2 (en) * 2015-01-27 2021-06-08 Verint Systems Ltd. Ontology expansion using entity-association rules and abstract relations
US10691214B2 (en) 2015-10-12 2020-06-23 Honeywell International Inc. Gesture control of building automation system components during installation and/or maintenance
US10878191B2 (en) * 2016-05-10 2020-12-29 Nuance Communications, Inc. Iterative ontology discovery
CN108874819B (en) * 2017-05-11 2021-09-03 上海醇聚信息科技有限公司 Data mining method for database
US10909210B2 (en) * 2018-03-22 2021-02-02 Ovh Method and system for defining a web site development strategy
EP3870203A4 (en) 2018-10-22 2022-07-20 William D. Carlson Therapeutic combinations of tdfrps and additional agents and methods of use
US11042601B2 (en) 2018-11-15 2021-06-22 Ovh Method for attracting users to a web page and server implementing the method
US11769012B2 (en) 2019-03-27 2023-09-26 Verint Americas Inc. Automated system and method to prioritize language model and ontology expansion and pruning
US11526565B2 (en) 2019-04-05 2022-12-13 Ovh Method of and system for clustering search queries
US11875130B1 (en) * 2023-07-25 2024-01-16 Intuit Inc. Confidence generation for managing a generative artificial intelligence model

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5404506A (en) * 1985-03-27 1995-04-04 Hitachi, Ltd. Knowledge based information retrieval system
US6453315B1 (en) * 1999-09-22 2002-09-17 Applied Semantics, Inc. Meaning-based information organization and retrieval

Family Cites Families (64)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4839853A (en) 1988-09-15 1989-06-13 Bell Communications Research, Inc. Computer information retrieval using latent semantic structure
US5325298A (en) 1990-11-07 1994-06-28 Hnc, Inc. Methods for generating or revising context vectors for a plurality of word stems
US5659724A (en) 1992-11-06 1997-08-19 Ncr Interactive data analysis apparatus employing a knowledge base
US5619709A (en) 1993-09-20 1997-04-08 Hnc, Inc. System and method of context vector generation and retrieval
US5499360A (en) 1994-02-28 1996-03-12 Panasonic Technolgies, Inc. Method for proximity searching with range testing and range adjustment
US5887120A (en) 1995-05-31 1999-03-23 Oracle Corporation Method and apparatus for determining theme for discourse
US5694523A (en) 1995-05-31 1997-12-02 Oracle Corporation Content processing system for discourse
US5768580A (en) 1995-05-31 1998-06-16 Oracle Corporation Methods and apparatus for dynamic classification of discourse
US6061675A (en) 1995-05-31 2000-05-09 Oracle Corporation Methods and apparatus for classifying terminology utilizing a knowledge catalog
US5708822A (en) 1995-05-31 1998-01-13 Oracle Corporation Methods and apparatus for thematic parsing of discourse
US5724571A (en) 1995-07-07 1998-03-03 Sun Microsystems, Inc. Method and apparatus for generating query responses in a computer-based document retrieval system
US6067552A (en) 1995-08-21 2000-05-23 Cnet, Inc. User interface system and method for browsing a hypertext database
US6112201A (en) 1995-08-29 2000-08-29 Oracle Corporation Virtual bookshelf
US5778367A (en) 1995-12-14 1998-07-07 Network Engineering Software, Inc. Automated on-line information service and directory, particularly for the world wide web
US5778364A (en) 1996-01-02 1998-07-07 Verity, Inc. Evaluation of content of a data set using multiple and/or complex queries
US5987404A (en) 1996-01-29 1999-11-16 International Business Machines Corporation Statistical natural language understanding using hidden clumpings
US5848396A (en) 1996-04-26 1998-12-08 Freedom Of Information, Inc. Method and apparatus for determining behavioral profile of a computer user
US6101515A (en) * 1996-05-31 2000-08-08 Oracle Corporation Learning system for classification of terminology
US5918236A (en) 1996-06-28 1999-06-29 Oracle Corporation Point of view gists and generic gists in a document browsing system
US5778363A (en) 1996-12-30 1998-07-07 Intel Corporation Method for measuring thresholded relevance of a document to a specified topic
US5960383A (en) 1997-02-25 1999-09-28 Digital Equipment Corporation Extraction of key sections from texts using automatic indexing techniques
US6119164A (en) 1997-04-15 2000-09-12 Full Circle Software, Inc. Method and apparatus for distributing over a network unsolicited information to a targeted audience
US5878223A (en) 1997-05-07 1999-03-02 International Business Machines Corporation System and method for predictive caching of information pages
US6460034B1 (en) 1997-05-21 2002-10-01 Oracle Corporation Document knowledge base research and retrieval system
US6038560A (en) * 1997-05-21 2000-03-14 Oracle Corporation Concept knowledge base search and retrieval system
US5940821A (en) 1997-05-21 1999-08-17 Oracle Corporation Information presentation in a knowledge base search and retrieval system
US5930788A (en) 1997-07-17 1999-07-27 Oracle Corporation Disambiguation of themes in a document classification system
US5933822A (en) 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
US5845278A (en) 1997-09-12 1998-12-01 Inioseek Corporation Method for automatically selecting collections to search in full text searches
US5974412A (en) 1997-09-24 1999-10-26 Sapient Health Network Intelligent query system for automatically indexing information in a database and automatically categorizing users
US5953718A (en) 1997-11-12 1999-09-14 Oracle Corporation Research mode for a knowledge base search and retrieval system
US6134532A (en) 1997-11-14 2000-10-17 Aptex Software, Inc. System and method for optimal adaptive matching of users to most relevant entity and information in real-time
US6289342B1 (en) 1998-01-05 2001-09-11 Nec Research Institute, Inc. Autonomous citation indexing and literature browsing using citation context
US6421675B1 (en) 1998-03-16 2002-07-16 S. L. I. Systems, Inc. Search engine
US6044375A (en) 1998-04-30 2000-03-28 Hewlett-Packard Company Automatic extraction of metadata using a neural network
US6324519B1 (en) 1999-03-12 2001-11-27 Expanse Networks, Inc. Advertisement auction system
US6298348B1 (en) 1998-12-03 2001-10-02 Expanse Networks, Inc. Consumer profiling system
IT1303603B1 (en) 1998-12-16 2000-11-14 Giovanni Sacco DYNAMIC TAXONOMY PROCEDURE FOR FINDING INFORMATION ON LARGE HETEROGENEOUS DATABASES.
US6484161B1 (en) 1999-03-31 2002-11-19 Verizon Laboratories Inc. Method and system for performing online data queries in a distributed computer system
US6473730B1 (en) 1999-04-12 2002-10-29 The Trustees Of Columbia University In The City Of New York Method and system for topical segmentation, segment significance and segment function
US7065500B2 (en) 1999-05-28 2006-06-20 Overture Services, Inc. Automatic advertiser notification for a system for providing place and price protection in a search result list generated by a computer network search engine
US6269361B1 (en) 1999-05-28 2001-07-31 Goto.Com System and method for influencing a position on a search result list generated by a computer network search engine
US6314419B1 (en) 1999-06-04 2001-11-06 Oracle Corporation Methods and apparatus for generating query feedback based on co-occurrence patterns
US6711585B1 (en) 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
WO2000079436A2 (en) 1999-06-24 2000-12-28 Simpli.Com Search engine interface
US8051104B2 (en) 1999-09-22 2011-11-01 Google Inc. Editing a network of interconnected concepts
US6816857B1 (en) 1999-11-01 2004-11-09 Applied Semantics, Inc. Meaning-based advertising and document relevance determination
US6651058B1 (en) 1999-11-15 2003-11-18 International Business Machines Corporation System and method of automatic discovery of terms in a document that are relevant to a given target topic
US6691108B2 (en) 1999-12-14 2004-02-10 Nec Corporation Focused search engine and method
EP1275042A2 (en) 2000-03-06 2003-01-15 Kanisa Inc. A system and method for providing an intelligent multi-step dialog with a user
US20010044837A1 (en) 2000-03-30 2001-11-22 Iqbal Talib Methods and systems for searching an information directory
GB2362971B (en) 2000-05-30 2004-03-24 Com Nation Ltd A method of searching the internet and an internet search engine
KR20010113258A (en) 2000-06-19 2001-12-28 이교용 System and method of allocating profits for digital content providers
GB0016974D0 (en) 2000-07-12 2000-08-30 Univ Salford The Document retrieval system
US20020133392A1 (en) 2001-02-22 2002-09-19 Angel Mark A. Distributed customer relationship management systems and methods
US7376620B2 (en) 2001-07-23 2008-05-20 Consona Crm Inc. System and method for measuring the quality of information retrieval
US20030084066A1 (en) 2001-10-31 2003-05-01 Waterman Scott A. Device and method for assisting knowledge engineer in associating intelligence with content
US7206778B2 (en) 2001-12-17 2007-04-17 Knova Software Inc. Text search ordered along one or more dimensions
US6978264B2 (en) 2002-01-03 2005-12-20 Microsoft Corporation System and method for performing a search and a browse on a query
US7024624B2 (en) 2002-01-07 2006-04-04 Kenneth James Hintz Lexicon-based new idea detector
US7533090B2 (en) 2004-03-30 2009-05-12 Google Inc. System and method for rating electronic documents
US9002783B2 (en) 2004-09-17 2015-04-07 Go Daddy Operating Company, LLC Web page customization based on expertise level of a user
US20060069613A1 (en) 2004-09-29 2006-03-30 Microsoft Corporation System for partial automation of content review of network advertisements
US20060259357A1 (en) 2005-05-12 2006-11-16 Fu-Sheng Chiu Intelligent dynamic market data collection and advertising delivery system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5404506A (en) * 1985-03-27 1995-04-04 Hitachi, Ltd. Knowledge based information retrieval system
US6453315B1 (en) * 1999-09-22 2002-09-17 Applied Semantics, Inc. Meaning-based information organization and retrieval

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
APTED T ET AL: "Visualisation of ontological inferences for user control of personal web agents", CONFERENCE PROCEEDINGS ARTICLE, 16 July 2003 (2003-07-16), pages 306 - 311, XP010648516 *
KERSCHBERG L ET AL: "A semantic taxonomy-based personalizable meta-search agent", CONFERENCE PROCEEDINGS ARTICLE, vol. 1, 3 December 2001 (2001-12-03), pages 41 - 50, XP010589037 *
VOSS A ET AL: "Concepts as knowledge handles in collaborative document management", ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, 1999. (WET ICE '99). PROCEEDINGS. IEEE 8TH INTERNATIONAL WORKSHOPS ON STANFORD, CA, USA 16-18 JUNE 1999, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 16 June 1999 (1999-06-16), pages 245 - 252, XP010358658, ISBN: 0-7695-0365-9 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8433671B2 (en) 1999-09-22 2013-04-30 Google Inc. Determining a meaning of a knowledge item using document based information
US9811776B2 (en) 1999-09-22 2017-11-07 Google Inc. Determining a meaning of a knowledge item using document-based information
US7548929B2 (en) 2005-07-29 2009-06-16 Yahoo! Inc. System and method for determining semantically related terms
WO2009017941A1 (en) * 2007-07-31 2009-02-05 Yahoo! Inc. System and method for determining semantically related terms
US8521731B2 (en) 2008-07-09 2013-08-27 Yahoo! Inc. Systems and methods for query expansion in sponsored search

Also Published As

Publication number Publication date
US20130232145A1 (en) 2013-09-05
US9268839B2 (en) 2016-02-23
US8051104B2 (en) 2011-11-01
US20040236737A1 (en) 2004-11-25
US8661060B2 (en) 2014-02-25
US20120041959A1 (en) 2012-02-16

Similar Documents

Publication Publication Date Title
US9268839B2 (en) Methods and systems for editing a network of interconnected concepts
US10496683B2 (en) Automatically linking text to concepts in a knowledge base
AU2011201984B2 (en) Methods and systems for understanding a meaning of a knowledge item using information associated with the knowledge item
US7739277B2 (en) System and method for incorporating anchor text into ranking search results
CN100573513C (en) Be used to arrange the document of Search Results to improve the method and system of diversity and abundant information degree
Diligenti et al. A unified probabilistic framework for web page scoring systems
US8914361B2 (en) Methods and systems for determining a meaning of a document to match the document to content
US20040039734A1 (en) Apparatus and method for region sensitive dynamically configurable document relevance ranking
US9773054B2 (en) Inverted table for storing and querying conceptual indices
US20160012057A1 (en) Computing the relevance of a document to concepts not specified in the document
US20020133483A1 (en) Systems and methods for computer based searching for relevant texts
US20050050459A1 (en) Automatic partition method and apparatus for structured document information blocks
US7801898B1 (en) Methods and systems for compressing indices
WO2016009321A1 (en) System for searching, recommending, and exploring documents through conceptual associations and inverted table for storing and querying conceptual indices
Candan et al. Reasoning for web document associations and its applications in site map construction
Berger et al. Augmenting a characterization network with semantic information
Cronin Annual review of information science and technology
Kong et al. Integrating XLink and XPath to Retrieve Structured Multimedia Documents in Digital Libraries.
Kilfeather et al. An ontological application for archaeological narratives
Barbu et al. Information Filtering Using the Dynamics of the User Profile.
Doan-Nguyen et al. Using semantic information to improve the performance of a restricted-domain question-answering system
Malakar iAGENT: A Novel and Intelligent Assistant to Personalised Search
McAllister FP-Tree motivated system for information retrieval using an abstraction path-based inverted index
John One night only</title

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase