US6654813B1 - Dynamically categorizing entity information - Google Patents

Dynamically categorizing entity information Download PDF

Info

Publication number
US6654813B1
US6654813B1 US09/374,175 US37417599A US6654813B1 US 6654813 B1 US6654813 B1 US 6654813B1 US 37417599 A US37417599 A US 37417599A US 6654813 B1 US6654813 B1 US 6654813B1
Authority
US
United States
Prior art keywords
entity
category
search engine
user
criteria
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US09/374,175
Inventor
Jeffery Dean Black
Jason Harvey Titus
Ira Joseph Woodhead
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
R2 Solutions LLC
Altaba Inc
Original Assignee
AltaVista Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AltaVista Co filed Critical AltaVista Co
Priority to US09/374,175 priority Critical patent/US6654813B1/en
Assigned to IATLAS CORPORATION reassignment IATLAS CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BLACK, JEFFERY DEAN, TITUS, JASON HARVEY, WOODHEAD, IRA JOSEPH
Assigned to ATLA VISTA COMPANY reassignment ATLA VISTA COMPANY MERGER (SEE DOCUMENT FOR DETAILS). Assignors: IATLAS CORPORATION
Assigned to OVERTURE SERVICES, INC. reassignment OVERTURE SERVICES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALTA VISTA COMPANY
Priority to US10/693,667 priority patent/US7398266B2/en
Application granted granted Critical
Publication of US6654813B1 publication Critical patent/US6654813B1/en
Assigned to OVERTURE SERVICES, INC. reassignment OVERTURE SERVICES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALTA VISTA COMPANY
Assigned to OVERTURE SERVICES INC. reassignment OVERTURE SERVICES INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALTA VISTA COMPANY
Assigned to OVERTURE SERVICES, INC. reassignment OVERTURE SERVICES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALTA VISTA COMPANY
Assigned to YAHOO! INC reassignment YAHOO! INC MERGER (SEE DOCUMENT FOR DETAILS). Assignors: OVERTURE SERVICES, INC
Assigned to EXCALIBUR IP, LLC reassignment EXCALIBUR IP, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO! INC.
Assigned to YAHOO! INC. reassignment YAHOO! INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: EXCALIBUR IP, LLC
Assigned to EXCALIBUR IP, LLC reassignment EXCALIBUR IP, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO! INC.
Anticipated expiration legal-status Critical
Assigned to R2 SOLUTIONS LLC reassignment R2 SOLUTIONS LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: EXCALIBUR IP, LLC
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Definitions

  • This application relates to dynamically categorizing entity information.
  • Web World-Wide Web
  • services known as Internet or Web portals provide hierarchical directories of Web sites.
  • the hierarchical directories include, in Web pages organized by categories, links to Web sites and home pages that are under the control of entities such as businesses.
  • entities such as businesses.
  • both the creation of the categories and the assignment of Web sites to the categories are performed with substantial human input, as are any changes to the categories and assignments. For example, after a medical category is created for medical entities such as hospitals, a human searches for hospitals that have Web sites and then assigns the Web sites to the medical category.
  • a human must reassign the hospital Web sites to the proper subcategories, by determining which hospitals qualify as small hospitals and large hospitals and reassigning each of the corresponding hospital Web sites accordingly.
  • Each of the Web pages is identifiable by a respective Uniform Resource Locator text string (“URL”), such as “http://www.isp321.com/frontpage.html”, that the Web browser software can use to select the page.
  • URL Uniform Resource Locator text string
  • Each URL includes a domain name, such as “isp321.com”, that identifies the Web site where the corresponding Web page is stored for retrieval by Web browser software.
  • Each domain name is registered by an entity that controls the corresponding Web site and Web pages.
  • a domain name registry organization maintains the domain name registration information, which may include name, address, and other information that allows the organization to bill the entity for payment for the maintenance. (It is to be understood that the term “registry”, as used herein, also refers to a domain name registrar or any other entity that may provide assistance in registering a domain name.)
  • An Internet service provider (“ISP”) is an example of an entity that may have a registered domain name for a Web site.
  • ISP has customers such as individuals or businesses for whom the ISP stores Web pages on the Web site for retrieval by Web browser software.
  • the ISP may have a customer Maple Street Plumbing for which the ISP stores a home Web page having a URL that includes a prefix “http://www.isp321.com/ ⁇ maplestplumb”.
  • a home Web page is typically the only or the primary entry point into a Web site or a set of Web pages that are under the control of an entity.
  • a Web portal is another example of an entity that may have a registered domain name.
  • a Web portal site allows another entity to create a link from the Web portal site to the other entity's Web site or home page by submitting information to the Web portal site.
  • Some information about an entity may not be available on a Web site that is under the control of the entity.
  • public financial information about a company may be stored in a database that is not linked to the company's Web site or is not directly accessible by Web browser software, such as a database under the control of a financial services firm.
  • a method and a system are provided that allow categorized directories of Web sites to be created, maintained, and reconfigured easily without excessive human intervention, and that allow the Web sites listed in the categorized directories to be associated with links to additional information about the respective entities that have control over the Web sites.
  • a set of criteria (such as geographical location or corresponding standard industry code) is acquired, from a user or elsewhere, that defines a category of entities.
  • the set of criteria is dynamically applied, to a source such as an entity information database, to identify an entity that meets the criteria. It is determined, from a domain name registration organization or an ISP or elsewhere, that the entity is registered as having control over at least a portion of a World-Wide Web address.
  • the at least a portion of a World-Wide Web address is associated with the entity in a presentation, such as a Web page, that indicates that the entity meets the set of criteria (that is, it belongs to the category of entities).
  • a link may be included to a set of computer data about the entity, such as information about the entity in the entity information database.
  • the set of computer data includes information other than information provided at the World-Wide Web address.
  • FIGS. 1-4 are block diagrams of computer-based systems.
  • FIG. 5 is a flow diagram of a computer-based procedure.
  • FIGS. 6-7 are illustrations of output produced by software.
  • FIGS. 8-9 are illustrations of database information.
  • FIG. 10 is an illustration of computer file information.
  • FIG. 1 illustrates a computer system 10 in which a mapping database 12 maps URLs or domain names 14 to entities 16 such as people, businesses, or government agencies, as described in more detail below.
  • the mapping database may indicate that any URL that begins with “http://www.uspto.gov” is for a Web page controlled by the U.S. Patent and Trademark Office, or that domain names “elmstdogs.com” and “elmstcats.com” are under the control of a company named Elm Street Pets, Inc.
  • mapping database Numerous applications, such as the directory application described below, can take advantage of the mapping database.
  • FIG. 2 illustrates a computer system 20 having the mapping database, a search engine 22 , a Web page record database 24 that includes Web page records 26 a - 26 d , and an entity information database 28 (also known as a business data database) that includes information such as geographic information about entities to which URLs or domain names are mapped in the mapping database.
  • entity information database 28 also known as a business data database
  • the mapping database may use a unique identification number (“unique ID”), such as a 9-digit American Business Information (“ABI”) number, to identify an entity so that other information about the entity can be retrieved from the entity information database or elsewhere by searching under the unique ID. (ABI numbers are sponsored by info USA.) For example, unique IDs from the mapping database may be used to search the entity information database to produce a subset of the mapping database that has records only for entities having a particular characteristic, such as hospitals having a particular geographic location or more than 1000 employees.
  • unique IDs from the mapping database may be used to search the entity information database to produce a subset of the mapping database that has records only for entities having a particular characteristic, such as hospitals having a particular geographic location or more than 1000 employees.
  • each of the entities may be assigned different unique IDs, and the different unique IDs may be linked in the mapping database to note the relationship among the entities. For example, a company that has offices in different locations may be assigned a unique ID for the company itself and a respective different unique ID for each location. In another example, when two previously unrelated companies merge or one is acquired by the other, each may retain its unique ID and a new, different unique ID may be assigned to the combination of the two companies, or both companies may be assigned the same unique ID.
  • each entity may be associated with standard industry code (“SIC”) fields for SIC numbers that indicate the industry categories for the entity.
  • SIC field may be arranged to hold a number having sections to indicate broad and narrow industry categories.
  • the SIC field may hold an SIC number having six digits, of which the first two digits may indicate a broad industry category such as “service companies”, the second two digits may indicate “computer service companies” as a subcategory of “service companies”, and the third two digits may indicate “manufacturer computer service companies” as a subcategory of “computer service companies”.
  • the entity information database can be searched by industry categories or subcategories represented by SIC numbers.
  • Information in the mapping database may be derived from information submitted by or on behalf of the entity when a domain name is registered. For example, when the company Elm Street Pets, Inc. registers the domain names “elmstdogs.com” and “elmstcats.com” with a domain name registry, the company associates the domain names with at least enough information, such as name, address, and telephone number information, to allow the domain name registry to bill the company for maintenance of the registration.
  • the entity may submit information to the mapping database in other ways such as in an on-line questionnaire that feeds the mapping database.
  • Information in the mapping database may be derived from information provided by an intermediary such as an ISP or an Internet portal.
  • an ISP having a domain name “isp321.com” may have a customer Maple Street Plumbing for which the ISP hosts and administers a home page having a home page address “www.isp321.com/ ⁇ maplestplumb”.
  • the ISP may have name, address, and telephone number information for the purpose of billing Maple Street Plumbing for such hosting and administration, and may allow such information along with the home page address to be used to link the home page address to Maple Street Plumbing in the mapping database.
  • an Internet portal may allow an entity such as Maple Street Plumbing to create an entry or listing named “Maple Street Plumbing” in a “plumbing” section of a on-line directory maintained by the portal, to allow a user to view home page “www.isp321.com/ ⁇ maplestplumb” by selecting the entry.
  • the Internet portal may allow information in the entry, and perhaps any address and telephone number information submitted by the entity during creation of the entry, to be used to link the home page to Maple Street Plumbing in the mapping database.
  • the mapping database and applications based on the mapping database may take advantage of a hierarchical organization of Web pages, by treating similarly a mapped page and all pages below the mapped page, such as pages sharing a particular prefix with the mapped page. For example, all pages sharing the prefix “www.isp321.com” may be treated as being under the control of an ISP named Global ISP Co. Since such pages include pages sharing the prefix “www.isp321.com/ ⁇ maplestplumb”, which should be treated as being under the control of Maple Street Plumbing, execution of a unique ID tagging procedure for Global ISP Co. should be followed by execution of a unique ID tagging procedure for Maple Street Plumbing so that tags referring to Global ISP Co. are changed to tags referring to Maple Street Plumbing where appropriate.
  • the mapping database may map an entity to Web pages maintained at different Web sites. For example, Maple Street Plumbing may have a first set of Web pages at the Global ISP Co. site and a second set of Web pages at another ISP's site.
  • the entity information database may include a database such as EDGAR that includes information about companies. Information derived from EDGAR may be used to allow a search of the entity information database to be limited to companies that match a specified financial profile, such as profitable companies.
  • mapping database or the entity information database may allow searches to be limited by relative size of entities, such as size in an industry.
  • One or more of the databases referenced above may be or include a relational database and may have records to which fields may be added readily to accept informational tags and Web link information.
  • FIG. 3 illustrates an example of a directory application system 400 in which information drawn from the mapping database 12 and the entity information database 28 is used to produce a categorized directory view 402 of entities and Web sites that are under the control of the entities.
  • the entity information database has information about attributes of each entity, such as the entity's location, a SIC code for the entity, and the size of the entity by the number of employees.
  • the entities and the Web sites under the control of the entities are grouped according to one or more of the attributes. For example, the grouping may be according to subject matter areas identified by SIC codes, or according to business status such as privately-held, public, or not-for-profit.
  • a directory application 404 draws information from the mapping database and the entity information database at the time a categorized directory view is produced, so that the information presented in the view is as current as the information in the mapping and entity information databases.
  • the example embodiment described below also provides a highly data storage space efficient implementation that does not require an intermediate database of directory contents; the directory application relies on a category file 406 that defines the categorical structure, but not the contents, of the view.
  • a category file 502 is maintained that defines a hierarchical categorical structure of SIC codes (step 1010 ). As shown in an example in FIG. 10, the category file has at least a top level of broad subject matter categories, and may also have lower levels of subcategories. Each category and subcategory is associated with a respective set of SIC codes.
  • a top level directory view is presented (step 1020 ).
  • the top level directory view lists each of the categories in the top level, and may also allow the user to specify geographical criteria for a geographical filter, which filter is used as described below.
  • FIG. 6 illustrates an example of a top level directory view.
  • the user is allowed to select a lowest level category or subcategory (step 1030 ). If there are no subcategories, a top level category may serve as the lowest level category. Otherwise, the user can burrow down through one or more levels of subcategories until one of the lowest level subcategories is reached.
  • the lowest level category is selected by one or more mouse clicks or other user entry, and hierarchical views showing categories and subcategories are presented as necessary to help the user navigate.
  • the entity information database is searched to acquire information indicating which entities match the selected SIC codes (“selected entities”) (step 1050 ).
  • the selected entities information may identify the selected entities by their respective unique IDs.
  • a view 504 is presented in which the selected entities are listed under the selected lowest level category (step 1060 ). As the selected entities are listed, the mapping database is searched to determine URLs for Web sites that are under the control of the selected entities (“selected URLs”) (step 1070 ), and the selected URLS are listed together with the respective selected entities (step 1080 ).
  • FIG. 7 illustrates an example of view of the list of the selected entities. From the view, the user can retrieve a page from a selected entity's Web site by selecting the respective selected URL (e.g., “www.nationalparks.org”), and can retrieve information about the entity from the entity information database by selecting the name of the entity in the list (e.g., “US Interior Dept”).
  • the selected entities are filtered, e.g., to filter out selected entities that do not match geographical criteria specified by the user (step 1090 ), or that are indicated in the mapping database as having control over no Web sites (step 1100 ).
  • the geographical filter may be implemented by including the geographical criteria with the selected SIC codes when the entity information database is searched to determine selected entities. For example, if the user's geographical criteria specify a city, the only entities that are included in the view of the list of the selected entities are entities that are indicated in the entity information database as having a primary or secondary address in the city. In the user's geographical criteria, the presence may be required to be a primary address, and the directory application may allow this requirement to be selected by an input selection prompt such as box 700 of FIG. 7 .
  • the directory application or another application such as a search engine may keep track of the number of times a Web site is accessed (i.e., the site's popularity), and may sort the list of-selected entities by the number of times the Web sites under the control of the respective entities have been accessed, or the category file may be structured to group entities by the number of times their web sites have been accessed.
  • Multiple directory applications based on different category files may draw information from the same mapping database and the same entity information database, and thereby may be updated effectively instantaneously and simultaneously as the mapping and entity information databases are updated.
  • a first directory application may be provided to group entities by SIC codes as described above, and a second directory application may be provided to group entities by relationship to medical specialties or other professional specialties.
  • the mapping database is updated to reflect that a medical office has made a change to the Web address of the Web site that is under the control of the medical office, the change is included in any views that are subsequently presented by the first and second directory applications and that list the medical office as a selected entity.
  • Other information that may be used in a category file to group entities includes brands of goods or services that may be offered by or through entities, and information listed in FIGS. 8-9 that illustrate an example of information that may be stored in the entity information database for each entity.
  • the user may also be permitted, in accordance with an input mechanism such as box 702 of FIG. 7, to execute a Web site search limited to Web sites identified by the selected URLs.
  • a search may be accomplished by directing search engine 22 (FIG. 2) to search Web page record database 24 and then filtering out of the search results any records that do not include at least one of the selected URLs.
  • search engine 22 FIG. 2
  • the records in the Web page record database have been tagged with unique IDs corresponding to entities
  • such a search may be accomplished by retrieving, from the entity information database, the unique IDs for the selected entities (“selected unique IDs”), and directing the search engine to return only records that are tagged with one or more of the selected unique IDs.
  • any of many different types of computer equipment may be used.
  • one or more Intel-based personal computers may be used that run an SQL database on Linux and one or more programs written in Perl or the C programming language with interfaces to the SQL database.
  • the technique may be implemented in hardware or software, or a combination of both.
  • the technique is implemented in computer programs executing on one or more programmable computers, such as a personal computer running or able to run an operating system such as Unix, Linux, Microsoft Windows 95, 98, or NT, or Macintosh OS, that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device such as a keyboard, and at least one output device.
  • Program code is applied to data entered using the input device to perform the technique described above and to generate output information.
  • the output information is applied to one or more output devices such as a display screen of the computer.
  • each program is implemented in a high level procedural or object-oriented programming language such as Perl, C, C++, or Java to communicate with a computer system.
  • a high level procedural or object-oriented programming language such as Perl, C, C++, or Java
  • the programs can be implemented in assembly or machine language, if desired.
  • the language may be a compiled or interpreted language.
  • each such computer program is stored on a storage medium or device, such as ROM or optical or magnetic disc, that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document.
  • the system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.
  • the user may be a human being or a non-human entity such as a computer program or an automated device that may interact with one or more of the databases or one or more of the applications via an application programming interface (“API”) or a network message.
  • An on-line information store or multiple databases may serve as the entity information database, which may take the form of any mechanism that provides automated access to information, such as a spreadsheet file or a store of email messages.

Abstract

A set of criteria is acquired that defines a category of entities. The set of criteria is dynamically applied to identify an entity that meets the criteria. It is determined that the entity is registered as having control over at least a portion of a World-Wide Web address. The at least a portion of a World-Wide Web address is associated with the entity in a presentation that indicates that the entity belongs to the category of entities. In the presentation, a link may be included to a set of computer data about the entity. The set of computer data includes information other than information provided at a World-Wide Web site corresponding to the World-Wide Web address.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims the benefit of United States Provisional Application Serial No. 60/097,029 entitled “Collecting, Combining, Analyzing, and Using Internet and Business Information” filed on Aug. 17, 1998, which is incorporated herein.
BACKGROUND OF THE INVENTION
This application relates to dynamically categorizing entity information.
On the World-Wide Web (“Web”), services known as Internet or Web portals provide hierarchical directories of Web sites. The hierarchical directories include, in Web pages organized by categories, links to Web sites and home pages that are under the control of entities such as businesses. Typically, both the creation of the categories and the assignment of Web sites to the categories are performed with substantial human input, as are any changes to the categories and assignments. For example, after a medical category is created for medical entities such as hospitals, a human searches for hospitals that have Web sites and then assigns the Web sites to the medical category. In such a case, if the medical category is then broken up into multiple subcategories such as a small hospitals subcategory and a large hospitals subcategory, a human must reassign the hospital Web sites to the proper subcategories, by determining which hospitals qualify as small hospitals and large hospitals and reassigning each of the corresponding hospital Web sites accordingly.
Much of the information available on Web sites is organized into Web pages that can be retrieved and displayed by Web browser software under the direction of a user. Each of the Web pages is identifiable by a respective Uniform Resource Locator text string (“URL”), such as “http://www.isp321.com/frontpage.html”, that the Web browser software can use to select the page. Each URL includes a domain name, such as “isp321.com”, that identifies the Web site where the corresponding Web page is stored for retrieval by Web browser software. Each domain name is registered by an entity that controls the corresponding Web site and Web pages. A domain name registry organization maintains the domain name registration information, which may include name, address, and other information that allows the organization to bill the entity for payment for the maintenance. (It is to be understood that the term “registry”, as used herein, also refers to a domain name registrar or any other entity that may provide assistance in registering a domain name.)
An Internet service provider (“ISP”) is an example of an entity that may have a registered domain name for a Web site. Typically, an ISP has customers such as individuals or businesses for whom the ISP stores Web pages on the Web site for retrieval by Web browser software. For example, the ISP may have a customer Maple Street Plumbing for which the ISP stores a home Web page having a URL that includes a prefix “http://www.isp321.com/˜maplestplumb”. A home Web page is typically the only or the primary entry point into a Web site or a set of Web pages that are under the control of an entity.
A Web portal is another example of an entity that may have a registered domain name. Typically, a Web portal site allows another entity to create a link from the Web portal site to the other entity's Web site or home page by submitting information to the Web portal site.
Some information about an entity may not be available on a Web site that is under the control of the entity. For example, public financial information about a company may be stored in a database that is not linked to the company's Web site or is not directly accessible by Web browser software, such as a database under the control of a financial services firm.
SUMMARY OF THE INVENTION
A method and a system are provided that allow categorized directories of Web sites to be created, maintained, and reconfigured easily without excessive human intervention, and that allow the Web sites listed in the categorized directories to be associated with links to additional information about the respective entities that have control over the Web sites. A set of criteria (such as geographical location or corresponding standard industry code) is acquired, from a user or elsewhere, that defines a category of entities. The set of criteria is dynamically applied, to a source such as an entity information database, to identify an entity that meets the criteria. It is determined, from a domain name registration organization or an ISP or elsewhere, that the entity is registered as having control over at least a portion of a World-Wide Web address. The at least a portion of a World-Wide Web address is associated with the entity in a presentation, such as a Web page, that indicates that the entity meets the set of criteria (that is, it belongs to the category of entities). In the presentation, a link may be included to a set of computer data about the entity, such as information about the entity in the entity information database. The set of computer data includes information other than information provided at the World-Wide Web address.
Other features and advantages will become apparent from the following description, including the drawings, and from the claims.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 1-4 are block diagrams of computer-based systems.
FIG. 5 is a flow diagram of a computer-based procedure.
FIGS. 6-7 are illustrations of output produced by software.
FIGS. 8-9 are illustrations of database information.
FIG. 10 is an illustration of computer file information.
DETAILED DESCRIPTION
FIG. 1 illustrates a computer system 10 in which a mapping database 12 maps URLs or domain names 14 to entities 16 such as people, businesses, or government agencies, as described in more detail below. For example, the mapping database may indicate that any URL that begins with “http://www.uspto.gov” is for a Web page controlled by the U.S. Patent and Trademark Office, or that domain names “elmstdogs.com” and “elmstcats.com” are under the control of a company named Elm Street Pets, Inc.
Numerous applications, such as the directory application described below, can take advantage of the mapping database.
FIG. 2 illustrates a computer system 20 having the mapping database, a search engine 22, a Web page record database 24 that includes Web page records 26 a-26 d, and an entity information database 28 (also known as a business data database) that includes information such as geographic information about entities to which URLs or domain names are mapped in the mapping database.
The mapping database may use a unique identification number (“unique ID”), such as a 9-digit American Business Information (“ABI”) number, to identify an entity so that other information about the entity can be retrieved from the entity information database or elsewhere by searching under the unique ID. (ABI numbers are sponsored by info USA.) For example, unique IDs from the mapping database may be used to search the entity information database to produce a subset of the mapping database that has records only for entities having a particular characteristic, such as hospitals having a particular geographic location or more than 1000 employees.
Where an entity constitutes a portion of another entity, each of the entities may be assigned different unique IDs, and the different unique IDs may be linked in the mapping database to note the relationship among the entities. For example, a company that has offices in different locations may be assigned a unique ID for the company itself and a respective different unique ID for each location. In another example, when two previously unrelated companies merge or one is acquired by the other, each may retain its unique ID and a new, different unique ID may be assigned to the combination of the two companies, or both companies may be assigned the same unique ID.
In the entity information database, each entity may be associated with standard industry code (“SIC”) fields for SIC numbers that indicate the industry categories for the entity. Each SIC field may be arranged to hold a number having sections to indicate broad and narrow industry categories. For example, the SIC field may hold an SIC number having six digits, of which the first two digits may indicate a broad industry category such as “service companies”, the second two digits may indicate “computer service companies” as a subcategory of “service companies”, and the third two digits may indicate “manufacturer computer service companies” as a subcategory of “computer service companies”. As a result, the entity information database can be searched by industry categories or subcategories represented by SIC numbers.
Information in the mapping database may be derived from information submitted by or on behalf of the entity when a domain name is registered. For example, when the company Elm Street Pets, Inc. registers the domain names “elmstdogs.com” and “elmstcats.com” with a domain name registry, the company associates the domain names with at least enough information, such as name, address, and telephone number information, to allow the domain name registry to bill the company for maintenance of the registration.
The entity may submit information to the mapping database in other ways such as in an on-line questionnaire that feeds the mapping database.
Information in the mapping database may be derived from information provided by an intermediary such as an ISP or an Internet portal. For example, an ISP having a domain name “isp321.com” may have a customer Maple Street Plumbing for which the ISP hosts and administers a home page having a home page address “www.isp321.com/˜maplestplumb”. In such a case, the ISP may have name, address, and telephone number information for the purpose of billing Maple Street Plumbing for such hosting and administration, and may allow such information along with the home page address to be used to link the home page address to Maple Street Plumbing in the mapping database.
In another example, an Internet portal may allow an entity such as Maple Street Plumbing to create an entry or listing named “Maple Street Plumbing” in a “plumbing” section of a on-line directory maintained by the portal, to allow a user to view home page “www.isp321.com/˜maplestplumb” by selecting the entry. In such a case, the Internet portal may allow information in the entry, and perhaps any address and telephone number information submitted by the entity during creation of the entry, to be used to link the home page to Maple Street Plumbing in the mapping database.
The mapping database and applications based on the mapping database may take advantage of a hierarchical organization of Web pages, by treating similarly a mapped page and all pages below the mapped page, such as pages sharing a particular prefix with the mapped page. For example, all pages sharing the prefix “www.isp321.com” may be treated as being under the control of an ISP named Global ISP Co. Since such pages include pages sharing the prefix “www.isp321.com/˜maplestplumb”, which should be treated as being under the control of Maple Street Plumbing, execution of a unique ID tagging procedure for Global ISP Co. should be followed by execution of a unique ID tagging procedure for Maple Street Plumbing so that tags referring to Global ISP Co. are changed to tags referring to Maple Street Plumbing where appropriate.
The mapping database may map an entity to Web pages maintained at different Web sites. For example, Maple Street Plumbing may have a first set of Web pages at the Global ISP Co. site and a second set of Web pages at another ISP's site.
The entity information database may include a database such as EDGAR that includes information about companies. Information derived from EDGAR may be used to allow a search of the entity information database to be limited to companies that match a specified financial profile, such as profitable companies.
Information in the mapping database or the entity information database may allow searches to be limited by relative size of entities, such as size in an industry.
One or more of the databases referenced above may be or include a relational database and may have records to which fields may be added readily to accept informational tags and Web link information.
FIG. 3 illustrates an example of a directory application system 400 in which information drawn from the mapping database 12 and the entity information database 28 is used to produce a categorized directory view 402 of entities and Web sites that are under the control of the entities. The entity information database has information about attributes of each entity, such as the entity's location, a SIC code for the entity, and the size of the entity by the number of employees. In the categorized directory view, the entities and the Web sites under the control of the entities are grouped according to one or more of the attributes. For example, the grouping may be according to subject matter areas identified by SIC codes, or according to business status such as privately-held, public, or not-for-profit.
In at least the case of an example embodiment 500 described below, a directory application 404 draws information from the mapping database and the entity information database at the time a categorized directory view is produced, so that the information presented in the view is as current as the information in the mapping and entity information databases. The example embodiment described below also provides a highly data storage space efficient implementation that does not require an intermediate database of directory contents; the directory application relies on a category file 406 that defines the categorical structure, but not the contents, of the view.
With reference to FIGS. 4-5, example embodiment 500 is now described in connection with a procedure 1000. A category file 502 is maintained that defines a hierarchical categorical structure of SIC codes (step 1010). As shown in an example in FIG. 10, the category file has at least a top level of broad subject matter categories, and may also have lower levels of subcategories. Each category and subcategory is associated with a respective set of SIC codes.
A top level directory view is presented (step 1020). The top level directory view lists each of the categories in the top level, and may also allow the user to specify geographical criteria for a geographical filter, which filter is used as described below. FIG. 6 illustrates an example of a top level directory view.
The user is allowed to select a lowest level category or subcategory (step 1030). If there are no subcategories, a top level category may serve as the lowest level category. Otherwise, the user can burrow down through one or more levels of subcategories until one of the lowest level subcategories is reached. The lowest level category is selected by one or more mouse clicks or other user entry, and hierarchical views showing categories and subcategories are presented as necessary to help the user navigate.
It is determined, from the category file, which SIC codes are associated with the lowest level category that was selected (“selected SIC codes”) (step 1040).
The entity information database is searched to acquire information indicating which entities match the selected SIC codes (“selected entities”) (step 1050). The selected entities information may identify the selected entities by their respective unique IDs.
A view 504 is presented in which the selected entities are listed under the selected lowest level category (step 1060). As the selected entities are listed, the mapping database is searched to determine URLs for Web sites that are under the control of the selected entities (“selected URLs”) (step 1070), and the selected URLS are listed together with the respective selected entities (step 1080). FIG. 7 illustrates an example of view of the list of the selected entities. From the view, the user can retrieve a page from a selected entity's Web site by selecting the respective selected URL (e.g., “www.nationalparks.org”), and can retrieve information about the entity from the entity information database by selecting the name of the entity in the list (e.g., “US Interior Dept”).
In alternative embodiments, before or as the view of the list of the selected entities is presented, the selected entities are filtered, e.g., to filter out selected entities that do not match geographical criteria specified by the user (step 1090), or that are indicated in the mapping database as having control over no Web sites (step 1100). The geographical filter may be implemented by including the geographical criteria with the selected SIC codes when the entity information database is searched to determine selected entities. For example, if the user's geographical criteria specify a city, the only entities that are included in the view of the list of the selected entities are entities that are indicated in the entity information database as having a primary or secondary address in the city. In the user's geographical criteria, the presence may be required to be a primary address, and the directory application may allow this requirement to be selected by an input selection prompt such as box 700 of FIG. 7.
The directory application or another application such as a search engine may keep track of the number of times a Web site is accessed (i.e., the site's popularity), and may sort the list of-selected entities by the number of times the Web sites under the control of the respective entities have been accessed, or the category file may be structured to group entities by the number of times their web sites have been accessed.
Multiple directory applications based on different category files may draw information from the same mapping database and the same entity information database, and thereby may be updated effectively instantaneously and simultaneously as the mapping and entity information databases are updated. For example, a first directory application may be provided to group entities by SIC codes as described above, and a second directory application may be provided to group entities by relationship to medical specialties or other professional specialties. In such a case, if the mapping database is updated to reflect that a medical office has made a change to the Web address of the Web site that is under the control of the medical office, the change is included in any views that are subsequently presented by the first and second directory applications and that list the medical office as a selected entity.
Other information that may be used in a category file to group entities includes brands of goods or services that may be offered by or through entities, and information listed in FIGS. 8-9 that illustrate an example of information that may be stored in the entity information database for each entity.
The user may also be permitted, in accordance with an input mechanism such as box 702 of FIG. 7, to execute a Web site search limited to Web sites identified by the selected URLs. Such a search may be accomplished by directing search engine 22 (FIG. 2) to search Web page record database 24 and then filtering out of the search results any records that do not include at least one of the selected URLs. Alternatively, if the records in the Web page record database have been tagged with unique IDs corresponding to entities, such a search may be accomplished by retrieving, from the entity information database, the unique IDs for the selected entities (“selected unique IDs”), and directing the search engine to return only records that are tagged with one or more of the selected unique IDs.
Any of many different types of computer equipment may be used. For example, one or more Intel-based personal computers may be used that run an SQL database on Linux and one or more programs written in Perl or the C programming language with interfaces to the SQL database.
The technique (i.e., the procedures described above) may be implemented in hardware or software, or a combination of both. In at least some cases, it is advantageous if the technique is implemented in computer programs executing on one or more programmable computers, such as a personal computer running or able to run an operating system such as Unix, Linux, Microsoft Windows 95, 98, or NT, or Macintosh OS, that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device such as a keyboard, and at least one output device. Program code is applied to data entered using the input device to perform the technique described above and to generate output information. The output information is applied to one or more output devices such as a display screen of the computer.
In at least some cases, it is advantageous if each program is implemented in a high level procedural or object-oriented programming language such as Perl, C, C++, or Java to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language.
In at least some cases, it is advantageous if each such computer program is stored on a storage medium or device, such as ROM or optical or magnetic disc, that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document. The system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.
Other embodiments are within the scope of the following claims. For example, the user may be a human being or a non-human entity such as a computer program or an automated device that may interact with one or more of the databases or one or more of the applications via an application programming interface (“API”) or a network message. An on-line information store or multiple databases may serve as the entity information database, which may take the form of any mechanism that provides automated access to information, such as a spreadsheet file or a store of email messages.

Claims (14)

What is claimed is:
1. A method of operating a search engine, comprising:
acquiring from a user of the search engine a search request, the search request including a set of criteria that defines a category of entities;
dynamically applying the set of criteria to identify an entity that meets the criteria;
determining that the entity is registered as having control over at least a portion of a World-Wide Web address; and
associating the at least a portion of a World-Wide Web address with the entity in a presentation to a user of the search engine that identifies the entity and indicates that the entity belongs to the category of entities.
2. The method of claim 1, wherein the set of criteria includes a geographical limitation.
3. The method of claim 1, further comprising:
including, in the presentation, a link to a set of computer data about the entity, the set of computer data including information other than information provided at a World-Wide Web site corresponding to the World-Wide Web address.
4. A method of operating a search engine, comprising:
maintaining a category file in the search engine; the category file defining a hierarchy of categories having a top level of categories and at least one lower level of subcategories and defining industry category codes that correspond to the subcategories;
presenting to a user of the search engine a top level view of categories that are ordered in accordance with the category file;
allowing the user to make a selection of a lowest level category of the categories, the selection of the lowest level category comprising at least part of a search request submitted by the user to the search engine;
determining an industry category code that corresponds to the lowest level category;
determining an entity that corresponding to the industry category code; and
determining a World-Wide Web address that corresponds to the entity.
5. The method of claim 4, further comprising:
presenting to the user of the search engine a listing of the entity together with the World-Wide Web address.
6. The method of claim 5, further comprising:
rejecting an entity that lacks a World-Wide Web site.
7. The method of claim 4, wherein said determining an entity step includes rejecting an entity that does not meet a set of criteria comprising a part of the search request submitted by the user.
8. The method of claim 7, wherein the set of criteria include geographical criteria.
9. A search engine system comprising:
an acquirer that acquires from a user of the search engine system a set of criteria that defines a category of entities, the set of criteria comprising a part of a search request submitted by the user;
an applicator that dynamically applies the set of criteria to identify an entity that meets the criteria;
a determiner that determines that the entity is registered as having control over at least a portion of a World-Wide Web address; and
an associater that associates the at least a portion of a World-Wide Web address with the entity in a presentation that identifies the entity and indicates that the entity belongs to the category of entities.
10. A search engine system comprising:
a maintainer that maintains a category file in the search engine system; the category file defining a hierarchy of categories having a top level of categories and at least one lower level of subcategories and defining industry category codes that correspond to the subcategories;
a presenter that presents a top level view of categories that are ordered in accordance with the category file;
an allower that allows a selection of a lowest level category of the hierarchy of categories by a user of the search engine, the selection of the lowest level category comprising at least part of a search request submitted by the user;
a determiner that determines, in response to the user's search request, an industry category code that corresponds to the lowest level category selected by the user of the search engine;
a determiner that determines an entity that corresponds to the industry category code; and
a determiner that determines a World-Wide Web address that corresponds to the entity.
11. Computer software, residing on a computer-readable storage medium, comprising a set of instructions for use in a search engine computer system to cause the system to:
acquire from a user of the search engine a search request, the search request including a set of criteria that defines a category of entities;
dynamically apply the set of criteria to identify an entity that meets the criteria;
determine that the entity is registered as having control over at least a portion of a World-Wide Web address; and
associate the at least a portion of a World-Wide Web address with the entity in a presentation to a user of the search engine that identifies the entity and indicates that the entity belongs to the category of entities.
12. Computer software, residing on a computer-readable storage medium, comprising a set of instructions for use in a search engine computer system to cause the system to:
maintain a category file in the search engine; the category file defining a hierarchy of categories having a top level of categories and at least one lower level of subcategories, and defining industry category codes that correspond to the subcategories;
present to a user of the search engine a top level view of categories that are ordered in accordance with the category file;
allow the user to make a selection of a lowest level category of the categories, the selection of the lowest level category comprising at least part of a search request submitted by the user;
determine an industry category code that corresponds to the lowest level category;
determine an entity that corresponding to the industry category code; and
determine a World-Wide Web address that corresponds to the entity.
13. A search engine system comprising:
a computer;
a storage device for storing data on a storage medium;
a first logic system configured to acquire from a user of the search engine a search request, the search request including a set of criteria that defines a category of entities;
a second logic system configured to dynamically apply the set of criteria to identify an entity that meets the criteria;
a third logic system configured to determine that the entity is registered as having control over at least a portion of a World-Wide Web address; and
a fourth logic system configured to associate the at least a portion of a World-Wide Web address with the entity in a presentation to a user of the search engine that identifies the entity indicates that the entity belongs to the category of entities.
14. A search engine system comprising:
a computer;
a storage device for storing data on a storage medium;
a first logic system configured to maintain a category file in the search engine system; the category file defining a hierarchy of categories having a top level of categories and at least one lower level of subcategories and defining industry category codes that correspond to the subcategories;
a second logic system configured to present a top level view of categories that are ordered in accordance with the category file;
a third logic system configured to allow a selection of a lowest level category of the hierarchy of categories by a user of the search engine, the selection of the lowest level category comprising at least part of a search request submitted by the user;
a fourth logic system configured to determine an industry category code that corresponds to the lowest level category selected by the user of the search engine;
a fifth logic system configured to determine an entity that corresponds to the industry category code; and
a six logic system configured to determine a World-Wide Web address that corresponds to the entity.
US09/374,175 1998-08-17 1999-08-13 Dynamically categorizing entity information Expired - Lifetime US6654813B1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US09/374,175 US6654813B1 (en) 1998-08-17 1999-08-13 Dynamically categorizing entity information
US10/693,667 US7398266B2 (en) 1998-08-17 2003-10-24 Dynamically categorizing entity information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US9702998P 1998-08-17 1998-08-17
US09/374,175 US6654813B1 (en) 1998-08-17 1999-08-13 Dynamically categorizing entity information

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US10/693,667 Continuation US7398266B2 (en) 1998-08-17 2003-10-24 Dynamically categorizing entity information

Publications (1)

Publication Number Publication Date
US6654813B1 true US6654813B1 (en) 2003-11-25

Family

ID=22260433

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/374,175 Expired - Lifetime US6654813B1 (en) 1998-08-17 1999-08-13 Dynamically categorizing entity information
US10/693,667 Expired - Lifetime US7398266B2 (en) 1998-08-17 2003-10-24 Dynamically categorizing entity information

Family Applications After (1)

Application Number Title Priority Date Filing Date
US10/693,667 Expired - Lifetime US7398266B2 (en) 1998-08-17 2003-10-24 Dynamically categorizing entity information

Country Status (5)

Country Link
US (2) US6654813B1 (en)
EP (1) EP1105818A1 (en)
JP (2) JP2002522847A (en)
AU (4) AU5565999A (en)
WO (4) WO2000010106A1 (en)

Cited By (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010036224A1 (en) * 2000-02-07 2001-11-01 Aaron Demello System and method for the delivery of targeted data over wireless networks
US20010047254A1 (en) * 2000-03-14 2001-11-29 Hawley Rising Application of category theory and cognitive science to design of semantic descriptions for content data
US20020027568A1 (en) * 2000-09-06 2002-03-07 Nec Corporation Access method and system using small number of identifiers
US20020031269A1 (en) * 2000-09-08 2002-03-14 Nec Corporation System, method and program for discriminating named entity
US20030061232A1 (en) * 2001-09-21 2003-03-27 Dun & Bradstreet Inc. Method and system for processing business data
US20040267727A1 (en) * 1998-08-17 2004-12-30 Black Jeffrey Dean Dynamically categorizing entity information
US20050015394A1 (en) * 2001-11-30 2005-01-20 Mckeeth Jim Method and system for updating a search engine
US6917978B1 (en) * 1999-10-26 2005-07-12 Fujitsu Limited Network system having function of retrieving information, network terminal device having function of retrieving information, and network relay device having function of retrieving information
US6968380B1 (en) 2000-05-30 2005-11-22 International Business Machines Corporation Method and system for increasing ease-of-use and bandwidth utilization in wireless devices
US6983379B1 (en) 2000-06-30 2006-01-03 Hitwise Pty. Ltd. Method and system for monitoring online behavior at a remote site and creating online behavior profiles
US6985933B1 (en) * 2000-05-30 2006-01-10 International Business Machines Corporation Method and system for increasing ease-of-use and bandwidth utilization in wireless devices
US20060015401A1 (en) * 2004-07-15 2006-01-19 Chu Barry H Efficiently spaced and used advertising in network-served multimedia documents
US7080101B1 (en) * 2000-12-01 2006-07-18 Ncr Corp. Method and apparatus for partitioning data for storage in a database
US20070100836A1 (en) * 2005-10-28 2007-05-03 Yahoo! Inc. User interface for providing third party content as an RSS feed
US20070100960A1 (en) * 2005-10-28 2007-05-03 Yahoo! Inc. Managing content for RSS alerts over a network
US20090011744A1 (en) * 2007-07-07 2009-01-08 Qualcomm Incorporated Method and system for delivery of targeted information based on a user profile in a mobile communication device
US20090094137A1 (en) * 2005-12-22 2009-04-09 Toppenberg Larry W Web Page Optimization Systems
US7523085B2 (en) 2004-09-30 2009-04-21 Buzzmetrics, Ltd An Israel Corporation Topical sentiments in electronically stored communications
US20090125462A1 (en) * 2007-11-14 2009-05-14 Qualcomm Incorporated Method and system using keyword vectors and associated metrics for learning and prediction of user correlation of targeted content messages in a mobile environment
US20090182873A1 (en) * 2000-06-30 2009-07-16 Hitwise Pty, Ltd Method and system for monitoring online computer network behavior and creating online behavior profiles
US7600017B2 (en) 2000-10-11 2009-10-06 Buzzmetrics, Ltd. System and method for scoring electronic messages
US20090327223A1 (en) * 2008-06-26 2009-12-31 Microsoft Corporation Query-driven web portals
US7660783B2 (en) 2006-09-27 2010-02-09 Buzzmetrics, Inc. System and method of ad-hoc analysis of data
US7725414B2 (en) 2004-03-16 2010-05-25 Buzzmetrics, Ltd An Israel Corporation Method for developing a classifier for classifying communications
US7747713B1 (en) * 2000-06-30 2010-06-29 Hitwise Pty. Ltd. Method and system for classifying information available on a computer network
US7844483B2 (en) 2000-10-11 2010-11-30 Buzzmetrics, Ltd. System and method for predicting external events from electronic author activity
US8271316B2 (en) 1999-12-17 2012-09-18 Buzzmetrics Ltd Consumer to business data capturing system
US8347326B2 (en) 2007-12-18 2013-01-01 The Nielsen Company (US) Identifying key media events and modeling causal relationships between key events and reported feelings
US8484186B1 (en) 2010-11-12 2013-07-09 Consumerinfo.Com, Inc. Personalized people finder
US8874727B2 (en) 2010-05-31 2014-10-28 The Nielsen Company (Us), Llc Methods, apparatus, and articles of manufacture to rank users in an online social network
US9158855B2 (en) 2005-06-16 2015-10-13 Buzzmetrics, Ltd Extracting structured data from weblogs
US20150370918A1 (en) * 2014-06-20 2015-12-24 Cotap, Inc. Directory Generation and Messaging
US9392074B2 (en) 2007-07-07 2016-07-12 Qualcomm Incorporated User profile generation architecture for mobile content-message targeting
US9391789B2 (en) 2007-12-14 2016-07-12 Qualcomm Incorporated Method and system for multi-level distribution information cache management in a mobile environment
US10191982B1 (en) * 2009-01-23 2019-01-29 Zakata, LLC Topical search portal
US10380113B2 (en) * 2000-07-20 2019-08-13 Rodney D. Johnson Information archival and retrieval system for internetworked computers
US10482513B1 (en) 2003-09-02 2019-11-19 Vinimaya, Llc Methods and systems for integrating procurement systems with electronic catalogs
US10528574B2 (en) 2009-01-23 2020-01-07 Zakta, LLC Topical trust network
US10643178B1 (en) 2017-06-16 2020-05-05 Coupa Software Incorporated Asynchronous real-time procurement system
US10861069B2 (en) 2010-12-02 2020-12-08 Coupa Software Incorporated Methods and systems to maintain, check, report, and audit contract and historical pricing in electronic procurement
US20220222277A1 (en) * 2021-01-12 2022-07-14 Tealbook Inc. System and method for data profiling
US11860954B1 (en) 2009-01-23 2024-01-02 Zakta, LLC Collaboratively finding, organizing and/or accessing information

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735585B1 (en) 1998-08-17 2004-05-11 Altavista Company Method for search engine generating supplemented search not included in conventional search result identifying entity data related to portion of located web page
NL1016379C2 (en) * 2000-07-25 2002-01-28 Alphonsus Albertus Schirris Information searching method for e.g. internet, uses synonyms or translations of inputted search terms
AUPQ920300A0 (en) * 2000-08-04 2000-08-31 Sharinga Networks Inc. Network address resolution
US7254573B2 (en) * 2002-10-02 2007-08-07 Burke Thomas R System and method for identifying alternate contact information in a database related to entity, query by identifying contact information of a different type than was in query which is related to the same entity
US7792828B2 (en) 2003-06-25 2010-09-07 Jericho Systems Corporation Method and system for selecting content items to be presented to a viewer
NO20035563D0 (en) * 2003-10-01 2003-12-12 Telenor Asa Method and system for obtaining improved subscriber information
US7536382B2 (en) * 2004-03-31 2009-05-19 Google Inc. Query rewriting with entity detection
US7624101B2 (en) 2006-01-31 2009-11-24 Google Inc. Enhanced search results
US20080313142A1 (en) * 2007-06-14 2008-12-18 Microsoft Corporation Categorization of queries
KR100930617B1 (en) * 2008-04-08 2009-12-09 한국과학기술정보연구원 Multiple object-oriented integrated search system and method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997015018A1 (en) 1995-10-16 1997-04-24 Bell Communications Research, Inc. Method and system for providing uniform access to heterogeneous information
WO1997029414A2 (en) 1996-02-09 1997-08-14 At & T Corp. Method and apparatus for passively browsing the internet
US5905862A (en) * 1996-09-04 1999-05-18 Intel Corporation Automatic web site registration with multiple search engines
US5933827A (en) * 1996-09-25 1999-08-03 International Business Machines Corporation System for identifying new web pages of interest to a user
US5974455A (en) * 1995-12-13 1999-10-26 Digital Equipment Corporation System for adding new entry to web page table upon receiving web page including link to another web page not having corresponding entry in web page table
US6085229A (en) * 1998-05-14 2000-07-04 Belarc, Inc. System and method for providing client side personalization of content of web pages and the like
US6141759A (en) * 1997-12-10 2000-10-31 Bmc Software, Inc. System and architecture for distributing, monitoring, and managing information requests on a computer network
US6195657B1 (en) * 1996-09-26 2001-02-27 Imana, Inc. Software, method and apparatus for efficient categorization and recommendation of subjects according to multidimensional semantics
US6401118B1 (en) * 1998-06-30 2002-06-04 Online Monitoring Services Method and computer program product for an online monitoring search engine

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09311873A (en) * 1996-01-11 1997-12-02 Sony Corp Information providing data structure, information providing method, and information receiving terminal
US5974572A (en) * 1996-10-15 1999-10-26 Mercury Interactive Corporation Software system and methods for generating a load test using a server access log
US5958008A (en) * 1996-10-15 1999-09-28 Mercury Interactive Corporation Software system and associated methods for scanning and mapping dynamically-generated web documents
EP1016009A4 (en) * 1996-10-25 2002-06-05 Ipf Inc System and method for managing and serving consumer product related information over the internet
US6151624A (en) * 1998-02-03 2000-11-21 Realnames Corporation Navigating network resources based on metadata
US6735585B1 (en) * 1998-08-17 2004-05-11 Altavista Company Method for search engine generating supplemented search not included in conventional search result identifying entity data related to portion of located web page
US6654813B1 (en) * 1998-08-17 2003-11-25 Alta Vista Company Dynamically categorizing entity information

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997015018A1 (en) 1995-10-16 1997-04-24 Bell Communications Research, Inc. Method and system for providing uniform access to heterogeneous information
US5974455A (en) * 1995-12-13 1999-10-26 Digital Equipment Corporation System for adding new entry to web page table upon receiving web page including link to another web page not having corresponding entry in web page table
WO1997029414A2 (en) 1996-02-09 1997-08-14 At & T Corp. Method and apparatus for passively browsing the internet
US6119135A (en) * 1996-02-09 2000-09-12 At&T Corporation Method for passively browsing the internet using images extracted from web pages
US5905862A (en) * 1996-09-04 1999-05-18 Intel Corporation Automatic web site registration with multiple search engines
US5933827A (en) * 1996-09-25 1999-08-03 International Business Machines Corporation System for identifying new web pages of interest to a user
US6195657B1 (en) * 1996-09-26 2001-02-27 Imana, Inc. Software, method and apparatus for efficient categorization and recommendation of subjects according to multidimensional semantics
US6141759A (en) * 1997-12-10 2000-10-31 Bmc Software, Inc. System and architecture for distributing, monitoring, and managing information requests on a computer network
US6085229A (en) * 1998-05-14 2000-07-04 Belarc, Inc. System and method for providing client side personalization of content of web pages and the like
US6401118B1 (en) * 1998-06-30 2002-06-04 Online Monitoring Services Method and computer program product for an online monitoring search engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PCT International Search Report Corresponding To PCT International Application No. PCT/US99/18646; Authorized Officer: R. Katerbau; Date of Completion: Dec. 21, 1999; Date of Mailing: Jan. 11, 2000 (3 pages).

Cited By (76)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040267727A1 (en) * 1998-08-17 2004-12-30 Black Jeffrey Dean Dynamically categorizing entity information
US7398266B2 (en) * 1998-08-17 2008-07-08 Overture Services, Inc. Dynamically categorizing entity information
US6917978B1 (en) * 1999-10-26 2005-07-12 Fujitsu Limited Network system having function of retrieving information, network terminal device having function of retrieving information, and network relay device having function of retrieving information
US8271316B2 (en) 1999-12-17 2012-09-18 Buzzmetrics Ltd Consumer to business data capturing system
US20010036224A1 (en) * 2000-02-07 2001-11-01 Aaron Demello System and method for the delivery of targeted data over wireless networks
US7319951B2 (en) * 2000-03-14 2008-01-15 Sony Corporation Application of category theory and cognitive science to design of semantic descriptions for content data
US20010047254A1 (en) * 2000-03-14 2001-11-29 Hawley Rising Application of category theory and cognitive science to design of semantic descriptions for content data
US6985933B1 (en) * 2000-05-30 2006-01-10 International Business Machines Corporation Method and system for increasing ease-of-use and bandwidth utilization in wireless devices
US6968380B1 (en) 2000-05-30 2005-11-22 International Business Machines Corporation Method and system for increasing ease-of-use and bandwidth utilization in wireless devices
US7996912B2 (en) 2000-06-30 2011-08-09 Hitwise Pty. Ltd. Method and system for monitoring online computer network behavior and creating online behavior profiles
US7971260B2 (en) 2000-06-30 2011-06-28 Hitwise Pty. Ltd. Matching session records of network users with corresponding transaction data
US7725944B2 (en) 2000-06-30 2010-05-25 Hitwise Pty. Ltd. Matching session records of network users with corresponding transaction data
US7747713B1 (en) * 2000-06-30 2010-06-29 Hitwise Pty. Ltd. Method and system for classifying information available on a computer network
US20060070117A1 (en) * 2000-06-30 2006-03-30 Hitwise Pty. Ltd. Method and system for monitoring online behavior at a remote site and creating online behavior profiles
US8321952B2 (en) 2000-06-30 2012-11-27 Hitwise Pty. Ltd. Method and system for monitoring online computer network behavior and creating online behavior profiles
US20100198705A1 (en) * 2000-06-30 2010-08-05 Jan-Dieter Spalink Matching session records of network users with corresponding transaction data
US6983379B1 (en) 2000-06-30 2006-01-03 Hitwise Pty. Ltd. Method and system for monitoring online behavior at a remote site and creating online behavior profiles
US20080098220A1 (en) * 2000-06-30 2008-04-24 Hitwise Pty. Ltd. Method and system for monitoring online behavior at a remote site and creating online behavior profiles
US20090182873A1 (en) * 2000-06-30 2009-07-16 Hitwise Pty, Ltd Method and system for monitoring online computer network behavior and creating online behavior profiles
US7360251B2 (en) 2000-06-30 2008-04-15 Hitwise Pty, Ltd. Method and system for monitoring online behavior at a remote site and creating online behavior profiles
US10380113B2 (en) * 2000-07-20 2019-08-13 Rodney D. Johnson Information archival and retrieval system for internetworked computers
US20020027568A1 (en) * 2000-09-06 2002-03-07 Nec Corporation Access method and system using small number of identifiers
US7093014B2 (en) * 2000-09-06 2006-08-15 Nec Corporation Access method and system with restricted number of address indentifiers in domain areas for identifying server addresses
US20020031269A1 (en) * 2000-09-08 2002-03-14 Nec Corporation System, method and program for discriminating named entity
US6975766B2 (en) * 2000-09-08 2005-12-13 Nec Corporation System, method and program for discriminating named entity
US7844484B2 (en) 2000-10-11 2010-11-30 Buzzmetrics, Ltd. System and method for benchmarking electronic message activity
US7844483B2 (en) 2000-10-11 2010-11-30 Buzzmetrics, Ltd. System and method for predicting external events from electronic author activity
US7600017B2 (en) 2000-10-11 2009-10-06 Buzzmetrics, Ltd. System and method for scoring electronic messages
US20110161270A1 (en) * 2000-10-11 2011-06-30 Arnett Nicholas D System and method for analyzing electronic message activity
US7080101B1 (en) * 2000-12-01 2006-07-18 Ncr Corp. Method and apparatus for partitioning data for storage in a database
US20030061232A1 (en) * 2001-09-21 2003-03-27 Dun & Bradstreet Inc. Method and system for processing business data
US7627568B2 (en) * 2001-11-30 2009-12-01 Micron Technology, Inc. Method and system for updating a search engine database based on popularity of links
US20100057802A1 (en) * 2001-11-30 2010-03-04 Micron Technology, Inc. Method and system for updating a search engine
US20050015394A1 (en) * 2001-11-30 2005-01-20 Mckeeth Jim Method and system for updating a search engine
US8832085B2 (en) 2001-11-30 2014-09-09 Round Rock Research, Llc Method and system for updating a search engine
US7979427B2 (en) 2001-11-30 2011-07-12 Round Rock Research, Llc Method and system for updating a search engine
US10482513B1 (en) 2003-09-02 2019-11-19 Vinimaya, Llc Methods and systems for integrating procurement systems with electronic catalogs
US7725414B2 (en) 2004-03-16 2010-05-25 Buzzmetrics, Ltd An Israel Corporation Method for developing a classifier for classifying communications
US20060015401A1 (en) * 2004-07-15 2006-01-19 Chu Barry H Efficiently spaced and used advertising in network-served multimedia documents
US7877345B2 (en) 2004-09-30 2011-01-25 Buzzmetrics, Ltd. Topical sentiments in electronically stored communications
US7523085B2 (en) 2004-09-30 2009-04-21 Buzzmetrics, Ltd An Israel Corporation Topical sentiments in electronically stored communications
US8041669B2 (en) 2004-09-30 2011-10-18 Buzzmetrics, Ltd. Topical sentiments in electronically stored communications
US11556598B2 (en) 2005-06-16 2023-01-17 Buzzmetrics, Ltd. Extracting structured data from weblogs
US10180986B2 (en) 2005-06-16 2019-01-15 Buzzmetrics, Ltd. Extracting structured data from weblogs
US9158855B2 (en) 2005-06-16 2015-10-13 Buzzmetrics, Ltd Extracting structured data from weblogs
US20070100836A1 (en) * 2005-10-28 2007-05-03 Yahoo! Inc. User interface for providing third party content as an RSS feed
US20070100960A1 (en) * 2005-10-28 2007-05-03 Yahoo! Inc. Managing content for RSS alerts over a network
US20090094137A1 (en) * 2005-12-22 2009-04-09 Toppenberg Larry W Web Page Optimization Systems
US7660783B2 (en) 2006-09-27 2010-02-09 Buzzmetrics, Inc. System and method of ad-hoc analysis of data
US9596317B2 (en) 2007-07-07 2017-03-14 Qualcomm Incorporated Method and system for delivery of targeted information based on a user profile in a mobile communication device
US20090013051A1 (en) * 2007-07-07 2009-01-08 Qualcomm Incorporated Method for transfer of information related to targeted content messages through a proxy server
US20090011744A1 (en) * 2007-07-07 2009-01-08 Qualcomm Incorporated Method and system for delivery of targeted information based on a user profile in a mobile communication device
US9497286B2 (en) 2007-07-07 2016-11-15 Qualcomm Incorporated Method and system for providing targeted information based on a user profile in a mobile environment
US9485322B2 (en) 2007-07-07 2016-11-01 Qualcomm Incorporated Method and system for providing targeted information using profile attributes with variable confidence levels in a mobile environment
US9398113B2 (en) * 2007-07-07 2016-07-19 Qualcomm Incorporated Methods and systems for providing targeted information using identity masking in a wireless communications device
US9392074B2 (en) 2007-07-07 2016-07-12 Qualcomm Incorporated User profile generation architecture for mobile content-message targeting
US20090125462A1 (en) * 2007-11-14 2009-05-14 Qualcomm Incorporated Method and system using keyword vectors and associated metrics for learning and prediction of user correlation of targeted content messages in a mobile environment
US9203912B2 (en) 2007-11-14 2015-12-01 Qualcomm Incorporated Method and system for message value calculation in a mobile environment
US9705998B2 (en) 2007-11-14 2017-07-11 Qualcomm Incorporated Method and system using keyword vectors and associated metrics for learning and prediction of user correlation of targeted content messages in a mobile environment
US9203911B2 (en) 2007-11-14 2015-12-01 Qualcomm Incorporated Method and system for using a cache miss state match indicator to determine user suitability of targeted content messages in a mobile environment
US9391789B2 (en) 2007-12-14 2016-07-12 Qualcomm Incorporated Method and system for multi-level distribution information cache management in a mobile environment
US8347326B2 (en) 2007-12-18 2013-01-01 The Nielsen Company (US) Identifying key media events and modeling causal relationships between key events and reported feelings
US8793715B1 (en) 2007-12-18 2014-07-29 The Nielsen Company (Us), Llc Identifying key media events and modeling causal relationships between key events and reported feelings
US20090327223A1 (en) * 2008-06-26 2009-12-31 Microsoft Corporation Query-driven web portals
US11860954B1 (en) 2009-01-23 2024-01-02 Zakta, LLC Collaboratively finding, organizing and/or accessing information
US11250076B1 (en) 2009-01-23 2022-02-15 Zakta Llc Topical search portal
US10191982B1 (en) * 2009-01-23 2019-01-29 Zakata, LLC Topical search portal
US10528574B2 (en) 2009-01-23 2020-01-07 Zakta, LLC Topical trust network
US9455891B2 (en) 2010-05-31 2016-09-27 The Nielsen Company (Us), Llc Methods, apparatus, and articles of manufacture to determine a network efficacy
US8874727B2 (en) 2010-05-31 2014-10-28 The Nielsen Company (Us), Llc Methods, apparatus, and articles of manufacture to rank users in an online social network
US8484186B1 (en) 2010-11-12 2013-07-09 Consumerinfo.Com, Inc. Personalized people finder
US10861069B2 (en) 2010-12-02 2020-12-08 Coupa Software Incorporated Methods and systems to maintain, check, report, and audit contract and historical pricing in electronic procurement
US10015125B2 (en) * 2014-06-20 2018-07-03 Zinc, Inc. Directory generation and messaging
US20150370918A1 (en) * 2014-06-20 2015-12-24 Cotap, Inc. Directory Generation and Messaging
US10643178B1 (en) 2017-06-16 2020-05-05 Coupa Software Incorporated Asynchronous real-time procurement system
US20220222277A1 (en) * 2021-01-12 2022-07-14 Tealbook Inc. System and method for data profiling

Also Published As

Publication number Publication date
EP1105818A1 (en) 2001-06-13
US20040267727A1 (en) 2004-12-30
AU5565999A (en) 2000-03-06
WO2000010106A1 (en) 2000-02-24
WO2000010108A1 (en) 2000-02-24
AU5566199A (en) 2000-03-06
JP2002522847A (en) 2002-07-23
JP5171927B2 (en) 2013-03-27
WO2000010107A1 (en) 2000-02-24
US7398266B2 (en) 2008-07-08
WO2000010105A1 (en) 2000-02-24
AU5565899A (en) 2000-03-06
JP2011100461A (en) 2011-05-19
AU5566099A (en) 2000-03-06

Similar Documents

Publication Publication Date Title
US6654813B1 (en) Dynamically categorizing entity information
US6735585B1 (en) Method for search engine generating supplemented search not included in conventional search result identifying entity data related to portion of located web page
KR100478019B1 (en) Method and system for generating a search result list based on local information
US6604113B1 (en) Method and apparatus for providing account information
US5832497A (en) Electronic automated information exchange and management system
US7376569B2 (en) Apparatus and method for providing compensation information
US7620725B2 (en) Metadata collection within a trusted relationship to increase search relevance
CN101044478B (en) Analyzing a query log for use in managing category-specific electronic content
US8108383B2 (en) Enhanced search results
JP5378364B2 (en) Credit card transaction data classification system and method
US6324566B1 (en) Internet advertising via bookmark set based on client specific information
JP5008979B2 (en) A system with an interactive graphical interface for delivering planning and consulting materials, surveys, and compliance information on taxes or other forms
US20090119268A1 (en) Method and system for crawling, mapping and extracting information associated with a business using heuristic and semantic analysis
US20100063925A1 (en) Method and system for performing legal services over the internet
US20120246139A1 (en) System and method for resume, yearbook and report generation based on webcrawling and specialized data collection
US6405223B1 (en) System for personal storage of different web source versions
JP2009271911A (en) Method and system for symbolical linkage and intelligent categorization of information
US20090222440A1 (en) Search engine for carrying out a location-dependent search
US7630959B2 (en) System and method for processing database queries
US20090254427A1 (en) Internet directory and method
US20060178889A1 (en) Method and system for performing electronic commerce
US8561019B2 (en) System and method for data abstraction using formatted system variables
KR100909561B1 (en) System for generating a search result list based on local information
Heery et al. Metadata
KR20010094228A (en) Fitted multi-searching system for daily information

Legal Events

Date Code Title Description
AS Assignment

Owner name: IATLAS CORPORATION, MARYLAND

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BLACK, JEFFERY DEAN;TITUS, JASON HARVEY;WOODHEAD, IRA JOSEPH;REEL/FRAME:010219/0319

Effective date: 19990902

AS Assignment

Owner name: ATLA VISTA COMPANY, CALIFORNIA

Free format text: MERGER;ASSIGNOR:IATLAS CORPORATION;REEL/FRAME:013714/0227

Effective date: 19991022

AS Assignment

Owner name: OVERTURE SERVICES, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALTA VISTA COMPANY;REEL/FRAME:014546/0785

Effective date: 20030918

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: OVERTURE SERVICES INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALTA VISTA COMPANY;REEL/FRAME:015271/0635

Effective date: 20030918

Owner name: OVERTURE SERVICES, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALTA VISTA COMPANY;REEL/FRAME:015271/0623

Effective date: 20030918

Owner name: OVERTURE SERVICES, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALTA VISTA COMPANY;REEL/FRAME:015271/0629

Effective date: 20030918

FEPP Fee payment procedure

Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

REFU Refund

Free format text: REFUND - SURCHARGE FOR LATE PAYMENT, SMALL ENTITY (ORIGINAL EVENT CODE: R2554); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: REFUND - SURCHARGE, PETITION TO ACCEPT PYMT AFTER EXP, UNINTENTIONAL (ORIGINAL EVENT CODE: R2551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

REMI Maintenance fee reminder mailed
FPAY Fee payment

Year of fee payment: 4

SULP Surcharge for late payment
AS Assignment

Owner name: YAHOO! INC,CALIFORNIA

Free format text: MERGER;ASSIGNOR:OVERTURE SERVICES, INC;REEL/FRAME:021652/0654

Effective date: 20081001

Owner name: YAHOO! INC, CALIFORNIA

Free format text: MERGER;ASSIGNOR:OVERTURE SERVICES, INC;REEL/FRAME:021652/0654

Effective date: 20081001

FPAY Fee payment

Year of fee payment: 8

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 12

AS Assignment

Owner name: EXCALIBUR IP, LLC, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO! INC.;REEL/FRAME:038383/0466

Effective date: 20160418

AS Assignment

Owner name: YAHOO! INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EXCALIBUR IP, LLC;REEL/FRAME:038951/0295

Effective date: 20160531

AS Assignment

Owner name: EXCALIBUR IP, LLC, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO! INC.;REEL/FRAME:038950/0592

Effective date: 20160531

AS Assignment

Owner name: R2 SOLUTIONS LLC, TEXAS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EXCALIBUR IP, LLC;REEL/FRAME:055283/0483

Effective date: 20200428