WO2005114379A2 - Personalized search engine - Google Patents

Personalized search engine Download PDF

Info

Publication number
WO2005114379A2
WO2005114379A2 PCT/US2005/017207 US2005017207W WO2005114379A2 WO 2005114379 A2 WO2005114379 A2 WO 2005114379A2 US 2005017207 W US2005017207 W US 2005017207W WO 2005114379 A2 WO2005114379 A2 WO 2005114379A2
Authority
WO
WIPO (PCT)
Prior art keywords
user
search
user profile
information
documents
Prior art date
Application number
PCT/US2005/017207
Other languages
French (fr)
Other versions
WO2005114379A3 (en
Inventor
William Gross
Tom Mcgovern
Steven Lee Colwell
Original Assignee
Perfect Market Technologies, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Perfect Market Technologies, Inc. filed Critical Perfect Market Technologies, Inc.
Publication of WO2005114379A2 publication Critical patent/WO2005114379A2/en
Publication of WO2005114379A3 publication Critical patent/WO2005114379A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Definitions

  • the present invention relates generally to an information retrieval application, and more specifically to a search engine for searching information on computer networks based on a combination of the user's query and information the user provides or the device discerns about the user.
  • search engines capable of searching computer networks for documents of interest, and generating a list of relevant documents ("search results") based on the search engine's determination of relationships between the user's query and characteristics of the documents.
  • search engines typically present the search results by sorting the results based on the search engines' determination of relevance of a document to the query.
  • the results are inherently limited by the specific terms provided by the user and the user's ability to accurately construct the query such that the terms specify the user's intent.
  • FIG. 1 is a flowchart illustrating the operation of an exemplary search process whereby the search engine utilizes the user's personalized profile, or digital signature, to determine relevance of documents
  • FIG. 2 is a flowchart illustrating the creation of the digital signature based on information declared by and observed of the user
  • FIG. 3 is a schematic diagram illustrating the components of the exemplary personalized search application capable of using the apparatus of FIG. 1
  • FIG. 4 is a schematic diagram illustrating select information that would be stored in the personal signature of the user;
  • FIG. 1 is a flowchart illustrating the operation of an exemplary search process whereby the search engine utilizes the user's personalized profile, or digital signature, to determine relevance of documents
  • FIG. 2 is a flowchart illustrating the creation of the digital signature based on information declared by and observed of the user
  • FIG. 3 is a schematic diagram illustrating the components of the exemplary personalized search application capable of using the apparatus of FIG. 1
  • FIG. 4 is a schematic diagram illustrating select information that would be stored in the personal signature
  • FIG. 5 is a schematic diagram illustrating the processing of the search query and postprocessing results based on the signature
  • FIG. 6 is a schematic diagram illustrating the processing of the search query together with the signature to provide the user search results.
  • computer network is used to refer to a system of interconnected devices, including without limitation, user-accessible server sites, peer to peer networks, the Internet as well as intranets and local area networks.
  • site is used to refer to server sites that implement current or future World Wide Web standards for the coding and transmission of hypertext documents. These standards currently include HTML (the Hypertext Markup Language), HTTP (the Hypertext Transfer Protocol), and asynchronous protocols.
  • the term "site" is not intended to imply a single geographic location, as a web or other network site can, for example, include multiple geographically distributed computer systems that are appropriately linked together.
  • other networks or hypermedia databases such as networked interactive televisions, and other present or future protocols may be used as well.
  • HDML High-held Device Markup Language
  • WAP Wireless Application Protocol
  • WML wireless markup language
  • XML Extensible Markup Language
  • the functions described herein are performed by programs including executable code or instructions running on one or more network- enabled devices, including, without limitation, general-purpose computers, cellular phones, PDAs, and other present or future devices.
  • the devices may include one or more central processing units for executing program code, volatile memory, such as random access memory (RAM) for temporarily storing data and data structures during program execution, non-volatile memory, such as a hard disk storage or optical storage, for storing programs and data, including databases, and a network interface for accessing an intranet and/or the Internet.
  • RAM random access memory
  • non-volatile memory such as a hard disk storage or optical storage
  • programs and data including databases
  • a network interface for accessing an intranet and/or the Internet.
  • the functions described herein may also be implemented using special purpose computers, state machines, and/or hardwired electronic circuits.
  • search engine is defined broadly, and includes, in addition to its ordinary meaning, a local or remote information retrieval system whereby users and/or electronic agents formulate and submit a query and the system locates documents that relate to the information contained in the query.
  • the processing of those queries and identification of the related documents may occur in a number of ways including the use of an index, such as an inverted file structure, signature files or any other present or future manner to retrieve information.
  • the index is typically developed through computerized agents that access the world wide web through a process known as crawling and spidering.
  • query is defined broadly, and includes, in addition to its ordinary meaning, a user's or agent's submission of terms to a search engine. Formation of the query may occur in a number of manners including, without limitation, exact or lexical,
  • the term "document” is defined broadly, and includes, in addition to its ordinary meaning, any files and data, including without limitation, computer files, machine configurations, executables and websites.
  • the term “document” is not limited to computer files containing text, but also includes computer files containing graphics, audio, video, and other multimedia data.
  • search results is defined broadly, and includes, in addition to its ordinary meaning, search results based on an index of documents where a computerized algorithm searches through the index and compiles search results based on relevancy to the query.
  • Search results may also include present or future types of paid listings whereby the results have a sponsor, defined broadly, who provides incentives for the search engine to present the listing to the user.
  • Paid listings includes, in addition to its ordinary meaning, pay for placement, pay for click, pay for action and paid inclusion listings generated by a search engine in response to a user's search query.
  • an exemplary personalized search apparatus provides a method for providing a search engine additional information about the user and their search query whereby the search engine tailors its processing providing the user providing more relevant search results.
  • FIG. 1 illustrates an exemplary arrangement where a user 100, through a user interface 110 on a computer or similar device 120, accesses the search engine through a communications network 130 and submit an information search query to either a local intranet search engine 140 or to an Internet search engine 150.
  • the user initiates a query by entry into a search engine user interface 200 for processing of the query and tailoring the search results 210.
  • the system provides to the search engine, along with the query, a user profile or digital signature.
  • the information in the digital signature allows the query to be contextualized by the user's profile. It also allows a means to weight, or scale, the importance of the terms based on the data contained in the user's files.
  • the search engine is able to recalculate the relevancy of search results 220, prior to returning the results to the user 230.
  • the apparatus separately transmits the signature information to the search application, which stores it for future use.
  • the user identifies himself or herself when submitting queries, either by logging in or other means such as a cookie on their computer, and the search application retrieves the signature from its storage device for processing with the query.
  • user profile information is maintained locally and filtering or resorting of search results occurs at the client side to protect against any potential unauthorized dissemination of the user's private information.
  • the apparatus provides a technique for executing an electronic agent that forms the profile, or digital signature, of the user using both declared and observed information.
  • the system is installed or downloaded by the user 310.
  • This agent may be a client on the user's computer or software from a host server that may function as a virtual client.
  • Declared information may include, but is not limited to, personal information declared by the user, such as demographic information and interests. Observed information includes, but is not limited to, an analysis of documents on the user's computer system, previous search history, and previous URL visitation history.
  • the agent uses this information to create all or part of the digital signature of the user. The frequency of update of the digital signature is configurable by the user, or predetermined by the system.
  • the user's declared information is provided during the process of installing and configuring the system 320. Referring to FIG.
  • the declared information 410 may include various demographic information such as sex, age, location as well as interests 420 (such as history, wildlife, technology etc.)
  • the declared information is stored for use in the digital signature.
  • the electronic agent also performs an analysis of information contained in the user's computer 330. This is performed as part of the process of installing the apparatus and is configurable by the user with respect to what data is analyzed and upon what frequency. Examples of the data analyzed includes all system and non-system files such as, but not limited to, machine configuration, e-mail, word processing documents, electronic spreadsheets, presentation and graphic package documents, instant messenger history and stored PDF documents.
  • the agent analyzes the user's data by scanning the words used in the documents and determining which words have a higher incidence of use versus a baseline 340, 350. Referring to FIG. 4, those words, and their semantic meaning, are stored for inclusion in the digital signature 430. For example, if a user has 3000 references to "intel" that would far exceed and average user and would be stored in the baseline as a high incidence word. An example of this observed information in the signature is shown in FIG. 4. For security, compressing and encrypting the signature may be done in several ways based on well known techniques of hashing and keys. Referring once again to FIG. 3, the system creates the digital signature using the declared and observed information (collectively "user's information"). This signature may be created in multiple ways.
  • the system compares words used in the user's information to a baseline of the word use in the English, or other, language to identify interests. Further, the system may record the semantic meaning of the word, or context, of the word in the creating the signature. For instance, if the word "jaguar” is often used in the users information in the context of computer operating systems, it will record the word and the context of computers rather than alternative meaning such as automobiles or wildlife. If the user then searched for "jaguar manual” the normal search results of documents for "jaguar manual” are modified such that the computer operating system documents would have a higher than normal relevance ranking and those related to automobiles would have a lower ranking than normal.
  • the system contributes the user's information to a network that continually updates the baseline word use 340.
  • the system then in turn provides an updated baseline for use in comparison to the user's information and for creation of the digital signature.
  • the user may review and edit any information in the user profile to highlight immediate intent.
  • the user may create multiple profiles, subprofiles or combined profiles. These profiles may be used in conjunction with a particular search to provide context for the search. By way of example, the user may set up different profiles reflecting his or her varying interests or hobbies.
  • FIG. 5 outlines how, in one embodiment, the search engine processes a query and reformulates the results based on the user's information.
  • the system receives a search query and signature from a user 500.
  • FIG. 6 outlines an alternative embodiment whereby the search engine refines the query by modifying or appending information relevant to the user based on the information in the signature.
  • the search query and signature are received from the user.
  • the query is then reformulated or refined based on the user's signature to increase the relevance of the query by incorporating information or keywords into the query relating to the user 610.
  • the index is then searched based on the modified or enhanced query 620 and the results are returned 630. Referring also to FIG.
  • a user's prior web browser history may be used to improve relevance 440.
  • the personal search apparatus may track, and store a log of, web sites visited, time spent, prior searches and use that data to increase the relevance weighting of sites that have been visited before to improve relevance. This includes recording URL's visited and the number of page views as well as other actions (download, buy etc.) at the URL's. This history is stored for inclusion in the digital signature.
  • the search engine compares the signature with other user's signatures identifying others who have similar profiles.
  • the relevance rankings of the search results would be re-ranked based on the search history of the previous user(s). For instance, if user "A” searched for “mouse” and iterated their query to "optical mice” and user “B” had a signature that resembles "A” and searched for "mice”, then the search engine would boost the relevance ranking on documents related to optical mice over that of the other meanings of mice (sites on rodents, mice for animal testing etc.) In effect, the signatures based on the user's information forms a means for collaboration between anonymous users.
  • Access to the search engine may be either direct, such as by a user accessing the engine through a URL on the Internet, or through a distributed fashion via a application contained on users' computers or via a third party web site that provides search services on a syndicated manner for the search engine.
  • the system disclosed herein enables the user to receive tailored results based upon information contained in the user profile, or digital signature. While the foregoing detailed description discloses several embodiments of the present invention, it should be understood that this disclosure is illustrative only and is not limiting of the present invention. It should be appreciated that the specific configurations and operations disclosed can differ from those described above, and that the methods described herein can be used in contexts other than use of a personalized search engine.

Abstract

A system and method and method for personalized searching of a computer network, such as a local area network or the world wide web, is disclosed. The method involves submitting a user search query, submitting the search query and a user profile to a search engine, processing the search query based on a user profile to calculate the relevancy of search results, and returning highly personalized search results to the user based upon the calculated relevancy. The user profile may include declared and observed information. Declared information includes information provided by the user, such as, for example, individual and demographic information. Observed information is gathered by the system by reviewing user word usage gathered from the user's documents, machine configuration, email and instant messages, and other areas. The system may compare words to a baseline to determine the relative incidence of word usage for inclusion into the user's profile. Observed information may further or alternatively include information regarding the user's historical behavior, including the types and frequency of websites visited.

Description

PERSONALIZED SEARCH ENGINE
FIELD OF THE INVENTION The present invention relates generally to an information retrieval application, and more specifically to a search engine for searching information on computer networks based on a combination of the user's query and information the user provides or the device discerns about the user.
BACKGROUND OF THE INVENTION There are many search engines capable of searching computer networks for documents of interest, and generating a list of relevant documents ("search results") based on the search engine's determination of relationships between the user's query and characteristics of the documents. Such search engines typically present the search results by sorting the results based on the search engines' determination of relevance of a document to the query. As such, the results are inherently limited by the specific terms provided by the user and the user's ability to accurately construct the query such that the terms specify the user's intent.
BRIEF DESCRIPTION OF THE DRAWINGS Exemplary embodiments of the personalized search engine disclosed herein are illustrated in the accompanying drawings, which are for illustrative purposes only. The drawings comprise the following figures, in which: FIG. 1 is a flowchart illustrating the operation of an exemplary search process whereby the search engine utilizes the user's personalized profile, or digital signature, to determine relevance of documents; FIG. 2 is a flowchart illustrating the creation of the digital signature based on information declared by and observed of the user; FIG. 3 is a schematic diagram illustrating the components of the exemplary personalized search application capable of using the apparatus of FIG. 1; FIG. 4 is a schematic diagram illustrating select information that would be stored in the personal signature of the user; FIG. 5 is a schematic diagram illustrating the processing of the search query and postprocessing results based on the signature; and FIG. 6 is a schematic diagram illustrating the processing of the search query together with the signature to provide the user search results. DETAILED DESCRIPTION OF THE INVENTION Throughout the following description, the term "computer network" is used to refer to a system of interconnected devices, including without limitation, user-accessible server sites, peer to peer networks, the Internet as well as intranets and local area networks. Further, the term "site" is used to refer to server sites that implement current or future World Wide Web standards for the coding and transmission of hypertext documents. These standards currently include HTML (the Hypertext Markup Language), HTTP (the Hypertext Transfer Protocol), and asynchronous protocols. It should be understood that the term "site" is not intended to imply a single geographic location, as a web or other network site can, for example, include multiple geographically distributed computer systems that are appropriately linked together. Furthermore, while the following description relates to an embodiment utilizing the Internet and related protocols, other networks or hypermedia databases, such as networked interactive televisions, and other present or future protocols may be used as well. For example, for use with cell phones, personal digital assistants (PDAs), and the like, HDML (Handheld Device Markup Language), WAP (Wireless Application Protocol), WML (wireless markup language), XML (Extensible Markup Language), or the like can be used. Additionally, unless otherwise indicated, the functions described herein are performed by programs including executable code or instructions running on one or more network- enabled devices, including, without limitation, general-purpose computers, cellular phones, PDAs, and other present or future devices. The devices may include one or more central processing units for executing program code, volatile memory, such as random access memory (RAM) for temporarily storing data and data structures during program execution, non-volatile memory, such as a hard disk storage or optical storage, for storing programs and data, including databases, and a network interface for accessing an intranet and/or the Internet. However, the functions described herein may also be implemented using special purpose computers, state machines, and/or hardwired electronic circuits. The exemplary processes described herein do not necessarily have to be performed in the described sequence, and not all states have to be reached or performed. As used herein, the term "search engine" is defined broadly, and includes, in addition to its ordinary meaning, a local or remote information retrieval system whereby users and/or electronic agents formulate and submit a query and the system locates documents that relate to the information contained in the query. The processing of those queries and identification of the related documents may occur in a number of ways including the use of an index, such as an inverted file structure, signature files or any other present or future manner to retrieve information. The index is typically developed through computerized agents that access the world wide web through a process known as crawling and spidering. As used herein, the term "query" is defined broadly, and includes, in addition to its ordinary meaning, a user's or agent's submission of terms to a search engine. Formation of the query may occur in a number of manners including, without limitation, exact or lexical,
Boolean, natural language, or any other present or future manner. As used herein, the term "document" is defined broadly, and includes, in addition to its ordinary meaning, any files and data, including without limitation, computer files, machine configurations, executables and websites. The term "document" is not limited to computer files containing text, but also includes computer files containing graphics, audio, video, and other multimedia data. As used herein, the term "search results" is defined broadly, and includes, in addition to its ordinary meaning, search results based on an index of documents where a computerized algorithm searches through the index and compiles search results based on relevancy to the query. Search results may also include present or future types of paid listings whereby the results have a sponsor, defined broadly, who provides incentives for the search engine to present the listing to the user. Paid listings, includes, in addition to its ordinary meaning, pay for placement, pay for click, pay for action and paid inclusion listings generated by a search engine in response to a user's search query. As described in greater detail below, an exemplary personalized search apparatus provides a method for providing a search engine additional information about the user and their search query whereby the search engine tailors its processing providing the user providing more relevant search results. FIG. 1 illustrates an exemplary arrangement where a user 100, through a user interface 110 on a computer or similar device 120, accesses the search engine through a communications network 130 and submit an information search query to either a local intranet search engine 140 or to an Internet search engine 150. Referring to FIG. 2, the user initiates a query by entry into a search engine user interface 200 for processing of the query and tailoring the search results 210. In one embodiment, the system provides to the search engine, along with the query, a user profile or digital signature. The information in the digital signature allows the query to be contextualized by the user's profile. It also allows a means to weight, or scale, the importance of the terms based on the data contained in the user's files. In this way, the search engine is able to recalculate the relevancy of search results 220, prior to returning the results to the user 230. In another embodiment, the apparatus separately transmits the signature information to the search application, which stores it for future use. In this example, the user identifies himself or herself when submitting queries, either by logging in or other means such as a cookie on their computer, and the search application retrieves the signature from its storage device for processing with the query. In another embodiment, user profile information is maintained locally and filtering or resorting of search results occurs at the client side to protect against any potential unauthorized dissemination of the user's private information. Referring to FIG. 3, in another embodiment, the apparatus provides a technique for executing an electronic agent that forms the profile, or digital signature, of the user using both declared and observed information. In one example, the system is installed or downloaded by the user 310. This agent may be a client on the user's computer or software from a host server that may function as a virtual client. Declared information may include, but is not limited to, personal information declared by the user, such as demographic information and interests. Observed information includes, but is not limited to, an analysis of documents on the user's computer system, previous search history, and previous URL visitation history. The agent uses this information to create all or part of the digital signature of the user. The frequency of update of the digital signature is configurable by the user, or predetermined by the system. In one embodiment, the user's declared information is provided during the process of installing and configuring the system 320. Referring to FIG. 4, the declared information 410 may include various demographic information such as sex, age, location as well as interests 420 (such as history, wildlife, technology etc.) The declared information is stored for use in the digital signature. Referring once again to FIG. 3, to obtain observed information, the electronic agent also performs an analysis of information contained in the user's computer 330. This is performed as part of the process of installing the apparatus and is configurable by the user with respect to what data is analyzed and upon what frequency. Examples of the data analyzed includes all system and non-system files such as, but not limited to, machine configuration, e-mail, word processing documents, electronic spreadsheets, presentation and graphic package documents, instant messenger history and stored PDF documents. The agent analyzes the user's data by scanning the words used in the documents and determining which words have a higher incidence of use versus a baseline 340, 350. Referring to FIG. 4, those words, and their semantic meaning, are stored for inclusion in the digital signature 430. For example, if a user has 3000 references to "intel" that would far exceed and average user and would be stored in the baseline as a high incidence word. An example of this observed information in the signature is shown in FIG. 4. For security, compressing and encrypting the signature may be done in several ways based on well known techniques of hashing and keys. Referring once again to FIG. 3, the system creates the digital signature using the declared and observed information (collectively "user's information"). This signature may be created in multiple ways. In one embodiment, the system compares words used in the user's information to a baseline of the word use in the English, or other, language to identify interests. Further, the system may record the semantic meaning of the word, or context, of the word in the creating the signature. For instance, if the word "jaguar" is often used in the users information in the context of computer operating systems, it will record the word and the context of computers rather than alternative meaning such as automobiles or wildlife. If the user then searched for "jaguar manual" the normal search results of documents for "jaguar manual" are modified such that the computer operating system documents would have a higher than normal relevance ranking and those related to automobiles would have a lower ranking than normal. In another embodiment, the system contributes the user's information to a network that continually updates the baseline word use 340. The system then in turn provides an updated baseline for use in comparison to the user's information and for creation of the digital signature. In one embodiment, the user may review and edit any information in the user profile to highlight immediate intent. In addition, the user may create multiple profiles, subprofiles or combined profiles. These profiles may be used in conjunction with a particular search to provide context for the search. By way of example, the user may set up different profiles reflecting his or her varying interests or hobbies. By way of another example, if a user is purchasing a gift for his or her elderly aunt, the user may not want to submit his or her user profile for the search, but may instead provide no profile, a new profile or a modified profile setting forth information concerning his or her aunt. In another embodiment, the user may set the period for observed behavior to coincide with the user's current online session to create a more immediate or time restricted context for the search. In a further embodiment, the user may toggle the user profile on or off, restrict certain parameters, modify certain parameters, or specify additional parameters for one or more search sessions. FIG. 5 outlines how, in one embodiment, the search engine processes a query and reformulates the results based on the user's information. The system receives a search query and signature from a user 500. The system then searches an index of documents 510 and returns results 520. The digital signature is analyzed and personal interests and information is discovered 530. The discovered information is used by the search engine to resort the results based on the signature 540. The results are then returned to the user. FIG. 6 outlines an alternative embodiment whereby the search engine refines the query by modifying or appending information relevant to the user based on the information in the signature. In this embodiment, the search query and signature are received from the user. The query is then reformulated or refined based on the user's signature to increase the relevance of the query by incorporating information or keywords into the query relating to the user 610. The index is then searched based on the modified or enhanced query 620 and the results are returned 630. Referring also to FIG. 4, in a modified embodiment, in addition to word frequency usage, a user's prior web browser history, including searches, may be used to improve relevance 440. The personal search apparatus may track, and store a log of, web sites visited, time spent, prior searches and use that data to increase the relevance weighting of sites that have been visited before to improve relevance. This includes recording URL's visited and the number of page views as well as other actions (download, buy etc.) at the URL's. This history is stored for inclusion in the digital signature. For example, if one of the word pairs in the user's corpus user information that has a higher frequency, than the baseline of average frequency, is "pro bikes" because you recently bought a new derailer for your mountain bike, and type in the search term "bike rack' then the normal search results for "bike rack" would be retrieved from the web (say the top 100 or top 1000) and then the web site of the "pro bikes" company would be increased in relevance than its normal position as you have done business with them before (as indicated by its frequency on your hard disk being significantly higher than normal). In a modified embodiment, in addition to using the user's signature to influence the results, the search engine compares the signature with other user's signatures identifying others who have similar profiles. In the event that other users have utilized the search engine for the same query (or similar based on synonyms) the relevance rankings of the search results would be re-ranked based on the search history of the previous user(s). For instance, if user "A" searched for "mouse" and iterated their query to "optical mice" and user "B" had a signature that resembles "A" and searched for "mice", then the search engine would boost the relevance ranking on documents related to optical mice over that of the other meanings of mice (sites on rodents, mice for animal testing etc.) In effect, the signatures based on the user's information forms a means for collaboration between anonymous users. Access to the search engine may be either direct, such as by a user accessing the engine through a URL on the Internet, or through a distributed fashion via a application contained on users' computers or via a third party web site that provides search services on a syndicated manner for the search engine. Thus, in contrast to conventional systems, which often fail to list the items most relevant to the user first because of its inability to discern the users intentions or interests, the system disclosed herein enables the user to receive tailored results based upon information contained in the user profile, or digital signature. While the foregoing detailed description discloses several embodiments of the present invention, it should be understood that this disclosure is illustrative only and is not limiting of the present invention. It should be appreciated that the specific configurations and operations disclosed can differ from those described above, and that the methods described herein can be used in contexts other than use of a personalized search engine.

Claims

WHAT IS CLAIMED IS : 1. A method for searching a computer network, the method comprising: generating a user profile; submitting a user search query; providing the search query to a search engine; processing the search query based on the user profile to calculate the relevancy of search results; and returning the search results to the user based upon the calculated relevancy.
2. The method of claim 1, further comprising: declaring information relating to user demographics and interests; observing information relating to the user's behavior; and processing the declared information and observed information to generate the user profile.
3. The method of claim 2, further comprising: updating the user profile based on a user-defined frequency.
4. The method of claim 2, wherein the observing step comprises one or more of: analyzing documents on the user's computer system; analyzing the user's search history; and analyzing the user's URL visitation history.
5. The method of claim 4, wherein the analyzing documents step comprises analyzing information contained in one or more documents on a user's network-enabled device.
6. The method of claim 5, further comprising: scanning words in the documents; establishing a baseline of user word usage; determining the relative incidence of words compared to the baseline; and generating a component of the user profile based on the words identified in the determining step.
7. The method of claim 6, further wherein the baseline is established by reviewing word usage from a group of users.
8. The method of claim 5, further comprising: scanning words in the documents; establishing a baseline based on average word usage in the language of the user; determining the relative incidence of words compared to the baseline; and generating a component of the user profile based on the words identified in the determining step.
9. The method of claim 2, further comprising the step of setting the period within which information is observed.
10. The method of claim 2, further comprising the step of generating a plurality of profiles for a user.
11. The method of claim 1, further comprising the step of toggling on or off processing of the user profile.
12. The method of claim 1, further comprising the step of modifying the user profile prior to the processing step.
13. The method of claim 1, wherein the step of processing the search query based on the user profile comprises resorting the search results based on information contained within the profile .
14. The method of claim 1, wherein the step of processing the search query based on the user profile comprises modifying the search query submitted to the search engine to perform the search.
15. A system for searching a computer network, the system comprising: means for generating a user profile; means for formulating a user search query; means for providing the search query and a user profile to a search engine; means for processing the search query based on the user profile to calculate the relevancy of search results; and means for returning the search results to the user based upon the calculated relevancy.
16. The system of claim 15, further comprising: means for declaring information relating to user demographics and interests; means for observing information relating to the user's historical behavior; and means for processing the declared information and observed information to generate the user profile.
17. The method of claim 16, further comprising: means for updating the user profile based on user-defined frequency.
18. The method of claim 16, wherein the observing step comprises: means for analyzing documents on the user's computer system; means for analyzing the user's previous search history; and means for analyzing the user's previous internet visitation history.
19. The method of claim 18, wherein the means for analyzing documents comprises means for analyzing information contained in one or more of the user's documents.
20. The method of claim 19, further comprising: means for scanning words in the documents; means for establishing a baseline of user word usage; means for determining the relative incidence of words compared to the baseline; and means for generating a component of the user profile based on the words identified in the determining step.
21. The method of claim 20, further wherein the baseline is established by reviewing word usage from a group of users.
22. The method of claim 19, further comprising: means for scanning words in the documents; means for establishing a baseline based on average word usage in the language of the user; means for determining the relative incidence of words compared to the baseline; and means for generating a component of the user profile based on the words identified in the determining step.
23. The system of claim 16, further comprising means for setting the period within which information is observed
24. The system of claim 16, further comprising means for generating a plurality of profiles for a user.
25. The system of claim 15, further comprising means for toggling on or off processing of the user profile.
26. The system of claim 15, further comprising means for modifying the user profile prior to the processing step.
27. The method of claim 15, wherein the means for processing the search query based on the user profile comprises means for resorting the search results based on information contained within the user profile.
28. The method of claim 15, wherein the means for processing the search query based on the user profile comprises means for modifying the search query used by the search engine to perform the search.
PCT/US2005/017207 2004-05-14 2005-05-16 Personalized search engine WO2005114379A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US57145204P 2004-05-14 2004-05-14
US60/571,452 2004-05-14

Publications (2)

Publication Number Publication Date
WO2005114379A2 true WO2005114379A2 (en) 2005-12-01
WO2005114379A3 WO2005114379A3 (en) 2008-08-28

Family

ID=35429023

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/017207 WO2005114379A2 (en) 2004-05-14 2005-05-16 Personalized search engine

Country Status (2)

Country Link
US (1) US20050278317A1 (en)
WO (1) WO2005114379A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009072095A2 (en) * 2007-12-06 2009-06-11 France Telecom Page indexer
US20220103574A1 (en) * 2020-09-25 2022-03-31 International Business Machines Corporation Generating and mutually maturing a knowledge corpus
US11520838B2 (en) * 2018-04-30 2022-12-06 Innoplexus Ag System and method for providing recommendations of documents

Families Citing this family (106)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9977831B1 (en) 1999-08-16 2018-05-22 Dise Technologies, Llc Targeting users' interests with a dynamic index and search engine server
US9195756B1 (en) 1999-08-16 2015-11-24 Dise Technologies, Llc Building a master topical index of information
US8504554B2 (en) * 1999-08-16 2013-08-06 Raichur Revocable Trust, Arvind A. and Becky D. Raichur Dynamic index and search engine server
US7599938B1 (en) 2003-07-11 2009-10-06 Harrison Jr Shelton E Social news gathering, prioritizing, tagging, searching, and syndication method
US8620915B1 (en) 2007-03-13 2013-12-31 Google Inc. Systems and methods for promoting personalized search results based on personal information
US20070011155A1 (en) * 2004-09-29 2007-01-11 Sarkar Pte. Ltd. System for communication and collaboration
US8635216B1 (en) * 2004-09-30 2014-01-21 Avaya Inc. Enhancing network information retrieval according to a user search profile
US8874570B1 (en) 2004-11-30 2014-10-28 Google Inc. Search boost vector based on co-visitation information
US8099405B2 (en) * 2004-12-28 2012-01-17 Sap Ag Search engine social proxy
US7603414B2 (en) * 2005-09-09 2009-10-13 Outland Research, Llc System, method and computer program product for collaborative background music among portable communication devices
US20070189544A1 (en) 2005-01-15 2007-08-16 Outland Research, Llc Ambient sound responsive media player
US8606781B2 (en) * 2005-04-29 2013-12-10 Palo Alto Research Center Incorporated Systems and methods for personalized search
US20060259494A1 (en) * 2005-05-13 2006-11-16 Microsoft Corporation System and method for simultaneous search service and email search
US8176101B2 (en) 2006-02-07 2012-05-08 Google Inc. Collaborative rejection of media for physical establishments
US8688673B2 (en) 2005-09-27 2014-04-01 Sarkar Pte Ltd System for communication and collaboration
US20090030801A1 (en) * 2005-11-07 2009-01-29 Anthony Meggs Permissive search intent registry
US10534820B2 (en) * 2006-01-27 2020-01-14 Richard A. Heggem Enhanced buyer-oriented search results
US10042927B2 (en) 2006-04-24 2018-08-07 Yeildbot Inc. Interest keyword identification
US8069182B2 (en) 2006-04-24 2011-11-29 Working Research, Inc. Relevancy-based domain classification
US20070266015A1 (en) * 2006-05-12 2007-11-15 Microsoft Corporation User Created Search Vertical Control of User Interface
US7966324B2 (en) * 2006-05-30 2011-06-21 Microsoft Corporation Personalizing a search results page based on search history
US7761464B2 (en) * 2006-06-19 2010-07-20 Microsoft Corporation Diversifying search results for improved search and personalization
US10909197B2 (en) * 2006-06-22 2021-02-02 Rohit Chandra Curation rank: content portion search
US7664740B2 (en) * 2006-06-26 2010-02-16 Microsoft Corporation Automatically displaying keywords and other supplemental information
US7792967B2 (en) 2006-07-14 2010-09-07 Chacha Search, Inc. Method and system for sharing and accessing resources
US8255383B2 (en) 2006-07-14 2012-08-28 Chacha Search, Inc Method and system for qualifying keywords in query strings
US8024308B2 (en) * 2006-08-07 2011-09-20 Chacha Search, Inc Electronic previous search results log
US9037581B1 (en) * 2006-09-29 2015-05-19 Google Inc. Personalized search result ranking
US9519715B2 (en) 2006-11-02 2016-12-13 Excalibur Ip, Llc Personalized search
US20080198995A1 (en) * 2006-11-14 2008-08-21 Mcgary Faith System and method for providing a search portal with enhanced results
US20080215560A1 (en) * 2007-03-01 2008-09-04 Denise Ann Bell Information technology management system database for coordinating the inforamtion technology activites for a business enterprise
US20080228700A1 (en) 2007-03-16 2008-09-18 Expanse Networks, Inc. Attribute Combination Discovery
GB0707839D0 (en) * 2007-04-21 2007-05-30 Johnston Michael SAR initiation
US7743047B2 (en) * 2007-05-08 2010-06-22 Microsoft Corporation Accounting for behavioral variability in web search
US7774339B2 (en) * 2007-06-11 2010-08-10 Microsoft Corporation Using search trails to provide enhanced search interaction
US20090006344A1 (en) * 2007-06-28 2009-01-01 Microsoft Corporation Mark-up ecosystem for searching
US8260809B2 (en) * 2007-06-28 2012-09-04 Microsoft Corporation Voice-based search processing
US8190627B2 (en) * 2007-06-28 2012-05-29 Microsoft Corporation Machine assisted query formulation
US9716764B2 (en) 2007-07-25 2017-07-25 Yahoo! Inc. Display of communication system usage statistics
US20090043752A1 (en) 2007-08-08 2009-02-12 Expanse Networks, Inc. Predicting Side Effect Attributes
WO2009030972A1 (en) * 2007-09-06 2009-03-12 Chin San Sathya Wong Method and system of generating and presenting search results
US8655868B2 (en) * 2007-09-12 2014-02-18 Ebay Inc. Inference of query relationships based on retrieved attributes
US8131731B2 (en) 2007-12-27 2012-03-06 Microsoft Corporation Relevancy sorting of user's browser history
US7797314B2 (en) * 2007-12-31 2010-09-14 International Business Machines Corporation Adaptive searching
US9584343B2 (en) 2008-01-03 2017-02-28 Yahoo! Inc. Presentation of organized personal and public data using communication mediums
US8577894B2 (en) 2008-01-25 2013-11-05 Chacha Search, Inc Method and system for access to restricted resources
US7930287B2 (en) * 2008-03-14 2011-04-19 Michelli Capital Limited Liability Company Systems and methods for compound searching
US20090327520A1 (en) * 2008-06-26 2009-12-31 Felix Lee Method and apparatus for mapping unknown information in real or virtual worlds
US20100082684A1 (en) * 2008-10-01 2010-04-01 Yahoo! Inc. Method and system for providing personalized web experience
US8407218B2 (en) * 2008-12-05 2013-03-26 Microsoft Corporation Role based search
US20100161641A1 (en) * 2008-12-22 2010-06-24 NBC Universal, Inc., a New York Corporation System and method for computerized searching with a community perspective
US20100169338A1 (en) * 2008-12-30 2010-07-01 Expanse Networks, Inc. Pangenetic Web Search System
US8108406B2 (en) 2008-12-30 2012-01-31 Expanse Networks, Inc. Pangenetic web user behavior prediction system
US8386519B2 (en) 2008-12-30 2013-02-26 Expanse Networks, Inc. Pangenetic web item recommendation system
US8255403B2 (en) 2008-12-30 2012-08-28 Expanse Networks, Inc. Pangenetic web satisfaction prediction system
US8595228B1 (en) * 2009-01-09 2013-11-26 Google Inc. Preferred sites
US9477763B2 (en) 2009-03-02 2016-10-25 Excalibur IP, LC Personalized search results utilizing previously navigated web sites
US20120046995A1 (en) 2009-04-29 2012-02-23 Waldeck Technology, Llc Anonymous crowd comparison
EP2438571A4 (en) 2009-06-02 2014-04-30 Yahoo Inc Self populating address book
KR101136726B1 (en) * 2009-06-15 2012-05-04 차영철 User-generated rule based multi-function search box service method and system
US20150261858A1 (en) * 2009-06-29 2015-09-17 Google Inc. System and method of providing information based on street address
US9870572B2 (en) 2009-06-29 2018-01-16 Google Llc System and method of providing information based on street address
US8984074B2 (en) 2009-07-08 2015-03-17 Yahoo! Inc. Sender-based ranking of person profiles and multi-person automatic suggestions
US8990323B2 (en) 2009-07-08 2015-03-24 Yahoo! Inc. Defining a social network model implied by communications data
US7930430B2 (en) 2009-07-08 2011-04-19 Xobni Corporation Systems and methods to provide assistance during address input
US20110191717A1 (en) 2010-02-03 2011-08-04 Xobni Corporation Presenting Suggestions for User Input Based on Client Device Characteristics
US9721228B2 (en) 2009-07-08 2017-08-01 Yahoo! Inc. Locally hosting a social network using social data stored on a user's computer
JP5292250B2 (en) * 2009-10-13 2013-09-18 日本電信電話株式会社 Document search apparatus, document search method, and document search program
US9087323B2 (en) 2009-10-14 2015-07-21 Yahoo! Inc. Systems and methods to automatically generate a signature block
WO2011056169A1 (en) * 2009-11-03 2011-05-12 Arvind Raichur Dynamic index and search engine server
US8473512B2 (en) 2009-11-06 2013-06-25 Waldeck Technology, Llc Dynamic profile slice
US9514466B2 (en) 2009-11-16 2016-12-06 Yahoo! Inc. Collecting and presenting data including links from communications sent to or from a user
US9760866B2 (en) 2009-12-15 2017-09-12 Yahoo Holdings, Inc. Systems and methods to provide server side profile information
US20120063367A1 (en) 2009-12-22 2012-03-15 Waldeck Technology, Llc Crowd and profile based communication addresses
US8924956B2 (en) 2010-02-03 2014-12-30 Yahoo! Inc. Systems and methods to identify users using an automated learning process
US20110196878A1 (en) * 2010-02-11 2011-08-11 Wang Tsu-Jia Friend System Based on Interest Ranking
WO2011102765A1 (en) * 2010-02-22 2011-08-25 Telefonaktiebolaget L M Ericsson (Publ) Method and arrangement for network searching
US8982053B2 (en) 2010-05-27 2015-03-17 Yahoo! Inc. Presenting a new user screen in response to detection of a user motion
US8972257B2 (en) 2010-06-02 2015-03-03 Yahoo! Inc. Systems and methods to present voice message information to a user of a computing device
US8620935B2 (en) 2011-06-24 2013-12-31 Yahoo! Inc. Personalizing an online service based on data collected for a user of a computing device
US20110313994A1 (en) * 2010-06-18 2011-12-22 Microsoft Corporation Content personalization based on user information
US20120005186A1 (en) * 2010-06-30 2012-01-05 Dante Monteverde System and method for aggregating and interactive ranking of search engine results
KR20120076482A (en) * 2010-11-26 2012-07-09 삼성전자주식회사 Method and apparatus for searching contents in a communication system
US10078819B2 (en) 2011-06-21 2018-09-18 Oath Inc. Presenting favorite contacts information to a user of a computing device
US9747583B2 (en) 2011-06-30 2017-08-29 Yahoo Holdings, Inc. Presenting entity profile information to a user of a computing device
US8521735B1 (en) 2012-02-27 2013-08-27 Google Inc. Anonymous personalized recommendation method
US20130246415A1 (en) * 2012-03-13 2013-09-19 Microsoft Corporation Searching based on others' explicitly preferred sources
US20130246385A1 (en) * 2012-03-13 2013-09-19 Microsoft Corporation Experience recommendation system based on explicit user preference
US10977285B2 (en) 2012-03-28 2021-04-13 Verizon Media Inc. Using observations of a person to determine if data corresponds to the person
US9465875B2 (en) * 2012-05-31 2016-10-11 Hewlett Packard Enterprise Development Lp Searching based on an identifier of a searcher
US9245428B2 (en) 2012-08-02 2016-01-26 Immersion Corporation Systems and methods for haptic remote control gaming
US9189555B2 (en) * 2012-09-07 2015-11-17 Oracle International Corporation Displaying customized list of links to content using client-side processing
US10013672B2 (en) 2012-11-02 2018-07-03 Oath Inc. Address extraction from a communication
US10192200B2 (en) 2012-12-04 2019-01-29 Oath Inc. Classifying a portion of user contact data into local contacts
US8930353B2 (en) * 2013-01-04 2015-01-06 International Business Machines Corporation System and method for reflective searching of previous search results
US9547698B2 (en) 2013-04-23 2017-01-17 Google Inc. Determining media consumption preferences
US20140317099A1 (en) * 2013-04-23 2014-10-23 Google Inc. Personalized digital content search
US9582482B1 (en) 2014-07-11 2017-02-28 Google Inc. Providing an annotation linking related entities in onscreen content
US9965559B2 (en) * 2014-08-21 2018-05-08 Google Llc Providing automatic actions for mobile onscreen content
US10007730B2 (en) 2015-01-30 2018-06-26 Microsoft Technology Licensing, Llc Compensating for bias in search results
US10007719B2 (en) 2015-01-30 2018-06-26 Microsoft Technology Licensing, Llc Compensating for individualized bias of search users
US11836169B2 (en) * 2015-10-05 2023-12-05 Yahoo Assets Llc Methods, systems and techniques for providing search query suggestions based on non-personal data and user personal data according to availability of user personal data
US11899728B2 (en) 2015-10-05 2024-02-13 Yahoo Assets Llc Methods, systems and techniques for ranking personalized and generic search query suggestions
US20180025089A1 (en) * 2016-07-25 2018-01-25 Microsoft Technology Licensing, Llc Profile-based search results modification
US10540399B2 (en) 2016-08-11 2020-01-21 Shayan Aliakbar Tabrizi Method and system for information retrieval
US10535005B1 (en) 2016-10-26 2020-01-14 Google Llc Providing contextual actions for mobile onscreen content

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030126108A1 (en) * 2001-12-31 2003-07-03 Knoinklijke Philips Electronics N.V. Method and apparatus for access and display of content allowing users to apply multiple profiles
US20030195877A1 (en) * 1999-12-08 2003-10-16 Ford James L. Search query processing to provide category-ranked presentation of search results
US20030208486A1 (en) * 2002-04-25 2003-11-06 International Machines Business Machines Dynamic end user specific customization of an application's physical data layer through a data repository abstraction layer

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6182068B1 (en) * 1997-08-01 2001-01-30 Ask Jeeves, Inc. Personalized search methods
US6199067B1 (en) * 1999-01-20 2001-03-06 Mightiest Logicon Unisearch, Inc. System and method for generating personalized user profiles and for utilizing the generated user profiles to perform adaptive internet searches
US6647383B1 (en) * 2000-09-01 2003-11-11 Lucent Technologies Inc. System and method for providing interactive dialogue and iterative search functions to find information
US6850934B2 (en) * 2001-03-26 2005-02-01 International Business Machines Corporation Adaptive search engine query

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030195877A1 (en) * 1999-12-08 2003-10-16 Ford James L. Search query processing to provide category-ranked presentation of search results
US20030126108A1 (en) * 2001-12-31 2003-07-03 Knoinklijke Philips Electronics N.V. Method and apparatus for access and display of content allowing users to apply multiple profiles
US20030208486A1 (en) * 2002-04-25 2003-11-06 International Machines Business Machines Dynamic end user specific customization of an application's physical data layer through a data repository abstraction layer

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009072095A2 (en) * 2007-12-06 2009-06-11 France Telecom Page indexer
WO2009072095A3 (en) * 2007-12-06 2009-10-15 France Telecom Page indexer
US11520838B2 (en) * 2018-04-30 2022-12-06 Innoplexus Ag System and method for providing recommendations of documents
US20220103574A1 (en) * 2020-09-25 2022-03-31 International Business Machines Corporation Generating and mutually maturing a knowledge corpus
US11552966B2 (en) * 2020-09-25 2023-01-10 International Business Machines Corporation Generating and mutually maturing a knowledge corpus

Also Published As

Publication number Publication date
WO2005114379A3 (en) 2008-08-28
US20050278317A1 (en) 2005-12-15

Similar Documents

Publication Publication Date Title
US20050278317A1 (en) Personalized search engine
US7353246B1 (en) System and method for enabling information associations
US8386476B2 (en) Computer-implemented search using result matching
KR101532715B1 (en) Search engine that applies feedback from users to improve search results
US11681750B2 (en) System and method for providing content to users based on interactions by similar other users
US8606781B2 (en) Systems and methods for personalized search
US7475074B2 (en) Web search system and method thereof
US8244737B2 (en) Ranking documents based on a series of document graphs
US8260774B1 (en) Personalization search engine
US7779002B1 (en) Detecting query-specific duplicate documents
KR101532054B1 (en) Blending mobile search results
US6691106B1 (en) Profile driven instant web portal
US8515950B2 (en) Combining log-based rankers and document-based rankers for searching
US20090006388A1 (en) Search result ranking
US20070260597A1 (en) Dynamic search engine results employing user behavior
US20080104042A1 (en) Personalized Search Using Macros
US20060129533A1 (en) Personalized web search method
WO2006034038A2 (en) Systems and methods of retrieving topic specific information
US20060206474A1 (en) System for modifying queries before presentation to a sponsored search generator or other matching system where modifications improve coverage without a corresponding reduction in relevance
KR20070038146A (en) Personalization of placed content ordering in search results
WO2006036781A2 (en) Search engine using user intent
US8626753B1 (en) Personalization search engine
JPWO2012023541A1 (en) Information providing apparatus, information providing method, program, and information recording medium
Jain et al. Ranking web pages based on user interaction time
Xu et al. Towards semantics-based prefetching to reduce web access latency

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase