Suche Bilder Maps Play YouTube News Gmail Drive Mehr »
Anmelden
Nutzer von Screenreadern: Klicke auf diesen Link, um die Bedienungshilfen zu aktivieren. Dieser Modus bietet die gleichen Grundfunktionen, funktioniert aber besser mit deinem Reader.

Patentsuche

  1. Erweiterte Patentsuche
VeröffentlichungsnummerCN100533434 C
PublikationstypErteilung
AnmeldenummerCN 200480007418
PCT-NummerPCT/KR2004/000416
Veröffentlichungsdatum26. Aug. 2009
Eingetragen27. Febr. 2004
Prioritätsdatum19. März 2003
Auch veröffentlicht unterCN1761961A, CN101388035A, WO2004084097A1
Veröffentlichungsnummer200480007418.X, CN 100533434 C, CN 100533434C, CN 200480007418, CN-C-100533434, CN100533434 C, CN100533434C, CN200480007418, CN200480007418.X, PCT/2004/416, PCT/KR/2004/000416, PCT/KR/2004/00416, PCT/KR/4/000416, PCT/KR/4/00416, PCT/KR2004/000416, PCT/KR2004/00416, PCT/KR2004000416, PCT/KR200400416, PCT/KR4/000416, PCT/KR4/00416, PCT/KR4000416, PCT/KR400416
Erfinder姜锡昊, 李宇晟, 河定秀
AntragstellerNhn株式会社
Zitat exportierenBiBTeX, EndNote, RefMan
Externe Links:  SIPO, Espacenet
Method and apparatus for detecting invalid clicks on the internet search engine
CN 100533434 C
Zusammenfassung  übersetzt aus folgender Sprache: Chinesisch
本发明涉及一种因特网搜索引擎服务器。 The present invention relates to an Internet search engine server. 更明确地说,本发明涉及用于检测搜索项的无效点击的方法和设备,搜索项被包括在一个由因特网搜索引擎服务器提供的搜索结果网页内。 More specifically, the present invention relates to a method and apparatus for detecting invalid clicks a search term, the search items are included in one provided by an Internet search engine server search results pages. 本发明涉及一种用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:响应于来自于搜索器的搜索请求产搜索结果网页;获取一对应于被产生网页的页面标识符;从搜索器接收一包括在搜索结果网页内的搜索项的点击;获取一对应于被点击的搜索项的站点标识符;并且如果页面标识符和站点标识符与预定时段内的其它点击有关的页面标识符和站点标识符一致,则确定该点击无效。 The present invention relates to a method for detecting invalid clicks in an Internet search engine, comprising the steps of: in response to a search request from the searcher produce search results page; access is generated corresponding to a web page identifier; from search receives one included in the search results page click on the search terms; obtaining a site corresponding to the clicked search item identifier; and if the page identifier and a site identifier and other click predetermined time period associated page identifiers and site identifiers match, it is determined that the click is invalid. 根据本发明提供了一个用于检测无效点击的方法和设备,其检测各种不正当地增加搜索项点击量的尝试,并且立即处理这些尝试。 According to the present invention provides a method and apparatus for detecting invalid clicks, which detect various improper search term increase traffic to try and immediately deal with these attempts.
Ansprüche(14)  übersetzt aus folgender Sprache: Chinesisch
1. 一种用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:根据日志文件中的点击时间存储页面标识符和站点标识符;响应于来自于搜索器的搜索请求产生搜索结果网页;获取一对应于所产生的网页的页面标识符;从搜索器接收一包括在搜索结果网页内的搜索项的点击;获取一对应于被点击搜索项的站点标识符;和查阅日志文件,如果页面标识符和站点标识符与预定时段内的与其它点击有关的页面标识符和站点标识符一致,则确定该点击无效。 1. A method for Internet search engines to detect invalid clicks, comprising the steps of: according to the log file to store page identifiers and-click site identifier; in response to a search request from the searcher to generate search results pages ; obtaining a corresponding to the generated web page identifier; received from a searcher included in the search results page click on the search terms; obtaining a search term corresponding to the click site identifier; and access to the log file, if page identifier and a site identifier associated with other click page identifiers and site identifier within a predetermined period of time consistent with, it is determined that the click is invalid.
2. 权利要求1的方法,其中,页面标识符和站点标识符包括一校验和。 The method of claim 1, wherein the page identifier and the site identifier comprises a checksum.
3. —种用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:根据日志文件中的点击时间存储会话标识符和站点标识符;响应于来自于搜索器的搜索请求产生搜索结果网页;获取一包括在搜索器终端中存储的会话cookie文件内的会话标识符;从搜索器接收一包括在搜索结果网页内的搜索项的点击; 获取一对应于被点击搜索项的站点标识符;和查阅日志文件,如果会话标识符和站点标识符与在预定时段内的与其它点击有关的会话标识符和站点标识符一致,则确定该点击无效。 3 - kind of an Internet search engine for detecting invalid clicks, comprising the steps of: according to the log file and click to store session identifier site identifier; in response to the search from search requests to generate search results pages ; Gets an identifier for the session includes a session cookie file is stored in the searcher's terminal within; receiving a included in the search results page click on the search terms from search engine; obtaining a search term corresponding to the click site identifier; and access to the log file, if the session identifier and the site identifier within a predetermined period and other relevant session identifier and click on the site identifiers match, it is determined that the click is invalid.
4. 权利要求3的方法,其中,获取包括在搜索器终端中存储的会话cookie文件内的会话标识符的步骤包括下列步骤:确定会话cookie文件是否被存储在终端中;和如果确定会话cookie文件没有存储在终端中,则产生一新的会话标识符然后把包括产生的会话标识符的会话cookie文件存储在终端中。 4. The method of claim 3, wherein the obtaining step includes session session cookie file is stored in the searcher's terminal within the identifier comprises the steps of: determining whether the session cookie file is stored in the terminal; and if it is determined the session cookie file It is not stored in the terminal, generating a new session identifier and the session identifier generated including a session cookie file is stored in the terminal.
5. 权利要求4的方法,还包括下列步骤:如果确定会话cookie文件被存储在终端中,则确定包括在会话cookie 文件内的会话标识符的最后更新时间是否在预定时段内•,和如果确定最后更新时间在预定时段内,则获取一包括在会话cookie文件内的会话标识符。 The method of claim 4, further comprising the steps of: determining if the session cookie file is stored in the terminal, including the last update time is determined in the session cookie file of the session identifier is within a predetermined time period •, and if it is determined Last updated within a predetermined period of time, then get a file included in a session cookie session identifier.
6. 权利要求5的方法,还包括下列步骤:如果确定最后更新时间不在预定时段内,则通过产生新的会话标识符来更新包括在会话cookie文件内的会话标识符;和把会话标识符的更新时间存储在会话cookie文件中。 The method of claim 5, further comprising the steps of: determining a last update time if not within the predetermined period of time, through the new session identifier update is included in the session cookie file generated session identifier; and the session identifier Updated stored in the session cookie file.
7. 权利要求4的方法,还包括下列步骤:如果确定会话cookie文件存储在终端中,则确定来自搜索器的搜索项的点击时间是否在与会话标识符有关的最后点击时间之后的预定时段内;如果确定搜索项的点击时间在最后点击时间之后的预定时段内,则获取一包括在会话cookie文件内的会话标识符;和用搜索项的点击时间来更新最后点击时间。 The method of claim 4, further comprising the steps of: determining if the session cookie file is stored in the terminal, it is determined from the time of the click search term searcher is within a predetermined time period associated with the session identifier time after the last click ; if it is determined-click search items within a predetermined period of time after the last click, get a file included in a session cookie session identifier; and time with the click of a search term to update the last time of the click.
8. 权利要求7的方法,还包括下列步骤:如果确定搜索项的点击时间不在最后点击时间之后的预定时段内,则通过产生新的会话标识符来更新包括在会话cookie文件内的会话标识符;禾口用搜索项的点击时间来更新最后点击时间。 The method of claim 7, further comprising the steps of: a predetermined period of time if it is determined click search term is not the last time after the click, and then update the document included in the session cookie session identifier by creating a new session identifier ; Hekou time search terms by clicking to update the final click time.
9. 权利要求3到8中任何一个的方法,其中,会话标识符和站点标识符包括一个校验和。 Of claim any one of 3-8, wherein, the session identifier and the site identifier comprises a checksum.
10. —种用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:根据日志文件中的点击时间存储客户机IP地址和站点标识符; 从搜索器接收一包括在搜索结果网页内的搜索项的点击; 获取一对应于搜索器终端的客户机IP地址; 获取一对应于被点击搜索项的站点标识符;和查阅日志文件,如果客户机IP地址和站点标识符与预定时段内的与其它点击有关的客户机IP地址和站点标识符一致,则确定该点击是无效的。 10. - kind of a method for detecting invalid clicks in an Internet search engine, comprising the steps of: according to the log file click to store client IP addresses and domain identifier; receiving a ranging from search engine in the search results page click on a search term; obtaining a terminal corresponding to the search of the client IP address; get a click on the search term corresponding to the site identifier; and access to the log file, if the client IP address and the site identifier within a predetermined period of time Client IP addresses and site and other click-related identifiers match, it is determined that the click is invalid.
11. 权利要求10的方法,其中,站点标识符用其中包括的校验和来产生。 11. The method of claim 10, wherein the site identifier comprises a checksum with which to generate.
12. —种用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:根据日志文件中的点击时间存储终端标识符和站点标识符;响应于来自于搜索器的搜索请求产生一搜索结果网页; 获取一对应于搜索器终端的终端标识符;产生一包括终端标识符的用户cookie文件,然后把用户cookie文件存储在搜索器终端中;从搜索器接收一包括在搜索结果网页内的搜索项的点击; 获取一对应于被点击搜索项的站点标识符;和查阅日志文件,如果终端标识符和站点标识符与预定时段内的与其它点击有关的终端标识符和站点标识符一致,则确定该点击是无效的。 12. - kind of an Internet search engine for detecting invalid clicks, comprising the steps of: according to the log file and click time storage terminal identifier site identifier; in response to the search from search requests to generate a search result web pages; Get a terminal corresponding to the search terminal identifier; generating a user cookie file including the terminal identifier, then the user cookie files stored in the searcher's terminal; receiving a search within the search results include pages from the Finder Click on the item; obtaining a search term corresponding to the click site identifier; and access to the log files, click if other sites related to the terminal identifier and terminal identifier and a site identifier and a predetermined time period identifier matches determine that the click is invalid.
13. 权利要求12的方法,还包括下列步骤: 确定包括终端标识符在内的cookie文件是否被存储在终端中;和如果确定包括终端标识符在内的用户cookie文件存储在终端中,则从用户cookie文件接收终端标识符。 13. The method of claim 12, further comprising the steps of: determining a cookie file including the terminal identifier is stored in the terminal; and determining if the user cookie file including the terminal identifier stored in the terminal, from user cookie file receiving terminal identifiers.
14. 权利要求12或13的方法,其中,终端标识符和站点标识符包括一校验和。 The method according to claim 12 or 13, wherein the terminal identifier and the site identifier includes a checksum.
Beschreibung  übersetzt aus folgender Sprache: Chinesisch

在因特网搜索引擎上检测无效点击的方法和设备 On Internet search engines to detect invalid clicks method and apparatus

技术领域 Technical Field

本发明涉及因特网搜索引擎服务器。 The present invention relates to an Internet search engine server. 更明确地说,本发明涉及用于检测搜索项的无效点击的方法和设备,搜索项被包括在一个由因特网搜索引擎服务器提供的搜索结果网页内。 More specifically, the present invention relates to a method and apparatus for detecting invalid clicks a search term, the search items are included in one provided by an Internet search engine server search results pages. 此外,本发明涉及用于检测无效点击的方法和设备,其可以检测不公平地增加搜索项点击量的各种尝试并可以立即应付这些尝试。 Furthermore, the present invention relates to a method and apparatus for detecting invalid clicks, which can detect attempts to unfairly increase traffic and search terms can immediately cope with these attempts.

背景技术 Background

随着因特网的使用越来越广泛,诸如可经由因特网访问的网页之类的信息源的数量已经以算术级数增长。 With more and more widespread use of the Internet, the number of pages and the like, such as a source of information accessible via the Internet has been growing at an arithmetic progression. 此外,为了在大量信息源之中发现信 In addition, in order to find the letter among a large number of information sources

息,搜索器访问诸如NAVER、 Yahoo和Lycos之类的因特网搜索引擎服务 Interest, such as search access NAVER, Yahoo and Lycos like Internet search engine service

器以请求搜索。 Is to request a search. 因特网搜索服务提供商产生一个包括搜索项在内的搜索结果网页,其包括与搜索器输入的搜索字有关的信息,然后向搜索器提供生成的搜索结果网页。 Internet search provider, including search terms, including generating a search results page, which includes information about search words and search input and then provide the generated search results page to the search engine. 例如,当搜索器访问NAVER搜索引擎服务器然后输入搜索字"Digital Camera (数码相机)"时,搜索结果网页如图2所示。 For example, when the search engine server access NAVER search and enter the search word "Digital Camera (Digital Camera)", the search results page as shown in Figure 2. 包括在搜索结果网页内的每一项都与URL(统一资源定位符)有关。 Included in the search results page for each item and URL (Uniform Resource Locator) related.

因为与单一搜索字有关的搜索项的数量不计其数,然而,这类不计其 Since the number of search terms related to a single search word countless, however, these do not count it

数的搜索项如何在搜索结果网页上显示和以什么顺序显示对因特网搜索服务提供商来说是一个非常重要的问题。 How to display the number of search terms and in what order to display an Internet search service provider for it is a very important issue on the search results pages. 因特网搜索服务提供商通过结合几个标准来确定搜索项的列出顺序。 Internet search service providers through a combination of several criteria used to determine the order of search terms listed. 已被广泛使用的其中一个标准是用户对特殊搜索项的点击量。 One of the standard has been widely used is the user clicks on the specific search term. 例如,如果用户对一个搜索项的点击量很大,则该搜索项被显示在搜索结果网页相对靠上的部分。 For example, if a user clicks on a search term is large, the search terms are displayed in the search results pages on the opposite by section. 甚至在因特网搜索服务提供商通过结合多个参数来确定搜索项的列出顺序的情况中,如果其中一个参数是用户点击量,则具有很高点击量的搜索项被显示在搜索结果网页的相对靠上的部分。 Even in the case of Internet search service provider to determine the order of search terms listed by the combination of a number of parameters, if one of the parameters is the user clicks, then a high-traffic search terms are displayed in the search results page opposite depend on the part.

5此外,因特网搜索服务器产生的搜索结果网页被显示得越高,用户可 5 In addition, the Internet search server generates the search results page is displayed higher, the user can

能点击和访问该网页的可能性就越大。 You can click and access the web page likely. 从而,web服务器的网络信息提供 Thus, the network information provider web server

商想要把与他(她)自己有关的搜索项显示在搜索结果网页的顶端。 Suppliers want to with him (her) own search-related items are displayed at the top of the search results page. 因为这个原因,为了将他(她)的网页搜索项显示在搜索结果网页的顶端,网络信息提供商可以故意地访问因特网搜索服务器来多次点击他(她)自己网页的搜索项。 For this reason, in order to him (her) web search items are displayed at the top of the search results page, the network information provider may intentionally accessing Internet search server to repeatedly click on his (her) own website search item. 有时,网络信息提供商可以用一个专门的程序不断地点击他(她) 的网页的搜索项。 Sometimes, the network information provider can use a special program continue to click on his (her) pages for the search term. 因为这类不公平的搜索项点击并不反映真实的用户搜索结果,所以因特网搜索服务提供商必须检测这类无效的点击。 Because such unfair search term does not reflect the real user click on search results, so the Internet search service provider must detect this kind of invalid clicks.

先有技术中存在这类服务,其中,与搜索项有关的网络信息提供商基于搜索结果网页中的每个搜索项的点击量被收费。 The prior art in the presence of such services, which, with the network information provider search terms based on the search results page for each search term traffic is free of charge. 因特网搜索服务提供商 Internet search provider

Overture Services ,lnc.(USA)提供这类服务,其中,当搜索器点击与网络信息提供商有关的搜索结果网页中的搜索项时,网络信息提供商支付每次点击。 Overture Services, lnc. (USA) to provide such services, which, when the searcher clicks on network information provider about the search results pages of search terms, the network information providers pay per click. 在这种情况下,如果搜索器故意多次点击一个特殊的搜索项,则与搜索项有关的网络信息提供商必须支付额外的费用。 In this case, if the search is repeated intentionally click on a specific search term, the search term related network information provider must pay an additional fee. 因此,甚至在这种情况下也必须要检测无效点击,其意图是只增加点击量而实际上没有对搜索项进行搜索。 Thus, even in this case must also detect invalid clicks, the intention is only to increase traffic without actually searching for a search term.

发明内容 DISCLOSURE

本发明被提供来解决上述的先有技术中的问题。 The present invention is provided to solve the prior art problems described above. 本发明的一个目的是提供用于检测搜索项的无效点击的方法和设备,搜索项包括在一个由因特网搜索引擎服务器提供的搜索结果网页内。 It is an object of the present invention is to provide a method and apparatus for detecting invalid clicks search term, the search term included in one provided by an Internet search engine server search results pages.

本发明的另一个目的是提供用于检测无效点击的方法和设备,其可以检测不正当增加搜索项的点击量的各种尝试,并且可以立即应付这些尝试。 Another object of the present invention is to provide a method and apparatus for detecting invalid clicks, which can increase the detection of illicit traffic of various attempts to search for items, and you can try to deal with them immediately.

本发明的另一个目的是提供一个用于检测无效点击的方法和设备,其中,为了检测无效点击而提供的几个标识符很难被仿造或伪造。 Another object of the present invention is to provide a method and apparatus for detecting invalid clicks, in which several identifier is provided in order to detect invalid clicks difficult to counterfeit or forged.

为了达到上述目的并解决先有技术中的上述问题,本发明提供了一个 In order to achieve the above object and solve the above-described prior art problems, the present invention provides a

在因特网搜索引擎中检测无效点击的方法,包括下列歩骤:响应于来自搜索器的搜索请求产生一个搜索结果网页,获取一个对应于被产生网页的页面标识符,从搜索器接收包括在搜索结果网页内的搜索项的点击,获取一个对应于被点击搜索项的站点标识符,并且如果页面标识符和站点标识符与在预定时段内的其它点击有关的页面标识符和站点标识符一致,则确定该点击是无效的。 Click way in the Internet search engine to detect invalid, ho includes the following steps of: in response to a search request from the searcher generates a search results page, get a corresponding to the generated web page identifier included in the search results received from the search unit Click on the search terms within the page, click on to get a search term corresponding to the site identifier, and if the page identifier and a site identifier within a predetermined period and click on the relevant pages of the other identifiers and site identifier matches determine that the click is invalid.

根据本发明的方面提供了一个用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:响应于来自搜索器的搜索请求产生一个搜索结 Provides a method for detecting invalid clicks in an Internet search engine in accordance with aspects of the present invention includes the steps of: generating a response to a request from the search results on search searcher

果网页,获取一个包括在搜索器终端存储的会话cookie文件内的会话标识 If the page to get an identity in the session including the session cookie file finder terminal stored

符,从搜索器接收一个包括在搜索结果网页内的搜索项点击,获取一个对应于被点击搜索项的站点标识符,并且如果会话标识符和站点标识符与预定时段内与其它点击有关的会话标识符和站点标识符一致,则确定该点击是无效的。 Symbol, the searcher receives from one included in the search results page click on the search term, get a click on the search term corresponding to the site identifier, and if the session identifier and site identifier within a predetermined period of time and other click-related sessions identifier and site identifier is consistent, it is determined that the click is invalid.

根据本发明的方面提供了一个用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:从搜索器接收包括在搜索结果网页内的搜索项的点击,获取一个对应于搜索器终端的客户机IP地址,获取一个对应于被点击的搜索项的站点标识符,并且如果客户机IP地址和站点标识符与预定时段内的其它点击有关的客户机IP地址和站点标识符一致,则确定该点击是无效的。 Provides a method for detecting invalid clicks in an Internet search engine in accordance with aspects of the present invention includes the steps of: receiving from a searcher included in the search results page click on the search term, get a terminal corresponding to the search of clients IP address, get a site corresponding to the clicked search item identifier, and if the client IP address and the site identifier and other click predetermined time period relevant to the client IP addresses and site identifiers match, it is determined that Click is invalid.

根据本发明的方面提供了一个用于在因特网搜索引擎中检测无效点击的方法,包括下列步骤:响应于来自搜索器的搜索请求产生一个搜索结果网页,获取一个对应于搜索器终端的终端标识符,产生一个包括终端标 Provides a method for detecting invalid clicks in an Internet search engine in accordance with aspects of the present invention includes the steps of: in response to a search request from the searcher generates a search results page, get a terminal corresponding to the terminal identifier search generating a includes a terminal standard

识符的用户cookie文件然后把用户cookie文件存储在搜索器终端中,从搜 User cookie file identifier and cookie files stored on the user's terminal search from search

索器接收一个包括在搜索结果网页内的搜索项点击,获取一个对应于被点击搜索项的站点标识符,并且如果终端标识符和站点标识符与预定时段内的其它点击有关的终端标识符和站点标识符一致,则确定该点击是无效的。 Receives a cable included in the search results page click on the search term, get a click on the search term corresponding to the site identifier, and if the terminal identifier and the site identifier and other click predetermined time period relevant terminal identifier and Site identifiers match, it is determined that the click is invalid.

根据本发明的另一个方面提供了一个用于检测无效点击的设备,其中,如果搜索器点击包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,则至少搜索器终端的IP地址、搜索器终端所属的网络地址、与搜索 Provided in accordance with another aspect of the present invention, an apparatus for detecting invalid clicks, which, if the searcher clicks include in provided by an Internet search engine search results page search term, at least the IP address of the searcher terminal, search terminal network address belongs, and search

结果网页有关的搜索字、搜索器的web浏览器的相关信息、与存储在搜索器终端中的点击和cookie文件信息有关的点击时间、与搜索项有关的URL 信息的其中一个被接收,并且基于一个根据被接收信息预定的标准(reference)来确定该点击是否无效。 Results web browser page related to a search word, search the relevant information, with one click and cookie file information about the time of the click, and search for items stored in the searcher's terminal in the relevant URL information one is received, and based on one to determine whether the hit is invalid based on the received information of a predetermined standard (reference).

根据本发明的另一个方面提供了一个用于检测无效点击的设备,包括(1)一个日志存储单元,其响应于搜索器点击包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,来存储一个至少与下列两项有关的日志: 搜索器终端的IP地址,搜索器终端所属的网络地址,与搜索结果网页有关的搜索字,搜索器的web浏览器的相关信息,与点击有关的点击时间、存储在搜索器终端中cookie文件信息和与搜索项有关的URL信息,(2)—个无效点击模型存储单元,其存储与至少下列中两个有关的无效点击模型:搜索器终端的IP地址、搜索器终端所属的网络地址、与搜索结果网页有关的搜索字、搜索器的web浏览器的相关信息、与点击有关的点击时间、存储在搜索器终端中的cookie文件信息、和与搜索项有关的URL信息,和(3) 一个无效点击决定单元,其基于日志存储单元中存储的日志和无效点击模型存储单元中存储的无效点击模型来确定搜索点击是否是一个无效点击。 According to another aspect of the present invention to provide a detection of invalid clicks, comprising (1) a log storage unit in response to the searcher clicks include in provided by an Internet search engine search results pages search terms to At least two stores a log related to the following: IP address of the searcher's terminal, network address searcher's terminal belongs, and search results pages relevant to the search word, web browser, search the relevant information, and click on the relevant Click time, cookie information and documents related to the search term URL information stored in the searcher's terminal, (2) - invalid click pattern storage unit that stores associated with at least two of the following invalid click pattern: the searcher terminal IP cookie file information address, network address searcher's terminal belongs, and search results pages relevant to the search word, web browser, search the relevant information, and click on the relevant time of the click, the search is stored in the terminal, and with the search URL information about the item, and (3) an invalid click decision unit, based on the log storage unit to store logs and invalid Invalid Clicks model click model storing unit to determine whether a search clicks invalid clicks.

根据本发明的另一个方面提供了一个用于检测无效点击的设备,包括一个点击计数器装置,用于针对包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,计数预定时段内每个搜索项的搜索器点击量, 一个平均点击量计算装置,用于在预定时段内计算属于搜索项所属类别的搜索项的平均点击量,和一个决定装置,用于确定每个搜索项的点击量是否比平均点击量大一个预定的差。 Provided in accordance with another aspect of the present invention, an apparatus for detecting invalid clicks, including a hit counter means for each of the search for a predetermined period of time is included in the provided by an Internet search engine search results page search term, count The searcher clicks items, an average amount of traffic calculating means for average amount of traffic a search term is calculated within a predetermined period belong to the category for the search term, and a decision means for determining for each of your search term is a predetermined difference than the average click volume.

根据本发明的另一个方面提供了一个用于检测无效点击的设备,包括一个点击计数器装置,用于针对包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,计数预定时段内每个搜索项的搜索器点击量, 一个平均点击量计算装置,用于在搜索结果网页中在预定时段内计算位于搜索项较高端的搜索项的预定第一数量和位于搜索项较低端的搜索结果的预定第二数量的平均点击量,和决定装置,用于确定每个搜索项的点击量是否比平均点击量大一个预定的差。 Provided in accordance with another aspect of the present invention, an apparatus for detecting invalid clicks, including a hit counter means for each of the search for a predetermined period of time is included in the provided by an Internet search engine search results page search term, count The searcher clicks items, an average traffic calculation means for a predetermined first number of the search results page within a predetermined time period to calculate the higher end in the search term search terms in the search term and the lower end of the search results Book the second number of average traffic, and determining means for determining whether each of your search terms greater than the average number of clicks a predetermined difference.

无效点击很难精确地定义,并且无效点击的范围应该取决于实施例和应用来不同地定义。 Invalid clicks is difficult to define precisely, and should depend on the scope of invalid clicks embodiments and applications to be defined differently. 然而,无效点击可能指的是以只增加点击量而不以实际搜索为目的而做出的点击。 However, invalid clicks may refer is not only increase traffic to the actual search for the purpose of making a click. 附图说明 Brief Description

图1是一个示意图,说明因特网搜索服务器的一个网络连接,包括用于检测无效点击的设备和根据本发明的客户机终端。 Figure 1 is a schematic diagram illustrating a network like the Internet search server connections, including apparatus for detecting invalid clicks and according to the client terminal of the invention.

图2是一个说明由因特网搜索引擎产生的搜索结果网页的示意图。 Figure 2 is a schematic diagram generated by an Internet search engine search results page description. 图3是一个说明根据本发明实施例来检测无效点击的设备结构的框图。 Figure 3 is an illustrative embodiment of the present invention according to a block diagram for detecting invalid clicks structure.

图4是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 4 is an embodiment of the present invention, a method for detecting invalid clicks flowchart. 图5显示了根据本发明实施例的示例的日志文件。 Figure 5 shows the log file example according to the present invention implementation. 图6a和6b是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 6a and 6b is a flowchart of a method embodiment of the present invention to detect invalid clicks. 图7显示了一个根据本发明实施例的示例的日志文件。 Figure 7 shows an example of the log file in accordance with this embodiment of the invention. 图8是一个根据本发明实施例来产生会话标识符的方法流程图。 Figure 8 is a flowchart of a method embodiment of the present invention to generate the session identifier. 图9是一个根据本发明实施例来检测无效点击的方法流程图。 FIG. 9 is a flowchart of a method embodiment of the present invention to detect invalid clicks. 图10显示了一个根据本发明实施例的示例的日志文件。 Figure 10 shows an example of the log file in accordance with this embodiment of the invention. 图11是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 11 is an embodiment according to the present invention a method for detecting invalid clicks flowchart. 图12是一个说明通用计算机系统的结构的框图,该系统可用于创立一个搜索引擎服务器和一个用于根据本发明检测无效点击的设备。 FIG. 12 is a block diagram illustrating the structure of a general purpose computer system, the system can be used to create a search engine server and one for invalid clicks according to the present invention detects the device.

具体实施方式 DETAILED DESCRIPTION

在下文中,本发明的优选实施例将参考附图被详细描述。 Hereinafter, the preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. 图1是一个示意图,说明包括用于检测无效点击的设备和根据本发明的客户机终端的因特网搜索服务器的网络连接。 Figure 1 is a schematic diagram illustrating comprises means for detecting invalid clicks and connections based on network client terminal of the present invention, the Internet search server.

尝试不公平点击的搜索器或作弊器经由连接到因特网103的客户机终端101来访问因特网搜索服务器104。 Try clicking the search unfair or cheating via connection to the Internet 103, the client terminal 101 to access the Internet search server 104. 作弊器通过多次点击由因特网搜索服务器104提供的搜索结果网页中的搜索项来增加点击量。 Cheater to increase traffic through multiple clicks provided by an Internet search server 104 search results pages of search terms. 例如在图2 中,假定搜索项202是一个与http:〃www.invalidclick.com有关的搜索项, 并且作弊器不断地点击搜索项202以便于搜索项202被显示在搜索结果网页的顶端。 For example, in Figure 2, assuming that the search term 202 is a http: 〃www.invalidclick.com related search terms, and cheating is constantly click on the search term in order to search for items 202 202 is displayed at the top of the search results page.

当客户机终端101被连接到搜索引擎服务器104或其它网络站点时, cookie文件102是一个由搜索引擎服务器104或其它网络站点存储在客户机终端101的硬盘中的特殊的文本文件。 When the client terminal 101 is connected to the search engine server 104 or other network sites, cookie file 102 is a special text file by the search engine server 104 or other network sites stored on the hard disk of the client terminal 101 in. 在用于连接网络站点的HTTP协 Association for the HTTP connection network sites

9议中,每个对网页的请求都与其它请求无关。 9 meetings, each request for a Web page are independent of other requests. 因此,网络服务器不具这样 Thus, the network server does not have such a

的信息,即哪个页面先前已经被发送到客户机终端101或者客户机终端 The message that the page which has been previously sent to the client terminal 101 or the client terminal

101先前已经执行了什么工作。 101 has been previously performed any work. 因此,为了关联像这样独立处理的各个请 Therefore, each request to be processed independently associated with something like this

求, 一个cookie文件被提供。 Seeking, a cookie file is provided. 这类cookie文件服务允许网络服务器把用户信息存储在用户的计算机中。 Such cookie file services allows the network server to the user's computer, the user information storage. 为了在本发明中检测无效点击,甚至可以使用几个cookie文件。 In the present invention, in order to detect invalid clicks, you can even use several cookie files. 这将在后面被详细描述。 This will be described in detail later.

日志文件105是一个用于存储与用户点击模型相关的几个日志的文件。 105 log file is stored in one file associated with a user clicks on the model for several logs. 在本发明中,为了检测无效点击而使用几个参数。 In the present invention, in order to detect invalid clicks using several parameters. 在与各个点击有关的参数被存储在日志文件中之后,基于预定的规则和模型来确定输入点击是否无效。 With each click on the relevant parameters are stored in the following log files based on predefined rules and models to determine whether the invalid input click. - -

根据本发明实施例的日志文件的例子如图5、 7和10中所示。 According to an example embodiment of the log file of the present invention embodiment of FIG. 5, 7 and 10 in FIG. 图3是一个说明根据本发明实施例来捡测无效点击的设备结构的框图。 Figure 3 is an illustrative embodiment of the present invention according to pick a block diagram of the measured invalid clicks structure.

根据本发明实施例来检测无效点击301的设备包括参数输入单元304、日志存储单元305、无效点击模型存储单元306、无效点击验证单元307、无效点击报告单元308和无效点击决定单元309。 According to an embodiment of the present invention to detect invalid clicks device 301 includes a parameter input unit 304, a log storage unit 305, the invalid click pattern storage unit 306, invalid click verification unit 307, the invalid click report unit 308 and the invalid click decision unit 309.

如果搜索器点击包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,则与该点击有关的几个参数302被输入到参数输入单元304。 If the searcher clicks include in provided by an Internet search engine search results page search item, then click a few parameters, and the 302 is input to the parameter input unit 304. 这些参数是用于确定无效点击的基本信息,并且包括搜索器终端的IP地址、 搜索器终端所属的网络地址、与搜索结果网页有关的搜索字、搜索器的web浏览器的相关信息、与点击有关的点击时间、存储在搜索器终端中的cookie文件信息、与搜索项有关的URL信息等等。 These parameters are used to determine basic information about invalid clicks, and includes the IP address of the searcher's terminal, network address searcher's terminal belongs, and search results pages relevant to the search word, web browser, search the relevant information, and click Click on the relevant time, cookie files stored in the searcher's terminal information, and search terms related URL information and so on.

如果搜索器向因特网搜索引擎服务器104请求一个搜索,则搜索请求分组从客户机终端101被传递到因特网搜索引擎服务器104。 If the searcher to the Internet search engine server 104 requests a search, the search request packet from the client terminal 101 is transmitted to the Internet search engine server 104. 搜索请求分组包括一个根据HTTP协议的分组配置并且还被包含在因特网(IP:网际协议)分组内。 Search request packet includes a packet according to the configuration of the HTTP protocol and is also contained in the Internet (IP: Internet Protocol) within the packet. 因为源IP地址字段被包括在因特网协议分组的配置内,所以因特网搜索引擎服务器104从点击所请求的搜索请求分组提取一个源IP地址,从而提取搜索器终端的IP地址。 Because the source IP address field is included in the configuration of Internet Protocol packets, so the Internet search engine server 104. Click the requested search request packet extracting a source IP address, thereby extracting the IP address of the searcher's terminal.

源IP地址的前部分是搜索器终端所属的网络地址。 Front part of the source IP address is the network address of the searcher's terminal belongs. IP地址由4个字节组成。 IP address consists of 4 bytes. IP地址的前部分是一个用于识别搜索器终端所属网络的网络地址,而其剩余部分是用于识别网络内的搜索器终端的地址。 The front part of the IP address is a network address for identifying the searcher's terminal belongs to a network, while the remaining portion is used for the searcher terminal identification address within the network. 因此,网络地址从源1P地址中被提取。 Therefore, the network address is extracted from the source 1P addresses. 根据本发明的实施例,IP地址前部分的3个字节被认为是一个网络地址并且该网络地址从源IP地址被获得。 According to an embodiment of the present invention, the three bytes of the IP address of the front portion is considered a network address and the network address is obtained from the source IP addresses. 例如,如果源IP地 For example, if the source IP

址是123.45.67.89,则123.45.67被提取为一个网络地址。 Address is 123.45.67.89, then 123.45.67 is extracted as a network address.

与搜索结果网页有关的搜索字是一个由搜索器输入因特网搜索服务器104的值。 And search results pages relevant to the search word is a value Internet search server 104 entered by the searcher.

搜索器的web浏览器的相关信息是web浏览器上的信息,所述web浏览器被装载在搜索器的客户机终端101中并被用来访问因特网搜索服务器104。 Web browser search the relevant information is the information on the web browser, the web browser is loaded searcher client terminal 101 and is used to access the Internet search server 104. web浏览器的相关信息包括web浏览器的类型、web浏览器的版本、web浏览器的产品ID等等。 web browser-related information includes the type of web browser, web browser version, web browser product ID and so on. 特别地,即使当多个搜索器具有相同类型和相同版本的web浏览器时,它们的web浏览器的产品ID也可能不同。 In particular, even when a plurality of search that has the same type and the same version of the web browser, their web browser product ID may be different. 从而,它变成了用于识别一个搜索器终端的有用信息。 Thus, it becomes useful information for identifying a searcher terminal.

根据被用于连接到网络的HTTP协议,客户机的一部分环境参数被包括在HTTP分组内来传送到网络服务器。 According to the HTTP protocol is used to connect to the network, the client part of the environmental parameters are included in the HTTP packet is transmitted to the network server. 网络服务器的程序(搜索弓i擎程序) 可以接收环境参数并且可以使用这些参数来检测无效点击。 Network server programs (search engine procedures bow i) may receive environmental parameters and can use these parameters to detect invalid clicks.

这类环境参数包括下列信息: Such environmental parameters include the following information:

REMOTEJHOST:被连接者的域名 REMOTEJHOST: connected by name

REMOTE—ADDR:被连接客户机主机的IP地址 REMOTE-ADDR: IP address to be connected to the client hosts

REMOTE—USER:被连接者的名字(在网络服务器设置了用户验证的情况下显示) REMOTE-USER: the names are connected (in network server set up user authentication display case)

REMOTE一USER:被连接者的! A REMOTE USER: connected persons! D(在网络服务器设置了用户验证的情况下被显示) D (on a network server set up user authentication condition is shown)

HTTP—USER—AGENT:被连接者驱动的程序的相关注册信息, 一般来说是浏览器的名称 HTTP-USER-AGENT: be connected relevant registration information-driven program, in general is the browser name

HTTP_ACCEPT—LANGUAGE:被连接者使用的语言HTTP—REFERER:呼叫对应CGI程序的文档名称REQUEST—METHOD:向服务器传输数据的方法(GET,POST) QUERY—STRING:当数据以GET模式发送时,发送数据的被存储 HTTP_ACCEPT-LANGUAGE: Language HTTP-REFERER be connected by use of: CGI programs call the corresponding document name REQUEST-METHOD: data transfer to the server method (GET, POST) QUERY-STRING: When data is sent in GET mode, the transmission data It is stored

参数 Parameters

CONTENT—LENGTH:当数据以POST模式被发送时,被发射数据的总长度(字节数) CONTENT-LENGTH: POST mode when the data is transmitted, the total length of the data to be transmitted (in bytes)

CONTENT—TYPE:当数据以POST模式被发射时,数据的MIME CONTENT-TYPE: POST mode when the data is transmitted, the data MIME

类型 Type

AUTH一TYPE:用于确认用户授权的参数SERVER—NAME:当前服务器的域名 AUTH a TYPE: for confirming user authorization parameters SERVER-NAME: The current domain name server

SERVER—SOFTWARE:当前安装在服务器上的网络服务器程序的 SERVER-SOFTWARE: Current installed on the server's network server program

名称 Name

SERVER一PROTOCOL:服务器当前使用的网络协议的名称和版本SERVER_PORT:服务器当前所使用的端口数(在HTTP的情况下一般是80) SERVER a PROTOCOL: server currently used network protocol name and version SERVER_PORT: port number currently used by the server (in the case of HTTP is generally 80)

PATH—INFO:被呼叫的CGI程序的当前路径的信息PATH—TRANSLATED:网络要求的网络服务器中的当前资源路径的相关信息 Related information network server network resources required by the current path: PATH-INFO: Information PATH-TRANSLATED current path to be called CGI programs

SCRIPT—NAME:当前正在被呼叫的CGI程序的名称HTTP_ACCEPT:当前可以以HTTP接收的资源的类型与搜索器的点击有关的点击时间是来自搜索器的点击输入被接收的时间。 SCRIPT-NAME: the current CGI program is being called the name of HTTP_ACCEPT: current type and searchers can click HTTP resources received related to time click-click input from the searcher is received. 根据本发明的另一个实施例,与搜索器的点击时间有关的其它时间可以被使用。 According to another embodiment of the present invention, the other time a searcher clicks and time-dependent may be used. 例如,可以使用搜索器实际上将点击输入客户机的时间。 For example, you can use the search is actually a click input time client.

存储在搜索器终端中的cookie文件上的信息被因特网搜索服务器104 获得,其中因特网搜索服务器104访问存储在客户机终端101中的cookie 文件102。 Information stored on the searcher terminal cookie file is an Internet search server 104 obtains, cookie file which Internet search server 104 access data stored in the client terminal 101 102. 在本发明中,cookie文件102可以被用于多种用途。 In the present invention, cookie file 102 may be used for various purposes. 这将参考 This reference

其它实施例被详细描述。 Other embodiments are described in detail.

与搜索器点击的搜索项有关的URL信息可以通过查阅搜索数据库而获得,因为它被存储在与搜索引擎服务器104有关的搜索数据库(未示出) 中。 And a searcher clicked URL information about a search term can be obtained by referring to the search database, because it is stored in the search engine server 104 related searches database (not shown). URL信息可以是网络服务器的域名或包括域名、目录和文件名的信息。 URL information may be the network server domain name or domain name, directory and filename information. 例如,http:〃www. naver.com和http:〃www.naver.com/download是相同的,因为它们是鉴于域名的www.naver.com,但是具有不同的URL。 For example, http: 〃www naver.com and http:. 〃www.naver.com / download are the same since they are in view of the domain name www.naver.com, but having a different URL. 在本发明中,使用URL及至域名的实施例已经为了解释起见进行了说明。 In the present invention, the embodiment uses a URL domain name of up to explain the reasons already described. 然而, 本发明覆盖了所有的实施例,其中,如果URL尽管其域名相同但是具有不同的目录(因为它们包括了域名、目录和文件名全部),则URL被认为是不同的搜索项。 However, the present invention covers all of the examples, which, despite its domain name if the same URL but with a different directory (because they include a domain name, directory and file name of all), the URL is considered to be a different search term. 此外应当理解,在本发明中,URL信息包括根据这个说明书的所有实施例。 Moreover, it should be understood that in the present invention, URL information includes all the embodiments of this specification.

此外,除了上述的参数之外,在本发明的精神内,被用于检测无效点击的其它参数也可以被用来检测无效点击。 Furthermore, in addition to the above parameters, within the spirit of the invention, it is used to detect invalid clicks other parameters can also be used to detect invalid clicks.

上述种类的参数302被输入到参数输入单元304。 The above-mentioned types of parameters 302 is input to the parameter input unit 304. 这些参数又被存储在日志存储单元305中。 These parameters are then stored in the log storage unit 305. 根据本发明,存储在日志存储单元中的日志的例子如图5、 7和10中所示。 According to the present invention, an example of a log stored in the log storage unit in Fig. 5, 7, and 10 in FIG. 在这些附图中,只包括一部分参数的日志被显示以用于解释。 In these drawings, logs including only some parameters are shown for explanation. 然而,根据本发明的另一个实施例,包括全部或一部分参数302的日志可以被存储在日志存储单元305中。 However, according to another embodiment of the present invention, including all or part of the log parameters 302 may be stored in the log storage unit 305.

根据本发明的一个实施例,日志存储单元305在其中存储关于至少下列两项的日志:搜索器终端的IP地址、搜索器终端所属的网络地址、与搜索结果网页有关的搜索字、搜索器的web浏览器的相关信息、与点击有关的点击时间、存储在搜索器终端中的cookie文件信息和与搜索项有关的URL信息。 305 in which to store a log on at least the following two cases, the log storage unit in accordance with one embodiment of the invention: IP address of the searcher's terminal, network address searcher's terminal belongs, and search results pages relevant to the search word, search the web browser-related information, with the click click on the relevant time, cookie files stored in the searcher's terminal information and search terms related URL information. 根据本发明的一个优选实施例,日志存储单元305在其中存储一个关于至少下列一项的日志:搜索器终端的IP地址、搜索器终端所属的网络地址、与搜索结果网页有关的搜索字、搜索器的web浏览器的相关信息、与点击有关的点击时间、存储在搜索器终端中的cookie文件信息和与搜索项有关的URL信息。 According to a preferred embodiment of the present invention, the log storage unit 305 stores therein a log regarding at least one of the following: IP address of the searcher's terminal, network address searcher's terminal belongs, and search results pages relevant to the search word, search the web browser-related information, and click the relevant time of the click, cookie files stored in the searcher's terminal information and search terms related URL information.

无效点击型式存储单元306在其中存储一个与至少下列两项的一对有关的无效点击模型或规则:搜索器终端的IP地址、搜索器终端所属的网络地址、与搜索结果网页有关的搜索字、搜索器的web浏览器的相关信息、 与点击有关的点击时间、存储在搜索器终端中的cookie文件信息和与搜索项有关的URL信息。 Invalid click pattern storage unit 306 stores therein a pair of at least the following two related invalid clicks model or rule: IP address of the searcher's terminal, network address searcher's terminal belongs, and search results pages relevant to the search word, web browser searcher information, cookie files associated with one click-click, stored in the terminal information search and search terms related URL information. 例如,搜索器终端的IP地址和与搜索项有关的URL 信息型在10分钟内的点击输入中彼此一致的规则或模型可以被存储在无效点击模型存储单元306中。 For example, IP addresses and search terms relevant to the searcher terminal type URL information in the click input within 10 minutes of each other and consistent rules or models can be stored in the invalid click pattern storage unit 306. 同样地,用于确定无效点击的被存储在无效点击模型存储单元306中的规则等等可以用文件的形式存储,该文件使用根据预定规则的预定语言。 Similarly, for determining invalid clicks are invalid click pattern stored in the storage unit 306 rules, etc. can be stored as a file, the file using a predetermined language according to a predetermined rule. 或者,在上述规则或模型的情况下,它可以用程序的形式被存储以便于它被确定是一无效点击。 Alternatively, in the case of the above rules or models, which can be stored in the form of a program so that it is determined to be an invalid click.

无效点击决定单元309基于日志存储单元305中存储的日志和无效点击模型存储单元306中存储的无效点击模型来确定搜索器点击是否是无无效点击报告单元308向因特网搜索引擎的管理员303报告与点击中的预定标准一致的点击,其被无效点击决定单元309确定无效。 Invalid click decision unit 309 based on the log storage unit 305 stores logs and invalid click pattern storage unit 306 stores invalid clicks models to determine whether searchers click no invalid click report unit 308 to the Internet search engine administrator 303 report Click on predetermined criteria consistent clicks, which are invalid click decision unit 309 determines invalid. 根据本发明的一个实施例,无效点击报告单元308向因特网搜索引擎的管理员报告所有被无效点击决定单元309确定为无效的点击。 According to one embodiment of the present invention, the invalid click report unit 308 Internet search engine administrator reports to all invalid click decision unit 309 is determined to be invalid clicks. 在这种情况下,预定标准是已经被无效点击决定单元309确定为无效的所有点击。 In this case, the predetermined criteria have been invalid click decision unit 309 determined to be invalid all clicks. 根据本发明的另一个实施例,指示是否向管理员303报告对应于规则或模型的情况的字段被存储在无效点击模型存储单元306中储存的每个规则或者模型中。 According to field another embodiment of the present invention, indicating whether the administrator 303 reports to correspond to the rules or models is stored in the invalid click pattern storage unit 306 to store each rule or model. 在这种情况下,在对应于管理员303必须被通知的规则的情况下,无效点击报告单元308将其报告给管理员303。 In this case, in the case of 303 corresponds to the rule managers must be notified, invalid click report unit 308 reports it to the administrator 303.

无效点击验证单元307允许管理员303把已经被无效点击决定单元309确定为无效的点击改变成有效点击。 Invalid click verification unit 307 allows the administrator 303 has been invalid click decision unit 309 determined to be invalid clicks changed to valid clicks. 因为无效点击验证单元307可以把误定为无效点击的点击改变成有效点击,所以无效点击可以被更精确地确定。 Because invalid click verification unit 307 can be mistaken as invalid clicks clicks changed to valid clicks, so invalid clicks can be determined more accurately.

图4是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 4 is an embodiment of the present invention, a method for detecting invalid clicks flowchart.

因特网搜索服务器104从搜索器接收一个搜索请求(步骤401)。 The Internet search server 104 receives a search request (step 401) from the Finder. 如果搜索器访问因特网搜索服务器104然后输入搜索字,则该搜索字作为搜索请求分组被传送到因特网搜索服务器104。 If the search is access to the Internet search server 104 and enter the search word, the search word as a search request packet is transmitted to the Internet search server 104.

因特网搜索服务器104响应于该搜索请求产生一个搜索结果网页(步骤402)。 The Internet search server 104 in response to the search request to produce a search results page (step 402). 例如图2中所示,包括多个对应于搜索器输入搜索字的搜索项的搜索结果网页被提供给搜索器。 For example in Figure 2, including a plurality of corresponding to the search input search word search terms search results page is provided to the searcher.

对应于产生的搜索结果网页的页面标识符被获取(步骤403)。 Search results page page identifier is generated corresponding to the acquisition (step 403). 每当产 Whenever production

生搜索结果网页的时候就产生一个页面标识符。 Health search results page when generating a page identifier. 页面标识符是一个用于识别搜索结果网页的标识符。 Page identifier is a recognition of an identifier for the search results page. 因此,如果相同的搜索器通过重复地向因特网搜索服务器104的搜索窗中输入相同的搜索字,则每次都分配一个新的页面标识符。 Therefore, if the same search repeatedly entering the same through the Internet search server 104 search word search window, each time a new page is assigned an identifier. 同样地,如果搜索器点击显示搜索结果网页的web浏览器中的"reload (重新加载)",则因特网搜索服务器104向搜索结果网页分配一个新的页面标识符,因为搜索请求分组从客户机终端101传送到因特网搜索服务器104。 Likewise, if the searcher clicking displays the search results page web browser "reload (reloaded)", the Internet search server 104 assigns a new identifier to the search results page page because search request packet from the client terminal 101 transferred to the Internet search server 104. 不同的页面标识符被分配给乍一看相同的搜索结果网页是可能的。 Different page identifier is assigned to the same search results page at first sight is possible. 然而,如果新的搜索请求从客户机终端101被接收,则搜索结果网页在那时被重新产生。 However, if the new search request is received from the client terminal 101, the search results page is regenerated at that time. 不同于先前的搜索结果网页的搜索结果网页从而可以被提供。 Unlike previous search result pages for the search results page which can be provided.

在步骤404中,因特网搜索服务器104从搜索器接收一个包括在搜索结果网页内的搜索项的点击。 In step 404, the Internet search server 104 receives from the search includes in the search results page click on the search term. 如果点击被接收,则因特网搜索服务器104 允许用于搜索项的超链接来连接因特网搜索服务器104,允许因特网搜索服务器104执行必要的处理,然后允许客户机终端访问对应于该搜索项的网络站点。 If the click is received, the Internet search server 104 allows a hyperlink for the search item to connect to the Internet search server 104, allows the Internet search server 104 performs the necessary processing, and then allows a client terminal to access the search term corresponding to the network site. 例如, 在 For example, in

http:〃www.naver.com/abc^http:〃www.invalidclick.com/被准备作为对应于"http〃www.invalidclick.com/"的搜索项超链接的情况下,如果搜索器点击该搜索项,则搜索被允许以访问称作^口://,\^.naver.com的搜索服务器。 http: 〃www.naver.com / abc ^ http: under 〃www.invalidclick.com / is prepared as corresponding to "http〃www.invalidclick.com /" search term hyperlinks, if the searcher click on the search item, the search has been allowed to access is called ^ mouth: //, \ ^ .naver.com search server. 搜索服务器允许客户机终端根据位于超链接后侧的URL来访问http:〃www.invalidclick.com。 Search server allows a client terminal according to the rear URL hyperlink located to visit http: 〃www.invalidclick.com.

因特网搜索服务器104获取一个对应于被点击搜索项的站点标识符(步骤405)。 The Internet search server 104 acquires a search term corresponding to the click site identifier (step 405). 站点标识符是一个用于识别搜索项的标识符,并且基于对应于搜索项的URL信息来产生。 Site identifier is an identifier for identifying the search terms, and to generate corresponding to the search term based on the URL information. 根据本发明的另一个实施例,站点标识符使用对应于搜索项的原URL信息。 According to another embodiment of the present invention, the site identifiers corresponding to the search key from the original URL information. 用作产生站点标识符的基本信息的URL URL used as the basic information generating station identifiers

信息可以是网络服务器的域名或包括域名、目录和文件名在内的信息。 Information can be a network server name or domain name, including the directory and file name information. 例如,http:〃www.naver.com禾口http:〃www.naver.com/download是相同的, For example, http: 〃www.naver.com Hekou http: 〃www.naver.com / download is the same,

因为它们从域名的观点来看都是www.naver.com,但是从URL的观点来看则不相同。 Because they are from the standpoint of the domain name www.naver.com, but from the point of view is not the same URL. 在本发明中, 一个使用URL及至域名的实施例已经为了解释方便起见而进行了说明。 In the present invention, an Example URL domain name has been up for convenience of explanation and described. 然而,本发明覆盖了所有的实施例,其中,如果URL尽管其域名相同但是具有不同的目录(因为它们不仅包括了域名,而且还包括了目录和文件名),则URL被认为是不同的搜索项。 However, the present invention covers all of the examples, which, despite its domain name if the same URL but with a different directory (because they include not only the domain name, but also includes a directory and file name), the URL is considered to be a different search items. 此外应当理解,在本发明中,URL信息包括根据这个说明书的所有实施例。 Moreover, it should be understood that in the present invention, URL information includes all the embodiments of this specification.

在步骤406中,如果页面标识符和站点标识符与预定时段内的其它点击相关的页面标识符和站点标识符一致,则用于检测无效点击的设备确定点击是无效的。 In step 406, if the page identifier and the site identifier page identifiers and other sites with one click predetermined time period associated identifier matches for detecting invalid clicks OK clicking is invalid.

图5显示了根据本发明实施例的示例的日志文件。 Figure 5 shows the log file example according to the present invention implementation. 图4的实施例将参考图5来说明。 4 embodiment will be described with reference to FIG.

根据本发明,每当从用户接收一个搜索项的点击,页面标识符509和站点标识符510就被存储在日志文件500中。 According to the present invention, each time a search is received from the user clicks, page identifiers 509 and 510 items of site identifier is stored in a log file 500. 附图标记501到508指出被存储的各个点击输入的日志。 Reference numerals 501-508 indicate each click is stored input logs.

作弊器访问因特网搜索服务器104以请求一搜索。 Cheating device to access the Internet search server 104 to request a search. 因特网搜索服务器104产生搜索结果网页并产生一个对应于搜索结果网页的页面标识符"nCe249sisnO"。 The Internet search server 104 generates search results pages and generate a corresponding to the search results page page identifiers "nCe249sisnO". 作弊器不断地点击包括在搜索结果网页内的一个特定的搜索项。 Cheater keep clicking included in the search results page for a particular search term. 即使一旦所产生的搜索结果网页中的特定搜索项被不断地点击, 页面标识符也不会被重新产生。 Even if the resulting search results page specific search term is keep clicking, page identifiers will not be regenerated. 从而,页面标识符保留了相同的值。 Thus, the page identifier retains the same value.

从而在预定时段内的点击输入日志中,确定具有相同的页面标识符和相同的站点标识符的日志501、日志502和日志504是无效点击。 Click to enter a log so that within a predetermined period of time, it is determined the same page identifier and the same site identifier log 501, 502, and log log 504 is invalid clicks. 根据本发明的一个实施例,确定一致的日志中的一个是无效点击,则剩余的日志是无效点击。 According to one embodiment of the present invention, the log cases, to determine a consistent invalid clicks, the remaining logs are invalid clicks.

作弊器可以通过点击web浏览器中的"reload"来更新搜索结果网页。 Cheating can be updated search results page by clicking on the web browser "reload". 在这种情况下,页面标识符被重新分配并且关于页面标识符的日志是日志505。 In this case, the page is re-assigned identifier and the identifier of the log on the log page 505. 其后,作弊器点击相同搜索项的情况对应于日志506。 Thereafter, the cheater clicks on the same search item corresponding to the log 506.

因此,根据这个实施例,如果作弊器点击"reloads"然后点击相同的搜索项(在日志506的情况下),则它不被确定是一个无效点击。 Therefore, according to this embodiment, if the cheater click "reloads" and then click on the same search term (in the case log 506), it is not determined to be an invalid click. 同样地,用于确定"reload"是无效点击的情况的方法将参考图6在下列实施例中被说明。 Similarly, for determining the "reload" is a method for invalid clicks in reference to FIG. 6 in the following examples are described below.

图6a和6b是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 6a and 6b is a flowchart of a method embodiment of the present invention to detect invalid clicks. 因特网搜索服务器104从搜索器接收搜索请求(步骤601)。 The Internet search server 104 receives a search request (step 601) from the Finder. 因特网搜索 Internet search

服务器104响应于该搜索请求产生一搜索结果网页(步骤602)。 Server 104 in response to the search request to produce a search results page (step 602).

用于确定无效点击的设备确定会话cookie文件是否被存储在请求搜 Request Search for determining invalid clicks determines whether the session cookie file is stored in the

索的客户机终端101中(步骤603)。 Cable client terminal 101 (step 603). 步骤603到步骤611被处理以获得一个 Step 603 to step 611 is processed to obtain a

会话标识符。 The session identifier.

如果确定会话cookie文件没有存储在客户机终端101中,则用于确定无效点击的设备产生一个新的会话标识符(步骤604)。 If it is determined the session cookie file is not stored in the client terminal 101, is used to determine invalid clicks generates a new session identifier (step 604). 在步骤605中,包括会话标识符在内的会话cookie文件被存储在客户机终端101中。 In step 605, the session cookie file includes a session identifier to be stored in the client terminal 101. 会话标识符的更新时间还被存储在会话cookie文件中。 Updated session identifier is also stored in the session cookie file. 更新时间被存储在会话cookie文件中(步骤609)。 Update time is stored in the session cookie file (step 609).

如果确定会话cookie文件在步骤602中存储在客户机终端101中,则用于确定无效点击的设备确定包括会话cookie文件在内的会话标识符的最后更新时间是否在预定时段内(步骤606)。 If it is determined the session cookie file stored in step 602 in the client terminal 101, is used to determine invalid clicks OK last update include the session, including the session identifier cookie file is within a predetermined period of time (step 606).

作为步骤606中的确定结果,如果包括在会话cookie文件内的会话标识符的最后更新时间在预定时段内,则用于确定无效点击的设备提取一个包括在会话cookie文件内的会话标识符(步骤607)。 As a result of the determination in step 606, if the last update is included in the session cookie file of the session identifier within a predetermined period, the apparatus for determining invalid clicks extract a file included in a session cookie session identifier (step 607).

作为步骤606中的确定结果,如果包括没有会话cookie文件内的会话标识符的最后更新时间不在预定时段内,则用于确定无效点击的设备产生一个新的会话标识符(步骤608)。 As a result of the determination in step 606, if you include the last update time of the session identifier no session cookie file that is not within a predetermined period of time, it is used to determine invalid clicks generates a new session identifier (step 608). 包括在会话cookie文件内的会话标识符用重新创建的会话标识符来更新(步骤610)。 Included in the session cookie file of the session identifier with the session identifier is re-created to update (step 610). 会话标识符的更新时间被存储在会话cookie文件中(步骤611)。 Updated session identifier is stored in the session cookie file (step 611).

因特网搜索服务器104从搜索器接收一个包括在搜索结果网页内的搜索项的点击(步骤612)。 The Internet search server 104 receives from a searcher included in the search results page click on the search term (step 612).

因特网搜索服务器104获取一个对应于被点击搜索项的站点标识符(步骤613)。 The Internet search server 104 acquires a search term corresponding to the click site identifier (step 613).

如果会话标识符和站点标识符与在预定时段内与其它点击有关的会话标识符和站点标识符一致,则用于检测无效点击的设备确定该点击是无效点击(步骤614)。 If the session identifier and site identifier within a predetermined period of time with the other click on the relevant session identifier and site identifier matches for detecting invalid clicks determines that the clicks to be invalid (step 614).

图7显示了根据本发明实施例的示例的日志文件。 Figure 7 shows an exemplary log file according to an embodiment of the present invention.

在这个实施例中,每当从用户接收一个搜索项的点击,点击时间710、 会话标识符的更新时间711 、会话标识符712和站点标识符713被存储在日志文件700中。 In this embodiment, each time receiving a search term from a user click, click time 710, 711 update session identifier, the session identifier site identifier 712 and 700 713 are stored in a log file. 附图标记701到708指出对应于各个点击输入存储的日志。 Reference numerals 701-708 indicate input corresponding to respective click logs stored.

作弊器访问因特网搜索服务器104以请求一个搜索请求。 Cheating device to access the Internet search server 104 to request a search request. 因特网搜索服务器104产生一个搜索结果网页。 The Internet search server 104 generates a search results page. 因特网搜索服务器104接收一个包括 Internet search server 104 receives a comprises

在搜索结果网页内包括在内搜索项的点击。 Within the search results page, click on the search terms included.

因特网搜索服务器104确定会话cookie文件是否被存储在客户机终端101中。 Internet search server 104 determines whether a session cookie file is stored in the client terminal 101. 如果确定会话cookie文件没有存储在客户机终端101中,则因特网搜索服务器104产生一个新的会话标识符,并且将其更新时间和包括会话标识符在内的会话cookie文件存储在客户机终端101中。 If it is determined the session cookie file is not stored in the client terminal 101, the Internet search server 104 generates a new session identifier and will update it 101 times and the session cookie file is stored including the session identifier included in the client terminal . 在这个实施例中,会话标识符"xigw9492"和更新时间"10:50:14"被记录。 In this embodiment, the session identifier "xigw9492" and update time "10:50:14" is recorded. 此外,对应于搜索项的点击时间、更新时间、会话标识符和站点标识符作为日志701被存储在日志文件700中。 In addition, corresponding to the time of the click search item, update time, the session identifier and site identifier as the log 701 is stored in the log file 700. 在第一次产生会话cookie文件的情况中,只要在那时还产生点击和会话标识符,会话cookie文件就被产生。 In the case of the first generation session cookie file, just in time still generating clicks and a session identifier, a session cookie file is generated. 从而,点击时间和会话标识符更新时间是相同的。 Thus, the time of the click and the session identifier update times are the same.

作弊器在相同的搜索结果页面中点击相同的搜索项。 Click to cheat the same search terms in the same search results page. 因特网搜索服务器104确定会话cookie文件是否被存储在客户机终端101中。 Internet search server 104 determines whether a session cookie file is stored in the client terminal 101. 因为上述产生的会话cookie文件已经被存储在客户机终端101中,因特网搜索服务器104访问存储在客户机终端101中的会话cookie文件。 Since the above generated session cookie file has been stored in the client terminal 101, the session cookie file access Internet search server 104 stored in the client terminal 101. 会话cookie文件在其中存储一个会话标识符和会话标识符的最后更新时间。 Session cookie file which was last updated in a session identifier and storing the session identifier. 在这个实施例中, 会话标识符"xigw9492"和更新时间"10:50:14"被存储在会话cookie文件中。 In this embodiment, the session identifier "xigw9492" and update time "10:50:14" is stored in the session cookie file.

因特网搜索服务器104确定来自搜索器的搜索项的点击时间是否在从与会话标识符有关的最后更新时间开始的预定时段内。 The Internet search server 104 determines click to search items from the searcher is within a predetermined time period associated with the session identifier from the last update time began. 在这个实施例中,第二点击的点击时间是"10:50:18"。 In this embodiment, the second click-click the "10:50:18." 如果预定时段是5秒,则点击时间"10:50:18"在从最后更新时间"10:50:14"开始的预定时段内。 If the predetermined period of time is 5 seconds, then click on the time "10:50:18" from the last update time "10:50:14" predetermined period began. 同样地, 在这种情况下,存储在会话cookie文件中的会话标识符被用作一个当前的会话标识符并且该会话cookie文件的会话标识符没有被更新。 Likewise, in this case, the session identifier is stored in the session cookie file is used as an identifier for the current session and the session identifier of the session cookie file is not updated. 从而在这种情况下,例如日志702被记录。 Whereby in this case, for example, the log 702 is recorded.

从而,确定日志702是一个无效点击,因为它具有与日志701相同的会话标识符和站点标识符。 Thus, determine the log 702 is an invalid clicks, because it has the same log session identifier and site identifier 701.

日志704对应于其中作弊器请求"reload"的情况。 Log 704 corresponds to a case in which cheating requests "reload" in. 同样地,结果作弊器请求"reload",制定出存储在客户机终端101中的会话cookie文件的标准,并且会话标识符没有被更新,因为存储在会话cookie文件中的最后更新时间在预定时段内。 Similarly, the results of cheating requests "reload", to develop a standard session cookie file is stored in the client terminal 101, and the session identifier has not been updated since the last update time is stored in the session cookie file within a predetermined period . 因此,例如日志704被记录。 Thus, for example, the log 704 is recorded. 因为它和日志701 — 样,所以确定日志704是一个无效点击。 Because it and log 701-- like, so make sure the log 704 is an invalid click. 即,根据这个实施例,有可能检测作弊器在短时间间隔内在点击"reload"之后点击相同的搜索项的情况。 That is, according to this embodiment, it is possible to detect cheating after a short interval inherent click "reload" clicks on the same search term.

日志705对应于这种情况,即相同搜索项的点击从不同于日志701 、 日志702和日志704的搜索器被接收。 Log 705 corresponds to the case that click on the same search item 701 is different from the log, the log 702 and 704 logs are received from the searcher. 在这种情况下,因为新的会话标识符被分配,所以它不被确定为一个无效点击。 In this case, since the new session identifier is assigned, so it is not determined to be an invalid click.

日志709对应于这种情况,即与日志701相同的搜索器在相当多时间之后点击相同的搜索项。 Log 709 corresponds to the case that the log 701 in the same search after considerable time click on the same search term. 在这种情况下,因为点击在相当长时间之后才被接收,所以它不被确定为一个无效点击。 In this case, since the click was only received after quite a long time, so it is not determined to be an invalid click.

根据这个实施例,作弊器在预定时段之后点击相同的搜索项的情况, 因为一个会话标识符被产生,所以它被确定是一个无效点击。 According to this embodiment, after a predetermined period cheat clicks on the same search term, since a session identifier is generated, it is determined to be an invalid click.

同样地,根据本发明的另一个实施例基于无效点击决定来确定这样的情况可能是一个无效点击,即在从相同搜索项的最后点击时间开始的预定时段内做出点击。 Also, according to another embodiment of the present invention based on invalid clicks decision to determine such a situation could be an invalid click, or click to make within a predetermined period of time from the last click on the same search term began. 这将被简单地说明。 This will be briefly described.

如果点击从搜索器被接收,则确定会话cookie文件是否被存储在终端中。 If you click on is received from the searcher, it is determined whether or not the session cookie file is stored in the terminal. 如果确定会话cookie文件被存储在终端中,则确定来自搜索器的搜索项的点击时间是否在从与会话标识符有关的最后点击时间开始的预定时段内。 If it is determined the session cookie file is stored in the terminal, it is determined from the time of the click search term searcher is within a predetermined period of time from the session identifier associated with the last click start time.

如果确定搜索项的点击时间在预定时段内,则包括在会话cookie文件内的会话标识符被获取并且最后点击时间用搜索项的点击时间来更新。 If it is determined click to search items within a predetermined period, the documents included in the session cookie session identifier is acquired and the final time of the click-click the search term used to update.

如果确定搜索项的点击时间不在预定时段内,则新的会话标识符被产生以更新包括在会话cookie文件内的会话标识符。 Click on time if it is determined the search term is not a predetermined period of time, the new session identifier is generated to update the document included in the session cookie session identifier. 此外,最后点击时间用搜索项的点击时间来更新。 In addition, the last time with the click-click search items to update.

例如在图7中,在存在来自于相同客户机终端的相同搜索项的多个点击的情况下,如果确定从最后的点击已经过去了5秒的情况是有效的,则与日志704有关的点击被确定是有效的,因为它在先前的最后点击时间"10:50:18"的13秒后被做出"10:50:31"。 For example, in Figure 7, in the presence of multiple clicks from the same under the same search term client terminal case, if it is determined from the last case of five seconds have elapsed click is valid, then the log 704 clicks related It is determined to be valid, because it finally click in the previous time "10:50:18" 13 seconds later to make "10:50:31."

根据本发明的优选实施例,时间参考根据无效点击的检测目的来决定。 According to a preferred embodiment of the present invention, with reference to time in accordance with the purpose of detecting invalid clicks determined.

图8是一个根据本发明实施例来产生会话标识符的方法流程图。 Figure 8 is a flowchart of a method embodiment of the present invention to generate the session identifier. 会话标识符必须被唯一地分配以便它能与其它的会话标识符区分并且必须很难被仿造或伪造。 The session identifier must be assigned uniquely to it with other session identifiers are and must be difficult to counterfeit or forged. 在会话标识符只被唯一地分配的情况下,存在一个可能性,即作弊器实际上可能产生一个会话标识符然后把会话标识符存储在会话cookie中,或者可能用一个程序不正当地增加点击量,这个程 In the case where the session identifier uniquely allocated only, there is a possibility that a cheater may virtually generate a session identifier and the session identifier stored in the session cookie, or may unduly increase with a traffic program This process

序被驱动来不断地点击搜索项而同时改变会话标识符。 Sequence is driven to constantly click on the search term and also change the session identifier.

源数据801是用于产生会话标识符805的基本数据。 Source data 801 is used to generate the basic data session identifier 805. 源数据可以是当 When the source data can be

前的时间信息、搜索字、搜索器的web浏览器的产品ID等等。 Time information before the search word, web browsers, search the product ID and the like. 源数据可以是随机选择的数量。 Source data may be randomly selected number. 散列函数802被应用到源数据801以产生一个编码串 Hash function 802 is applied to the source data to generate a code string 801

19803。 19803. 然后,校验和被添加到编码串803以产生会话标识符805。 Then, the checksum is added to the code string 803 to generate a session identifier 805. 校验和用来防止作弊器伪造会话标识符。 The checksum is used to prevent cheating forgery session identifier.

用于根据这个实施例产生会话标识符的方法可以被应用来产生一个随后将被说明的页面标识符、站点标识符、终端标识符等等。 A method for generating a session identifier according to this embodiment can be applied to generate a page identifier will be explained subsequently, the site identifier, terminal identifier, and so on.

图9是一个根据本发明实施例来检测无效点击的方法流程图。 FIG. 9 is a flowchart of a method embodiment of the present invention to detect invalid clicks.

因特网搜索服务器104从搜索器接收一个包括在搜索结果网页内的搜索项的点击(步骤901)。 The Internet search server 104 receives from a searcher included in the search results page click on the search term (step 901). 因特网搜索服务器104获取一个对应于搜索器的终端101的客户机IP地址(步骤902)。 The Internet search server 104 acquires a corresponding to the search the IP address of the client terminal 101 (step 902). 客户机的IP地址可以从被接收的IP分组的源IP地址字段中提取。 IP address of the client can be extracted from the source IP address field of the received IP packet.

因特网搜索服务器104获取对应于被点击搜索项的站点标识符(步骤903)。 The Internet search server 104 acquires click on the search term corresponding to the site identifier (step 903).

在步骤904中,如果客户机IP地址和站点标识符与预定时段内其它点击相关的客户机IP地址和站点客户机IP地址一致,则用于搜索无效点击的设备确定该点击无效。 In step 904, if the client IP address and the site identifier within a predetermined period of time other click on the relevant client IP address and client IP address matches the site, is used to search for invalid clicks determines that the click is invalid.

图10显示了根据本发明实施例的示例的日志文件。 Figure 10 shows the log file example according to the present invention implementation.

在这个实施例中,每当从用户接收一个搜索项的点击,点击时间1010、客户机IP地址1011和站点标识符1012就被存储在日志文件1000 中。 In this embodiment, each time a search is received from a user click on the item, click on the time 1010, the client IP address 1011 and 1012 the site identifier is stored in a log file 1000. 附图标记1001到1009指定对应于各个点击输入的所存储的日志。 Numeral 1001-1009 specified input corresponding to the respective click the stored log.

如果相同的客户机终端不断地点击相同的搜索项,则如果点击在预定时段内被重复,则该点击无效的可能性很高。 If the same client terminals continue to click on the same search term, then if clicking is repeated within a predetermined period of time, the possibility of invalid click is high. 然而,往往是这样的情况, 即相同客户机终端的用户在相当长时间之后点击相同的搜索项。 However, this is often the case that the same user client terminals in quite a long time after clicking the same search term. 换言之, 存在一个趋势,即用户往往访问一个它很感兴趣的网络站点。 In other words, there is a tendency that often visit the Web sites a user it is very interesting. 如果用户在短时间内不断地访问一个网络站点,则很难把它看作是一个普通的点击。 If you continue to visit in a short time a network site, it is difficult to see it as a normal click. 从而,这个情况被确定是一个无效点击。 Thus, this case is determined to be an invalid click. 例如,如果时间标准是5分钟, 则具有与日志1001相同的客户机IP地址和相同的站点标识符的日志1002、日志1004和日志1005被确定是无效点击。 For example, if the time standard is 5 minutes, then log 1001 having the same client IP address and the same logging site identifiers 1002, 1004 and logs logs 1005 is determined to be invalid clicks. 确定在大约20分钟中与被点击日志1009相关的点击是有效点击。 Determined in about 20 minutes in 1009 with the click log relevant clicks are valid clicks.

如果基于客户机IP地址来确定无效点击,那么存在一些需要谨慎的点。 If based on the client IP address to identify invalid clicks, then there are some points need to be careful. 在客户机终端使用代理服务器或IP网关的情况中,存在一个危险,即使作弊器点击与其它的客户机终端相同的搜索项,它也可能被确定为一个无效点击。 In using a proxy server or IP gateway client terminal situation, there is a danger, even if cheating click to other client terminals same search term, it may also be identified as an invalid click. 因此,优选地,这个实施例与使用诸如会话标识符之类的其它参数的一个实施例一起联合构造。 Therefore, preferably, this embodiment is the use of such an embodiment of a session identifier such cases jointly with other configuration parameters.

相反地,存在这样一种情况,即点击相同搜索项的客户机终端的客户机1P地址是不同的,而它们的网络地址是相同的。 On the contrary, there is a case that the same client terminals click on a search term client 1P addresses are different, but their network addresses are the same. 这对应于这样一种情况, 即几个人不断地尝试用一个程序来不公平的点击一处或点击相同的搜索项,而同时改变它们的源IP地址。 This corresponds to a case that a few people continue to try to use a program to unfair one click or click on the same search term, while changing their source IP address. 在这种情况下,如果点击相同搜索项的客户机终端的网络地址是相同的并且其它情况(例如,在搜索项所属的目录内,点击量大于平均点击量的情况)被满足,则这可以被确定是一个无效点击。 In this case, if you click on the network address of the client terminal is the same search terms (the case, for example, belongs in the directory search term, click the greater than average amount of traffic) is the same and other conditions are met, then this can be It is determined to be an invalid click.

图11是一个根据本发明实施例来检测无效点击的方法流程图。 Figure 11 is an embodiment according to the present invention a method for detecting invalid clicks flowchart. 因特网搜索服务器104从搜索器接收搜索请求(步骤1101)并且产生一 The Internet search server 104 receives a search request (step 1101) from the Finder and generating a

个搜索结果网页(步骤1102)。 Search results page (step 1102).

因特网搜索服务器104确定包括终端标识符在内的用户cookie文件是否被存储在终端中(步骤1103)。 Internet search server 104 determines whether the terminal identifier comprises a user cookie file is stored in the terminal (step 1103).

由于步骤1103中的确定结果,如果包括终端标识符在内的用户cookie 文件没有被存储在终端中,则因特网搜索服务器104产生一个终端标识符(步骤1104)。 Since the result of the determination in step 1103, if the user cookie file including the terminal identifier is not stored in the terminal, the Internet search server 104 generates a terminal identifier (step 1104).

因特网搜索服务器104产生包括终端标识符在内的用户cookie文件并把它存储在搜索器终端中(步骤1105)。 The Internet search server 104 generates the user cookie file including the terminal identifier and store it in the searcher's terminal (step 1105).

由于步骤1103中的确定结果,如果包括终端标识符在内的用户cookie 文件被存储在终端中,则因特网搜索服务器104从用户cookie文件中提取终端标识符(步骤1106)。 Since the result of the determination in step 1103, if the user cookie file including the terminal identifier is stored in the terminal, the Internet search server 104 extracts the terminal identifier from the user cookie file (step 1106).

因特网搜索服务器104从搜索器接收包括在搜索结果网页内的搜索项的点击(步骤1107),然后获取一个对应于被点击搜索项的站点标识符(步骤1108)。 The Internet search server 104 receives from the searcher is included in the search results page click on the search term (step 1107), and then obtain a search term corresponding to the click site identifier (step 1108).

最后,在步骤1109中,用于确定如果无效点击的设备确定终端标识符和站点标识符与与预定时段内其它点击有关的终端标识符和站点标识符一致,则该点击是无效的。 Finally, in step 1109, to determine if the invalid clicks OK terminal identifier and a site identifier and the other within a predetermined period click the relevant terminal identifier and site identifier consistent with the click is invalid.

根据这个实施例,即使客户机终端使用一个代理服务器或IP网关,也有可能用终端标识符来判别客户机的终端。 According to this embodiment, even when the client terminal uses a proxy server or an IP gateway, it is possible to use a terminal identifier to determine the client's terminal. 从而,即使不同的客户机终端使用代理服务器或IP网关,也可能正确地识别来自于不同客户机的点击。 Thus, even if different client terminals using a proxy server or IP gateway, it may correctly recognize clicks from different clients. 在本发明的另一个实施例中,如果对于包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,预定时段内每个搜索项的搜索器的点击量大于属于搜索项所属类别的搜索项的平均点击量,则它被认为是一个无效点击并从而将其报告给管理员。 In another embodiment of the present invention, if the search term for belonging to the category for the search term is included in the provided by an Internet search engine search results page search term, within a predetermined period searcher clicks per search item is greater than The average amount of traffic, it is considered to be an invalid click and thus its report to the administrator.

根据本实施例的用于检测无效点击的设备包括点击计数器装置,用于针对包括在由因特网搜索引擎提供的搜索结果网页内的搜索项计数预定时段内每个搜索项的搜索器点击量,,平均点击量计算装置,用于计算预定时段内属于.搜索项所属类别的搜索项的平均点击量,和决定装置,用于确定每个搜索项的点击量是否比平均点击量大一个预定的差。 According to the present embodiment is used to detect invalid clicks include hit counter means for the searcher clicks a predetermined time period for each search term includes a search term in the count provided by an Internet search engine search results page ,, The average traffic calculation means for calculating a predetermined period of time belongs to the average amount of traffic a search term category for the search term, and the decision means for determining whether each of your search terms greater than the average number of clicks a predetermined difference . 如果每个搜索项的点击量比平均点击量大一个预定的差,则这个事实经由无效点击报告单元308被报告给管理员。 If you click on greater than average number of clicks per search item is a predetermined difference, then this fact via invalid click report unit 308 is reported to the administrator.

根据本发明的另一个实施例,针对包括在由因特网搜索引擎提供的搜索结果网页内的搜索项,在预定时间段内,将每个搜索项的搜索器的点击量与预定时段内在搜索结果网页中的位于搜索项上端的搜索项预定第一数量和位于搜索项下端的搜索项的预定第二数量的平均点击量相比较。 According to another embodiment of the present invention, for included in the provided by an Internet search engine search results page the search term, in a predetermined period of time, the searcher clicks each search term with a predetermined period of internal search results page The upper end of the search term in the search term predetermined first number in the search term and the lower end of a predetermined number of second average amount of traffic a search term compared. 例如,在相同的周期中,特殊的搜索项的点击量与紧接位于特殊搜索项上的两个搜索项和紧接位于特殊搜索项下的两个搜索项的点击量相比较。 For example, in the same period, the special search term hits two search terms and compared with the immediately preceding search terms located on particular traffic immediately located two search terms under specific search terms. 作为比较的结果,如果特殊搜索项的点击量比围绕其它搜索项的点击量大5倍, As a result of the comparison, if a particular search term Click Click greater than around five times the other search terms,

则它是无效点击的可能性很高并且从而同样地被报告给管理员。 It is a high possibility of invalid clicks and thus likewise be reported to the administrator.

用于确定无效点击的各种方法已经在上面被说明。 For determining invalid clicks various methods have been described above. 用于确定无效点击的方法可以被独立地使用或者可以与用于确定无效点击的方法联合使用。 The method for determining invalid clicks may be used independently or may be used to determine the method used in combination invalid clicks.

例如, 一个规则可以被存储在无效点击模型存储单元306中,其中,对应 For example, a rule may be stored in the invalid click pattern storage unit 306, which corresponds to

于搜索项的客户机IP地址、页面标识符和站点标识符在从搜索项的最后点 To search for items on the client IP address, the page identifier and a site identifier from the last point of the search terms

击开始的5分钟内被重复的情况是无效的。 Be repeated within five minutes the situation began to strike is invalid.

在本发明中,因特网搜索服务器和用于识别不公平点击的设备已经被混乱地描述为单个单元。 In the present invention, the Internet search server and used to identify unfair clicked devices have been described chaos as a single unit. 然而,根据本发明的另一个实施例,应当注意它们可以根据它们的功能被分开执行并且可以由不同的管理员来管理。 However, according to another embodiment of the present invention, it should be noted that they can be separately performed according to their functions and can be managed by different administrators.

此外,在本发明中,被显示并被描述为分开元件的元件可以物理上被创建在单个系统中并且可以物理上被创建在一个单独的系统中。 Further, in the present invention, it is shown and described as separate elements elements may be physically created and may be created in a physically separate system in a single system.

22此外,尽管几个实施例已经在本发明中被说明,对于所属领域技术人员来说显而易见的是,多个实施例的一部分或剩余的实施例也属于本发明的精神。 22 In addition, although a few embodiments have been described in the present invention, the apparent to the skilled artisan that the remaining part or a plurality of embodiments are also within the spirit of the embodiments of the present invention.

另外,本发明的实施例还涉及包括用于执纟亍不同的计算机执行操作的程序指令的计算机可读媒介。 Further, embodiments of the present invention further relates Si right foot for performing different operations of the computer to execute computer program instructions readable medium. 该媒介还可以单独(或与程序指令相结合)包括数据文件、数据结构、数据表等等。 The media also can be used alone (or with a combination of program instructions), including data files, data structures, data tables, and so on. 媒介和程序指令可以被特别ftfe设计并构造以用于本发明目的,或它们可能是众所周知的类型并是计算机软件领域的技术人员可用的。 Media and program instructions may be specifically designed and constructed to ftfe for the purposes of the present invention, or they may be well-known type and is skilled in the art of computer software available. 计算机可读媒介的例子包括诸如硬盘、软盘和磁带之类的磁性媒介;诸如CD-ROM磁盘之类的光媒介;诸如可光读磁盘之类的磁光媒介;.和被特别配置来存储和执行程序指令的硬件装置,比如只读存储器装置(ROM)和随机存取存储器(RAM)。 Examples include computer-readable media such as hard disks, floppy disks and magnetic media like tape; optical media such as CD-ROM disks and the like; magneto-optical media such as optically readable disks and the like;. And that are specially configured to store and Hardware devices perform program instructions, such as read-only memory device (ROM) and random access memory (RAM). 媒介还可能是诸如光或金 Media may also be such as a light or gold

属线路、导波器等等之类的传输媒介,包括发射规定程序指令、数据结构等等的信号的载波。 Genus lines, wave guides, etc. like transmission medium, comprising a carrier transmitting a predetermined program instructions, data structures, etc. of the signal. 程序指令的例子包括两个诸如由编译器产生的之类的 Examples of program instructions include two by the compiler, such as the type of

机器代码,和包括可以由计算机使用解释器来执行的高级代码在内的文件。 Machine code, and includes an interpreter can be used by a computer to perform a higher level code, including file.

图12是一个说明通用计算机系统的结构的框图,该系统可用于创立搜索引擎服务器和用于根据本发明检测无效点击的设备。 FIG. 12 is a block diagram illustrating the structure of a general purpose computer system, the system can be used for the creation of a search engine server and according to the present invention detects invalid clicks.

计算机系统包括任意数量的处理器1240(也被称为中央处理器或CPUs),它们被耦合到包括主存储器1260(—般来说是随机存取存储器或"RAM")、主存储器1270(—般来说是只读存储器或"ROM")的存储装置。 The computer system includes any number of processors 1240 (also referred to as central processors or CPUs), which includes a main memory coupled to 1260 (- as it is a random access memory, or "RAM"), a main memory 1270 (- Generally a memory or "ROM") read-only memory device. 在本领域中众所周知的是,主存储器1260把数据和指令单向传送到CPU, 并且主存储器1260—般被用来以双向方式传送数据和指令。 Is well known in the art, the main memory 1260 one-way transmission of data and commands to CPU, and is used as the main memory 1260- bidirectional transmission of data and instructions. 这两个主存储器装置都可以包括如上所述的任何适当的类型的计算机可读媒介。 Both primary storage devices may include any suitable type of computer-readable media described above. 大容量存储装置1210还被双向耦合到CPU1240和提供附加的数据存储量并且可以包括如上所述的任何计算机可读媒介。 1210 was also a two-way mass storage device coupled to CPU1240 and provides additional data storage capacity and may include any computer-readable media described above. 大容量存储装置1210可以被用来存储程序、数据等等,并且一般是一个诸如比主存储器慢的硬盘之类的辅助存储器媒介。 The mass storage device 1210 may be used to store programs, data and the like, and is generally slower than primary storage such as a hard disk or the like of the secondary storage medium. 诸如光盘1220之类的特殊大容量存储装置还可以把数据单向传递给CPU。 Such as special large-capacity storage device like CD 1220 can also be a one-way data transfer to the CPU. 处理器1240还被耦合到一个接口1230,其包括一个或多个输入输出设备,比如视频监视器、跟踪球、鼠标、键盘、扩音器、 触控式显示器、换能器读卡机、磁或纸带读取器、写字板、触针、音频或手写识别器或诸如当然包括其它计算机之类的其它众所周知的输入装置。 Processor 1240 is also coupled to an interface 1230 that includes one or more input and output devices, such as video monitors, track balls, mice, keyboards, microphones, touch-sensitive displays, transducer card readers, magnetic or paper tape reader, tablet, stylus, audio, or handwriting recognition, or other input devices such as, of course include other well-known computer and the like.

最后,如通常在1250所示,处理器1240可以选择性地使用网络连接被耦合到计算机或电信网。 Finally, as is often using a network connection can be selectively coupled to a computer or telecommunications network in 1250, the processor 1240. 有了这类网络连接,CPU可以在执行上述方法步骤 With such a network connection, CPU can step in the implementation of the method

的过程中从网络接收信息或者可以向网络输出信息是可期望的。 The process of receiving information from a network or network information can be output to be expected. 上述装置和材料对于计算机硬件和软件领域中的技术人员来说是很熟悉的。 The above-mentioned devices and materials for computer hardware and software in the field of art is very familiar with.

如上所述的硬件元件可以被配置(一般暂时)来充当一个或多个执行本发明操作的软件模块。 Hardware elements described above may be configured (usually temporarily) to act as one or more execution operations of this invention software modules. 工业实用性 Industrial Applicability

根据上述的本发明, 一个用于检测包括在由因特网搜索引擎服务器提供的搜索结果网页内的搜索项的无效点击的方法和设备被提供。 According to the present invention described above, a method and apparatus for detecting includes invalid in provided by an Internet search engine server search results pages of search terms clicks are available.

根据本发明, 一个用于检测无效点击的方法和设备,其可以检测各种不正当地增加搜索项点击量的尝试,并且立即处理这些尝试。 According to the present invention, a method and apparatus for detecting invalid clicks, which can detect a variety of improper search term increase traffic to try and immediately deal with these attempts. 即,如果新模型的不公平的点击尝试被发现,则该模型或规则被存储在一个根据本发明的无效点击模型存储单元中。 That is, if unfair clicks to try new model is found, then the model or rule is stored in a accordance with the present invention invalid click pattern storage unit. 从而,立即处理这个遵循新模型的不公平点击尝试是可能的。 Accordingly, immediately following the new model to deal with this unfair clicking attempt is possible.

此外,根据本发明提供了一个用于检测无效点击的方法和设备,其可以防止为了检测无效点击而提供的几个标识符被仿造或伪造。 Furthermore, according to the present invention provides a method and apparatus for detecting invalid clicks, which can prevent several identifiers provided in order to detect invalid clicks are fake or counterfeit.

尽管本发明已经关于附图中说明的本发明实施例而被说明,然而它并没有被限制在其中,因为对于所属领域技术人员来说,显然可以在其中做出不同的置换、修改和改变。 Although the embodiments of the present invention, the present invention has been illustrated in the drawings and are described, but it is not limited and in which, as those of ordinary skill in the art, can be made clearly different substitutions, modifications and changes therein. 本发明的范围由附加的权利要求来定义。 The scope of the invention defined by the appended claims. 所有在权利要求的意义和范围内做出的改变或修改或其等效物应该被看作是属于本发明的范围。 All within the meaning and scope of the claims made to change or modify or its equivalent should be seen as part of this invention.

Patentzitate
Zitiertes PatentEingetragen Veröffentlichungsdatum Antragsteller Titel
US626936128. Mai 199931. Juli 2001Goto.ComSystem and method for influencing a position on a search result list generated by a computer network search engine
Klassifizierungen
Internationale KlassifikationG06F17/30
UnternehmensklassifikationG06F17/30887, G06F17/30867
Europäische KlassifikationG06F17/30W1F, G06F17/30W5L
Juristische Ereignisse
DatumCodeEreignisBeschreibung
19. Apr. 2006C06Publication
7. Juni 2006C10Request of examination as to substance
26. Aug. 2009C14Granted
30. Dez. 2009C56Change in the name or address of the patentee
Owner name: NHN BUSINESS PLATFORM CO., LTD.)
Free format text: FORMER NAME: NHN CO., LTD.
3. Dez. 2014ASSSuccession or assignment of patent right
Owner name: NABAO CO., LTD.
Free format text: FORMER OWNER: NHN CORP.
Effective date: 20141114
3. Dez. 2014C41Transfer of the right of patent application or the patent right