CN100533434C - Method and apparatus for detecting invalid clicks on the internet search engine - Google Patents

Method and apparatus for detecting invalid clicks on the internet search engine Download PDF

Info

Publication number
CN100533434C
CN100533434C CNB200480007418XA CN200480007418A CN100533434C CN 100533434 C CN100533434 C CN 100533434C CN B200480007418X A CNB200480007418X A CN B200480007418XA CN 200480007418 A CN200480007418 A CN 200480007418A CN 100533434 C CN100533434 C CN 100533434C
Authority
CN
China
Prior art keywords
click
session
search
identifier
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CNB200480007418XA
Other languages
Chinese (zh)
Other versions
CN1761961A (en
Inventor
河定秀
姜锡昊
李宇晟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Naver Corp
Original Assignee
NHN Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NHN Corp filed Critical NHN Corp
Publication of CN1761961A publication Critical patent/CN1761961A/en
Application granted granted Critical
Publication of CN100533434C publication Critical patent/CN100533434C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links

Abstract

The present invention relates to an Internet search engine server. More particularly, the present invention relates to a method and apparatus for detecting invalid clicks for a search item included in a search result web page that is provided by an Internet search engine server. The present invention relates to a method for detecting invalid clicks in an Internet search engine, comprising the steps of generating a search result web page in response to a search request from a searcher; acquiring a page identifier corresponding to the generated web page; receiving a click for a search item included in the search result web page from the searcher; acquiring a site identifier corresponding to the clicked search item; and if the page identifier and the site identifier are coincident with a page identifier and a site identifier associated with other clicks within a predetermined time interval, determining the click to be invalid. According to the present invention, a method and apparatus for detecting invalid clicks that detects a variety of attempts to unduly increase the number of clicks for a search item and immediately coping with these attempts are provided.

Description

On internet search engine, detect the method and apparatus of invalid clicks
Technical field
The present invention relates to Internet search engine server.More particularly, the present invention relates to be used to detect the method and apparatus of the invalid clicks of search terms, search terms is included in the search result web page that is provided by Internet search engine server.In addition, the present invention relates to be used to detect the method and apparatus of invalid clicks, it can detect the various trials that increase the search terms click volume partially and can deal with these trials immediately.
Background technology
Along with the use of the Internet more and more widely, such as can increasing via the quantity of the information source the webpage of access to the Internet with arithmetical series.In addition, for discovery information among the bulk information source, the Internet search engine server of searcher visit such as NAVER, Yahoo and Lycos is with the request search.The Internet search service provider produces a search result web page that comprises search terms, and it comprises the relevant information of importing with searcher of search, and the search result web page of generation is provided to searcher then.For example, when searcher visit NAVER search engine server then during inputted search word " Digital Camera (digital camera) ", search result web page as shown in Figure 2.Each that is included in the search result web page is all relevant with URL (URL(uniform resource locator)).
Because the quantity of the search terms relevant with single search is countless, yet how the countless search terms of this class shows on search result web page and what shows that in proper order to the Internet search service provider be a very important problem with.The Internet search service provider is by determining the order of listing of search terms in conjunction with several standards.One of them standard that has been widely used is the click volume of user to special search terms.For example, if the user is very big to the click volume of a search terms, then this search terms is displayed on search result web page relatively by last part.Even the Internet search service provider by determine in conjunction with a plurality of parameters search terms list the order situation in, if one of them parameter is user's click volume, the search terms that then has very high click volume is displayed on the relative to last part of search result web page.
In addition, the search result web page that the Internet search server produces is shown highly more, and the possibility that the user might click and visit this webpage is just big more.Thereby the network information provider of web server wants a search terms own relevant with he (she) to be presented at the top of search result web page.Because this reason, for his the Webpage search item of (she) being presented at the top of search result web page, network information provider intentionally access the Internet search server repeatedly clicks the search terms of his (she) own webpage.Sometimes, network information provider can constantly click the search terms of webpage of he (she) with a special program.Because clicking, the inequitable search terms of this class do not reflect real user search result, so the Internet search service provider must detect the invalid click of this class.
Have this class service in the prior art, wherein, the network information provider relevant with search terms charged based on the click volume of each search terms in the search result web page.Internet search service provider Overture Services, Inc. (U.S.A) provide this class service, and wherein, when the search terms in the searcher click search result web page relevant with network information provider, the payment of network information provider is each to be clicked.In this case, if searcher is deliberately repeatedly clicked a special search terms, then relevant with search terms network information provider must pay extra-pay.Therefore, even also must detect invalid clicks in this case, it is intended that and only increases click volume and in fact search terms is not searched for.
Summary of the invention
The present invention is provided to solve the problem in the above-mentioned prior art.An object of the present invention is to provide the method and apparatus of the invalid clicks that is used to detect search terms, search terms is included in the search result web page that is provided by Internet search engine server.
Another object of the present invention provides the method and apparatus that is used to detect invalid clicks, and it can detect the various trials of the click volume of improper increase search terms, and can deal with these trials immediately.
Another object of the present invention provides a method and apparatus that is used to detect invalid clicks, and wherein, the several identifiers that provide in order to detect invalid clicks are difficult to be copied or forge.
In order to achieve the above object and solve the problems referred to above in the prior art, the invention provides a method that in internet search engine, detects invalid clicks, comprise the following steps: to produce a search result web page in response to searching request from searcher, obtain one corresponding to the page identifier that is produced webpage, receive the click that is included in the search terms in the search result web page from searcher, obtain a site identifier corresponding to clicked search terms, if and page identifier and site identifier click relevant page identifier and site identifier unanimity with in scheduled time slot other, determine that then this click is invalid.
A method that is used for detecting at internet search engine invalid clicks is provided according to aspects of the present invention, comprise the following steps: to produce a search result web page in response to searching request from searcher, obtain an interior Session ID of session cookie file that is included in the searcher terminal storage, receiving a search terms that is included in the search result web page from searcher clicks, obtain a site identifier corresponding to clicked search terms, if and Session ID and site identifier and scheduled time slot interior Session ID and the site identifier unanimity relevant with other click, determine that then this click is invalid.
A method that is used for detecting at internet search engine invalid clicks is provided according to aspects of the present invention, comprise the following steps: to receive the click that is included in the search terms in the search result web page from searcher, obtain a client computer IP address corresponding to the searcher terminal, obtain a site identifier corresponding to clicked search terms, if and client computer IP address and site identifier and interior relevant client computer IP address and the site identifier unanimity of other click of scheduled time slot, determine that then this click is invalid.
A method that is used for detecting at internet search engine invalid clicks is provided according to aspects of the present invention, comprise the following steps: to produce a search result web page in response to searching request from searcher, obtain a terminal identifier corresponding to the searcher terminal, producing user's cookie file that comprises terminal identifier is stored in user's cookie file in the searcher terminal then, receiving a search terms that is included in the search result web page from searcher clicks, obtain a site identifier corresponding to clicked search terms, if and terminal identifier and site identifier and interior relevant terminal identifier and the site identifier unanimity of other click of scheduled time slot, determine that then this click is invalid.
An equipment that is used to detect invalid clicks is provided according to another aspect of the present invention, wherein, if searcher is provided by the search terms that is included in the search result web page that is provided by internet search engine, the IP address of searcher terminal at least then, the network address under the searcher terminal, the search relevant with search result web page, the relevant information of the web browser of searcher, with the click and the information-related click time of cookie file that are stored in the searcher terminal, one of them of the URL information relevant with search terms is received, and is received the predetermined standard (reference) of information based on a basis and determines whether this click is invalid.
An equipment that is used to detect invalid clicks is provided according to another aspect of the present invention, comprise (1) log store unit, search terms that is included in the search result web page that is provided by internet search engine is provided in response to searcher for it, store one at least with following two relevant daily records: the IP address of searcher terminal, the network address under the searcher terminal, the search relevant with search result web page, the relevant information of the web browser of searcher, the click time relevant with click, be stored in cookie file information and the URL information relevant in the searcher terminal with search terms, (2) invalid clicks model storage unit, its the storage with following at least in two relevant invalid clicks models: the IP address of searcher terminal, the network address under the searcher terminal, the search relevant with search result web page, the relevant information of the web browser of searcher, the click time relevant with click, be stored in the cookie file information in the searcher terminal, with the URL information relevant with search terms, (3) invalid clicks decision unit, it determines based on the invalid clicks model of storing in stored log in the log store unit and the invalid clicks model storage unit whether search is clicked is an invalid clicks.
An equipment that is used to detect invalid clicks is provided according to another aspect of the present invention, comprise a click-through count apparatus, be used at the search terms that is included in the search result web page that provides by internet search engine, the searcher click amount of each search terms in the counting scheduled time slot, the average device for calculating of clicking, be used in scheduled time slot calculating the average click volume that belongs to the search terms of classification under the search terms, with a determination device, the click volume that is used for determining each search terms whether than average click volume big one predetermined poor.
An equipment that is used to detect invalid clicks is provided according to another aspect of the present invention, comprise a click-through count apparatus, be used at the search terms that is included in the search result web page that provides by internet search engine, the searcher click amount of each search terms in the counting scheduled time slot, the average device for calculating of clicking, be used in scheduled time slot, calculating predetermined first quantity of the search terms that is positioned at the search terms higher-end and being positioned at the average click volume of search terms than predetermined second quantity of the Search Results of low side in search result web page, and determination device, the click volume that is used for determining each search terms whether than average click volume big one predetermined poor.
Invalid clicks is difficult to accurately definition, and the scope of invalid clicks should depend on embodiment and should be used for differently defining.Yet it is not the click that purpose is made with the actual search that invalid clicks may refer to only to increase click volume.
Description of drawings
Fig. 1 is a synoptic diagram, illustrates that network of Internet search server connects, and comprises being used to detect the equipment of invalid clicks and according to client terminal of the present invention.
Fig. 2 is the synoptic diagram of an explanation by the search result web page of internet search engine generation.
Fig. 3 is an explanation detects the device structure of invalid clicks according to the embodiment of the invention a block diagram.
Fig. 4 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Fig. 5 has shown the journal file according to the example of the embodiment of the invention.
Fig. 6 a and 6b are method flow diagrams that detects invalid clicks according to the embodiment of the invention.
Fig. 7 has shown a journal file according to the example of the embodiment of the invention.
Fig. 8 is a method flow diagram that produces Session ID according to the embodiment of the invention.
Fig. 9 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Figure 10 has shown a journal file according to the example of the embodiment of the invention.
Figure 11 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Figure 12 is the block diagram of the structure of an explanation general-purpose computing system, and this system can be used for founding a search engine server and an equipment that is used for detecting according to the present invention invalid clicks.
Embodiment
Hereinafter, the preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Fig. 1 is a synoptic diagram, illustrates to comprise that the equipment that is used to detect invalid clicks is connected with network according to the Internet search server of client terminal of the present invention.
Searcher or the cheating device of attempting unfair click visit Internet search server 104 via the client terminal 101 that is connected to the Internet 103.The cheating device increases click volume by the search terms of repeatedly clicking in the search result web page that is provided by Internet search server 104.For example in Fig. 2, suppose that search terms 202 is search termses relevant with http://www.invalidclick.com, and the cheating device is constantly clicked search terms 202 so that search terms 202 is displayed on the top of search result web page.
When client terminal 101 was connected to search engine server 104 or other website, cookie file 102 was the special texts in the hard disk that is stored in client terminal 101 by search engine server 104 or other website.At the http protocol that is used for connecting website, each is all irrelevant with other request to the request of webpage.Therefore, the webserver is the such information of tool not, and promptly which page before had been sent to client terminal 101 or what work client terminal 101 had before carried out.Therefore, for association each request of independent processing like this, a cookie file is provided.This class cookie file service allows the webserver that user profile is stored in the user's computer.In order to detect invalid clicks in the present invention, even can use several cookie files.This will be described in detail in the back.
Journal file 105 is one and is used to store the file of clicking the relevant several daily records of model with the user.In the present invention, use Several Parameters in order to detect invalid clicks.After the parameter relevant with each click is stored in the journal file, determine based on predetermined rule and model whether the input click is invalid.
According to the example of the journal file of the embodiment of the invention as shown in Fig. 5,7 and 10.
Fig. 3 is an explanation detects the device structure of invalid clicks according to the embodiment of the invention a block diagram.
The equipment that detects invalid clicks 301 according to the embodiment of the invention comprises parameter input unit 304, log store unit 305, invalid clicks model storage unit 306, invalid clicks authentication unit 307, invalid clicks report unit 308 and invalid clicks decision unit 309.
If searcher is provided by the search terms that is included in the search result web page that is provided by internet search engine, then relevant with this click Several Parameters 302 is imported into parameter input unit 304.These parameters are the essential informations that are used for determining invalid clicks, and comprise the web browser of the IP address, the network address under the searcher terminal of searcher terminal, the search relevant, searcher with search result web page relevant information, click time relevant with click, be stored in the cookie file information in the searcher terminal, URL information relevant or the like with search terms.
If searcher is to search of Internet search engine server 104 requests, then the searching request grouping is passed to Internet search engine server 104 from client terminal 101.Searching request grouping comprises one according to the packet configuration of http protocol and be comprised in the Internet (IP: Internet protocol) in the grouping.Because the source IP address field is included in the configuration of Internet protocol packets,, thereby extract the IP address of searcher terminal so Internet search engine server 104 is extracted a source IP address from clicking the searching request grouping of being asked.
The forward part of source IP address is the network address under the searcher terminal.The IP address is made up of 4 bytes.The forward part of IP address is a network address that is used to discern searcher terminal belonging network, and its remainder is the address that is used for the searcher terminal in the recognition network.Therefore, the network address is extracted from source IP address.According to embodiments of the invention, 3 bytes of IP address forward part be considered to a network address and this network address obtained from source IP address.For example, if source IP address is 123.45.67.89, then 123.45.67 is extracted as a network address.
The search relevant with search result web page is a value by searcher inputting Internet search server 104.
The relevant information of the web browser of searcher is the information on the web browser, and described web browser is loaded in the client terminal 101 of searcher and is used to access the Internet search server 104.The relevant information of web browser comprises the type of web browser, the version of web browser, product IDs of web browser or the like.Especially, even when a plurality of searchers have the web browser of same type and identical version, the product IDs of their web browser also may be different.Thereby it has become the useful information that is used to discern a searcher terminal.
According to the http protocol that is used to be connected to network, a part of environmental parameter of client computer is included in the HTTP grouping and is sent to the webserver.The program of the webserver (search engine program) can the reception environment parameter and can be used these parameters to detect invalid clicks.
This class environmental parameter comprises following message:
REMOTE_HOST: the person's of being connected domain name
REMOTE_ADDR: the IP address that is connected Client Host
REMOTE_USER: the person's of being connected name (be provided with at the webserver under the situation of user rs authentication and show)
REMOTE_USER: the person's of being connected ID (be provided with at the webserver under the situation of user rs authentication and be shown)
HTTP_USER_AGENT: in general the related registration information of the program that the person of being connected drives is the title of browser
HTTP_ACCEPT_LANGUAGE: the language that the person of being connected uses
HTTP_REFERER: the document title of calling out corresponding cgi script
REQUEST_METHOD: to the method for Server Transport data (GET, POST)
QUERY_STRING: when data send with the GET pattern, send the parameter that is stored of data
CONTENT_LENGTH: when data are sent out with the POST pattern, be launched the total length (byte number) of data
CONTENT_TYPE: when data are launched with the POST pattern, the mime type of data
AUTH_TYPE: the parameter that is used to confirm subscriber authorisation
SERVER_NAME: the domain name of current server
SERVER_SOFTWARE: the current title that is installed in the network server program on the server
SERVER_PROTOCOL: the title of the procotol of the current use of server and version
SERVER_PORT: the presently used port number of server (generally being 80 under the situation of HTTP)
PATH_INFO: the information of the current path of called cgi script
PATH_RANSLATED: the relevant information of the current resource path in the webserver of network requirement
SCRIPT_NAME: current just in the title of called cgi script
HTTP_ACCEPT: the type of the current resource that can receive with HTTP
The click time relevant with the click of searcher is to import the received time from the click of searcher.According to another embodiment of the invention, relevantly with the click time of searcher can be used At All Other Times.For example, can use searcher in fact will click the time of input client computer.
The information that is stored on the cookie file in the searcher terminal is obtained by Internet search server 104, and wherein 104 visits of Internet search server are stored in the cookie file 102 in the client terminal 101.In the present invention, cookie file 102 can be used to multiple use.This will be described in detail with reference to other embodiment.
The relevant URL information of the search terms of clicking with searcher can obtain by consulting search database, because it is stored in the search database (not shown) relevant with search engine server 104.URL information can be the domain name of the webserver or the information that comprises domain name, catalogue and filename.For example, http://www.naver.com is identical with http://www.naver.com/download, because they are the www.naver.com in view of domain name, but has different URL.In the present invention, use URL and for the purpose of explaining, be illustrated to the embodiment of domain name.Yet the present invention has covered all embodiment, and wherein, if having a different catalogue (because they comprised that domain name, catalogue and filename are whole) although its domain name of URL is identical, URL is considered to different search termses.Should be appreciated that in addition that in the present invention URL information comprises all embodiment according to this instructions.
In addition, except above-mentioned parameter, in spirit of the present invention, other parameter that is used to detect invalid clicks also can be used to detect invalid clicks.
The parameter 302 of mentioned kind is imported into parameter input unit 304.These parameters are stored in again in the log store unit 305.According to the present invention, the example that is stored in the daily record in the log store unit is as shown in Fig. 5,7 and 10.In these accompanying drawings, the daily record that includes only a part of parameter is shown to be used for explanation.Yet, according to another embodiment of the invention, comprise that the daily record of all or part of parameter 302 can be stored in the log store unit 305.
According to one embodiment of present invention, log store unit 305 is stored therein about following at least two daily record: the relevant information of the web browser of the network address under the IP address of searcher terminal, searcher terminal, the search relevant, searcher, the click time relevant with click with search result web page, be stored in the searcher terminal cookie file information and with search terms relevant URL information.According to a preferred embodiment of the present invention, log store unit 305 is stored one therein about following at least one daily record: the relevant information of the web browser of the network address under the IP address of searcher terminal, the searcher terminal, the search relevant, searcher, the click time relevant with click with search result web page, be stored in the searcher terminal cookie file information and with search terms relevant URL information.
Invalid clicks pattern storage unit 306 store therein one with following at least two a pair of relevant invalid clicks model or rule: the relevant information of the web browser of the network address under the IP address of searcher terminal, searcher terminal, the search relevant, searcher, the click time relevant with click with search result web page, be stored in the searcher terminal cookie file information and with search terms relevant URL information.For example, consistent each other rule or model can be stored in the invalid clicks model storage unit 306 in the IP address of searcher terminal and the URL information relevant with the search terms click input in 10 minutes.Similarly, rule that is stored in invalid clicks model storage unit 306 that is used for definite invalid clicks or the like can be stored with the form of file, and this document uses the scheduled instruction according to pre-defined rule.Perhaps, under the situation of above-mentioned rule or model, it can be stored with the form of program so that it is determined is an invalid clicks.
Invalid clicks decision unit 309 determines based on the invalid clicks model of storage in stored log in the log store unit 305 and the invalid clicks model storage unit 306 whether searcher is clicked is invalid clicks.
Invalid clicks is reported keeper 303 report and point hit the preassigned consistent click of unit 308 to internet search engine, and it is disabled to click and determines that unit 309 is definite invalid.According to one embodiment of present invention, invalid clicks report unit 308 reports that to the keeper of internet search engine all are disabled click decision unit 309 and are defined as invalid click.In this case, preassigned is to be disabled to click to determine that unit 309 is defined as all invalid clicks.According to another embodiment of the invention, indicate whether to be stored in each rule or model that stores in the invalid clicks model storage unit 306 to the field of keeper's 303 reports corresponding to the situation of rule or model.In this case, under the situation of rule that must be notified corresponding to keeper 303, invalid clicks report unit 308 reports to keeper 303 with it.
Invalid clicks authentication unit 307 allows keepers 303 to determine unit 309 to be defined as invalid click to change over effective click being disabled to click.Because invalid clicks authentication unit 307 can change over effective click to the click that is decided to be invalid clicks by mistake, so invalid clicks can be determined more accurately.
Fig. 4 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Internet search server 104 receives a searching request (step 401) from searcher.If searcher access the Internet search server 104 is the inputted search word then, then grouping is sent to Internet search server 104 to this search as searching request.
Internet search server 104 produces a search result web page (step 402) in response to this searching request.For example shown in Fig. 2, comprise that the search result web page of a plurality of search termses corresponding to searcher inputted search word is provided for searcher.
Page identifier corresponding to the search result web page that produces is acquired (step 403).When producing search result web page, just produce a page identifier.Page identifier is an identifier that is used to discern search result web page.Therefore, if identical searcher then all distributes a new page identifier by repeatedly import identical search in the search window of Internet search server 104 at every turn.Similarly, if " reload (reloading) " in the web browser of searcher click display of search results webpage, then Internet search server 104 distributes a new page identifier to search result web page, because the searching request grouping is sent to Internet search server 104 from client terminal 101.It is possible that different page identifiers is assigned at first sight identical search result web page.Yet if new searching request is received from client terminal 101, search result web page is produced at that time again.Thereby the search result web page that is different from previous search result web page can be provided.
In step 404, Internet search server 104 receives a click that is included in the search terms in the search result web page from searcher.Be received if click, then Internet search server 104 hyperlink that allows to be used for search terms connects Internet search server 104, allow Internet search server 104 to carry out necessary processing, allow the website of client terminal visit then corresponding to this search terms.For example, at http://www.naver.com/abc/*http: //www.invalidclick.com/ is prepared as under the situation corresponding to the search terms hyperlink of " http//www.invalidclick.com/ ", if searcher is clicked this search terms, then search is allowed to be called with visit the search server of http://www.naver.com.Search server allows client terminal to visit http://www.invalidclick.com according to the URL that is positioned at the hyperlink rear side.
Internet search server 104 obtains a site identifier (step 405) corresponding to clicked search terms.Site identifier is an identifier that is used to discern search terms, and produces based on the URL information corresponding to search terms.According to another embodiment of the invention, site identifier is used the former URL information corresponding to search terms.URL information as the essential information that produces site identifier can be the domain name of the webserver or the information that comprises domain name, catalogue and filename.For example, http://www.naver.com is identical with http://www.naver.com/download, because they all are www.naver.com from the viewpoint of domain name, but then inequality from the viewpoint of URL.In the present invention, one use URL and to the embodiment of domain name for convenience of explanation for the purpose of and be illustrated.Yet the present invention has covered all embodiment, and wherein, if having a different catalogue (because they have not only comprised domain name, but also having comprised catalogue and filename) although its domain name of URL is identical, URL is considered to different search termses.Should be appreciated that in addition that in the present invention URL information comprises all embodiment according to this instructions.
In step 406, if other in page identifier and site identifier and the scheduled time slot clicked relevant page identifier and site identifier unanimity, the equipment that then is used to detect invalid clicks determines that click is invalid.
Fig. 5 has shown the journal file according to the example of the embodiment of the invention.The embodiment of Fig. 4 will illustrate with reference to figure 5.
According to the present invention, whenever the click that receives a search terms from the user, page identifier 509 and site identifier 510 just are stored in the journal file 500.Reference numeral 501 to 508 is pointed out the daily record of stored each click input.
Cheating device access the Internet search server 104 is to ask a search.Internet search server 104 produces search result web page and produces a page identifier " nCe249sisnO " corresponding to search result web page.The cheating device is constantly clicked a specific search terms that is included in the search result web page.Even in case the specific search term in the search result web page that is produced is constantly clicked, page identifier can not produced again yet.Thereby page identifier has kept identical value.
Thereby in the click input journal in scheduled time slot, daily record 501, daily record 502 and the daily record 504 of determining to have identical page identifier and identical site identifier are invalid clicks.According to one embodiment of present invention, determine that in the consistent daily record is an invalid clicks, then remaining daily record is an invalid clicks.
The cheating device can upgrade search result web page by the " reload " that clicks in the web browser.In this case, page identifier is reallocated and is daily record 505 about the daily record of page identifier.Thereafter, the cheating device is clicked the situation of identical search term corresponding to daily record 506.
Therefore,, click identical search terms (under the situation of daily record 506) then if the cheating device is clicked " reloads " according to this embodiment, then it not to be determined be an invalid clicks.Similarly, be used for determining that " reload " is that the method for the situation of invalid clicks will be illustrated at the following example with reference to figure 6.
Fig. 6 a and 6b are method flow diagrams that detects invalid clicks according to the embodiment of the invention.
Internet search server 104 receives searching request (step 601) from searcher.Internet search server 104 produces a search result web page (step 602) in response to this searching request.
The equipment that is used for definite invalid clicks determines whether the session cookie file is stored in the client terminal 101 (step 603) of request search.Step 603 is processed to obtain a Session ID to step 611.
If determine that the session cookie file is not stored in the client terminal 101, be used for then determining that the equipment of invalid clicks produces a new Session ID (step 604).In step 605, comprise that the session cookie file of Session ID is stored in the client terminal 101.Also be stored in the session cookie file update time of Session ID.Be stored in update time in the session cookie file (step 609).
Be stored in step 602 in the client terminal 101 if determine the session cookie file, the equipment that then is used for determining invalid clicks determines to comprise that final updating time of Session ID of session cookie file is whether in scheduled time slot (step 606).
As the definite result in the step 606, if the final updating time that is included in the Session ID in the session cookie file in scheduled time slot, is used for then determining that the equipment of invalid clicks extracts a Session ID (step 607) that is included in the session cookie file.
As the definite result in the step 606,, be used for then determining that the equipment of invalid clicks produces a new Session ID (step 608) if comprise the final updating time that does not have the Session ID in the session cookie file not in scheduled time slot.The Session ID that is included in the session cookie file upgrades (step 610) with the Session ID of creating again.Be stored in the session cookie file (step 611) update time of Session ID.
Internet search server 104 receives a click (step 612) that is included in the search terms in the search result web page from searcher.
Internet search server 104 obtains a site identifier (step 613) corresponding to clicked search terms.
If Session ID and site identifier and click relevant Session ID and site identifier unanimity with other in scheduled time slot, the equipment that then is used to detect invalid clicks determines that this click is invalid clicks (step 614).
Fig. 7 has shown the journal file according to the example of the embodiment of the invention.
In this embodiment, whenever the click that receives a search terms from the user, the update time 711 of click time 710, Session ID, Session ID 712 and site identifier 713 are stored in the journal file 700.Reference numeral 701 to 708 is pointed out to click the input stored log corresponding to each.
Cheating device access the Internet search server 104 is to ask a searching request.Internet search server 104 produces a search result web page.Internet search server 104 receives a click that is included in the search terms that included in the search result web page.
Internet search server 104 determines whether the session cookie file is stored in the client terminal 101.If determine that the session cookie file is not stored in the client terminal 101, then Internet search server 104 produces a new Session ID, and with its update time with comprise that the session cookie file of Session ID is stored in the client terminal 101.In this embodiment, Session ID " xigw9492 " and update time " 10:50:14 " be recorded.In addition, click time, update time, Session ID and the site identifier corresponding to search terms is stored in the journal file 700 as daily record 701.Producing in the situation of session cookie file for the first time, clicking and Session ID as long as also produce at that time, the session cookie file is just produced.Thereby the click time is identical with Session ID update time.
The cheating device is clicked identical search terms in identical result of page searching.Internet search server 104 determines whether the session cookie file is stored in the client terminal 101.Because the session cookie file of above-mentioned generation has been stored in the client terminal 101,104 visits of Internet search server are stored in the session cookie file in the client terminal 101.The session cookie file is stored the final updating time of a Session ID and Session ID therein.In this embodiment, Session ID " xigw9492 " and update time " 10:50:14 " be stored in the session cookie file.
Internet search server 104 determine from click time of the search terms of searcher whether in the scheduled time slot that begins from the final updating time relevant with Session ID.In this embodiment, the second click time of clicking was " 10:50:18 ".If scheduled time slot is 5 seconds, then click time " 10:50:18 " in the scheduled time slot that begins from final updating time " 10:50:14 ".Similarly, in this case, be stored in the Session ID that Session ID in the session cookie file is used as a current Session ID and this session cookie file and be not updated.Thereby in this case, for example daily record 702 is recorded.
Thereby, determine that daily record 702 is invalid clicks, because it has Session ID and the site identifier identical with daily record 701.
Daily record 704 is corresponding to the situation of the device request " reload " that wherein practises fraud.Similarly, the result device request " reload " that practises fraud, make the standard of the session cookie file that is stored in the client terminal 101, and Session ID is not updated, because the final updating time in the session cookie file of being stored in is in scheduled time slot.Therefore, for example daily record 704 is recorded.Because it is the same with daily record 701, so determine that daily record 704 is invalid clicks.That is,, might detect the cheating device is clicked identical search terms after clicking " reload " in short time interval situation according to this embodiment.
Daily record 705 is corresponding to this situation, i.e. the click of identical search term is received from the searcher that is different from daily record 701, daily record 702 and daily record 704.In this case, because new Session ID is assigned with, so it is not confirmed as an invalid clicks.
Daily record 709 is corresponding to this situation, and promptly identical with daily record 701 searcher is clicked identical search terms after quite a lot of time.In this case, just be received after the long duration because click, so it is not confirmed as an invalid clicks.
According to this embodiment, the cheating device is clicked the situation of identical search terms after scheduled time slot, because a Session ID is produced, be an invalid clicks so it is determined.
Similarly, determine to determine that based on invalid clicks such situation may be an invalid clicks according to another embodiment of the invention, promptly make click in the scheduled time slot that begins in the last click time from identical search term.This will be illustrated simply.
Be received from searcher if click, determine then whether the session cookie file is stored in the terminal.Be stored in the terminal if determine the session cookie file, then determine from click time of the search terms of searcher whether in the scheduled time slot that begins from the last click time relevant with Session ID.
The interior Session ID of session cookie file is acquired and the time of click at last upgraded with the click time of search terms if the click time of determining search terms in scheduled time slot, then is included in.
If the click time of determining search terms, then new Session ID is produced with renewal was included in Session ID in the session cookie file not in scheduled time slot.In addition, the time of click at last upgraded with the click time of search terms.
For example in Fig. 7, come from existence under the situation of a plurality of clicks of identical search term of same client machine terminal, if determine from last click over and done with 5 seconds situation be effective, it is effectively that then relevant with daily record 704 click is determined, because made " 10:50:31 " behind 13 seconds of its last click time " 10:50:18 " formerly.
According to a preferred embodiment of the invention, time reference decides according to the testing goal of invalid clicks.
Fig. 8 is a method flow diagram that produces Session ID according to the embodiment of the invention.
Session ID must be distributed uniquely so that it can distinguish and must be difficult to be copied or forge with other Session ID.Under the situation that Session ID is only distributed uniquely, there is a possibility, in fact the device of promptly practising fraud may produce a Session ID and then Session ID is stored among the session cookie, perhaps may increase click volume wrongly with a program, this program is driven constantly to be clicked search terms and changes Session ID simultaneously.
Source data 801 is the master datas that are used to produce Session ID 805.Source data can be the product IDs or the like of the web browser of current information of time, search, searcher.Source data can be the quantity of selecting at random.Hash function 802 is applied to source data 801 to produce a coded strings 803.Then, verification and be added to coded strings 803 to produce Session ID 805.The verification and the device that is used for preventing to practise fraud are forged Session ID.
Be used for to be used to and produce one subsequently with the page identifier that is illustrated, site identifier, terminal identifier or the like according to the method that this embodiment produces Session ID.
Fig. 9 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Internet search server 104 receives a click (step 901) that is included in the search terms in the search result web page from searcher.Internet search server 104 obtains a client computer IP address (step 902) corresponding to the terminal 101 of searcher.The IP address of client computer can be extracted from the source IP address field of received IP grouping.
Internet search server 104 obtains the site identifier (step 903) corresponding to clicked search terms.
In step 904, if relevant client computer IP address and site customers machine IP address unanimity are clicked with site identifier and interior other of scheduled time slot in client computer IP address, the equipment that then is used for the search invalid click determines that this click is invalid.
Figure 10 has shown the journal file according to the example of the embodiment of the invention.
In this embodiment, whenever the click that receives a search terms from the user, click time 1010, client computer IP address 1011 and site identifier 1012 just are stored in the journal file 1000.Reference numeral 1001 to 1009 is specified institute's stored log of clicking input corresponding to each.
If identical client terminal is constantly clicked identical search terms, be repeated in scheduled time slot if then click, it is very high then should to click invalid possibility.Yet, so often situation, promptly the user of same client machine terminal clicks identical search terms after the long duration.In other words, have a trend, promptly the user often visits its very interested website.If the user constantly visits a website at short notice, then be difficult to it is regarded as a common click.Thereby it is an invalid clicks that this situation is determined.For example, if time standard is 5 minutes, it is invalid clicks that daily record 1002, daily record 1004 and the daily record 1005 that then has client computer IP address identical with daily record 1001 and an identical site identifier is determined.Determine that in about 20 minutes the click relevant with clicked daily record 1009 is effective click.
If determine invalid clicks, there is the careful point of some needs so based on client computer IP address.Use in the situation of acting server or IP gateway at client terminal, have a danger, even the cheating device is clicked the search terms identical with other client terminal, it also may be confirmed as an invalid clicks.Therefore, preferably, this embodiment is with an embodiment combined structure that uses other parameter such as Session ID.
On the contrary, have such a case, the client computer IP address of promptly clicking the client terminal of identical search term is different, and their network address is identical.This is corresponding to such a case, and promptly the several people constantly attempts coming inequitable click one place or clicking identical search terms with a program, and changes their source IP address simultaneously.In this case, if to click the network address of the client terminal of identical search term be identical and other situation (for example, in the catalogue under search terms, click volume is greater than the situation of average click volume) be satisfied, then this can be determined be an invalid clicks.
Figure 11 is a method flow diagram that detects invalid clicks according to the embodiment of the invention.
Internet search server 104 receives searching request (step 1101) and produces a search result web page (step 1102) from searcher.
Internet search server 104 determines whether the user's cookie file that comprises terminal identifier is stored in (step 1103) in the terminal.
Because the definite result in the step 1103 is not stored in the terminal if comprise user's cookie file of terminal identifier, then Internet search server 104 produces a terminal identifier (step 1104).
104 generations of Internet search server comprise user's cookie file of terminal identifier and it are stored in the searcher terminal (step 1105).
Because the definite result in the step 1103 is stored in the terminal if comprise user's cookie file of terminal identifier, then Internet search server 104 extracts terminal identifier (step 1106) from user's cookie file.
Internet search server 104 receives the click (step 1107) that is included in the search terms in the search result web page from searcher, obtains a site identifier (step 1108) corresponding to clicked search terms then.
At last, in step 1109, if be used for determining the equipment of invalid clicks determine terminal identifier and site identifier with scheduled time slot in relevant terminal identifier and the site identifier unanimity of other click, then this click is invalid.
According to this embodiment,, also might differentiate the terminal of client computer with terminal identifier even client terminal uses an acting server or IP gateway.Thereby,, also may correctly discern the click that comes from different client computer even different client terminals uses acting server or IP gateway.
In another embodiment of the present invention, if for the search terms that is included in the search result web page that provides by internet search engine, the click volume of the searcher of each search terms is greater than the average click volume that belongs to the search terms of classification under the search terms in the scheduled time slot, thereby then it is considered to an invalid clicks and it is reported to the keeper.
The equipment that is used to detect invalid clicks according to present embodiment comprises the click-through count apparatus, searcher click amount at each search terms in the search terms counting scheduled time slot that is included in the search result web page that provides by internet search engine is provided, the average device for calculating of clicking, be used to calculate the average click volume of the search terms that belongs to the affiliated classification of search terms in the scheduled time slot, and determination device, the click volume that is used for determining each search terms whether than average click volume big one predetermined poor.If the click volume of each search terms than average click volume big one predetermined poor, then this fact is reported to the keeper via invalid clicks report unit 308.
According to another embodiment of the invention, at the search terms that is included in the search result web page that provides by internet search engine, at the fixed time in the section, predetermined first quantity of the search terms that is positioned at the search terms upper end in the inherent search result web page of click volume and scheduled time slot of the searcher of each search terms and the average click volume of being scheduled to second quantity that is positioned at the search terms of search terms lower end are compared.For example, in the identical cycle, the click volume of special search terms is compared with being right after two search termses that are positioned on the special search terms and being right after the click volume that is positioned at two search termses under the special search terms.Result as a comparison, if the click volume of special search terms is than big 5 times around the click volume of other search terms, thereby then it is that the possibility of invalid clicks is very high and similarly be reported to the keeper.
Be used for determining that the whole bag of tricks of invalid clicks is illustrated in the above.The method that is used for determining invalid clicks can be used independently or can be united use with being used for determining the method for invalid clicks.For example, a rule can be stored in the invalid clicks model storage unit 306, situation about being repeated in wherein, begin in the last click from search terms corresponding to client computer IP address, page identifier and the site identifier of search terms 5 minutes is invalid.
In the present invention, Internet search server and being used to is discerned the unfair equipment of clicking and is described as individual unit confusedly.Yet, according to another embodiment of the invention, should be noted that they can be separated according to their function to carry out and can be managed by different keepers.
In addition, in the present invention, the element that is shown and is described to separate elements can physically be created in the individual system and can physically be created in the independent system.
In addition,, it is evident that for one of ordinary skill in the art that the part of a plurality of embodiment or remaining embodiment also belong to spirit of the present invention although several embodiment is illustrated in the present invention.
In addition, embodiments of the invention also relate to the computer readable medium that comprises the programmed instruction that is used to carry out different computing machine executable operations.This media can also be separately (or combine with programmed instruction) comprises data file, data structure, tables of data or the like.Media and programmed instruction can be designed especially and be constructed being used for the object of the invention, or they may be well-known types and be that the technician of computer software fields can use.The example of computer readable medium comprises the magnetic media such as hard disk, floppy disk and tape; Light media such as the CD-ROM disk; The magneto-optic media such as but light is read disk; With the special hardware unit of storing with execution of program instructions that disposes of quilt, such as ROM device (ROM) and random-access memory (ram).Media also may be the transmission medium such as light or metallic circuit, wave guide or the like, comprises the carrier wave of the signal of the instruction of emission established procedure, data structure or the like.The example of programmed instruction comprise two such as producing by compiler machine code and comprise the file that can use the high-level code that interpreter carries out by computing machine.
Figure 12 is the block diagram of the structure of an explanation general-purpose computing system, the equipment that this system can be used for founding search engine server and is used for detecting according to the present invention invalid clicks.
Computer system comprises the processor 1240 (being also referred to as central processing unit or CPUs) of any amount, and they are coupled to the memory storage that comprises primary memory 1260 (in general being random access memory or " RAM "), primary memory 1270 (in general being ROM (read-only memory) or " ROM ").It is well known in the art that primary memory 1260 data and instruction sheet to being sent to CPU, and primary memory 1260 generally is used to transmit in a bi-directional way data and instruction.These two host memory device can comprise the computer readable medium of aforesaid any suitable type.Mass storage device 1210 also by bidirectional coupled to CPU1240 and additional memory data output is provided and can comprises aforesaid any computer readable medium.Mass storage device 1210 can be used to stored programme, data or the like, and generally be one such as the secondary storage medium than the slow hard disk of primary memory.Specific mass storage device such as CD 1220 can also be data sheet to passing to CPU.Processor 1240 also is coupled to an interface 1230, it comprises one or more input-output device, such as video monitor, tracking ball, mouse, keyboard, loudspeaker, touch-sensitive display, transducer card reader, magnetic or paper tape reader, board, contact pilotage, audio frequency or handwriting recognizer or other the well-known input media such as comprising other computing machine certainly.At last, shown in 1250, processor 1240 can optionally use network to connect and be coupled to computing machine or telecommunications network as usually.Had this class network to connect, CPU can be in the process of carrying out the said method step from network receiving information or can expect to network output information.Said apparatus and material are very familiar for the technician in computer hardware and the software field.
Aforesaid hardware element can be configured the software module that (generally temporary transient) serves as one or more execution the present invention operations.
Industrial applicibility
According to above-mentioned the present invention, carry for detection of being included in by Internet search engine server for one The method and apparatus of the invalid clicks of the search terms in the search result web page of confession is provided.
According to the present invention, the method and apparatus for detection of invalid clicks, it can detect various Increase wrongly the trial of search terms click volume, and process immediately these trials. That is, if new Inequitable click of model is attempted being found, and then this model or rule are stored in one according to this In the bright invalid clicks model memory cell. Thereby, process immediately the unfairness that this follows new model It is possible clicking trial.
In addition, according to the invention provides one for detection of the method and apparatus of invalid clicks, it can Copied or forged with the several identifiers that prevent from providing in order to detect invalid clicks.
Although the present invention is about the embodiment of the invention that illustrates in the accompanying drawing and be illustrated, yet it also Be not limited in wherein, because for one of ordinary skill in the art, obviously can do therein Go out different displacements, modification and change. Scope of the present invention is defined by additional claim. The institute There are the change in the meaning of claim and scope, made or modification or its equivalent to be counted as To belong to scope of the present invention.

Claims (14)

1. a method that is used for detecting at internet search engine invalid clicks comprises the following steps:
According to click time memory page identifier and the site identifier in the journal file;
Produce search result web page in response to the searching request that comes from searcher;
Obtain a page identifier corresponding to the webpage that is produced;
Be included in the click of the search terms in the search result web page from searcher reception one;
Obtain a site identifier corresponding to clicked search terms; With
Consult journal file, if page identifier and site identifier and interior page identifier and the site identifier unanimity relevant with other click of scheduled time slot determine that then this click is invalid.
2. the process of claim 1 wherein, page identifier and site identifier comprise a verification and.
3. a method that is used for detecting at internet search engine invalid clicks comprises the following steps:
According to click time store session identifier and the site identifier in the journal file;
Produce search result web page in response to the searching request that comes from searcher;
Obtain a Session ID that is included in the session cookie file of storing in the searcher terminal;
Be included in the click of the search terms in the search result web page from searcher reception one;
Obtain a site identifier corresponding to clicked search terms; With
Consult journal file, if Session ID and site identifier with in scheduled time slot, click relevant Session ID and site identifier unanimity with other, determine that then this click is invalid.
4. the method for claim 3 wherein, is obtained the step that is included in the Session ID in the session cookie file of storing in the searcher terminal and is comprised the following steps:
Determine whether the session cookie file is stored in the terminal; With
If determine that the session cookie file is not stored in the terminal, then produce a new Session ID and then the session cookie file of the Session ID that comprises generation is stored in the terminal.
5. the method for claim 4 also comprises the following steps:
Be stored in the terminal if determine the session cookie file, whether the final updating time of then determining to be included in the Session ID in the session cookie file is in scheduled time slot; With
If determine the final updating time in scheduled time slot, then obtain a Session ID that is included in the session cookie file.
6. the method for claim 5 also comprises the following steps:
Not in scheduled time slot, then upgrade the Session ID that is included in the session cookie file if determine the final updating time by producing new Session ID; With
Be stored in the session cookie file update time of Session ID.
7. the method for claim 4 also comprises the following steps:
Be stored in the terminal if determine the session cookie file, then determine from click time of the search terms of searcher whether in the scheduled time slot after the last click time relevant with Session ID;
In the end click in the time scheduled time slot afterwards if determine the click time of search terms, then obtain a Session ID that is included in the session cookie file; With
The click time with search terms is upgraded the last click time.
8. the method for claim 7 also comprises the following steps:
In the end do not click in the time scheduled time slot afterwards if determine the click time of search terms, then upgrade the Session ID that is included in the session cookie file by producing new Session ID; With
The click time with search terms is upgraded the last click time.
9. any one method in the claim 3 to 8, wherein, Session ID and site identifier comprise a verification and.
10. a method that is used for detecting at internet search engine invalid clicks comprises the following steps:
According to click time storage client IP address and the site identifier in the journal file;
Be included in the click of the search terms in the search result web page from searcher reception one;
Obtain a client computer IP address corresponding to the searcher terminal;
Obtain a site identifier corresponding to clicked search terms; With
Consult journal file, if client computer IP address and site identifier and interior client computer IP address and the site identifier unanimity relevant with other click of scheduled time slot determine that then this click is invalid.
11. the method for claim 10, wherein, site identifier use comprising verification and produce.
12. a method that is used for detecting at internet search engine invalid clicks comprises the following steps:
According to click time storage terminal identifier and the site identifier in the journal file;
Produce a search result web page in response to the searching request that comes from searcher;
Obtain a terminal identifier corresponding to the searcher terminal;
Generation one comprises user's cookie file of terminal identifier, then user's cookie file is stored in the searcher terminal;
Be included in the click of the search terms in the search result web page from searcher reception one;
Obtain a site identifier corresponding to clicked search terms; With
Consult journal file, if terminal identifier and site identifier and interior terminal identifier and the site identifier unanimity relevant with other click of scheduled time slot determine that then this click is invalid.
13. the method for claim 12 also comprises the following steps:
Determine whether the cookie file that comprises terminal identifier is stored in the terminal; With
Be stored in the terminal if determine the user's cookie file that comprises terminal identifier, then from user's cookie file receiving terminal identifier.
14. the method for claim 12 or 13, wherein, terminal identifier and site identifier comprise a verification and.
CNB200480007418XA 2003-03-19 2004-02-27 Method and apparatus for detecting invalid clicks on the internet search engine Expired - Lifetime CN100533434C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020030017233 2003-03-19
KR1020030017233A KR100619178B1 (en) 2003-03-19 2003-03-19 Method and apparatus for detecting invalid clicks on the internet search engine

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CNA2008101610324A Division CN101388035A (en) 2003-03-19 2004-02-27 Method and device for detecting invalid click on internet search engine server

Publications (2)

Publication Number Publication Date
CN1761961A CN1761961A (en) 2006-04-19
CN100533434C true CN100533434C (en) 2009-08-26

Family

ID=36707372

Family Applications (2)

Application Number Title Priority Date Filing Date
CNB200480007418XA Expired - Lifetime CN100533434C (en) 2003-03-19 2004-02-27 Method and apparatus for detecting invalid clicks on the internet search engine
CNA2008101610324A Pending CN101388035A (en) 2003-03-19 2004-02-27 Method and device for detecting invalid click on internet search engine server

Family Applications After (1)

Application Number Title Priority Date Filing Date
CNA2008101610324A Pending CN101388035A (en) 2003-03-19 2004-02-27 Method and device for detecting invalid click on internet search engine server

Country Status (4)

Country Link
JP (1) JP4358188B2 (en)
KR (1) KR100619178B1 (en)
CN (2) CN100533434C (en)
WO (1) WO2004084097A1 (en)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8706551B2 (en) * 2003-09-04 2014-04-22 Google Inc. Systems and methods for determining user actions
US11042886B2 (en) 2003-09-04 2021-06-22 Google Llc Systems and methods for determining user actions
KR100786796B1 (en) * 2005-03-25 2007-12-18 주식회사 다음커뮤니케이션 Method and system for billing of internet advertising
WO2006121269A1 (en) * 2005-05-06 2006-11-16 Nhn Corporation Personalized search method and system for enabling the method
KR20060028463A (en) * 2006-03-09 2006-03-29 정성욱 Click tracking and management system for online advertisement service
KR100777659B1 (en) * 2006-04-10 2007-11-19 (주)소만사 Device of detecting invalid use of keyword advertisement
KR100777660B1 (en) * 2006-04-10 2007-11-19 (주)소만사 Method of detecting robot-based invalid use of keyword advertisement and computer-readable medium having thereon program performing function embodying the same
WO2008030670A1 (en) * 2006-09-08 2008-03-13 Microsoft Corporation Detecting and adjudicating click fraud
CN101075908B (en) * 2006-11-08 2011-04-20 腾讯科技(深圳)有限公司 Method and system for accounting network click numbers
KR100857148B1 (en) 2007-04-26 2008-09-05 엔에이치엔(주) Method for processing invalid click and system for executing the method
KR100841348B1 (en) * 2007-08-16 2008-06-25 방용정 Non-cost internet advertisement system each time unfairness click of cost-per-click-view and method thereof
KR100902466B1 (en) * 2007-10-30 2009-06-11 엔에이치엔비즈니스플랫폼 주식회사 System and Method for Tracking a Keyword Search Abuser
KR100914600B1 (en) * 2007-11-14 2009-08-31 엔에이치엔(주) System and Method for Determining Invalid Clicks
KR101020949B1 (en) * 2008-11-18 2011-03-09 주식회사 데이타웨이브 시스템 Method and server for detecting unfair click of keyword advertisement
KR20110116562A (en) 2010-04-19 2011-10-26 서울대학교산학협력단 Method and system for detecting bot scum in massive multiplayer online role playing game
CN102289756A (en) * 2010-06-18 2011-12-21 百度在线网络技术(北京)有限公司 Method and system for judging click validation
KR101158464B1 (en) * 2010-11-26 2012-06-20 고려대학교 산학협력단 Method and apparatus for detecting bot process
CN103368857B (en) * 2012-03-26 2016-09-21 北大方正集团有限公司 A kind of method and system sending data message
CN102663062B (en) * 2012-03-30 2015-01-14 北京奇虎科技有限公司 Method and device for processing invalid links in search result
JP2014026528A (en) * 2012-07-27 2014-02-06 Nippon Telegr & Teleph Corp <Ntt> Effective click counter, method and program
KR101919137B1 (en) * 2012-11-08 2018-11-15 네이버 주식회사 Display advertising rate calculating method and system acording to value index of advertisement slot
WO2015012865A1 (en) 2013-07-26 2015-01-29 Empire Technology Development, Llc Device and session identification
CN103475543A (en) * 2013-09-11 2013-12-25 北京思特奇信息技术股份有限公司 Abnormal system service call detection method and system
CN107077455A (en) * 2014-06-03 2017-08-18 埃克斯凯利博Ip有限责任公司 Flow mass is determined using the score traffic based on event
CN104331306B (en) * 2014-10-14 2017-05-10 北京齐尔布莱特科技有限公司 Content updating method, equipment and system
CN104580244B (en) * 2015-01-26 2018-03-13 百度在线网络技术(北京)有限公司 The defence method and device clicked maliciously
KR101639752B1 (en) * 2015-02-13 2016-07-15 네이버 주식회사 System and method for aggregating view of contents using filter logic
CN105069061B (en) * 2015-07-28 2019-03-12 安一恒通(北京)科技有限公司 Loading method, system, the browser and server of webpage in historical viewings record
CN105677869A (en) * 2016-01-06 2016-06-15 广州神马移动信息科技有限公司 Multidimensional search log anti-cheating method, system and computing equipment
CN107526748B (en) * 2016-06-22 2021-08-03 华为技术有限公司 Method and equipment for identifying user click behavior
CN108255885B (en) * 2016-12-29 2020-11-06 北京酷我科技有限公司 Song recommendation method and system
CN110020206B (en) * 2019-04-12 2021-10-15 北京搜狗科技发展有限公司 Search result ordering method and device
CN110069691B (en) * 2019-04-29 2021-05-28 百度在线网络技术(北京)有限公司 Method and device for processing click behavior data
CN111444408B (en) * 2020-03-26 2021-09-14 腾讯科技(深圳)有限公司 Network search processing method and device and electronic equipment
WO2022049776A1 (en) * 2020-09-07 2022-03-10 シエンプレ株式会社 Click fraud prevention system, click fraud prevention method, and program

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6269361B1 (en) * 1999-05-28 2001-07-31 Goto.Com System and method for influencing a position on a search result list generated by a computer network search engine
KR20020020584A (en) * 2000-09-09 2002-03-15 맹진기 Internet survey system and method and media for storing program source thereof

Also Published As

Publication number Publication date
WO2004084097A1 (en) 2004-09-30
CN101388035A (en) 2009-03-18
KR100619178B1 (en) 2006-09-05
JP2006520940A (en) 2006-09-14
JP4358188B2 (en) 2009-11-04
KR20040082633A (en) 2004-09-30
CN1761961A (en) 2006-04-19

Similar Documents

Publication Publication Date Title
CN100533434C (en) Method and apparatus for detecting invalid clicks on the internet search engine
US8131799B2 (en) User-transparent system for uniquely identifying network-distributed devices without explicitly provided device or user identifying information
US9479343B2 (en) Engine for processing content rules associated with locations in a page
US8706551B2 (en) Systems and methods for determining user actions
KR100341110B1 (en) Method and apparatus for tracking client interaction with a network resource and creating client profiles and resource database
JP4470069B2 (en) Input assist device, input assist system, input assist method, and input assist program
KR100692209B1 (en) Method and system for providing information of commodity customized to each user
US20040015714A1 (en) Systems and methods for user identification, user demographic reporting and collecting usage data using biometrics
US20110185016A1 (en) Enhanced website tracking system and mehod
US20090112703A1 (en) User identification in the header of data packets
KR100377515B1 (en) Method for managing advertisements on Internet and System therefor
KR20080031276A (en) Method and system for obtaining information
CN103401835A (en) Method and device for presenting safety detection results of microblog page
WO2007038389A2 (en) Method and apparatus for identifying and classifying network documents as spam
US7032017B2 (en) Identifying unique web visitors behind proxy servers
CN105450462A (en) On-line state monitoring method and system
CN115757995A (en) Method and device for processing characteristic-free data label, computer equipment and storage medium
US8909795B2 (en) Method for determining validity of command and system thereof
KR100619179B1 (en) Method and apparatus for detecting invalid clicks on the internet search engine
KR20030063253A (en) The method and system that provide with advanced keyword retrieval advertising service
CN110795616B (en) Data collection method and device
KR102215479B1 (en) Server, system, method for artist id integrated management
KR20040083340A (en) Method and apparatus for detecting invalid clicks on the internet search engine
KR20120006958A (en) Method for providing multimedia except for communication load
JP2003167974A (en) Prize competition service method and prize competition service system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee

Owner name: NHN BUSINESS PLATFORM CO., LTD.)

Free format text: FORMER NAME: NHN CO., LTD.

CP03 Change of name, title or address

Address after: Gyeonggi Do, South Korea

Patentee after: NHN BUSINESS PLATFORM Corp.

Address before: Seoul, South Kerean

Patentee before: NHN Corp.

ASS Succession or assignment of patent right

Owner name: NABAO CO., LTD.

Free format text: FORMER OWNER: NHN CORP.

Effective date: 20141114

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20141114

Address after: Gyeonggi Do city of South Korea

Patentee after: NAVER Corp.

Address before: Gyeonggi Do, South Korea

Patentee before: NHN BUSINESS PLATFORM Corp.

CX01 Expiry of patent term

Granted publication date: 20090826

CX01 Expiry of patent term