US20110153550A1 - System and method for determining an event occurrence rate - Google Patents

System and method for determining an event occurrence rate Download PDF

Info

Publication number
US20110153550A1
US20110153550A1 US13/034,835 US201113034835A US2011153550A1 US 20110153550 A1 US20110153550 A1 US 20110153550A1 US 201113034835 A US201113034835 A US 201113034835A US 2011153550 A1 US2011153550 A1 US 2011153550A1
Authority
US
United States
Prior art keywords
content items
region
impression volume
ctr
impression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/034,835
Inventor
Deepak Agarwal
Dejan Diklic
Deepayan Chakrabarti
Andrei Zary Broder
Vanja Josifovski
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yahoo Inc
Original Assignee
Yahoo Inc until 2017
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yahoo Inc until 2017 filed Critical Yahoo Inc until 2017
Priority to US13/034,835 priority Critical patent/US20110153550A1/en
Publication of US20110153550A1 publication Critical patent/US20110153550A1/en
Assigned to YAHOO HOLDINGS, INC. reassignment YAHOO HOLDINGS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO! INC.
Assigned to OATH INC. reassignment OATH INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO HOLDINGS, INC.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Definitions

  • the invention disclosed herein relates generally to determining an event occurrence rate. More specifically, the present invention relates to estimating an occurrence rate for events aggregated at multiple resolutions through hierarchical data structures.
  • Web advertising is typically implemented according to two general schemes: content match and sponsored search.
  • Content match refers to placement of advertisements (“ads”) within a webpage on the basis of the content of the web page.
  • Sponsored search refers to placing ads on a search results page generated by a web search engine, the ads being responsive to a query that a given user submits to the web search engine.
  • the ads placed on the search results page are selected via analysis of a query string entered into the web search engine.
  • Those of skill in the art recognize that other factors or parameters beyond the query string may influence the selection of ads for placement on a search results page that the web search engine generates including a score that indicates the quality of the ad, a time zone of the user, user browsing history, demographic information, etc.
  • a content match system can generate data indicating each instance that an ad is displayed on a webpage (an “impression”).
  • An ad network an intermediary entity that selects the ad in the content match system, determines a most relevant ad to place on the webpage to entice a user to click on that ad. For example, on a webpage related to sports, the ad network may select ads for soft drinks, because a demographic of visitors interested in sports may be substantially similar to a demographic likely to buy soft drinks. By computing a ratio of a number of clicks on the ads to a number of impressions, the ad network can determine a click-through-rate (CTR) indicative of, inter alia, the relevancy of the ads that are selected. Thus, the CTR becomes a valuable indicator for ad networks seeking to attract business from advertisers.
  • CTR click-through-rate
  • the number of clicks is typically very low compared to the number of impressions.
  • Conventional estimation algorithms based on frequencies of event occurrences incur high statistical variance and fail to provide satisfactory predictions of the CTR because the number of clicks appears negligible in view of the large amount of impressions.
  • estimating CTR from entire corpus of data might involve storing information for each impression. In a content matching system, however, this might involve crawling pages and storing the entire page content, which is expensive both in terms of storage and bandwidth requirements.
  • the present invention generally relates to systems and methods for determining an event occurrence rate.
  • a sample set of content items may be obtained.
  • Each of the content items may be associated with at least one region in a hierarchical data structure.
  • a hierarchical data structure comprises nodes in an advertisement taxonomy hierarchy and nodes in a page taxonomy hierarchy, with a given region characterized or otherwise identified by a combination of nodes from the advertisement taxonomy hierarchy and nodes from the page taxonomy hierarchy.
  • a first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region.
  • a scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume.
  • a click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item.
  • CTR click-through
  • the content items may include at least one of webpages and ads.
  • the obtaining of the sample set may include identifying first content items that have been clicked, identifying a predetermined number of second content items that have not been clicked, and generating the sample set as a function of the first and second content items.
  • the first impression volume may be calculated as a function of the impressions for the first and second content items.
  • the third impression volume may be a total number of impressions associated within a pre-selected level in the hierarchical data structure.
  • a difference impression volume may be calculated as a difference between the first impression volume and the third impression volume, and the difference impression volume may be distributed to the at least one region as a function of the first impression volume.
  • the distributing may include determining a sum of the first impression volumes for each region across a level of the hierarchical data structure, computing a ratio of the first impression volume for a given region to the sum, multiplying the difference impression volume by the ratio to determine an impression addition for the given region, and adding the impression addition to the first impression volume of the given region to generate a fourth impression volume.
  • Estimating the CTR may include assigning a state variable to each of the at least one region, and applying a Markovian model to the state variable to estimate the CTR.
  • the Markovian model may compute a posterior for the state using a Kalman filter, propagate the posterior to the at least one region, and repeat the computing and the propagating until convergence. Upon the convergence, the CTR for the at least one region may be identified and stored on a storage medium.
  • FIG. 1 shows an exemplary embodiment of a system for determining an event occurrence rate according to one embodiment of the present invention
  • FIG. 2 shows an exemplary embodiment of a method for determining an event occurrence rate according to one embodiment of the present invention
  • FIG. 3 shows an exemplary embodiment of a method for generating a sample set of webpages/ads according to one embodiment of the present invention
  • FIG. 4 shows an exemplary embodiment of a method for determining impression volumes at a predetermined node(s) in a webpage/ad hierarchy according to one embodiment of the present invention
  • FIG. 5 shows an exemplary embodiment of a method for estimating a click-through-rate in one or more regions of a webpage/ad hierarchy according to one embodiment of the present invention
  • FIG. 6 shows an exemplary embodiment of a generative model for a two level hierarchy according to one embodiment of the present invention.
  • FIG. 1 shows an exemplary embodiment of a system 100 for determining an event occurrence rate according to the present invention.
  • the system 100 may comprise a publisher server 102 , an ad network server 104 and a client device 106 which are communicatively interconnected via a communications network 108 (e.g., a wired/wireless LAN/WAN, a cellular network, the Internet, an intranet, a VPN, a PSTN, etc.).
  • the publisher and ad network servers 102 , 104 and the client device 106 may be processor-based computing devices which include memory and network connection ports for communicating data on the network 108 .
  • the client device 106 may be a PC, laptop, mobile phone, PDA, tablet computer, handheld computer, smart appliance (e.g., scanner, copier, facsimile machine), etc. which utilizes a web browser (or command-line interface) for allowing a user to interface with devices on the network 108 and view content items (e.g., webpages, ads, videos, audio files, etc.).
  • client devices 106 may be connected to the network 108 and that the servers 102 may comprise any number of servers and/or databases.
  • the publisher server 102 may host one or more webpages that include text, audio, video and/or interactive content (e.g., games, Flash programs, etc.).
  • the webpages may also include ad space (e.g., blank space on the webpage in which an ad may be displayed).
  • a company operating the publisher server 102 may generate revenue by displaying the ads on the webpages.
  • the ads may be hosted by the ad network server 104 or an ad company server 110 (e.g., a repository with company/product-specific ads).
  • the ad network server 104 selects an ad (usually based on an agreement with the website owner and the advertiser) from its own database (or retrieves the selected ad from the ad company server 110 ) and transmits the selected ad to the client device 106 . Displaying the ad on the webpage is typically referred to as an “impression.” The user then sees the selected ad as a part of the webpage that was requested.
  • the ad network server 104 may also implement a content match application.
  • the content match application may include a crawler module which indexes content on various webpages and ads available to be served by the ad network server 104 . Using the indices, the ad network server 104 may select an ad that is most likely to be clicked by the user. The ad network server 104 may generate data recording the impressions and the clicks on served ads for calculating a click-through-rate (CTR), e.g., a percentage of ads that were served and clicked. The CTR may be a valuable statistic for the ad network to demonstrate to advertisers the efficacy of the content match application.
  • CTR click-through-rate
  • the CTR may be estimated at one or more resolutions of webpage/ad hierarchy. That is, the webpages and ads may be classified (manually or automatically) into a pre-existing hierarchy in which nodes in the hierarchy are associated with contextual themes (e.g., skiing winter sports 4 sports).
  • the web pages/ads may be associated with a give node based on the resolution thereof. That is, the more themes used to describe a webpage/ad, the further to the fringe the webpage/ad will be in the hierarchy.
  • the hierarchy may be a tree comprising a single root node that extends into a plurality of leaf nodes.
  • One or more of the leaf nodes may be identified as comprising a region of the tree.
  • a parent node and its children nodes, a plurality of nodes with a common ancestor node or sharing a common theme may be considered a region, or a region may be identified by the contextual theme (e.g., swimming ⁇ summer sports ⁇ sports).
  • FIG. 2 shows an exemplary embodiment of a method 200 for determining an event occurrence rate according to the present invention.
  • the method 200 provides an overview of the exemplary steps for determining an event occurrence rate according to the present invention, and, as such, implementation of each of the steps will be described in further detail below.
  • a sample set of webpages is identified.
  • the sample set may include a predetermined number of webpages on which ads have been served by the ad network server 104 , including the webpages/ads that have been clicked.
  • the webpages in the sample set may be gleaned from, for example, a log maintained by the ad network server 104 .
  • impression volumes are determined, based on true impression data for the webpages in the sample set, for regions in the hierarchy.
  • the determined impression volumes and actual numbers of clicks in a given region are used to determine the CTR(s) for the region.
  • the CTR computed may be computed for any resolution within the hierarchy, allowing for discrimination between regions which truly have negligible CTR from those which may obtain more clicks if provided with more impressions, as will be explained further below.
  • FIG. 3 shows an exemplary embodiment of a method 300 for generating the sample set of webpages/ads.
  • a webpage is identified.
  • the webpage may be one of the webpages that was indicated on the log at the ad network server 104 as registering as impression.
  • a number of the non-clicked webpages included in the sample set may be predetermined or determined automatically when harvesting the pages (e.g., as a function of a total number of webpages in the log, a total number of webpages clicked, a total number of impressions, etc.).
  • the page is crawled to obtain features thereof for classification into a region of the hierarchy.
  • the features on a webpage include, but are not limited to, a URL, an HTML tag(s), words, images, scripts, etc.
  • features on the ads may be available from the log or other pre-recorded data identifying (or providing data for identifying) the features.
  • step 310 the impressions associated with the webpage are mapped onto regions in the hierarchy corresponding to the features of the webpage. This yields the number of sampled impressions in each of the regions.
  • the method 300 may be iterated over all of the webpages in the sample set, resulting in a hierarchy which reflects all of the sampled impressions in each of the regions. Because the impressions associated with the sample set of webpages are relatively small (as compared to the total number of impressions recorded), the hierarchy may not fully reflect true impression volumes for all of the regions.
  • FIG. 4 shows an exemplary embodiment of a method 400 for determining impression volumes at a predetermined node(s) in the hierarchy using the sample impression at another region(s).
  • a scale factor is determined, and, in step 404 , the scale factor is applied to all nodes across a level of the hierarchy.
  • a lower bound on impression volume is computed for each of the regions.
  • the lower bound may be, for example, the total number of sampled impressions in each of the respective regions.
  • excess impressions e.g., the total number of scaled impressions in a region minus the lower bound of sampled impressions in the region
  • a variance of the estimated impression volumes may be reduced.
  • a sum of the estimated impression volumes for children regions nested within a parent region should correlate to the estimated impression volume of the parent region.
  • the excess impressions may be imputed to some (or all of) the nodes using a maximum entropy formulation.
  • FIG. 5 shows an exemplary embodiment of a method 500 for estimating the CTR in one or more regions of the hierarchy using the estimated impression volumes.
  • a state variable is assigned to each node in the hierarchy.
  • a posterior of each of the state variables is computed for each node from fringe leaf nodes to the root node.
  • the posterior may be computed by, for example, a Kalman filter algorithm that “filters” the leave nodes in a bottom-up fashion to the root node.
  • a smoothing effect may be applied to modify the state variables.
  • the smoothing effect may be the result of applying a Markovian model on the state variables. That is, since the state variables of child nodes sharing a common parent node are drawn from a distribution centered around the state variable of the parent, the Markovian model may specify a joint distribution on an entire state space of CTR values.
  • step 508 variance components of the Markovian model may be estimated using, for example, an Expectation-Maximization (EM) algorithm.
  • the EM algorithm may repeat steps 504 (filtering) and 506 (smoothing) for several iterations until convergence (step 510 ).
  • the resulting CTR values may be stored on a storage device for output and/or additional processing.
  • the resulting CTR values may be stored on a storage medium.
  • a set of regions Z may consist of two successive levels of nested regions corresponding to depths 1 and 2 , respectively.
  • Generalization to all regions formed by the page and ad hierarchies may follow as: let IJ and ij denote regions in Z (1) and Z (2) , respectively.
  • the actual impressions in region r from the clicked and non-clicked pages (e.g., as described with reference to the method 300 ) may be denoted as n r and m r , respectively.
  • N r the true impression volume in region r that is to be estimated may be denoted as N r .
  • the x r 's may be interpreted as excess impressions that may be allocated to adjust for a sampling bias.
  • a page (or ad) classified to a node i in the tree may belong to the entire path from a node i to the root node. Also, the page (or ad) may be classified to a node at a depth other than leaf node L—leaf level. As understood by those of skill in the art, this classification scheme has the potential to create inconsistencies in a total number of impressions and clicks obtained at different levels in the tree. For instance, the total number of impressions (or clicks) for a group of children regions may be strictly smaller than the number of impressions (or clicks) of the parent region they are nested within. To ensure consistency, the excess impressions and clicks in a parent node are distributed among the children nodes associated therewith. The steps are repeated at every level in a top-down fashion. Thus, each impression in a non-leaf region is guaranteed to come from some smaller region nested within it.
  • constraints may be imposed while imputing the impression volumes as described in the method 400 .
  • a first set of constraints e.g., column constraints
  • a j (a J ) is the total impression volume for node j(J) in the ad hierarchy
  • CS. (.) ) represents the excess impressions in the column that were missed by the sampling process.
  • a second set of constraints may preserve the impression volumes at nodes in the page hierarchy as follows:
  • RS. (.) represents the excess impressions aggregated for each node in the page hierarchy
  • K (1) and K (2) are constants for levels 1 and 2 .
  • the underlying assumption is that for each sampled impression, there are K (.) times as many excess impressions from the non-clicked pool that did not appear in the sample. Since pages may be randomly sampled from the non-clicked pool, this simple adjustment is reasonable.
  • a third set of constraints may ensure that the excess impressions allocated to a region at level 1 equals the sum of excess impression allocated to regions nested within it at level 2 as follows:
  • true impression volumes may satisfy the block contracts.
  • the block constraints may be imposed during the imputation of impression volumes.
  • a set of positive initial prior values ⁇ x r (0) ⁇ may be identified for all regions r E Z.
  • An aim of the exemplary embodiments of the present invention is to determine a solution ⁇ x r ⁇ which is as close as possible to the prior initial value ⁇ x r (0) ⁇ but satisfies all the row, column and block constraints.
  • this process may be equivalent to finding a solution having a smallest discrepancy from the prior distribution in terms of Kullback-Leibler divergence, subject to the constraints. It may also be referred to as a Maximum Entropy model, because, when the prior initial value ⁇ x r (0) ⁇ is uniform, the solution may maximize Shannon entropy.
  • IPF Iterative Proportional Fitting
  • the exemplary embodiment of the present invention may jointly estimate all x r 's by iterating through a series of top-down and bottom-up scalings.
  • ⁇ x IJ (t) ⁇ For a two level tree, at the t th iteration, start with level 1 , and modify ⁇ x IJ (t) ⁇ to ⁇ x IJ (t+1) ⁇ after adjusting for the row and column constraints.
  • level 2 change the ⁇ x* ij (t) ⁇ 's to ⁇ x ij (t+1) ⁇ 's by adjusting for row and column constraints. This completes the top-down step.
  • the top-down and bottom-up steps may be iterated until convergence. The algorithm may converge rapidly, requiring, for example, 156 iterations for an error tolerance of 1%.
  • One exemplary variable in the exemplary imputation algorithm is the choice of the prior.
  • Setting x r (0) is proportional to lb r may ensure that the excess impressions are distributed in proportion to the lower bounds obtained from the crawled sample as closely as possible subject to the constraints.
  • An alternative is to simply use the traditional IPF algorithm, which starts with a prior of x r (0) that is proportional to 1, and computes the x r values for each level separately, using only the row and column constraints. It can be shown that this automatically satisfies the block constraints as well, due to the relationships between the row and column sums at different levels.
  • the prior distributes the excess impressions using an independence model and does not incorporate the a priori interaction information in the lower bounds.
  • the CTRs are estimated for all (or selected ones) of the nodes therein.
  • the distribution of raw CTRs may be skewed and the variance may depend on the mean (roughly, Var proportional to mean/N r ).
  • the count data may be modeled on a transformed scale using the Freeman-Tukey transformation:
  • c r is the number of clicks in the region r and N r is the imputed number of impressions, determined from the imputation algorithm described above.
  • the second term in the transformation distinguishes between zeros on the basis of the number of impressions, e.g., zero clicks from 100 impressions corresponds to a smaller transformed CTR than zero clicks from only 10 impressions.
  • the transformation may also provide symmetry to an otherwise skewed rate distribution and provide a variance stabilization property, making the variance of the distribution independent of the mean (roughly, Var proportional to 1/N r ).
  • a squared-root transform may be utilized to model the data on a transformed scale.
  • the Markov model may be used as a generative model to calculate the CTRs from the imputed impression volumes.
  • u T r 1 for all r which corresponds to one covariate for each level in the region hierarchy.
  • ⁇ S r ⁇ the observations y r to be independently distributed as a Gaussian:
  • the ⁇ (d(r)) is the unknown coefficient vector attached to covariates at level d(r), and V r is the unknown variance parameter.
  • the latent S r variables are adjusting for effects that are not accounted for by the covariates. However, estimating one S r per region leads to severe overfitting; hence smoothing on S r 's is necessary.
  • the smoothing step is performed by exploiting dependencies induced by the tree structure of regions:
  • FIG. 6 shows an exemplary embodiment of the generative model for two levels.
  • the ratios W r /V r determine the amount of smoothing that takes place in the Markovian model. If W r is large relative to V r , the sibling S r 's are drawn from a distribution that has high variance and hence little smoothing. According to one embodiment, if W r /V r is proportional to infinity, then S r 43 (y r u T r ⁇ (d(r)) ) and the training data is perfectly fit. On the other extreme, if W r /V r ⁇ 0, then S r ⁇ 0 and the fit is a regression model given by the covariates, with the maximum possible smoothing.
  • the variance in the states S r depends only on the depth of region r, and increases when moving from coarser to finer resolutions.
  • the correlation coefficient Corr (l,l′) depends only on the level of the regions and the distance to their least common ancestor.
  • the y r 's may be independent conditional on S r 's, but the dependencies in S r 's impose dependencies in the marginal distribution of y r 's.
  • the EM algorithm may be used to estimate the posterior distribution of ⁇ S r ⁇ 's and ⁇ (d(r)) ⁇ 's and provide point estimates of the variance components ⁇ W(l) ⁇ and V.
  • Implementation of the EM algorithm may utilize a Kalman filtering step for efficiently estimating the posterior distributions of ⁇ S r ⁇ 's for fixed values of the variance components.
  • the Kalman filtering algorithm itself consists of two steps, namely, a filtering step that aggregates information from the leaves up to the root, followed by a smoothing step that propagates the aggregated information in the root downwards to the leaves.
  • the EM algorithm may use these in the Kalman filtering and smoothing steps, recomputing the variance and covariate components, and repeating the process until convergence.
  • the EM algorithm first computes the expected log-likelihood of the conditional distribution of all the state variables ⁇ S r ⁇ given the current estimates of all variance and covariate components ⁇ W (1) (t) ⁇ , V(t), ⁇ (t) ⁇ and the data ⁇ y r ⁇ . This step uses the posterior distributions of the state variables from the Kalman filtering and smoothing steps.
  • the Kalman filtering step may be implemented as follows:
  • FIGS. 1 through 6 are conceptual illustrations allowing for an explanation of the present invention. It should be understood that various aspects of the embodiments of the present invention could be implemented in hardware, firmware, software, or combinations thereof. In such embodiments, the various components and/or steps would be implemented in hardware, firmware, and/or software to perform the functions of the present invention. That is, the same piece of hardware, firmware, or module of software could perform one or more of the illustrated blocks (e.g., components or steps).
  • computer software e.g., programs or other instructions
  • data is stored on a machine readable medium as part of a computer program product, and is loaded into a computer system or other device or machine via a removable storage drive, hard drive, or communications interface.
  • Computer programs also called computer control logic or computer readable program code
  • processors controllers, or the like
  • machine readable medium “computer program medium” and “computer usable medium” are used to generally refer to media such as a random access memory (RAM); a read only memory (ROM); a removable storage unit (e.g., a magnetic or optical disc, flash memory device, or the like); a hard disk; electronic, electromagnetic, optical, acoustical, or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); or the like.
  • RAM random access memory
  • ROM read only memory
  • removable storage unit e.g., a magnetic or optical disc, flash memory device, or the like
  • hard disk e.g., a hard disk
  • electronic, electromagnetic, optical, acoustical, or other form of propagated signals e.g., carrier waves, infrared signals, digital signals, etc.

Abstract

Described are a system and method for determined an event occurrence rate. A sample set of content items may be obtained. Each of the content items may be associated with at least one region in a hierarchical data structure. A first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region. A scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume. A click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item.

Description

    CLAIM OF PRIORITY
  • This application is a continuation of and claims priority to U.S. Ser. No. 11/696,944, filed Apr. 5, 2007, which is hereby incorporated by reference in its entirety.
  • COPYRIGHT NOTICE
  • A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
  • CROSS REFERENCE TO RELATED APPLICATION
  • The present application is related to co-pending U.S. patent application Ser. No. 11/637,524, entitled “SYSTEM AND METHOD FOR MATCHING OBJECTS BELONGING TO HIERARCHIES,” filed on Dec. 12, 2006, and published under US Publication No. 2008/0140591, the disclosure of which is hereby incorporated by reference in its entirety.
  • FIELD OF THE INVENTION
  • The invention disclosed herein relates generally to determining an event occurrence rate. More specifically, the present invention relates to estimating an occurrence rate for events aggregated at multiple resolutions through hierarchical data structures.
  • BACKGROUND OF THE INVENTION
  • Web advertising is typically implemented according to two general schemes: content match and sponsored search. Content match refers to placement of advertisements (“ads”) within a webpage on the basis of the content of the web page. Sponsored search refers to placing ads on a search results page generated by a web search engine, the ads being responsive to a query that a given user submits to the web search engine. The ads placed on the search results page are selected via analysis of a query string entered into the web search engine. Those of skill in the art recognize that other factors or parameters beyond the query string may influence the selection of ads for placement on a search results page that the web search engine generates including a score that indicates the quality of the ad, a time zone of the user, user browsing history, demographic information, etc. A content match system can generate data indicating each instance that an ad is displayed on a webpage (an “impression”).
  • An ad network, an intermediary entity that selects the ad in the content match system, determines a most relevant ad to place on the webpage to entice a user to click on that ad. For example, on a webpage related to sports, the ad network may select ads for soft drinks, because a demographic of visitors interested in sports may be substantially similar to a demographic likely to buy soft drinks. By computing a ratio of a number of clicks on the ads to a number of impressions, the ad network can determine a click-through-rate (CTR) indicative of, inter alia, the relevancy of the ads that are selected. Thus, the CTR becomes a valuable indicator for ad networks seeking to attract business from advertisers. However, the number of clicks is typically very low compared to the number of impressions. Conventional estimation algorithms based on frequencies of event occurrences incur high statistical variance and fail to provide satisfactory predictions of the CTR because the number of clicks appears negligible in view of the large amount of impressions. Furthermore, estimating CTR from entire corpus of data might involve storing information for each impression. In a content matching system, however, this might involve crawling pages and storing the entire page content, which is expensive both in terms of storage and bandwidth requirements.
  • Therefore, there exists a need for a reliable sampling model for determining an occurrence of a rare event within large volumes of data.
  • SUMMARY OF THE INVENTION
  • The present invention generally relates to systems and methods for determining an event occurrence rate. A sample set of content items may be obtained. Each of the content items may be associated with at least one region in a hierarchical data structure. According to one embodiment, a hierarchical data structure comprises nodes in an advertisement taxonomy hierarchy and nodes in a page taxonomy hierarchy, with a given region characterized or otherwise identified by a combination of nodes from the advertisement taxonomy hierarchy and nodes from the page taxonomy hierarchy. A first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region. A scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume. A click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item.
  • The content items may include at least one of webpages and ads. The obtaining of the sample set may include identifying first content items that have been clicked, identifying a predetermined number of second content items that have not been clicked, and generating the sample set as a function of the first and second content items. The first impression volume may be calculated as a function of the impressions for the first and second content items. The third impression volume may be a total number of impressions associated within a pre-selected level in the hierarchical data structure. A difference impression volume may be calculated as a difference between the first impression volume and the third impression volume, and the difference impression volume may be distributed to the at least one region as a function of the first impression volume. The distributing may include determining a sum of the first impression volumes for each region across a level of the hierarchical data structure, computing a ratio of the first impression volume for a given region to the sum, multiplying the difference impression volume by the ratio to determine an impression addition for the given region, and adding the impression addition to the first impression volume of the given region to generate a fourth impression volume. Estimating the CTR may include assigning a state variable to each of the at least one region, and applying a Markovian model to the state variable to estimate the CTR. The Markovian model may compute a posterior for the state using a Kalman filter, propagate the posterior to the at least one region, and repeat the computing and the propagating until convergence. Upon the convergence, the CTR for the at least one region may be identified and stored on a storage medium.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The invention is illustrated in the figures of the accompanying drawings which are meant to be exemplary and not limiting, in which like references are intended to refer to like or corresponding parts, and in which:
  • FIG. 1 shows an exemplary embodiment of a system for determining an event occurrence rate according to one embodiment of the present invention;
  • FIG. 2 shows an exemplary embodiment of a method for determining an event occurrence rate according to one embodiment of the present invention;
  • FIG. 3 shows an exemplary embodiment of a method for generating a sample set of webpages/ads according to one embodiment of the present invention;
  • FIG. 4 shows an exemplary embodiment of a method for determining impression volumes at a predetermined node(s) in a webpage/ad hierarchy according to one embodiment of the present invention;
  • FIG. 5 shows an exemplary embodiment of a method for estimating a click-through-rate in one or more regions of a webpage/ad hierarchy according to one embodiment of the present invention; and
  • FIG. 6 shows an exemplary embodiment of a generative model for a two level hierarchy according to one embodiment of the present invention.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • In the following description of the preferred embodiments, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration a specific embodiment in which the invention may be practiced. It is to be understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.
  • FIG. 1 shows an exemplary embodiment of a system 100 for determining an event occurrence rate according to the present invention. The system 100 may comprise a publisher server 102, an ad network server 104 and a client device 106 which are communicatively interconnected via a communications network 108 (e.g., a wired/wireless LAN/WAN, a cellular network, the Internet, an intranet, a VPN, a PSTN, etc.). The publisher and ad network servers 102, 104 and the client device 106 may be processor-based computing devices which include memory and network connection ports for communicating data on the network 108. For example, the client device 106 may be a PC, laptop, mobile phone, PDA, tablet computer, handheld computer, smart appliance (e.g., scanner, copier, facsimile machine), etc. which utilizes a web browser (or command-line interface) for allowing a user to interface with devices on the network 108 and view content items (e.g., webpages, ads, videos, audio files, etc.). Those of skill in the art understand that any number of client devices 106 may be connected to the network 108 and that the servers 102 may comprise any number of servers and/or databases.
  • The publisher server 102 may host one or more webpages that include text, audio, video and/or interactive content (e.g., games, Flash programs, etc.). The webpages may also include ad space (e.g., blank space on the webpage in which an ad may be displayed). A company operating the publisher server 102 may generate revenue by displaying the ads on the webpages. The ads may be hosted by the ad network server 104 or an ad company server 110 (e.g., a repository with company/product-specific ads). When the browser on the client device 106 requests the webpage from the publisher server 102, the ad network server 104 selects an ad (usually based on an agreement with the website owner and the advertiser) from its own database (or retrieves the selected ad from the ad company server 110) and transmits the selected ad to the client device 106. Displaying the ad on the webpage is typically referred to as an “impression.” The user then sees the selected ad as a part of the webpage that was requested.
  • Along with using rules defined in website owner-advertiser agreements to select ads, the ad network server 104 may also implement a content match application. The content match application may include a crawler module which indexes content on various webpages and ads available to be served by the ad network server 104. Using the indices, the ad network server 104 may select an ad that is most likely to be clicked by the user. The ad network server 104 may generate data recording the impressions and the clicks on served ads for calculating a click-through-rate (CTR), e.g., a percentage of ads that were served and clicked. The CTR may be a valuable statistic for the ad network to demonstrate to advertisers the efficacy of the content match application.
  • In an exemplary embodiment of the present invention, the CTR may be estimated at one or more resolutions of webpage/ad hierarchy. That is, the webpages and ads may be classified (manually or automatically) into a pre-existing hierarchy in which nodes in the hierarchy are associated with contextual themes (e.g., skiing winter sports 4 sports). The web pages/ads may be associated with a give node based on the resolution thereof. That is, the more themes used to describe a webpage/ad, the further to the fringe the webpage/ad will be in the hierarchy.
  • While the exemplary embodiments will be described with reference to a single hierarchy used by both the webpages and the ads, those of skill in the art will understand that the webpages and the ads may utilize mutually exclusive and/or overlapping hierarchies. The hierarchy may be a tree comprising a single root node that extends into a plurality of leaf nodes. One or more of the leaf nodes may be identified as comprising a region of the tree. For example, a parent node and its children nodes, a plurality of nodes with a common ancestor node or sharing a common theme may be considered a region, or a region may be identified by the contextual theme (e.g., swimming→summer sports→sports).
  • FIG. 2 shows an exemplary embodiment of a method 200 for determining an event occurrence rate according to the present invention. The method 200 provides an overview of the exemplary steps for determining an event occurrence rate according to the present invention, and, as such, implementation of each of the steps will be described in further detail below. In step 202, a sample set of webpages is identified. The sample set may include a predetermined number of webpages on which ads have been served by the ad network server 104, including the webpages/ads that have been clicked. The webpages in the sample set may be gleaned from, for example, a log maintained by the ad network server 104. In step 204, impression volumes are determined, based on true impression data for the webpages in the sample set, for regions in the hierarchy. In step 206, the determined impression volumes and actual numbers of clicks in a given region are used to determine the CTR(s) for the region. The CTR computed may be computed for any resolution within the hierarchy, allowing for discrimination between regions which truly have negligible CTR from those which may obtain more clicks if provided with more impressions, as will be explained further below.
  • FIG. 3 shows an exemplary embodiment of a method 300 for generating the sample set of webpages/ads. In step 302, a webpage is identified. The webpage may be one of the webpages that was indicated on the log at the ad network server 104 as registering as impression. In step 304, it is determined whether the webpage was clicked. If the webpage was not clicked, it is determined whether the webpage should be included in the sample set as a non-clicked webpage (step 306). A number of the non-clicked webpages included in the sample set may be predetermined or determined automatically when harvesting the pages (e.g., as a function of a total number of webpages in the log, a total number of webpages clicked, a total number of impressions, etc.).
  • In step 308, the page is crawled to obtain features thereof for classification into a region of the hierarchy. The features on a webpage include, but are not limited to, a URL, an HTML tag(s), words, images, scripts, etc. As understood by those of skill in the art, features on the ads may be available from the log or other pre-recorded data identifying (or providing data for identifying) the features.
  • In step 310, the impressions associated with the webpage are mapped onto regions in the hierarchy corresponding to the features of the webpage. This yields the number of sampled impressions in each of the regions. The method 300 may be iterated over all of the webpages in the sample set, resulting in a hierarchy which reflects all of the sampled impressions in each of the regions. Because the impressions associated with the sample set of webpages are relatively small (as compared to the total number of impressions recorded), the hierarchy may not fully reflect true impression volumes for all of the regions.
  • FIG. 4 shows an exemplary embodiment of a method 400 for determining impression volumes at a predetermined node(s) in the hierarchy using the sample impression at another region(s). In step 402, a scale factor is determined, and, in step 404, the scale factor is applied to all nodes across a level of the hierarchy. In step 406, it is determined whether the total number of true impressions indicated in the log is substantially equal to (e.g., within a predefined error bound) the sampled impressions for the predetermined node multiplied by the scale factor. If the totals are incongruous, the scale factor may be modified.
  • In step 408, a lower bound on impression volume is computed for each of the regions. The lower bound may be, for example, the total number of sampled impressions in each of the respective regions. In step 410, excess impressions (e.g., the total number of scaled impressions in a region minus the lower bound of sampled impressions in the region), may be distributed among the respective regions. That is, by conforming estimated impression volumes to the scaled impression totals at each node in the page and ad hierarchies, a variance of the estimated impression volumes may be reduced. Additionally, a sum of the estimated impression volumes for children regions nested within a parent region should correlate to the estimated impression volume of the parent region. As will be explained further below, the excess impressions may be imputed to some (or all of) the nodes using a maximum entropy formulation.
  • FIG. 5 shows an exemplary embodiment of a method 500 for estimating the CTR in one or more regions of the hierarchy using the estimated impression volumes. In step 502, a state variable is assigned to each node in the hierarchy. In step 504, a posterior of each of the state variables is computed for each node from fringe leaf nodes to the root node. The posterior may be computed by, for example, a Kalman filter algorithm that “filters” the leave nodes in a bottom-up fashion to the root node.
  • In step 506, a smoothing effect may be applied to modify the state variables. The smoothing effect may be the result of applying a Markovian model on the state variables. That is, since the state variables of child nodes sharing a common parent node are drawn from a distribution centered around the state variable of the parent, the Markovian model may specify a joint distribution on an entire state space of CTR values.
  • In step 508, variance components of the Markovian model may be estimated using, for example, an Expectation-Maximization (EM) algorithm. The EM algorithm may repeat steps 504 (filtering) and 506 (smoothing) for several iterations until convergence (step 510). When convergence is reached, the resulting CTR values may be stored on a storage device for output and/or additional processing. In step 512, the resulting CTR values may be stored on a storage medium.
  • A more detailed exemplary embodiment of determining and imputing impression volumes is described below. A set of regions Z may consist of two successive levels of nested regions corresponding to depths 1 and 2, respectively. Generalization to all regions formed by the page and ad hierarchies may follow as: let IJ and ij denote regions in Z(1) and Z(2), respectively. The actual impressions in region r from the clicked and non-clicked pages (e.g., as described with reference to the method 300) may be denoted as nr and mr, respectively. Thus, lbr=nr+mr may provide a lower bound on the impression volume for the region r. Let Nrdenote the true impression volume in region r that is to be estimated may be denoted as Nr. Using a linear transformation xr=Nrlbr, the estimation problem may be written in terms of xr and derive estimates of Nr as Nr=xr+lbr where xris our estimate of xr. In fact, the xr's may be interpreted as excess impressions that may be allocated to adjust for a sampling bias.
  • A page (or ad) classified to a node i in the tree may belong to the entire path from a node i to the root node. Also, the page (or ad) may be classified to a node at a depth other than leaf node L—leaf level. As understood by those of skill in the art, this classification scheme has the potential to create inconsistencies in a total number of impressions and clicks obtained at different levels in the tree. For instance, the total number of impressions (or clicks) for a group of children regions may be strictly smaller than the number of impressions (or clicks) of the parent region they are nested within. To ensure consistency, the excess impressions and clicks in a parent node are distributed among the children nodes associated therewith. The steps are repeated at every level in a top-down fashion. Thus, each impression in a non-leaf region is guaranteed to come from some smaller region nested within it.
  • One or more constraints may be imposed while imputing the impression volumes as described in the method 400. A first set of constraints (e.g., column constraints) may ensure that a sum of the impressions along a column is substantially equal to a total number of impressions for a corresponding node in the ad hierarchy:

  • Σx ij =a j Σlb ij =CS j (2); for all j in Level 2  (1)

  • Σx IJ =a J Σlb IJ =CS J (1); for all J in Level 1  (2)
  • In the exemplary column constraint, aj(aJ) is the total impression volume for node j(J) in the ad hierarchy, and CS.(.)) represents the excess impressions in the column that were missed by the sampling process. For a node J at level 1 in the ad hierarchy, ajjpa(j)=Jaj, where pa(j) denotes the parent node of node j, e.g., the column impressions total for a level 1 node is the sum of the column totals of its children in level 2. Also, ΣCSj (2)=ΣCSJ (1))=TotExcess, where TotExcess is the total number of excess impressions in the data.
  • A second set of constraints (e.g., row constraints) may preserve the impression volumes at nodes in the page hierarchy as follows:

  • Σxij=K(2)Σmij=RSi (2); Vi

  • ΣxIJ=K(1)ΣmIJ=RSI (1);VI  (3)
  • In the second set of constraints, RS.(.) represents the excess impressions aggregated for each node in the page hierarchy, and K(1) and K(2) are constants for levels 1 and 2. The underlying assumption is that for each sampled impression, there are K(.) times as many excess impressions from the non-clicked pool that did not appear in the sample. Since pages may be randomly sampled from the non-clicked pool, this simple adjustment is reasonable. The constants K(.) are chosen to preserve total impression volume, e.g., so that ΣRSi (2)=ΣRSI (1)=TotExcess.
  • A third set of constraints (e.g., block constraints) may ensure that the excess impressions allocated to a region at level 1 equals the sum of excess impression allocated to regions nested within it at level 2 as follows:

  • Σi:j:pa(ij)=IJ=xIJ; for all IJ  (4)
  • As understood by those of skill in the art, true impression volumes may satisfy the block contracts. Thus, the block constraints may be imposed during the imputation of impression volumes. Additionally, analogous row, column and block constraints may be imposed at all other levels l(l=0, . . . , L).
  • In estimating the impression volumes, a set of positive initial prior values {xr(0)} may be identified for all regions r E Z. An aim of the exemplary embodiments of the present invention is to determine a solution {xr} which is as close as possible to the prior initial value {xr(0)} but satisfies all the row, column and block constraints. As understood by those of skill in the art, this process may be equivalent to finding a solution having a smallest discrepancy from the prior distribution in terms of Kullback-Leibler divergence, subject to the constraints. It may also be referred to as a Maximum Entropy model, because, when the prior initial value {xr(0)} is uniform, the solution may maximize Shannon entropy.
  • In one exemplary embodiment, the Maximum Entropy model may be solved using an Iterative Proportional Fitting (IPF) algorithm, which iterates cyclically over all of the constraints and updates the xr values to match the constraints as closely as possible. Specifically, at the tth iteration, if: a constraint of the form Σrkrxr=C is being violated (kr=0 or 1 for all of the constraints); the current value C(t) of the LHS is C(t)=Σrkrxr(t), where Ct≠C; then, the IPF algorithm adjusts each element xr involved in the constraint by a constant factor C/C(t) to get the new values xr(t+1)=xr(t)·C/C(t). Updating in this manner may ensure non-negativity of a final solution. The updates may be performed for all constraints until convergence.
  • The exemplary embodiment of the present invention may jointly estimate all xr's by iterating through a series of top-down and bottom-up scalings. For a two level tree, at the tth iteration, start with level 1, and modify {xIJ(t)} to {xIJ(t+1)} after adjusting for the row and column constraints. This changes the values of {xij(t)}'s at level 2 to {x*ij(t)}'s by adjusting for the corresponding block constraints. At level 2, change the {x*ij(t)}'s to {xij(t+1)}'s by adjusting for row and column constraints. This completes the top-down step. In the bottom-up step, the leaf regions (in the exemplary embodiment, the regions at level 2 do not change, e.g., xij(t+2)=xij (t+1). Using the block constraints, the values at level 1 change to {x*ij(t+1)}=Σi:l:pa(ij)=IJXij(t+2) followed by row and column scalings to satisfy the level 1 constraints, ending with xIJ(t+2). The top-down and bottom-up steps may be iterated until convergence. The algorithm may converge rapidly, requiring, for example, 156 iterations for an error tolerance of 1%.
  • The exemplary algorithm described above with reference to a two-level tree may be extended to a tree with/levels as follows:
  • Initialization:
    Begin with a prior {xr(0)}for regions r E Z(1) of level 1
    From iteration t to t + 2:
    Begin Top-down: A
    For all r E Z(1), xr(t) → row constraints → column constraints → xr(t + 1)
    For levels 1 = 2, ... , L
    For all r E .Z(1): xr(t) → block constraints with xpa(r) (t + 1) on the
    RHS → x*r(t),
    where pa(r) is the parent region subsuming r
    x*r(t) → row constraints → column constraints → xr(t + 1)
    Begin bottom-up:
    For all r E Z(L), xr(t + 2) = xr(t + 1)
    For levels 1 = L,..., 1
    For all r E Z(e): x;.(t + 1) = ΣkEch(r) xk(t + 1), where ch(r) are all children
    regions nested within r
     x*r(t + 1) → row constraints → column constraints → xr(t + 2)
     Iterate until all constraints are substantially satisfied up to a predefined
     accuracy factor
  • One exemplary variable in the exemplary imputation algorithm is the choice of the prior. Setting xr(0) is proportional to lbr may ensure that the excess impressions are distributed in proportion to the lower bounds obtained from the crawled sample as closely as possible subject to the constraints. An alternative is to simply use the traditional IPF algorithm, which starts with a prior of xr(0) that is proportional to 1, and computes the xrvalues for each level separately, using only the row and column constraints. It can be shown that this automatically satisfies the block constraints as well, due to the relationships between the row and column sums at different levels. However, the prior distributes the excess impressions using an independence model and does not incorporate the a priori interaction information in the lower bounds.
  • After the impression volumes have been imputed to the hierarchy, the CTRs are estimated for all (or selected ones) of the nodes therein. The distribution of raw CTRs may be skewed and the variance may depend on the mean (roughly, Var proportional to mean/Nr). In the exemplary embodiment, the count data may be modeled on a transformed scale using the Freeman-Tukey transformation:
  • y r = 1 2 ( c r N ^ r + c r + 1 N ^ r ) , ( 5 )
  • In the above transformation, cr is the number of clicks in the region r and Nr is the imputed number of impressions, determined from the imputation algorithm described above. The second term in the transformation distinguishes between zeros on the basis of the number of impressions, e.g., zero clicks from 100 impressions corresponds to a smaller transformed CTR than zero clicks from only 10 impressions. The transformation may also provide symmetry to an otherwise skewed rate distribution and provide a variance stabilization property, making the variance of the distribution independent of the mean (roughly, Var proportional to 1/Nr). In an alternative exemplary embodiment, a squared-root transform may be utilized to model the data on a transformed scale.
  • As stated above in the description of method 500, the Markov model may be used as a generative model to calculate the CTRs from the imputed impression volumes. In the exemplary dataset, uT r=1 for all r which corresponds to one covariate for each level in the region hierarchy. Conditional on the states {Sr} being known, assume the observations yr to be independently distributed as a Gaussian:

  • yr|Sr(d(r))˜N(ur™β (d(r))+Sr, Vr),  (6)
  • The β(d(r)) is the unknown coefficient vector attached to covariates at level d(r), and Vr is the unknown variance parameter. The latent Sr variables are adjusting for effects that are not accounted for by the covariates. However, estimating one Sr per region leads to severe overfitting; hence smoothing on Sr's is necessary. The smoothing step is performed by exploiting dependencies induced by the tree structure of regions:

  • S r =S pa(r) +w r,  (7)
  • The wr is computationally similar to N(0, Wr) for all r E Z \Z(0). Also, wr is independent of Spa(r) and SRoot=WRaot=0. FIG. 6 shows an exemplary embodiment of the generative model for two levels.
  • In the exemplary embodiment, estimating a separate Wr and Vr for each region may require assuming that all regions at the same level have the same Wr value: Wr=W(l) for all r E S(l). Modeling assumptions on Vr depend on the data and the tree structure of regions. In the present example, Var(yr) is proportional to 1/N, (from Equation 5). Thus, assume that there is a V such that Vr=V/Nr for all r E S(l).
  • The ratios Wr/Vr, determine the amount of smoothing that takes place in the Markovian model. If Wr is large relative to Vr, the sibling Sr's are drawn from a distribution that has high variance and hence little smoothing. According to one embodiment, if Wr/Vr is proportional to infinity, then Sr 43 (yr uT rβ(d(r))) and the training data is perfectly fit. On the other extreme, if Wr/Vr→0, then Sr→0 and the fit is a regression model given by the covariates, with the maximum possible smoothing.
  • From the above description, one or more correlations may be implied by the Markovian model. For example, from Equation 7 and the independence of wr and Spa(r), it follows that:
  • Var ( S r ) = t = 1 d ( r ) W ( i ) ( 8 )
  • Thus, the variance in the states Sr depends only on the depth of region r, and increases when moving from coarser to finer resolutions.
  • For any two regions r1 and r2 at depth/sharing a common ancestor g at depth l′<1, the covariance between the state values is given by Cov(Sr1, Sr2)=Var(Sq), which depends only on l′. Thus, the correlation coefficient of nodes at level 1 whose least common ancestor is at level l′ is given by
  • Corr ( l , l ) = i = i l W ( i ) i = 1 l W ( i ) ( 9 )
  • The correlation coefficient Corr (l,l′) depends only on the level of the regions and the distance to their least common ancestor. The yr's may be independent conditional on Sr's, but the dependencies in Sr's impose dependencies in the marginal distribution of yr's.
  • As explained and described above, the EM algorithm may be used to estimate the posterior distribution of {Sr}'s and {β(d(r))}'s and provide point estimates of the variance components {W(l)} and V. Implementation of the EM algorithm may utilize a Kalman filtering step for efficiently estimating the posterior distributions of {Sr}'s for fixed values of the variance components. The Kalman filtering algorithm itself consists of two steps, namely, a filtering step that aggregates information from the leaves up to the root, followed by a smoothing step that propagates the aggregated information in the root downwards to the leaves. To provide intuition on the filtering step, note that the state equations may be inverted to express parent states in terms of their children's states:
  • S pa ( r ) = E ( S pa ( r ) S r ) + ( S pa ( r ) E ( S pa ( r ) S r ) ) = B r S r + ψ r ( 10 ) where B r = i = 1 d ( r ) - 1 W ( i ) / i = 1 d ( r ) W ( i ) , E [ ψ r ] = 0 and V a ^ r ( ψ r ) = W ( d ( r ) ) B r
  • Beginning with initial estimates for {W(l)(0)}, V, and {β(d(r))(0)}, the EM algorithm may use these in the Kalman filtering and smoothing steps, recomputing the variance and covariate components, and repeating the process until convergence. At step l+1, the EM algorithm first computes the expected log-likelihood of the conditional distribution of all the state variables {Sr} given the current estimates of all variance and covariate components {W(1)(t)}, V(t), {β(t)} and the data {yr}. This step uses the posterior distributions of the state variables from the Kalman filtering and smoothing steps. Subsequently, the parameters {W(1)(t+1)}, V(t+1), {β1(t+1)} are determined which maximize the conditional distribution of {Sr}. The new estimates are used at the next timestep of the EM algorithm.
  • The Kalman filtering step may be implemented as follows:
  • Filtering: Define, for all r ε Z, the following quantities:
    e r = y r · u r T β _ ( d ( r ) ) ; B _ r = i = 1 d ( r ) - 1 W ( i ) i = 1 d ( r ) W ( i )
    σ r = i = 1 d ( r ) W ( i ) ; R T = B _ r W r = B _ r W ( d ( r ) )
    For the leaf regions r ε Z(L), compute:
    Ŝr|r = σrer/(σr + Vr); Γr|r = σrVr/(σr + Vr)
    For non-leaf nodes r ε Z\Z(L), let kr denote the number of children
    regions under r, and let ci(r) denote the ith such child. Then, compute:
    Ŝr|c i (r) = B c i (r)Ŝc i (r)|c i (r)
    Γr|c i (r) = B c i (r)Γc i (r)|c i (r) B c i (r) + Rc i (r)
    S ^ r | r * = Γ r | r * ( i = 1 k r Γ r | c i ( r ) - 1 S ^ r | c i ( r ) )
    Γ r | r * = { r - 1 + i = 1 k r ( Γ r | c i ( r ) - 1 · r - 1 ) } - 1
    Ŝr|r = Γr|r(Vr −1er + (Γr|r *)−1Ŝr|r *)
    Γr|r = Γr|r * · Γr|r *r|r * + Vr)−1Γr|r *
    Smoothing: Set the values Ŝr = Ŝr|r and Γr = Γr|r for all r ε Z(l).
    for all other levels r ε Z\Z(l), compute:
    Ŝr = Ŝr|r + Γr|r B rΓpa(r)|r −1pa(r) · Ŝpa(r)|r)
    Γr = Γr|r + Γr|r B r 2Γpa(r)|r −1pa(r) · Γpa(r)|rpa(r)|r −1Γr|r
    Γr|pa(r) = Γr|r B rΓpa(r)|r −1Γpa(r)
    Expectation Maximization: Define the following:
    er(t) = yr · ur T{circumflex over (B)}(d(r))(t)
    Q ( l ) ( t + 1 ) = r ε Z ( l ) ( Γ r + ( S ^ r · e r t ) 2 ) N ^ r | Z ( l ) |
    Then, compute:
    V ( t + 1 ) = l | Z ( l ) | · Q ( l ) ( t + 1 ) l | Z ( l ) |
    W ( l ) ( t + 1 ) = r ε Z ( l ) ( Γ r + Γ p α ( r ) 2 Γ r , p α ( r ) + ( S ^ r · S ^ p α ( r ) ) 2 ) | Z ( l ) |
    The value of {circumflex over (β)}(l)(t + 1) at each level l is obtained by performing a
    weighted least squares at level l with V(t + 1) as estimate of V.
  • FIGS. 1 through 6 are conceptual illustrations allowing for an explanation of the present invention. It should be understood that various aspects of the embodiments of the present invention could be implemented in hardware, firmware, software, or combinations thereof. In such embodiments, the various components and/or steps would be implemented in hardware, firmware, and/or software to perform the functions of the present invention. That is, the same piece of hardware, firmware, or module of software could perform one or more of the illustrated blocks (e.g., components or steps).
  • In software implementations, computer software (e.g., programs or other instructions) and/or data is stored on a machine readable medium as part of a computer program product, and is loaded into a computer system or other device or machine via a removable storage drive, hard drive, or communications interface. Computer programs (also called computer control logic or computer readable program code) are stored in a main and/or secondary memory, and executed by one or more processors (controllers, or the like) to cause the one or more processors to perform the functions of the invention as described herein. In this document, the terms “machine readable medium,” “computer program medium” and “computer usable medium” are used to generally refer to media such as a random access memory (RAM); a read only memory (ROM); a removable storage unit (e.g., a magnetic or optical disc, flash memory device, or the like); a hard disk; electronic, electromagnetic, optical, acoustical, or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); or the like.
  • Notably, the figures and examples above are not meant to limit the scope of the present invention to a single embodiment, as other embodiments are possible by way of interchange of some or all of the described or illustrated elements. Moreover, where certain elements of the present invention can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present invention are described, and detailed descriptions of other portions of such known components are omitted so as not to obscure the invention. In the present specification, an embodiment showing a singular component should not necessarily be limited to other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein. Moreover, applicants do not intend for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such. Further, the present invention encompasses present and future known equivalents to the known components referred to herein by way of illustration.
  • The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the relevant art(s) (including the contents of the documents cited and incorporated by reference herein), readily modify and/or adapt for various applications such specific embodiments, without undue experimentation, without departing from the general concept of the present invention. Such adaptations and modifications are therefore intended to be within the meaning and range of equivalents of the disclosed embodiments, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance presented herein, in combination with the knowledge of one skilled in the relevant art(s).
  • While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example, and not limitation. It would be apparent to one skilled in the relevant art(s) that various changes in form and detail could be made therein without departing from the spirit and scope of the invention. Thus, the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.

Claims (25)

1. A computer-implemented method, comprising:
electronically, obtaining, via a processing device, a sample set of content items, each of the content items including a plurality of features associated with at least one region in a hierarchical data structure, the hierarchical data structure comprising nodes in an advertisement taxonomy hierarchy and nodes in a page taxonomy hierarchy, with the at least one region identified by a combination of nodes from the advertisement taxonomy hierarchy and nodes from the page taxonomy hierarchy, wherein the sample set is representation of a whole set of content items including features associated with the at least one region;
determining a first impression volume for each of the features corresponding to at least one region as a function of a number of impressions registered for a given content item from the sample set of content items;
applying a scale factor to the first impression volume to generate a second impression volume, the scale factor being selected so that the second impression volume is within a predefined range of a third impression volume;
electronically, estimating, via the processing device, a click-through-rate (CTR) as a function of the second impression volume and a number of clicks on the content item.
2. The method according to claim 1, wherein the content items include at least one of webpages and ads.
3. The method according to claim 1, wherein the obtaining includes:
identifying first content items that have been clicked;
identifying a predetermined number of second content items that have not been clicked; and
generating the sample set as a function of the first and second content items.
4. The method according to claim 3, further comprising:
calculating the first impression volume as a function of the impressions for the first and second content items.
5. The method according to claim 1, wherein the third impression volume is a total number of impressions associated within a preselected level in the hierarchical data structure.
6. The method according to claim 1, wherein the estimating includes:
assigning a state variable to each of the at least one region; and
applying a Markovian model to the state variable to estimate the CTR.
7. The method according to claim 6, wherein the applying includes:
computing a posterior for the state variable using a Kalman filter; and
propagating the posterior to the at least one region; and
repeating the computing and the propagating until convergence of the state variable to the CTR.
8. The method according to claim 7, further comprising:
upon the convergence, identifying the CTR for the at least one region.
9. The method according to claim 1, further comprising:
storing the CTR on a storage medium.
10. Computer readable media comprising program code that when executed by a programmable processor causes the processor to execute a method, the method comprising:
obtaining a sample set of content items, each of the content items including a plurality of features associated with at least one region in a hierarchical data structure, the hierarchical data structure comprising nodes in an advertisement taxonomy hierarchy and nodes in a page taxonomy hierarchy, with the at least one region identified by a combination of nodes from the advertisement taxonomy hierarchy and nodes from content items including features associated with the at least one region;
determining a first impression volume for each of the features corresponding to at least one region as a function of a number of impressions registered for a given content item from the sample set of content items;
applying a scale factor to the first impression volume to generate a second impression volume, the scale factor being selected so that the second impression volume is within a predefined range of a third impression volume;
estimating a click-through-rate (CTR) as a function of the second impression volume and a number of clicks on the content item.
11. The computer readable media of claim 10, wherein the content items include at least one of webpages and ads.
12. The computer readable media of claim 10, wherein the obtaining includes:
identifying first content items that have been clicked;
identifying a predetermined number of second content items that have not been clicked; and
generating the sample set as a function of the first and second content items.
13. The computer readable media of claim 12, further comprising:
calculating the first impression volume as a function of the impressions for the first and second content items.
14. The computer readable media of claim 10, wherein the third impression volume is a total number of impressions associated within a preselected level in the hierarchical data structure.
15. The computer readable media of claim 10, wherein the estimating includes:
assigning a state variable to each of the at least one region; and
applying a Markovian model to the state variable to estimate the CTR.
16. The computer readable media of claim 15, wherein the applying includes:
computing a posterior for the state variable using a Kalman filter; and
propagating the posterior to the at least one region; and
repeating the computing and the propagating until convergence of the state variable to the CTR.
17. The computer readable media of claim 16, further comprising:
upon the convergence, identifying the CTR for the at least one region.
18. A system comprising a processor and a memory device storing executable instructions thereon that when executed causes the processor to perform a method comprising:
obtaining a sample set of content items, each of the content items including a plurality of features associated with at least one region in a hierarchical data structure, the hierarchical data structure comprising nodes in an advertisement taxonomy hierarchy and nodes in a page taxonomy hierarchy, with the at least one region identified by a combination of nodes from the advertisement taxonomy hierarchy and nodes from the page taxonomy hierarchy, wherein the sample set is representation of a whole set of content items including features associated with the at least one region;
determining a first impression volume for each of the features corresponding to at least one region as a function of a number of impressions registered for a given content item from the sample set of content items;
applying a scale factor to the first impression volume to generate a second impression volume, the scale factor being selected so that the second impression volume is within a predefined range of a third impression volume;
estimating a click-through-rate (CTR) as a function of the second impression volume and a number of clicks on the content item.
19. The system of claim 18, wherein the content items include at least one of webpages and ads.
20. The system of claim 18, wherein the obtaining includes:
identifying first content items that have been clicked;
identifying a predetermined number of second content items that have not been clicked; and
generating the sample set as a function of the first and second content items.
21. The system of claim 20, further comprising:
calculating the first impression volume as a function of the impressions for the first and second content items.
22. The system of claim 18, wherein the third impression volume is a total number of impressions associated within a preselected level in the hierarchical data structure.
23. The system of claim 18, wherein the estimating includes:
assigning a state variable to each of the at least one region; and
applying a Markovian model to the state variable to estimate the CTR.
24. The system of claim 23, wherein the applying includes:
computing a posterior for the state variable using a Kalman filter; and
propagating the posterior to the at least one region; and
repeating the computing and the propagating until convergence of the state variable to the CTR.
25. The system of claim 24, further comprising:
upon the convergence, identifying the CTR for the at least one region.
US13/034,835 2007-04-05 2011-02-25 System and method for determining an event occurrence rate Abandoned US20110153550A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/034,835 US20110153550A1 (en) 2007-04-05 2011-02-25 System and method for determining an event occurrence rate

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/696,944 US7921073B2 (en) 2007-04-05 2007-04-05 System and method for determining impression volumes of content items in a taxonomy hierarchy
US13/034,835 US20110153550A1 (en) 2007-04-05 2011-02-25 System and method for determining an event occurrence rate

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/696,944 Continuation US7921073B2 (en) 2007-04-05 2007-04-05 System and method for determining impression volumes of content items in a taxonomy hierarchy

Publications (1)

Publication Number Publication Date
US20110153550A1 true US20110153550A1 (en) 2011-06-23

Family

ID=39827885

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/696,944 Expired - Fee Related US7921073B2 (en) 2007-04-05 2007-04-05 System and method for determining impression volumes of content items in a taxonomy hierarchy
US13/034,835 Abandoned US20110153550A1 (en) 2007-04-05 2011-02-25 System and method for determining an event occurrence rate

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/696,944 Expired - Fee Related US7921073B2 (en) 2007-04-05 2007-04-05 System and method for determining impression volumes of content items in a taxonomy hierarchy

Country Status (1)

Country Link
US (2) US7921073B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9053424B1 (en) * 2012-10-26 2015-06-09 Google Inc. Learning mechanism for recommended reordering of elements based on demographic information

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7844605B2 (en) * 2007-04-20 2010-11-30 Yahoo! Inc. Using natural search click events to optimize online advertising campaigns
US8719298B2 (en) * 2009-05-21 2014-05-06 Microsoft Corporation Click-through prediction for news queries
US20110251878A1 (en) * 2010-04-13 2011-10-13 Yahoo! Inc. System for processing large amounts of data
US20120158456A1 (en) * 2010-12-20 2012-06-21 Xuerui Wang Forecasting Ad Traffic Based on Business Metrics in Performance-based Display Advertising
US8666916B2 (en) * 2011-07-07 2014-03-04 Yahoo! Inc. Method for summarizing event-related texts to answer search queries
US8666810B1 (en) * 2012-02-17 2014-03-04 Google Inc. System and method for online unique users and frequency estimation for advertisements on a video sharing website based on auction history results
US20140096146A1 (en) * 2012-09-28 2014-04-03 Hewlett-Packard Development Company, L.P. Translating time-stamped events to performance indicators
US20140304063A1 (en) * 2013-04-04 2014-10-09 Google Inc. Determining resource allocation for content distrubution
WO2018218058A1 (en) * 2017-05-25 2018-11-29 Collective, Inc. Systems and methods for providing real-time discrepancies between disparate execution platforms
US20200007919A1 (en) 2018-04-02 2020-01-02 The Nielsen Company (Us), Llc Processor systems to estimate audience sizes and impression counts for different frequency intervals

Citations (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6466918B1 (en) * 1999-11-18 2002-10-15 Amazon. Com, Inc. System and method for exposing popular nodes within a browse tree
US20030088562A1 (en) * 2000-12-28 2003-05-08 Craig Dillon System and method for obtaining keyword descriptions of records from a large database
US6640218B1 (en) * 2000-06-02 2003-10-28 Lycos, Inc. Estimating the usefulness of an item in a collection of information
US20040044658A1 (en) * 2000-11-20 2004-03-04 Crabtree Ian B Information provider
US20040059708A1 (en) * 2002-09-24 2004-03-25 Google, Inc. Methods and apparatus for serving relevant advertisements
US20050021397A1 (en) * 2003-07-22 2005-01-27 Cui Yingwei Claire Content-targeted advertising using collected user behavior data
US20050033803A1 (en) * 2003-07-02 2005-02-10 Vleet Taylor N. Van Server architecture and methods for persistently storing and serving event data
US20050144067A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Identifying and reporting unexpected behavior in targeted advertising environment
US6978263B2 (en) * 1999-05-28 2005-12-20 Overture Services, Inc. System and method for influencing a position on a search result list generated by a computer network search engine
US20060143084A1 (en) * 2004-12-28 2006-06-29 Boloto, Inc. Software and method for advertisor sponsored events within a private centrally managed local or distributed network of users and an optional associated private network card for specialty marketing identification or banking
US7136875B2 (en) * 2002-09-24 2006-11-14 Google, Inc. Serving advertisements based on content
US20070022005A1 (en) * 2005-07-21 2007-01-25 Hanna Nader G Method for requesting, displaying, and facilitating placement of an advertisement in a computer network
US20070027901A1 (en) * 2005-08-01 2007-02-01 John Chan Method and System for Developing and Managing A Computer-Based Marketing Campaign
US20070094073A1 (en) * 2005-10-24 2007-04-26 Rohit Dhawan Advertisements for initiating and/or establishing user-advertiser telephone calls
US20070124425A1 (en) * 2005-11-30 2007-05-31 Gross John N System & Method of Delivering Content Based Advertising
US20070150353A1 (en) * 2005-12-24 2007-06-28 Rich Media Club, Llc System and method for creation, distribution and tracking of advertising via electronic networks
US20070156677A1 (en) * 1999-07-21 2007-07-05 Alberti Anemometer Llc Database access system
US20070174118A1 (en) * 2006-01-24 2007-07-26 Elan Dekel Facilitating client-side management of online advertising information, such as advertising account information
US20070214132A1 (en) * 2005-09-27 2007-09-13 Grubb Michael L Collection and delivery of internet ads
US20070288514A1 (en) * 2006-06-09 2007-12-13 Ebay Inc. System and method for keyword extraction
US20080046411A1 (en) * 2004-05-29 2008-02-21 Nhn Corporation Method and System for Managing the Impressing of the Search Listing Based on Advertisement Group
US20080077504A1 (en) * 2006-09-20 2008-03-27 Ebay Inc. Listing generation utilizing catalog information
US20080120278A1 (en) * 2006-11-16 2008-05-22 Miva, Inc. System and method for managing search results and delivering advertising and enhanced effectiveness
US20080140591A1 (en) * 2006-12-12 2008-06-12 Yahoo! Inc. System and method for matching objects belonging to hierarchies
US20080215623A1 (en) * 2005-09-14 2008-09-04 Jorey Ramer Mobile communication facility usage and social network creation
US7788245B1 (en) * 2005-06-16 2010-08-31 Google Inc. Method and system for dynamically generating search links embedded in content

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040225562A1 (en) * 2003-05-09 2004-11-11 Aquantive, Inc. Method of maximizing revenue from performance-based internet advertising agreements

Patent Citations (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7464079B2 (en) * 1999-05-28 2008-12-09 Yahoo! Inc. System and method for influencing a position on a search result list generated by a computer network search engine
US6978263B2 (en) * 1999-05-28 2005-12-20 Overture Services, Inc. System and method for influencing a position on a search result list generated by a computer network search engine
US20070156677A1 (en) * 1999-07-21 2007-07-05 Alberti Anemometer Llc Database access system
US6466918B1 (en) * 1999-11-18 2002-10-15 Amazon. Com, Inc. System and method for exposing popular nodes within a browse tree
US6640218B1 (en) * 2000-06-02 2003-10-28 Lycos, Inc. Estimating the usefulness of an item in a collection of information
US20040044658A1 (en) * 2000-11-20 2004-03-04 Crabtree Ian B Information provider
US20030088562A1 (en) * 2000-12-28 2003-05-08 Craig Dillon System and method for obtaining keyword descriptions of records from a large database
US7136875B2 (en) * 2002-09-24 2006-11-14 Google, Inc. Serving advertisements based on content
US20040059708A1 (en) * 2002-09-24 2004-03-25 Google, Inc. Methods and apparatus for serving relevant advertisements
US20060259455A1 (en) * 2002-09-24 2006-11-16 Darrell Anderson Serving advertisements based on content
US20050033803A1 (en) * 2003-07-02 2005-02-10 Vleet Taylor N. Van Server architecture and methods for persistently storing and serving event data
US20090276407A1 (en) * 2003-07-02 2009-11-05 Van Vleet Taylor N Server architecture and methods for storing and serving event data
US20050021397A1 (en) * 2003-07-22 2005-01-27 Cui Yingwei Claire Content-targeted advertising using collected user behavior data
US20050144067A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Identifying and reporting unexpected behavior in targeted advertising environment
US20080046411A1 (en) * 2004-05-29 2008-02-21 Nhn Corporation Method and System for Managing the Impressing of the Search Listing Based on Advertisement Group
US20060143084A1 (en) * 2004-12-28 2006-06-29 Boloto, Inc. Software and method for advertisor sponsored events within a private centrally managed local or distributed network of users and an optional associated private network card for specialty marketing identification or banking
US7788245B1 (en) * 2005-06-16 2010-08-31 Google Inc. Method and system for dynamically generating search links embedded in content
US20070022005A1 (en) * 2005-07-21 2007-01-25 Hanna Nader G Method for requesting, displaying, and facilitating placement of an advertisement in a computer network
US20070027901A1 (en) * 2005-08-01 2007-02-01 John Chan Method and System for Developing and Managing A Computer-Based Marketing Campaign
US20080215623A1 (en) * 2005-09-14 2008-09-04 Jorey Ramer Mobile communication facility usage and social network creation
US20070214132A1 (en) * 2005-09-27 2007-09-13 Grubb Michael L Collection and delivery of internet ads
US20070094073A1 (en) * 2005-10-24 2007-04-26 Rohit Dhawan Advertisements for initiating and/or establishing user-advertiser telephone calls
US20070124425A1 (en) * 2005-11-30 2007-05-31 Gross John N System & Method of Delivering Content Based Advertising
US20070150353A1 (en) * 2005-12-24 2007-06-28 Rich Media Club, Llc System and method for creation, distribution and tracking of advertising via electronic networks
US20070174118A1 (en) * 2006-01-24 2007-07-26 Elan Dekel Facilitating client-side management of online advertising information, such as advertising account information
US20070288514A1 (en) * 2006-06-09 2007-12-13 Ebay Inc. System and method for keyword extraction
US20080077504A1 (en) * 2006-09-20 2008-03-27 Ebay Inc. Listing generation utilizing catalog information
US20080120278A1 (en) * 2006-11-16 2008-05-22 Miva, Inc. System and method for managing search results and delivering advertising and enhanced effectiveness
US20080140591A1 (en) * 2006-12-12 2008-06-12 Yahoo! Inc. System and method for matching objects belonging to hierarchies

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9053424B1 (en) * 2012-10-26 2015-06-09 Google Inc. Learning mechanism for recommended reordering of elements based on demographic information

Also Published As

Publication number Publication date
US20080250033A1 (en) 2008-10-09
US7921073B2 (en) 2011-04-05

Similar Documents

Publication Publication Date Title
US7921073B2 (en) System and method for determining impression volumes of content items in a taxonomy hierarchy
US7672894B2 (en) Automated bidding system for use with online auctions
US8572011B1 (en) Outcome estimation models trained using regression and ranking techniques
US10580035B2 (en) Promotion selection for online customers using Bayesian bandits
Nekipelov et al. Econometrics for learning agents
US8738436B2 (en) Click through rate prediction system and method
US8255563B2 (en) Method and system for determining overall content values for content elements in a web network and for optimizing internet traffic flow through the web network
US8417559B2 (en) Assortment planning based on demand transfer between products
US7287000B2 (en) Configurable pricing optimization system
US7594189B1 (en) Systems and methods for statistically selecting content items to be used in a dynamically-generated display
US8650084B2 (en) Tool for analysis of advertising auctions
US7680746B2 (en) Prediction of click through rates using hybrid kalman filter-tree structured markov model classifiers
US20110035272A1 (en) Feature-value recommendations for advertisement campaign performance improvement
US20100010895A1 (en) Prediction of a degree of relevance between query rewrites and a search query
Paulson et al. Efficient large-scale internet media selection optimization for online display advertising
US20160132935A1 (en) Systems, methods, and apparatus for flexible extension of an audience segment
US20100082433A1 (en) Using A Threshold Function For Bidding In Online Auctions
CN104317822B (en) The ascribed characteristics of population Forecasting Methodology and device of the network user
US8880518B2 (en) Ranking products using purchase day based time windows
Yang et al. Large scale CVR prediction through dynamic transfer learning of global and local features
Hummel et al. Loss functions for predicted click‐through rates in auctions for online advertising
US20180300738A1 (en) Method and system for forecasting product sales on model-free prediction basis
US10937087B2 (en) Systems and methods for optimal bidding in a business to business environment
Kim et al. RETRACTED: An empirical test to measure the effectiveness of online advertising in online marketplaces using a hierarchical Bayes model
Danthurebandara et al. Designing choice experiments by optimizing the complexity level to individual abilities

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: YAHOO HOLDINGS, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO| INC.;REEL/FRAME:042963/0211

Effective date: 20170613

AS Assignment

Owner name: OATH INC., NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO HOLDINGS, INC.;REEL/FRAME:045240/0310

Effective date: 20171231